Infrastructure
Operations & Security
Active Anomalies
5
AI Recommendations
1
Recent Autonomous Actions
47 actions today
Facility Map
DC-Alpha - Hall A
Rack Layout
Impact Chain
CDU-12
83% flow
β
Racks 44-48
5 racks
β
127 GPUs
thermal risk
β
TJ-2847
at risk
Selected: CDU-12
Status
Degrading
Flow Rate
83%
Supply Temp
18.2Β°C
Return Temp
32.4Β°C
Racks Served
44-48
History
Review operational history, incidents, and autonomous actions
Recent Resolved Incidents
INC-2024-0847: CDU-12 Flow Degradation Cascade
INC-2024-0842: Network Congestion Event
INC-2024-0839: Power Distribution Imbalance
Audit Summary
Actions (24h)
47
Actions (7d)
312
Override Rate
2.1%
Configure
Configure control plane behavior, policies, and operational parameters
Autonomy Policies
12 active policies
Business Priorities
8 rules | 47 ranked workloads
Remediation Playbooks
34 active playbooks
Alert Configuration
156 alert rules
Topology Config
4,872 components
On-Call & Escalation
6 teams | 4 on-call