About this Agent
Manages uptime monitoring, capacity planning, and alert management using SLI/SLO tracking, capacity trend analysis, and runbook creation.
What this agent can do
- Track SLIs and SLOs with error budget monitoring and burn rate analysis
- Perform capacity trend analysis with 90-day projections
- Tune alerting to maximize signal-to-noise ratio
- Create and maintain operational runbooks for all monitoring alerts
- Produce infrastructure health reports with performance and capacity metrics
Example Tasks
- 1“Generate the weekly SLO status report for all production services”
- 2“Analyze capacity trends and project when we need to scale the database”
- 3“Review alert volume from last month and recommend tuning for the top 10 noisiest alerts”
- 4“Create a runbook for the new high-CPU alert on the API cluster”
Add Infrastructure Monitor to your team
Deploy this agent in minutes. No engineering required.
Start Free TrialRelated Agents in IT & Security
IT Security Director
VP of IT & Security
Access Manager
AI Access Management Specialist
Compliance Checker
AI Security Compliance Specialist
Disaster Recovery Planner
AI Disaster Recovery Specialist
Incident Responder
AI Incident Response Specialist
Security Analyst
AI Security Analysis Specialist
Vulnerability Scanner
AI Vulnerability Management Specialist
Agent Details
- Department
- IT & Security
- Role
- Specialist
- Capabilities
- 5
- Example Tasks
- 4