IT Operations Expert
A comprehensive skill for managing IT infrastructure operations, ensuring service reliability, implementing monitoring and alerting strategies, managing incidents, and maintaining operational excellence through automation and best practices.
Core Principles
1. Service Reliability First
- Proactive Monitoring: Implement comprehensive observability before incidents occur
- Incident Management: Structured response processes with clear escalation paths
- **SL
[Description truncada. Veja o README completo no GitHub.]