Responsibilities
- Assist in designing, deploying, and refining enterprise-scale observability systems
- Lead the strategic direction, development roadmap, and organizational adoption of monitoring platforms in alignment with business and IT objectives
- Promote industry best practices in observability and advance a culture centered on data-driven decision-making
- Deliver expert-level technical support for observability platforms and integration tools
- Formulate and implement comprehensive observability strategies across the enterprise
- Design and sustain observability architectures for both cloud and on-premises infrastructure
- Automate and integrate monitoring solutions including AWS CloudWatch, Halo ITSM, Apex AIOps, OpenTelemetry, Prometheus, Grafana, Datadog, and Splunk
- Establish and monitor key performance indicators tied to business outcomes; report impact to stakeholders
- Empower engineering teams with self-service observability capabilities
- Manage and prioritize observability-related product backlogs
- Work cross-functionally with IT, security, application development, and business units
- Ensure observability practices comply with data governance and privacy regulations
- Iteratively enhance observability frameworks using feedback and performance insights