This job posting has expired
Expired on April 1, 2026
Senior Site Reliability Engineer I - Observability
Job Description
Join the Infrastructure Observability team to build and manage a world-class observability ecosystem. Design and architect solutions with a focus on automation, testability, and reliability.
Responsibilities
- Develop and maintain distributed monitoring ecosystem
- Design observability solutions focusing on automation
- Coach and mentor colleagues
- Facilitate collaboration to solve observability challenges
- Maintain open-source and in-house monitoring applications
- Build dashboards and metrics for data-driven problem resolution
Qualifications
- 5+ years of experience with monitoring systems like Prometheus, NewRelic, or Dynatrace
- Strong experience in Go, Python, Java, or Bash
- Experience with Kubernetes and Cloud Infrastructure (AWS)
- Experience with Terraform