This job posting has expired
Expired on April 1, 2026
Job Description
Lead / Collaborate with Engineering teams to build SRE culture, maintain development, staging and production systems. Managing infrastructure reliability, scalability and security using SRE principles including setting up SLOs, tracking error budgets, and Production Readiness Reviews (PRR).
Responsibilities
- Lead build of SRE culture and maintain development/production systems
- Automation of Infrastructure provisioning using Terraform
- Proactively monitor and optimize Infrastructure costs
- Maintain API-gateways, web servers, and CICD pipelines
- Participate in on-call rotations and resolve incidents
- Hire, mentor and coach junior team members
Qualifications
- BS/MS in Computer Science, IT or related technical field
- 6+ years of relevant experience
- Expert programming and scripting skills (bash, shell scripting)
- Experience with established cloud platforms like AWS, Azure or Google Cloud
- Experience with Kubernetes platforms such as EKS