This job posting has expired

Expired on April 4, 2026

Site Reliability Engineer

On-siteFull-timeOn-site
GCPGKENode.jsMongoDB AtlasKubernetesPub/SubObservability

Job Description

We need a Lead Site Reliability Engineer for our low-code platform preparing to scale to 3,000,000 concurrent users. The role involves transforming a synchronous system into a high-concurrency asynchronous engine using GKE, MongoDB Atlas, and Pub/Sub.

Responsibilities

  • Transition synchronous API flows to Google Cloud Pub/Sub
  • Configure Subscriber-side Flow Control and Kubernetes HPA
  • Isolate heavy Puppeteer/Chrome workloads
  • Build observability using Cloud Monitoring
  • Optimize container footprints using VPA

Qualifications

  • GCP Mastery (GKE, Pub/Sub, Cloud Run)
  • Advanced Node.js event loop management
  • MongoDB Atlas at scale (M60/M80 tiers)

Job Information

Posted

February 3, 2026

Experience Level

lead

Status

Expired