Senior Site Reliability Engineer (SRE)
Estuary
Software Engineering
United States · Remote
Posted on Mar 11, 2026
Estuary combines CDC, stream processing, and declarative config into a unified system for reliable, low-latency pipelines. We seek a Senior SRE to architect resilient multi-cloud infrastructure, mature incident operations, and build automation/tooling.The Role: Bridge backend engineering and operations; architect systems ensuring resilience, security, and performance at enterprise scale; lead improvements from postmortems and incident learnings.What You’ll Do:- Architect for Resilience: Design/manage multi-cloud (AWS, GCP, Azure) for high availability and security- Evolve Incident Operations: On-call rotations, blameless postmortems, lead hardening initiatives- Build Automation & Tooling: Internal tools and CI/CD to reduce toil- Master IaC: Pulumi and Kubernetes to manage infra lifecycle as versioned, tested software- Drive Observability: Prometheus, Grafana, OpenTelemetry for deep-stack monitoring/alerting- Collaborate & Mentor: Establish operational best practices across teamsWhat We’re Looking For:- 8+ years SRE/systems experience operating production-grade distributed systems- Deep Linux, networking (gRPC, TCP/IP), filesystems; strong Go (plus Python/Bash) for automation- Kubernetes expertise with stateful workloads; Pulumi/IaC mastery; process improvement mindset; clear communication under pressureBonus Points: Rust familiarity; CDC/Kafka/Flink background; multi-cloud mastery; startup experienceWhy Estuary? Competitive comp/equity/benefits; flexible remote work; high autonomy; quarterly offsites.