Senior DevOps Engineer
Job Description
Job Summary:
We are looking for an experienced Site Reliability Engineer (SRE) to join our team and drive reliability, scalability, and performance across our production systems.
In this role, you will apply software engineering practices to infrastructure and operations, partnering closely with development teams to build resilient, observable, and automated platforms aligned with defined service level objectives (SLOs).
Key Responsibilities:
Ensure high availability, performance, and scalability of distributed systems
Analyze business requirements and design efficient technical solutions
Build and maintain automated, cloud-native infrastructure
Define and manage SLIs, SLOs, and error budgets
Lead incident management, root cause analysis (RCA), and postmortems
Implement monitoring, alerting, and observability best practices
Similar Jobs
Agentforce QA Engineer
North Carolina
Senior Palantir Foundry Data Engineer
California
Senior Palantir Foundry Data Engineer
California
Senior Palantir Foundry Data Engineer
California
Data Engineer
Texas