Site Reliability Engineer Sre
Job Description
Role: Site Reliability Engineer SRE - with strong Datadog observability experience
Location: Glen Allen - Virginia
Job description:
We are seeking Site Reliability Engineers SREs with strong Datadog observability experience to help build and scale a single pane of glass monitoring and observability platform across applications and infrastructure This role will focus on designing actionable dashboards APM synthetic monitoring and ing standards while driving observability as code and automation in partnership with the CloudOps team
The ideal candidate has hands on experience with Datadog and enjoys combining engineering discipline automation and reliability practices to improve system visibility and operational outcomes
Key Responsibilities
Datadog Observability Engineering
Design build and maintain Datadog dashboards for business application and infrastructure visibility single pane of glass
Implement and manage Datadog APM including service maps dependency tracing latencyerror analysis and performance baselines
Configure synthetic monitoring API browser tests to validate availability user journeys SSLDNS health and external dependencies
Create standardized monitors s and SLOs aligned with SRE best practices signal over noise actionable s
Observability Automation IaC
Similar Jobs
Site Reliability Engineer (Sre)
Texas
Site Reliability Engineer
Remote
Site Reliability Engineer (Sre) Architect
Remote
Site Reliability Engineer (Sre)
AZ
Site Reliability Engineer (Sre) Vulnerability Management
Washington