Site Reliability Engineer
Job Description
Title - Site Reliability Engineer (No OPT visa allowed. Please looking for GC, USC and H1Transfer.)
Plano, Texas (onsite)
Fulltime
Experience – 5 to 7 years
Keywords:
kafka, redis, docker, grafana, terraform
job Description:
We are seeking a Site Reliability Engineer (SRE) with 5 to 7 tears, with strong to expert-level knowledge of the AWS ecosystem to support and operate highly available, scalable cloud platforms. The role requires hands-on expertise across core AWS services, including Kafka, Redis, CloudWatch, Kubernetes (EKS), EC2, Secrets Manager, Route53, Lambda, RDS, DynamoDB, and AWS Transfer Family.
The candidate will be responsible for ensuring system reliability, performance, and observability in a production environment, with a strong emphasis on monitoring, automation, and infrastructure scalability. Deep experience with Grafana for metrics and dashboards, Terraform for infrastructure as code, and Docker-based containerization is required.
The ideal candidate is comfortable working in fast-paced production environments, proactively identifying reliability risks, and implementing automated solutions to improve uptime and operational efficiency. Experience with automated build and deployment pipelines (CI/CD) is highly desirable, enabling continuous delivery and operational consistency across environments.
Similar Jobs
Reliability Engineer
California
Site Reliability Engineer (Sre)
Michigan
Site Reliability Engineer
Texas
Reliability Engineer
California
Site Reliability Engineer (Sre)
Missouri