Cloud Platforms
Job Description
Required:
experience in systems engineering, DevOps, or site reliability engineering roles
Strong experience with Linux/Unix systems and system internals
Proficiency in one or more programming/scripting languages (Python, Go, Java, Bash)
Experience designing and operating highly available, distributed systems
Strong knowledge of cloud platforms (AWS, or GCP) and cloud-native services
Experience with containerization and orchestration (Docker, Kubernetes)
Strong understanding of monitoring, alerting, and logging concepts
Experience defining and managing SLIs, SLOs, and error budgets
Familiarity with incident management, root cause analysis (RCA), and postmortems
Experience integrating security and compliance into operational workflows
Familiarity with observability tools (Prometheus, Grafana, Application Insights, Datadog, Splunk)
Experience operating 24x7 production environments with on-call rotations
Experience with chaos engineering and resiliency testing
Experience with feature flags, canary deployments, and progressive delivery
Strong documentation skills for runbooks, dashboards, and operational standards
Similar Jobs
Senior Solution Architect -Hpc,Cloud-Native Systems
Remote
Senior Cloud Solution Architect
AZ
Senior Cloud IAM Engineer
Illinois
Cloud AI Engineer
Texas
Senior Cloud Engineer
Virginia