
Sr. DevOps Engineer
Job Description
Job Title : Sr. DevOps Engineer
Location :Westlake Village TX(Onsite position)(2 Positions)
Responsibilities:
The Sr DevOps Engineer - AI platform will:
• Design, implement, and manage scalable and resilient infrastructure on AWS.
• Architect and maintain Windows/Linux based environments, ensuring seamless integration with cloud platforms.
• Develop and maintain infrastructure-as-code(IaC) using both AWS Cloudformation/CDK and Terraform/OpenTofu.
• Develop and maintain Configuration Management for Windows & Linux servers using Chef.
• Design, build, and optimize CI/CD pipelines using GitLab CI/CD for .NET applications.
• Integrate and support AI services, including orchestration with AWS Bedrock, Google Agentspace, and other generative AI frameworks, ensuring they can be securely and efficiently consumed by platform services.
• Enable AI/ML workflows by building and optimizing infrastructure pipelines that support large-scale model training, inference, and deployment across AWS and GCP environments.
• Automate model lifecycle management (training, deployment, monitoring) through CI/CD pipelines, ensuring reproducibility and seamless integration with development workflows.
• Collaborate with AI engineering teams to deliver scalable environments, standardized APIs, and infrastructure that accelerate AI adoption at the platform level.
• Implement observability, security, data privacy and cost-optimization strategies specifically for AI workloads, including monitoring and resource scaling for inference services.
• Implement and enforce security best practices across the infrastructure and deployment processes.
• Collaborate closely with development teams to understand their needs and provide DevOps expertise.
• Troubleshoot and resolve infrastructure and application deployment issues.
• Implement and manage monitoring and logging solutions to ensure system visibility and proactive issue detection.
• Clearly and concisely contribute to the development and documentation of DevOps standards and best practices.
• Stay up-to-date with the latest industry trends and technologies in cloud computing, DevOps, and security.
• Provide mentorship and guidance to junior team members.
Qualifications:
• Bachelor's degree in Computer Science, Engineering, or a related field (or equivalent experience).
• 5+ years of experience in a DevOps or Site Reliability Engineering (SRE) role.
• 1+ year(s) of experience with AI services & LLMs.
• Extensive hands-on experience with Amazon Web Services (AWS)
• Solid understanding of Windows/Linux Server administration and integration with cloud environments.
• Proven experience with infrastructure-as-code tools, specifically AWS CDK and Terraform.
• Strong experience designing and implementing CI/CD pipelines using GitLab CI/CD.
• Experience deploying and managing .NET applications in cloud environments.
Similar Jobs
Sr. DevOps Engineer
New Jersey
DevOps Engineer
GA
Senior DevOps Engineer
Washington
Senior Application Cloud Sre/DevOps Engineer
New York
DevOps Engineer
Texas