
Mainframe Site Reliability Engineering
Ardor IT SystemsContract
Required Skillset:
Pythonperformance tuningproduction supportDB2capacity planningJCLCICSMQmajor incident managementIMSControl-MIBM z/OSREXXJES2/JES3SMFSDSFCLISTOMEGAMONCA/BMC toolsmonitoring and scheduling tools
Job Description
Role Summary: The Mainframe SRE Lead is responsible for the reliability, availability, performance, and stability of enterprise mainframe systems. This role combines traditional mainframe engineering with modern SRE practices, focusing on automation, monitoring, incident management, and continuous improvement. The lead will guide a team and work closely with application, infrastructure, and operations teams.
Key Responsibilities
- Lead and mentor the Mainframe SRE team and provide technical guidance
- Ensure high availability, reliability, and performance of mainframe platforms (z/OS)
- Define and implement SRE practices including SLIs, SLOs, SLAs, and error budgets
- Drive automation to reduce manual work and improve system stability and recovery
- Manage monitoring, alerting, and observability for mainframe systems
- Lead incident management, root cause analysis, and post-incident reviews
- Partner with application teams to improve performance and deployment reliability
Required Qualifications
- 10+ years of experience in mainframe systems engineering or operations
- Strong hands-on experience with IBM z/OS
- Experience with core mainframe technologies:
- CICS, IMS, DB2
- JES2/JES3, MQ
- SMF, SDSF
- Strong knowledge of performance tuning and capacity planning
- Experience leading production support and major incident management
- Automation and scripting skills (REXX, JCL, CLIST, Python, or similar)
- Experience with monitoring and scheduling tools (OMEGAMON, CA/BMC tools, Control-M)
Preferred Qualifications
- Experience applying SRE principles in mainframe or hybrid environments
- Exposure to DevOps and CI/CD practices
- Knowledge of Linux on Z and cloud integration
- Experience with resilience engineering or fault-tolerant systems
- Prior experience as a technical lead or people manager
Similar Jobs
Site Reliability Engineer
California
Feb 17th, 2026
Senior Mainframe Site Reliability Engineer Application Support
Ohio
Feb 17th, 2026
Azure (Sre) Site Reliability Engineer
DC
Feb 11th, 2026
Site Reliability Engineer
Remote
Feb 11th, 2026
Site Reliability Engineer
New Jersey
Feb 3rd, 2026