Ardor IT Systems

Mainframe Site Reliability Engineering

Ardor IT SystemsContract
Ohio
7 - 9 YearsFeb 17th, 2026
96 ViewsBe an Early Applicant
Required Skillset:
Pythonperformance tuningproduction supportDB2capacity planningJCLCICSMQmajor incident managementIMSControl-MIBM z/OSREXXJES2/JES3SMFSDSFCLISTOMEGAMONCA/BMC toolsmonitoring and scheduling tools

Job Description

Role Summary: The Mainframe SRE Lead is responsible for the reliability, availability, performance, and stability of enterprise mainframe systems. This role combines traditional mainframe engineering with modern SRE practices, focusing on automation, monitoring, incident management, and continuous improvement. The lead will guide a team and work closely with application, infrastructure, and operations teams.

 

Key Responsibilities

  • Lead and mentor the Mainframe SRE team and provide technical guidance
  • Ensure high availability, reliability, and performance of mainframe platforms (z/OS)
  • Define and implement SRE practices including SLIs, SLOs, SLAs, and error budgets
  • Drive automation to reduce manual work and improve system stability and recovery
  • Manage monitoring, alerting, and observability for mainframe systems
  • Lead incident management, root cause analysis, and post-incident reviews
  • Partner with application teams to improve performance and deployment reliability

Required Qualifications

  • 10+ years of experience in mainframe systems engineering or operations
  • Strong hands-on experience with IBM z/OS
  • Experience with core mainframe technologies:
    • CICS, IMS, DB2
    • JES2/JES3, MQ
    • SMF, SDSF
  • Strong knowledge of performance tuning and capacity planning
  • Experience leading production support and major incident management
  • Automation and scripting skills (REXX, JCL, CLIST, Python, or similar)
  • Experience with monitoring and scheduling tools (OMEGAMON, CA/BMC tools, Control-M)

Preferred Qualifications

  • Experience applying SRE principles in mainframe or hybrid environments
  • Exposure to DevOps and CI/CD practices
  • Knowledge of Linux on Z and cloud integration
  • Experience with resilience engineering or fault-tolerant systems
  • Prior experience as a technical lead or people manager

Similar Jobs

Site Reliability Engineer

California

Feb 17th, 2026

Senior Mainframe Site Reliability Engineer Application Support

Ohio

Feb 17th, 2026

Azure (Sre) Site Reliability Engineer

DC

Feb 11th, 2026

Site Reliability Engineer

Remote

Feb 11th, 2026

Site Reliability Engineer

New Jersey

Feb 3rd, 2026