
Senior Solution Architect
Job Description
Senior Solution Architect – HPC, Cloud-Native Systems (ITAR-Restricted Role)
Position Overview
We are seeking a high-performance Senior Solution Architect to lead the convergence of traditional High-Performance Computing (HPC) environments with modern cloud-native architectures. This position is designated as ITAR-restricted, requiring candidates legally authorized to access and handle U.S. export-controlled technical data.
The architect will design, integrate, and optimize large-scale, containerized, hybrid HPC environments using technologies such as Docker, Mirantis, ELK Stack, and advanced batch schedulers. This role requires deep technical leadership, architectural vision, and hands-on experience supporting mission-critical computational workloads in secure, compliant environments.
Core Responsibilities
1. Architecture & Design
Architect end-to-end hybrid cloud solutions integrating Mirantis Container Cloud with dedicated HPC clusters.
Balance performance, elasticity, and compliance requirements across on-prem and cloud environments.
Produce architecture documentation adhering to ITAR export-controlled standards and review practices.
2. HPC Orchestration
Design and implement HPC job scheduling strategies using Slurm, Volcano, LAVA, or similar technologies.
Support deterministic resource allocation for AI/ML analytics, physics simulations, and scientific workloads.
Ensure schedulers meet ITAR-restricted workload isolation and audit requirements.
3. Optimization & Performance Tuning
Apply best practices for high-performance containerization including multi-stage builds, minimal base images, and CPU/GPU/memory tuning.
Implement strategies to minimize overhead, ensure stability, and eliminate noisy-neighbor issues.
4. Centralized Observability
Architect and operate enterprise-grade ELK Stack (Elasticsearch, Logstash, Kibana) tuned for HPC-scale environments.
Manage Index Lifecycle Management (ILM) for high log throughput while preserving traceability for compliance audits.
5. Full-Stack Automation
Build IaC-driven automation pipelines using Terraform, Ansible, and GitOps workflows.
Automate deployment of Mirantis Kubernetes Engine (MKE) and integrated HPC schedulers within ITAR-secured environments.
6. CI/CD Automation
Implement robust CI/CD workflows using Jenkins, GitLab CI, Argo Workflows, or similar tools.
Ensure pipelines comply with ITAR policies including artifact access control, secure registries, and encrypted transport.
7. Hybrid Integration
Architect integration between Kubernetes and traditional HPC schedulers.
Enable workloads requiring high-speed interconnects such as InfiniBand, RDMA, or GPU-accelerated clusters.
Required Technical Skills
Containers & Mirantis
Expertise in Docker Runtime, Mirantis Kubernetes Engine (MKE), and Lens Desktop.
Deep experience designing containerized workloads for HPC environments.
HPC Schedulers
Hands-on experience with Slurm, PBS, or Kubernetes-native batch schedulers such as Volcano.
Knowledge of hierarchical priority queues, fair scheduling, and resource fairness algorithms.
ELK Stack Mastery
Strong understanding of Logstash pipeline optimization, Elasticsearch shard strategies, and Kibana visualization design.
Performance
Experience with NVIDIA Enroot/Pyxis or equivalent technologies for near bare-metal container performance.
Security & Compliance
Implement secure registry solutions, TLS encryption, RBAC, and identity-driven access controls.
Demonstrated experience supporting compliance frameworks including ITAR and NISTxxxxxxxxxxxxxxx.
Cloud Platforms
Experience with AWS HPC environments including EKS, AWS Batch, FSx for Lustre, and GPU-accelerated EC2 instances.
Experience & Qualifications
10+ years in systems architecture or engineering roles.
Similar Jobs
Oracle Integration Architect
Remote
Moodle Lms Specialist / BI Architect-Developer
Remote
Snowflake Architect
Remote
Data Architecture
Remote
Senior Solution Architect -Hpc,Cloud-Native Systems
Remote