MLOps Engineer — AI/ML Systems Deployment (TS/SCI Preferred)
Rackner
Job details
MLOps Engineer — AI/ML Systems Deployment
Location: Dayton, OH preferred
Work Arrangement: On-site preferred; remote may be considered for highly aligned, clearance-ready candidates able to support secure / CAC-enabled environments and travel as needed
Clearance: Active TS/SCI strongly preferred; active Secret may be considered for upgrade
Requirement: U.S. citizenship required
Build and Deploy Real-World AI Systems
Rackner is hiring an MLOps Engineer to move AI/ML systems from prototype → deployment → operational use in a secure, mission-focused environment.
This is not a research role—this is where models become reliable, repeatable, auditable systems that run in real-world conditions.
This role is ideal for engineers who want to:
- Work across AI/ML, Kubernetes, infrastructure, and mission systems
- Own deployed systems, not just experiments
- Build high-demand MLOps expertise in secure and constrained environments
- Deliver technology that is used, trusted, and operational
You will help operationalize AI/ML capabilities where reliability, performance, and trust matter most.
What You’ll Do
Operationalize AI/ML Systems
- Deploy AI/ML models and ML-enabled applications into secure, real-world environments
- Move workflows from experimentation into containerized, repeatable deployment pipelines
- Support batch and real-time inference architectures
- Bridge model development, software engineering, and platform operations
Own the ML Lifecycle
- Build and operate production-grade ML pipelines
- Support model versioning, lineage, reproducibility, and lifecycle governance
- Work with tools such as MLflow, Kubeflow, Airflow, Argo, ClearML, or similar platforms
Build Cloud-Native ML Infrastructure
- Deploy and support Kubernetes-based ML workloads
- Containerize models, pipelines, and services using Docker or similar tools
- Support CI/CD, automation, and repeatable deployment patterns for AI/ML systems
Engineer for Reliability
- Monitor model and system performance after deployment
- Support observability using tools such as Prometheus, Grafana, OpenTelemetry, or similar
- Detect and resolve issues related to latency, reliability, drift, degradation, or resource usage
Support Secure and Constrained Environments
- Help deploy AI/ML systems in secure, CAC-enabled, or constrained environments
- Support limited compute, restricted data, degraded connectivity, and other operational constraints
- Optimize systems for reliability and usability beyond ideal lab conditions
Create Repeatable Systems
- Develop runbooks, deployment documentation, and operational playbooks
- Build systems that can be understood, maintained, and operated by others
What You Bring
Core Experience
- U.S. citizenship
- Background in deploying ML systems, AI-enabled applications, or production software
- Strong programming skills in Python
- Hands-on work with Docker, containers, or containerized deployment
- Familiarity with Kubernetes or cloud-native environments
- Understanding of CI/CD, automation, or pipeline-based delivery
- Clear communication of technical decisions, tradeoffs, and ownership
- Ability to operate in a CAC-enabled or secure environment
Preferred Qualifications
- Active TS/SCI clearance
- Active Secret clearance with eligibility for upgrade
- Familiarity with ML lifecycle tools such as MLflow, Kubeflow, Airflow, Argo, ClearML, or similar
- Background in model serving, inference APIs, or deploying ML systems in production
- Exposure to LLMs, transformer-based models, computer vision, NLP, or applied AI solutions
- Hands-on work with Kubernetes-based ML workloads
- Knowledge of observability and monitoring tools such as Prometheus, Grafana, or OpenTelemetry
- Experience in DoD, defense, intelligence, regulated, or mission-critical settings
- Work in edge, offline, air-gapped, low-bandwidth, D-DIL, or limited-compute environments
Clearance Requirements
- Active TS/SCI clearance strongly preferred
- Candidates with an active Secret clearance may be considered and supported for upgrade
- Candidates without an active clearance must be:
- U.S. citizens
- eligible to obtain and maintain a clearance
- able to work in a CAC-enabled or secure environment
Note: Start timelines and work scope may vary depending on clearance status and program requirements
Who We Are
Rackner is a software consultancy that builds cloud-native solutions for startups, enterprises, and the public sector. We are an energetic, growing team focused on solving complex problems through:
- Distributed systems
- DevSecOps
- AI/ML
- Cloud-native architecture
Our approach is cloud-first, cost-effective, and outcome-driven, delivering systems that scale and perform in real-world environments.
Benefits & Perks
- 100% covered certifications & training aligned to your role
- 401(k) with 100% match up to 6%
- Highly competitive PTO
- Comprehensive Medical, Dental, Vision coverage
- Life Insurance + Short & Long-Term Disability
- Home office & equipment plan
- Industry-leading weekly pay schedule
Apply
If you are an engineer who wants to move from building models or platforms to owning deployed AI/ML systems, we would like to connect.