Archarithms Inc logo
Archarithms Inc

Arcarithm is located in beautiful, downtown Huntsville, AL, one of the fastest growing cities in the U.S.! We cultivate and foster an environment of integrity, open communication, work-life balance, and career development. We are excited to continue to change and improve the world through innovation and technology!

Senior OpenShift Systems Administrator/ DevOps Engineer

Location

United States

Posted

3 days ago

Salary

0

Seniority

Senior

No structured requirement data.

Job Description

Senior OpenShift Systems Administrator/ DevOps Engineer

Archarithms Inc

Role Description Arcarithm is seeking a Senior OpenShift Systems Administrator / DevOps Engineer to support the modernization and sustainment of mission-critical systems for the Air Force at Arnold Engineering Development Complex (AEDC). The selected candidate will be responsible for the administration, maintenance, automation, and optimization of Red Hat OpenShift environments supporting enterprise applications, data platforms, AI/ML solutions, and cloud-native services. - Administer, maintain, and troubleshoot Red Hat OpenShift clusters in development, test, and production environments. - Deploy, manage, and optimize containerized applications using Kubernetes and OpenShift technologies. - Build and maintain CI/CD pipelines supporting automated build, testing, security scanning, and deployment activities. - Develop and maintain Infrastructure as Code (IaC) solutions using tools such as Terraform and Ansible. - Support deployment and operation of cloud-native applications and microservices architectures. - Manage Linux-based infrastructure including Red Hat Enterprise Linux (RHEL) servers and virtualized environments. - Implement and maintain container security best practices, image scanning, vulnerability remediation, and compliance requirements. - Collaborate with cybersecurity teams to support RMF controls, STIG compliance, vulnerability remediation, and security hardening efforts. - Support monitoring, logging, and observability platforms including Prometheus, Grafana, Elastic Stack, or similar technologies. - Troubleshoot application, infrastructure, networking, storage, and container orchestration issues. - Support database deployments and integrations within containerized environments. - Develop automation scripts and tooling using Python, Bash, PowerShell, or similar languages. - Create and maintain technical documentation, architecture diagrams, operational procedures, and deployment runbooks. - Participate in system upgrades, patching activities, disaster recovery planning, and incident response efforts. Qualifications - Bachelor's degree in Computer Science, Information Systems, Engineering, or related field. - 4-10 years of experience administering enterprise Linux environments. - 3+ years of hands-on experience with Kubernetes and/or Red Hat OpenShift. - Experience deploying and managing containerized workloads in production environments. - Experience with Infrastructure as Code (Terraform, Ansible, or similar tools). - Experience building and maintaining CI/CD pipelines. - Strong knowledge of Linux system administration and troubleshooting. - Experience with Git-based source control and DevOps workflows. - Experience supporting cloud or hybrid-cloud environments. - Active Secret Security Clearance. - Security+ certification or ability to obtain within 90 days. Preferred Qualifications - Red Hat OpenShift Administration certification. - Red Hat Certified System Administrator (RHCSA) or RHCE. - Experience supporting Department of Defense environments. - Experience with AWS, Azure Government, OpenStack, or other cloud platforms. - Experience with VMware vSphere, ESXi, and virtualized infrastructure. - Experience supporting AI/ML platforms and containerized data environments. - Familiarity with DevSecOps practices and security automation. - Experience supporting RMF-accredited systems. Desired Technical Skills - Container Platforms: - Red Hat OpenShift - Kubernetes - Docker - Podman - Automation & DevOps: - Terraform - Ansible - Jenkins - GitLab CI/CD - Azure DevOps - Git - Operating Systems: - Red Hat Enterprise Linux (RHEL) - CentOS - Rocky Linux - Ubuntu - Cloud Platforms: - AWS - Azure - OpenStack - Monitoring & Logging: - Prometheus - Grafana - ELK Stack - Splunk - Scripting & Development: - Python - Bash - PowerShell - Security: - STIG Hardening - Vulnerability Management - Security Scanning - RMF Compliance - Container Security Benefits - Comprehensive health insurance options - Generous 401K plan - Competitive salaries - Continuous career growth opportunities - Flexible schedules including remote work - Mentoring and performance incentives

Related Categories

Related Job Pages

More DevOps Engineer Jobs

Role Description Für meinen langjährigen Kunden bin ich auf der Suche nach einem DevOps / Platform Engineer (m/w/d) REMOTE in Festanstellung. - Entwicklung und Implementierung von Infrastruktur als Code (IaC) Lösungen zur Automatisierung der Bereitstellung und Verwaltung von Cloud-Ressourcen. - Überwachung und Optimierung der Systemleistung, um eine hohe Verfügbarkeit und Skalierbarkeit der Plattform sicherzustellen. - Zusammenarbeit mit Entwicklungsteams zur Integration von CI/CD-Pipelines und zur Förderung von Best Practices in der Softwarebereitstellung. - Identifizierung und Behebung von Sicherheitsproblemen innerhalb der Infrastruktur, um den Schutz sensibler Daten zu gewährleisten. - Erstellung und Pflege von Dokumentationen und Leitfäden für die Nutzung und Wartung der Plattform. Qualifications - Sehr gute Kenntnisse in GitLab CI - Fundierte Erfahrung mit Docker und Containerisierung - Praxis in Build- und Release-Automatisierung - Gute Kenntnisse in Kubernetes (Deployments, Debugging, Helm) - Verständnis für automatisierte Tests im CI/CD-Kontext (Integration wünschenswert) - Sehr gute Deutsch- und Englischkenntnisse - Erfahrung mit Deployments auf Debian oder Ubuntu - Kenntnisse in Infrastructure as Code (z. B. Ansible, Terraform) - Python- und/oder Bash-Kenntnisse für Scripting und Tooling - Erfahrung in der Modernisierung von Legacy-Systemen - Operations-Verständnis (Betrieb beim Kunden, Updates, Monitoring, Stabilität)

Germany
ng-voice logo

Deployment Engineer

ng-voice

The Hyperscaling IMS Solution: Infrastructure-agnostic, cost-efficient, automated.

DevOps Engineer3 days ago
Full TimeRemoteTeam 51-200H1B No Sponsor

• Working in the Client Deployment Team, guaranteeing smooth delivery of solutions, including high-/low-level design with final technical deployment and integration of the solutions. • Planning, preparing, and setting up lab, test, and production environments for client and partnership deployments. • Installing and configuring cloud-native solutions using Kubernetes. • Automation, automation, automation – focusing on technical deployment and identifying ways to speed up these processes. • Maintaining, monitoring, administering, and updating current deployments, ensuring the stability of solutions. • Helping with incident management, providing 2nd-level support, error analysis, and classification. • Coordinating customer requests and disseminating client information. • Unlocking operational efficiency in deployment and support procedures.

Japan
Bellese Technologies logo

Engineer II, DevOps

Bellese Technologies

Improving the healthcare journey through civic innovation.

DevOps Engineer3 days ago
Full TimeRemoteTeam 51-200H1B No Sponsor

• Work as a part of an agile software development devops team to build and maintain critical infrastructure • Participate in planning and design • Complete sprint work collaboratively and punctually • Assist in the development and management of CI/CD processes for efficient code deployment and system updates. • Support the operation and maintenance of cloud infrastructure and services to ensure performance, security, and cost-efficiency. • Contribute to the management of microservices architecture and orchestration utilizing container technologies. • Collaborate with software development teams to troubleshoot and resolve system issues and outages. • Aid in the implementation of best practices for system hardening and configuration management. • Ensuring security compliance and vulnerability remediations remains within SLAs • Participate in on-call rotations to support system operations. • Help document processes and monitor system logs and activity for potential issues

United States
$108.7K - $126.3K / year
Full TimeRemoteTeam 51-200

Role Description We currently have several large-scale projects and are expanding our infrastructure team. Our product is an advanced platform for creating and managing AI agents. It can be deployed directly inside a customer’s infrastructure and delivered as an enterprise solution, while also being available as a SaaS version. Under the hood, there is real-time voice and telephony, GPU and LLM inference, streaming analytics, and all of this runs both in the cloud and on-prem, including in banking environments. There is a lot of infrastructure; it is complex, interesting, and sometimes at the edge of what is possible. That is why we are looking for a strong SRE who, like us, cares about making systems transparent, reliable, and built the right way. This is a role for a strong, independent engineer. A Senior SRE with real influence and a voice in how things are built and operated. You will also handle DevOps tasks for the team, but your main focus and area of expertise should be SRE: reliability, observability, incident management, and performance under load. Qualifications - 5+ years in SRE/DevOps. - Deep, practical understanding of Docker and Kubernetes. - Mature understanding of metrics and alerts. - Practical experience with Prometheus, Alertmanager, and Grafana. - Experience with SLIs/SLOs, reliability management, incident investigation, and postmortems. - Experience with load testing and basic capacity planning. - Python programming skills. - Cloud experience with GCP and/or AWS. - DevOps fundamentals: CI/CD and infrastructure as code. - Ownership mindset. - Strong communication with developers. - Willingness and ability to mentor, teach, and share knowledge. - Analytical mindset. - Proactivity. - Strong attention to detail and reliability. Requirements - Experience using AI agents for routine and recurring tasks (nice to have). - Real-time telephony: SIP, FreeSWITCH, RTP, WebRTC (nice to have). - GPU/ML serving: Triton, vLLM, RunPod, Nebius, Lambda, run:ai, DCGM (nice to have). - Streaming data and analytics: Kafka, ClickHouse (nice to have). - Deep experience with IaC and GitOps (nice to have). - Experience working in isolated and highly secure environments (nice to have). - Experience preparing systems for significant growth in load (nice to have). Responsibilities - Responsible for the reliability of our services: SLIs/SLOs, availability, and identifying and eliminating bottlenecks. - Set up monitoring for services, metrics, alerts, and dashboards. - Build and maintain Grafana dashboards. - Run load testing, analyze results, and provide recommendations. - Investigate incidents, participate in on-call rotations, and write postmortems. - Work closely with developers to communicate and defend your position. - Develop and support Kubernetes-based infrastructure across clouds. - Take part in delivering and supporting the platform for customers. - Mentor colleagues and help raise the engineering bar across the team. Benefits - The team has built award-winning AI products for tech corporations. - Cutting-edge tech stack: Speech Technologies, NLP, Generative AI. - High engineering bar and real ownership. - Fast career progression. - Startup pace with enterprise stability. - Fully remote across Europe. - 21 vacation days + public holidays + 5 sick days. - Private English lessons via Preply.

Europe