Job Closed

This listing is no longer active.

DevOps Engineer – Level 2

DevOps EngineerDevOps EngineerOtherRemoteMid LevelTeam 10,001+Since 1939H1B No SponsorCompany SiteLinkedIn

Location

Virginia

Posted

83 days ago

Salary

$83.4K - $137.6K / year

Seniority

Mid Level

Bachelor Degree2 yrs expExperience acceptedEnglishAnsibleAzureJavaJenkinsLinuxNoSQLPythonSQL

Job Description

DevOps Engineer – Level 2

Northrop Grumman

• Design and implement systems capable of scaling to meet growing data and computation demands in AI applications. • Implement AI Infrastructure that is compliant with relevant regulations and security standards, implementing necessary controls • Provide metrics to evaluate the performance and reliability of AI infrastructure and implement improvements based on findings. • Track advancements in AI technologies and infrastructure trends and evaluate applicability to the organization’s strategy. • Work alongside data scientists, software engineers, and i2 functional teams to provide infrastructure support for new and existing systems. • Other duties as assigned.

Job Requirements

  • Bachelor’s degree in computer science, software engineering, information systems or a related field with 2+ years of relevant professional experience – OR – Master’s degree in computer science, software engineering, information systems or a related field with 1+ years of relevant professional experience.
  • Will consider an additional 4+ years of experience in lieu of degree
  • 1+ years’ experience implementing the back-end infrastructure for COTS or SaaS platforms.
  • 1+ years working with tools for automation (ansible, argoCD), CI/CD (CodePipeline, Azure DevOps, Jenkins), and configuration management (git).
  • Hands-on experience with relevant programming languages (e.g., Python, Java, Powershell/Bash).
  • Experience managing Windows or Linux server environments.
  • Basic knowledge of database technologies (e.g., SQL, NoSQL) and distributed computing systems.
  • Exceptional verbal and written communication skills, with the ability to convey complex technical concepts to non-technical stakeholders.

Benefits

  • Health insurance coverage
  • Life and disability insurance
  • Savings plan
  • Company paid holidays
  • Paid time off (PTO) for vacation and/or personal business

Related Categories

Related Job Pages

More DevOps Engineer Jobs

Leidos logo

Cloud Operations Engineer

Leidos

Leidos is an innovation company rapidly addressing the world’s most vexing challenges in national security and health.

DevOps Engineer83 days ago
Full TimeRemoteTeam 10,001+Since 1969H1B Sponsor

• monitor and manage day-to-day operations of the hosted application environment • focusing primarily on auditing, monitoring, and "OS and up" configuration • working with a team of experienced engineers and with our external service provider • ensure a smooth application experience for end users • troubleshoot and diagnose issues • directly work with customers to configure new application instances • contribute to the ongoing architecture and design for current and future implementations

United States
$59.2K - $106.9K / year
Job Closed

Role Description Estamos en la búsqueda de un Ingeniero DevOps Senior, con sólida experiencia en Kubernetes, para unirse a nuestro equipo y ayudarnos a diseñar, implementar y mantener infraestructuras escalables, seguras y altamente disponibles. Buscamos a alguien proactivo, con mentalidad de mejora continua y pasión por la automatización. Responsibilities - Diseñar, implementar y administrar clústeres de Kubernetes en entornos productivos. - Automatizar procesos de CI/CD. - Gestionar infraestructura como código (IaC). - Monitorear, optimizar y garantizar la alta disponibilidad de los sistemas. - Colaborar con equipos de desarrollo para mejorar despliegues y rendimiento. - Implementar buenas prácticas de seguridad y observabilidad. - Resolver incidentes y proponer mejoras continuas. Qualifications - +4 años de experiencia como DevOps Engineer. - Experiencia comprobada y profunda en Kubernetes (indispensable). - Conocimientos sólidos en contenedores (Docker). - Experiencia con herramientas de CI/CD (GitHub Actions, GitLab CI, Jenkins u otras). - Manejo de cloud providers (AWS, GCP o Azure). - Conocimientos en Linux y scripting (Bash, Python u otros). - Experiencia con monitoreo y logging (Prometheus, Grafana, ELK, etc.). Desired Qualifications - Experiencia con Terraform u otras herramientas IaC. - Conocimientos de seguridad en entornos cloud. - Inglés técnico (lectura de documentación). Benefits - Trabajo 100% remoto desde Colombia. - Rango salarial competitivo: USD 600 – 850 según experiencia. - Estabilidad laboral y oportunidades de crecimiento. - Ambiente colaborativo y orientado a resultados. - Participación en proyectos desafiantes y de alto impacto.

Colombia
$600 - $850 / month
Cornelis Networks, Inc. logo

Senior Software Engineer - DevOps and Infrastructure

Cornelis Networks, Inc.

Cornelis Networks delivers the world’s highest performance scale-out networking solutions for AI and HPC datacenters. Our differentiated architecture seamlessly integrates hardware, software and system level technologies to maximize the efficiency of GPU, CPU and accelerator-based compute clusters at any scale. Our solutions drive breakthroughs in AI & HPC workloads, empowering our customers to push the boundaries of innovation. We are a fast-growing, forward-thinking team of architects, engineers, and business professionals with a proven track record of building successful products and companies. As a global organization, our team spans multiple U.S. states and six countries, and we continue to expand with exceptional talent in onsite, hybrid, and fully remote roles.

DevOps Engineer83 days ago
OtherRemoteTeam 51-200

Cornelis Networks delivers the world’s highest performance scale-out networking solutions for AI and HPC datacenters. Our differentiated architecture seamlessly integrates hardware, software and system level technologies to maximize the efficiency of GPU, CPU and accelerator-based compute clusters at any scale. Our solutions drive breakthroughs in AI & HPC workloads, empowering our customers to push the boundaries of innovation. Backed by top-tier venture capital and strategic investors, we are committed to innovation, performance and scalability - solving the world’s most demanding computational challenges with our next-generation networking solutions. We are a fast-growing, forward-thinking team of architects, engineers, and business professionals with a proven track record of building successful products and companies. As a global organization, our team spans multiple U.S. states and six countries, and we continue to expand with exceptional talent in onsite, hybrid, and fully remote roles. We are seeking an experienced Senior Software Engineer to enhance Cornelis Networks' development and testing infrastructure. This role focuses on the design and maintenance of onsite and cloud infrastructure, including CPU and GPU-accelerated systems, automation, and CI/CD pipelines. The ideal candidate will have deep Linux systems expertise and hands-on experience managing server hardware and software stacks in Enterprise IT, Cloud or HPC environments. Responsibilities - Design, build, and maintain CI/CD pipelines using GitHub Actions, including custom workflows, across multiple Linux distributions (RHEL, Rocky, SLES, Ubuntu). - Integrate HPC/AI workload and Cornelis software build and test stages into CI pipelines, including driver compatibility checks and GPU-accelerated test suites. - Interface with SW teams to validate and test new packages and ensure test hardware environments are provisioned with the correct package versions. - Collaborate with developers to help debug, interpret, and communicate testing results, ensuring issues are triaged and resolved efficiently. - Monitor pipeline health, troubleshoot failures, and continuously improve build reliability and speed. - Administer and maintain onsite Linux-based development and test servers across a heterogeneous multi-distro environment. - Perform OS provisioning, patching, and lifecycle management, including kernel module and driver management. - Manage hardware and software stacks for both CPU and GPU systems, including installation, upgrade, and troubleshooting of AMD (ROCm) and NVIDIA (CUDA/driver) environments. - Maintain driver and package compatibility matrices to ensure consistent environments across development, CI, and test infrastructure. - Manage test hardware environments to ensure they are provisioned with the correct packages and configurations for ongoing test campaigns. Minimum Qualifications - 5+ years of experience in DevOps or infrastructure engineering in a Linux-based environment. - Strong proficiency with Git and GitHub Actions, including designing and maintaining custom CI/CD workflows. - Deep Linux systems mastery, especially across RHEL/Rocky, SLES, and Ubuntu, such as package management (RPM, DEB, zypper, dnf, apt), systemd, kernel module management, and system troubleshooting. - Hands-on experience managing server systems, including installation and maintenance of drivers, software, and firmware across multiple Linux distributions. - Proficiency in Linux scripting for automation and tooling. - Strong troubleshooting and debugging skills across hardware, OS, driver, and application layers. - Excellent communication skills and ability to work effectively in a remote, collaborative environment. Preferred Qualifications - Experience with Reframe or similar HPC regression testing frameworks. - Experience with HPC, high-performance networking, or RDMA technologies (InfiniBand, Omni-Path, Ethernet RDMA). - Familiarity with kernel driver development or kernel module packaging (DKMS, kmod). - Experience managing multi-GPU server configurations (e.g., DGX, MI300X-class systems, or similar). - Knowledge of GPU monitoring and management tools (nvidia-smi, rocm-smi, DCGM, GPU operator frameworks). - Experience with infrastructure-as-code tools such as Ansible. - Familiarity with monitoring and observability stacks (Grafana, Loki, Prometheus). - Proficiency in Python for automation and tooling. - Familiarity with C/C++. - Experience in a networking, semiconductor, or systems software company. Location: This is a remote position for employees residing within the United States. We offer a competitive compensation package that includes equity, cash, and incentives, along with health and retirement benefits. Our dynamic, flexible work environment provides the opportunity to collaborate with some of the most influential names in the semiconductor industry. At Cornelis Networks your base salary is only one component of your comprehensive total rewards package. Your base pay will be determined by factors such as your skills, qualifications, experience, and location relative to the hiring range for the position. Depending on your role, you may also be eligible for performance-based incentives, including an annual bonus or sales incentives. In addition to your base pay, you’ll have access to a broad range of benefits, including medical, dental, and vision coverage, as well as disability and life insurance, a dependent care flexible spending account, accidental injury insurance, and pet insurance. We also offer generous paid holidays, 401(k) with company match, and Open Time Off (OTO) for regular full-time exempt employees. Other paid time off benefits include sick time, bonding leave, and pregnancy disability leave. Cornelis Networks does not accept unsolicited resumes from headhunters, recruitment agencies, or fee-based recruitment services. Cornelis Networks is an equal opportunity employer, and all qualified applicants will receive consideration for employment without regard to race, color, religion, sex, sexual orientation, gender identity or expression, pregnancy, age, national origin, disability status, genetic information, protected veteran status, or any other characteristic protected by law. We encourage applications from all qualified candidates and will accommodate applicants’ needs under the respective laws throughout all stages of the recruitment and selection process. Location Austin, Texas (Remote) Department Software Engineering Employment Type Full-Time

United States
Job Closed
hireforyou.pro logo

Middle DevOps Engineer

hireforyou.pro

We look forward to receiving your CV and learning more about your experience! Dear Candidates, due to a high volume of applications, only selected candidates will be contacted for interviews. We appreciate your understanding. Thank you for considering a career with us!

DevOps Engineer83 days ago

Role Description We are looking for a Middle DevOps Engineer to join a high-performing engineering team providing infrastructure and DevOps support for fast-growing technology companies across fintech, SaaS, e-commerce, and entertainment. This role is ideal for engineers who enjoy working in a dynamic environment with multiple projects, solving complex infrastructure challenges, and collaborating closely with both internal teams and client engineers. What You’ll Do: - Infrastructure Management: - Manage hybrid environments including cloud infrastructure and physical servers - Work with platforms such as AWS, DigitalOcean, Hetzner, Azure, or GCP - Infrastructure as Code: - Build and maintain infrastructure using Terraform and Ansible - Containerized Environments: - Work with Kubernetes in production environments - Monitoring & Incident Response: - Configure and maintain monitoring and alerting systems (Prometheus, Grafana, ELK/Graylog) - Participate in incident analysis and root cause investigations - Technical Collaboration: - Communicate with engineering teams and participate in architectural discussions - Engineers participate in on-call rotations and may occasionally respond to incidents outside standard hours. Tech Stack: - Core Requirements: - Linux administration (Debian / Ubuntu / CentOS) - Terraform & Ansible - Kubernetes (production experience) - Cloud infrastructure (AWS / DigitalOcean / Hetzner) - Monitoring and alerting systems (Prometheus, Grafana, ELK / Graylog) - Nice to Have: - Bare metal infrastructure or virtualisation (Proxmox) - Experience with distributed databases (ClickHouse, Cassandra, ScyllaDB) - Networking knowledge (BGP, VLAN, L2/L3 troubleshooting) - Experience with Azure or GCP - Automation for hybrid or Windows environments Qualifications - 3+ years of experience in DevOps, infrastructure engineering, or system administration - Experience working in fast-paced environments with multiple projects - Strong troubleshooting and incident management skills - Ability to collaborate with engineers and communicate technical solutions clearly - Technical English (B1+) for written communication Requirements - Remote full-time position - Market-level compensation - 21 days of paid vacation - Paid sick leave - Support for professional development and certifications Benefits - We look forward to receiving your CV and learning more about your experience! - Dear Candidates, due to a high volume of applications, only selected candidates will be contacted for interviews. - We appreciate your understanding. Thank you for considering a career with us.

United States + 171 moreAll locations: United States | Canada | Brazil | Colombia | Argentina | Chile | Venezuela | Bolivia | Ecuador | French Guiana | Guyana | Paraguay | Peru | Suriname | Uruguay | Mexico | Costa Rica | El Salvador | Guatemala | Honduras | Nicaragua | Panama | Dominican Republic | Puerto Rico | Bahamas | Guadeloupe | Haiti | Jamaica | Martinique | Montserrat | United Kingdom | Germany | France | Estonia | Portugal | Hungary | Poland | Ukraine | Romania | Bulgaria | Czechia | Slovakia | Belarus | Moldova | Sweden | Greece | Belgium | Italy | Ireland | Switzerland | Netherlands | Finland | Malta | Denmark | Lithuania | Croatia | Spain | Austria | Bosnia And Herzegovina | Iceland | Luxembourg | North Macedonia | Montenegro | Norway | Serbia | Slovenia | Albania | Cyprus | Latvia | Monaco | South Africa | Egypt | Algeria | Angola | Benin | Botswana | Burkina Faso | Burundi | Cameroon | Cabo Verde | Central African Republic | Chad | Congo | Côte D'ivoire | Democratic Republic of the Congo | Equatorial Guinea | Eritrea | Ethiopia | Gabon | Gambia | Ghana | Guinea | Guinea-bissau | Kenya | Lesotho | Liberia | Libya | Madagascar | Malawi | Mali | Mauritania | Mauritius | Mayotte | Morocco | Mozambique | Namibia | Niger | Nigeria | Réunion | Rwanda | Senegal | Seychelles | Sierra Leone | Somalia | Sudan | Eswatini | Tanzania | Togo | Tunisia | Uganda | Zambia | Zimbabwe | Georgia | Turkey | Israel | United Arab Emirates | Armenia | Azerbaijan | Bahrain | Iraq | Jordan | Kuwait | Lebanon | Oman | Qatar | Saudi Arabia | Palestine | Yemen | India | Japan | Philippines | Pakistan | Thailand | Singapore | Vietnam | Taiwan | Indonesia | Cambodia | Laos | Malaysia | Myanmar | South Korea | China | Afghanistan | Bangladesh | Bhutan | Kazakhstan | Kyrgyzstan | Maldives | Mongolia | Nepal | Sri Lanka | Tajikistan | Turkmenistan | Uzbekistan | Australia | Papua New Guinea | Kiribati | Palau | French Polynesia | Tuvalu | New Zealand
Job Closed