Job Closed
This listing is no longer active.
Fulfilling the promise of precision medicine through quality and innovation.
Staff DevOps Engineer
Location
New Jersey + 2 moreAll locations: New Jersey | Oklahoma | Minnesota
Posted
56 days ago
Salary
$140K - $179K / year
Seniority
Lead
Job Description
Staff DevOps Engineer
Caris Life Sciences
• Serve as a Staff DevOps Engineer specializing in AWS and Kubernetes to design, implement, and optimize scalable, secure cloud-native infrastructure • Lead PoC initiatives, oversee monitoring solutions, and translate SOX compliance into actionable cloud implementation plans • Break down silos by building a comprehensive team knowledge base, ensuring broad support capabilities • Provide technical leadership in cloud migration, security, and DevOps best practices, driving innovation and operational excellence across the organization • Lead the design, implementation, and management of Kubernetes clusters on AWS EKS, ensuring high availability, scalability, and security • Implement and manage advanced features including autoscaling, monitoring, logging, and security policies • Spearhead proof-of-concept (PoC) initiatives for new tools and environments, evaluating their potential benefits for the organization • Manage the full lifecycle of Kubernetes clusters, including regular upgrades, patch management, version control, and performance optimization • Provide expert-level support and guidance to teams for deploying and optimizing applications on Kubernetes, including container orchestration and service mesh implementation • Design and implement monitoring and alerting solutions for applications and infrastructure using CloudWatch, Prometheus, and Datadog • Develop observability standards and dashboards, leveraging AI/AIOps approaches and SRE agents to enable anomaly detection, alert noise reduction, and automated root cause analysis • Develop and maintain Infrastructure as Code (IaC) using tools such as Terraform or AWS CDK, and implement CI/CD pipelines for efficient application deployment and image management • Design and implement security solutions, including the deployment and management of security tools, and translate SOX compliance requirements into actionable implementation plans for cloud environments • Lead initiatives for cloud migration and modernization of legacy applications, collaborating with cross-functional teams to support their cloud and infrastructure needs • Provide technical leadership and mentorship to junior engineers on cloud technologies and DevOps practices, implementing knowledge-sharing initiatives to ensure broad support capabilities across the team • Stay current with emerging AWS services and features, evaluating their potential benefits and optimizing cloud resource utilization and cost-efficiency • Develop and maintain comprehensive documentation, including a team knowledge base, runbooks, and process documentation to eliminate information silos • Proactively identify areas of inefficiency and develop strategic plans for process improvements across the DevOps and cloud infrastructure landscape • Participate in on-call rotations to support critical cloud infrastructure and respond to emergency issues as needed
Job Requirements
- Bachelor's degree in Computer Science, Information Technology, or related field
- 7+ years of experience in DevOps or Site Reliability Engineering roles
- 5+ years of hands-on experience with AWS services and cloud architecture
- 5+ years of hands-on experience with Kubernetes, including deep expertise in cluster management, troubleshooting, and optimization
- Strong proficiency in at least one programming language (e.g., Python, Go, Java)
- Extensive experience with Infrastructure as Code tools (e.g., Terraform, CloudFormation, AWS CDK)
- Deep understanding of containerization technologies (Docker) and orchestration platforms (Kubernetes) including security best practices
- Experience with CI/CD tools and methodologies, particularly GitLab CI and Github actions
- Strong knowledge of networking concepts and implementation in cloud environments
- Excellent problem-solving skills and ability to troubleshoot complex systems
- Proven ability to lead PoC initiatives and evaluate new technologies
- Demonstrated experience in creating and maintaining technical documentation and knowledge bases
- Demonstrated ability to identify operational inefficiencies and develop strategic plans for process improvements in complex cloud and DevOps environments
- Strong analytical skills with the ability to translate technical insights into actionable business recommendations
- Strong communication and mentoring skills, with the ability to effectively transfer knowledge to team members of varying experience levels
- Proficient in Microsoft Office Suite, specifically Word, Excel, Outlook, and general working knowledge of Internet for business use.
Benefits
- Highly competitive and inclusive medical, dental and vision coverage options
- Health Savings Account for medical expenses and dependent care expenses
- Flexible Spending Account to pay for certain out-of-pocket expenses
- Paid time off, including: vacation, sick time and holidays
- 401k match and Financial Planning tools
- LTD and STD insurance coverages, as well as voluntary benefit options
- Employee Assistance Program
- Pet Insurance
- Legal Assistance
- Tuition Assistance
Related Guides
Related Categories
Related Job Pages
More DevOps Engineer Jobs
Senior Cloud Operations Engineer
MRI TechnologiesProof of U.S. Citizenship is a requirement for this position. Must be able to complete a U.S. government background investigation. MRI Technologies is an Equal Opportunity Employer. All qualified applicants will receive consideration for employment without regard to race, color, religion, sex, sexual orientation, gender identity, national origin, disability, or protected veteran status. As we are a Federal Contractor, most positions require the employee to obtain and maintain a U.S. Government background investigation. MRI also completes a pre-screening background check for anyone offered employment.
Role Description MRI Technologies has an exciting opportunity for a Senior Cloud Operations Engineer on the Mission Enabling Services Contract (MESC) at NASA. In this role, you will build and operate Azure cloud infrastructure from the ground up, establishing the landing zones, governance controls, identity integration, and compliance foundations that will carry NASA's mission workloads for years to come. You will work alongside cloud architects, security engineers, and NASA stakeholders, leading the Azure expansion of a production platform that already operates at enterprise scale on GCP. You are the subject-matter expert on Azure infrastructure design and federal compliance requirements, translating deep technical knowledge into a government-grade environment that meets NIST 800-171, CMMC 2.0, and NASA security standards. This is a build role, not a maintenance role. The platform is early-stage, and your decisions now will define how it operates across the life of the program. A typical day might include: - Reviewing Terraform changes for a new Azure subscription configuration - Coordinating with the security team on a Defender for Cloud alert - Designing a new landing zone architecture - Implementing Azure Policy assignments across management groups - Troubleshooting a network connectivity issue between the Azure environment and the existing GCP platform - Building CI/CD pipeline stages for infrastructure provisioning - Updating cost management tagging standards - Drafting runbook documentation for the operations team The work is hands-on, high-trust, and connected to infrastructure that supports NASA mission success. Qualifications - Bachelor's Degree in Computer Science, IT, or equivalent - 6 or more years of relevant cloud engineering experience - Deep experience designing and operating Azure cloud environments at an enterprise or government scale - Terraform (HCL) for Azure infrastructure automation - Azure Active Directory / Entra ID administration and federated identity configuration - Azure Policy, Management Groups, and subscription governance - CI/CD pipeline experience (Azure DevOps, GitLab, or equivalent) - Experience with NIST 800-171 or CMMC 2.0 compliance requirements - Strong networking fundamentals: VNets, NSGs, ExpressRoute/VPN, DNS - Ability to excel in a remote work environment Requirements - Azure Solutions Architect Expert or equivalent certification (preferred) - Experience with multi-cloud environments (Azure + GCP integration) (preferred) - Microsoft Sentinel, Defender for Cloud, or Azure Monitor experience (preferred) - Prior federal government or DoD cloud engineering experience (preferred) - Experience with CUI handling and controlled environment design (preferred) - Familiarity with CMMC 2.0 Level 2 self-assessment or C3PAO audit preparation (preferred) Benefits - Comprehensive benefits package including medical, dental, vision - Company-paid life and disability insurance - Paid time off - 401(k) - Flexible work schedule - Strong career development opportunities working alongside NASA's mission teams Company Description Proof of U.S. Citizenship is a requirement for this position. Must be able to complete a U.S. government background investigation. MRI Technologies is an Equal Opportunity Employer. All qualified applicants will receive consideration for employment without regard to race, color, religion, sex, sexual orientation, gender identity, national origin, disability, or protected veteran status. As we are a Federal Contractor, most positions require the employee to obtain and maintain a U.S. Government background investigation. MRI also completes a pre-screening background check for anyone offered employment.
Senior SRE
LiveRampThe leading data enablement platform for the safe, easy, and effective use of data.
• Support and/or own the deployment of global products including setting up production and internal environments • Provide 24/7 first line of Engineering support (via follow the sun teams in all regions) for any issues related to global product deployment, availability and internal operations support. • Drive effective resolutions of core product issues with Engineering teams • Setup and maintain Infrastructure & Product Reliability monitoring and alerting • Maintain and enhance CI/CD Tooling and Terraform scripts in support of the mission in close collaboration with DevOps team • Maintain and enhance Engineering Operational Documentation for supported products. • Provide expertise to build and maintain products operational documentation and setting up product SRE practices • Experience working with real-time and NoSQL Databases such as SingleStore DB, ScyllaDB, Cassandra or Dynamodb • Optimize the performance and cost of the systems and rightsize Kubernetes containers. • Work in close collaboration with SRE team members and Engineering organizations based in California, Paris, Nantong, Singapore, Australia and others.
Analista de Sistemas de Automação – DevOps – Pleno
Lanlink Informática Ltda.Tecnologias que transformam
• Utilizar ferramentas de gerenciamento de configuração para manter a consistência dos ambientes; • Utilizar ferramentas como Docker, Kubernetes, dentre outras, para gerenciar configurações, containers e orquestração de ambientes; • Automatizar a criação, configuração e escalabilidade de servidores, clusters e recursos em nuvem ou on-premises, garantindo flexibilidade e eficiência; • Diagnosticar problemas em ambientes de produção, realizando correções rápidas e prevenindo futuras ocorrências; • Garantir que os ambientes estejam seguros, com controle de acesso, criptografia e conformidade com normas de segurança da informação; • Realizar testes para validar melhorias, identificar gargalos e assegurar a estabilidade sob diferentes condições de uso; • Manter registros atualizados de pipelines, configurações, procedimentos e melhorias realizadas; • Trabalhar em conjunto com equipes técnicas para alinhar estratégias de automação, integração e entrega contínua, promovendo uma cultura DevOps no ambiente de infraestrutura de TI.
• Promover entrega contínua, criar guias e GoldenPaths, definir padrões de versionamento e segurança. • Conduzir workshops práticos para integração e uso das ferramentas internas. • Configurar pipelines CI/CD, desenvolver código para provisionamento (IaC), automatizar deploys e otimizar containers. • Integrar testes e ferramentas de qualidade, implementar práticas de segurança, participar de auditorias. • Atender incidentes e solicitações de infraestrutura cloud, monitorar aplicações e pipelines para alta disponibilidade. • Coletar métricas, revisar projetos para performance, segurança e custos, avaliar novas práticas e ferramentas.



