Job Closed

This listing is no longer active.

Lead DevOps Engineer

DevOps EngineerDevOps EngineerFull TimeRemoteSeniorTeam 10,001+Since 1912H1B SponsorCompany SiteLinkedIn

Location

New York

Posted

66 days ago

Salary

$139K - $190K / year

Seniority

Senior

Job Description

Lead DevOps Engineer

Paramount

• Design, implement, and manage scalable and reliable infrastructure for online inference services • Optimize Kubernetes-based deployments for low-latency model serving and real-time personalization • Automate CI/CD pipelines to streamline the deployment of ML models and services • Develop observability and monitoring solutions using tools like Prometheus, New Relic, and OpenTelemetry • Ensure high availability, security, and performance of real-time inference APIs • Work with ML engineers and backend teams to integrate inference models efficiently into production • Implement autoscaling strategies for inference workloads based on traffic patterns and model demand • Manage Pub/Sub and event-driven architectures to enable real-time messaging and engagement analytics • Optimize model-serving infrastructure using Redis, Memcached, and other caching strategies • Debug and tackle production issues related to latency, scaling, and reliability

Job Requirements

  • 4+ years of experience in DevOps, Site Reliability Engineering (SRE), or Cloud Infrastructure Engineering
  • Solid experience with Kubernetes and container orchestration
  • Hands-on experience with CI/CD tools such as GitHub Actions, Jenkins, and ArgoCD
  • Experience working with real-time inference and ML model deployment
  • Deep knowledge of Google Cloud Platform (GCP), AWS, or Azure
  • Expertise in infrastructure as code (IaC) using Terraform or Helm
  • Experience with message queues and event-driven architectures (Pub/Sub, Kafka, etc.)
  • Proficiency in monitoring and logging solutions (New Relic, Prometheus, OpenTelemetry, etc.)
  • Deep scripting skills in Python, Bash, or Go for automation

Benefits

  • Medical
  • Dental
  • Vision
  • 401(k) plan
  • Life insurance coverage
  • Disability benefits
  • Tuition assistance program
  • Paid time off

Related Categories

Related Job Pages

More DevOps Engineer Jobs

Full TimeRemoteTeam 10,001+Since 1978H1B No Sponsor

• Develop and apply automation/testing processes for a system-level software in cloud environments (AWS/Azure/GCP) • Liaise with internal engineering and support teams to plan, design, and code tests for new features, software deployment, recreation of bugs/cases, etc.

Ukraine
Job Closed

Role Description Tommy’s is looking for a DevOps engineer who is excited to work with modern technologies and grow into a senior-level contributor while making a meaningful impact. The ideal candidate enjoys collaborating with cross-functional development teams to build and support consumer-facing digital products used by customers, owners, and operators across the U.S. and internationally. You’re motivated by innovation, take initiative, and want a clear path to increased responsibility within a company that values continuous improvement and modern, digital experiences. Responsibilities - Own the selection, implementation, and evolution of development tools and infrastructure. - Lead and own the release process for each application, from CI/CD pipeline design through production deployment. - Design and build CI/CD pipelines that minimize manual intervention and accelerate delivery. - Collaborate with developers to continuously improve development processes and relay findings from release issues. - Champion code quality by leading constructive, detail-oriented code reviews. - Establish and maintain quality gates across multiple projects as part of the release process. - Author clear technical release notes for each product release. - Proactively identify and mitigate security vulnerabilities across the infrastructure. - Build and refine observability and alerting to catch issues before they become outages. - Manage and optimize cloud hosting environments for reliability and cost efficiency. - Other duties as assigned; duties and responsibilities may change at any time with or without notice. Qualifications - BS/MS in Computer Science or a related field. - 5+ years of hands-on DevOps or infrastructure engineering experience. - Proficient with Docker and containerized workflows. - Strong working experience with AWS services (ECS, CloudWatch, Aurora RDS, Secrets Manager, S3, CloudFront, EC2, IAM, etc.). - Experience with Azure environments and Azure DevOps. - Node.js, TypeScript, and Python programming experience. - Deep understanding of CI/CD concepts and pipeline design. - RDS management experience, particularly PostgreSQL. - Strong security mindset with experience implementing infrastructure security best practices. - Experience with Infrastructure as Code, particularly Terraform. - A self-starter who takes full ownership of projects and drives them to completion. - Technical savvy and proficient in Microsoft Office; experience within database systems a plus. - Excellent written and oral communication skills. - Process-oriented and strong collaborator with ability to communicate and manage well at all levels of the organization and across various departments. - Strong organizational and time management skills; ability to multitask and prioritize workload. - Highly adaptable with strong problem-solving and critical thinking skills; ability to exercise good judgment and make sound data-backed decisions. - High level of integrity and dependability with a strong sense of urgency and results-orientation. - Views customer care as high priority; exhibits a positive can-do attitude. - Displays a strong initiative and drive to identify gaps and fill them. Work Environment and Physical Demands - This job operates in a professional office environment. Office hours are Monday through Friday from 8:00am - 5:00pm. - This role routinely uses standard office equipment such as computers, phones, photocopiers, filing cabinets and operates primarily indoors with limited to no travel expectation. - Work and commute in all weather conditions. - Able to effectively communicate, listen, detect, converse with, discern, convey, express oneself and exchange information. - Able to walk, bend, twist, turn, stoop, climb steps, reach with hands, use hands to fingers. - Work in a fast-paced environment where they will often be multitasking. - Move about inside the office to access standard office equipment. - Constantly operate a computer and other office productivity machinery such as keyboard, copy machine and printer. - Remain in a stationary position 50%+ of the time, alternating between sitting and standing. - Ability to move and lift up to 30 pounds. - Drive between company locations and/or vendors or suppliers as needed while on job. Benefits - Base pay and eligibility for annual profit-sharing bonus. - Full insurance package including Health, Dental, Vision, Life, Disability, Employee Assistance. - Dependent Care FSA with on-site Daycare options. - 401k match and complimentary financial planning services. - Paid time off and paid holidays. - Opportunity for continued education and tuition assistance. - Valuable learning and development program. - Significant ability to grow internally for motivated and strong performing team members. - Fun, energetic, family-oriented work culture with an emphasis on team member morale. - Growing nationwide brand/presence.

United States
Reply logo

Senior TechOps – DevOps Engineer

Reply

Reply designs and implements innovative solutions in the areas: Digital Services, Technology and Consulting.

DevOps Engineer66 days ago
ContractRemoteTeam 10,001+Since 1996H1B Sponsor

• Support a global project ensuring stability and performance of a live production environment • Maintain and improve cloud infrastructure • Support deployments and handle operational challenges in a dynamic setting • Collaborate with distributed teams

Brazil
Runlayer logo

Senior Site Reliability Engineer

Runlayer

The Simpler, Safer Way to Connect MCPs

DevOps Engineer66 days ago
Full TimeRemoteTeam 11-50H1B No Sponsor

• Own reliability and performance of our cloud infrastructure across AWS (ECS, Aurora, CloudWatch) and GCP • Manage and optimize Kubernetes clusters and container orchestration • Drive database reliability engineering, including performance tuning and scaling • Build and maintain CI/CD pipelines for rapid, safe deployments • Run incident response and on-call rotations • Partner with product engineers to design scalable, resilient systems

United States