Job Closed

This listing is no longer active.

Prompt Therapy Solutions Inc

Senior DevOps Engineer, Infrastructure, MLOps

DevOps EngineerDevOps EngineerOther Remote SeniorTeam 11-50H1B No SponsorCompany Site LinkedIn

Location

United States

Posted

149 days ago

Salary

$180K - $200K / year

Seniority

Senior

Bachelor Degree5 yrs expEnglishAnsible AWS Azure Docker GCP Grafana Kubernetes Prometheus Python Terraform

Job Description

• Design, implement, and manage highly available infrastructure for our cloud-based platforms (AWS). • Work with our internal engineering teams to architect and support AI/ML infrastructure, specifically managing AWS Lambda infrastructure, and some legacy SageMaker environments for model training, hosting, and inference. • Create and automate robust deployment pipelines using CI/CD tools (GitLab / GitHub Actions) for both web applications and machine learning models. • Build, maintain, and scale containerized applications with Docker and ECS/Fargate. • Implement MLOps best practices to streamline the transition of models from development to production. • Ensure system scalability and reliability through proactive monitoring, logging, and automated alerting. • Collaborate with both Product Engineers and Data Scientists to optimize performance, security, and infrastructure costs. • Manage and evolve our Infrastructure as Code (IaC) footprint.

Job Requirements

5+ years of experience in a DevOps or infrastructure role.
Expert knowledge of cloud platforms such as AWS, GCP and Azure
Strong experience with containerization technologies (Docker, ECS / Kubernetes).
Proven track record of designing and managing complex CI/CD pipelines.
Experience with MLOps workflows (model versioning, retraining pipelines, or feature stores).
Hands-on experience with monitoring and logging tools (Datadog, Prometheus, Grafana, MLflow).
Expertise in scripting languages (Python is a must, along with Bash, Go, etc.).
Proficiency with infrastructure automation tools (Terraform, Ansible, or CloudFormation).
Excellent communication skills and the ability to bridge the gap between traditional DevOps and Data Science teams.

Benefits

Competitive salaries
Remote/hybrid environment
Potential equity compensation for outstanding performance
Flexible PTO
Company-wide sponsored lunches
Company paid disability and life insurance benefits
Company paid family and medical leave
Medical, dental, and vision insurance benefits
Discounted pet insurance
FSA/DCA and commuter benefits
401k
Credits for online fitness classes/gym memberships
Recovery suite at HQ – includes a cold plunge, sauna, and shower

Related Categories

DevOps Engineer

Related Job Pages

Remote Python Jobs (US)More Remote Jobs

More DevOps Engineer Jobs

Junior DevOps Engineer

Cprime, Inc

Cprime enables our clients to turn ideas into action faster.

DevOps Engineer149 days ago

Contract RemoteTeam 501-1,000Since 2003H1B No Sponsor

Company Site LinkedIn

• Assist in building, maintaining, and improving CI/CD pipelines in GitLab, with exposure to SAP platforms. • Support the automation of deployment, monitoring, and remediation processes under the guidance of senior engineers. • Help document CI/CD pipelines, templates, and operational procedures. • Participate in implementing DevSecOps best practices, including basic security and compliance checks. • Collaborate with infrastructure, security, and application teams to support on-prem (VMware) and cloud environments (GCP / Kubernetes). • Monitor pipelines and systems, troubleshoot issues, and escalate when necessary. • Continuously learn DevOps tools, cloud technologies, and automation practices.

Ansible AWS Azure GCP Kubernetes Linux Terraform VMware

View details: Junior DevOps Engineer

Hungary

Apply

Job Closed

Senior Site Reliability Engineer – SRE

Moonlite

DevOps Engineer149 days ago

Other RemoteTeam 1-10

Company Site LinkedIn

• Design, build, and operate production Kubernetes clusters on bare-metal infrastructure. • Implement and operate custom Kubernetes networking solutions. • Develop and maintain custom Kubernetes operators and controllers. • Deploy and optimize NVIDIA GPU operators and custom scheduling logic for GPU workloads. • Build deep integrations between Kubernetes and underlying infrastructure. • Design and implement automation using Terraform, Ansible, Helm, and custom operators. • Manage production bare-metal infrastructure across multiple regions ensuring high availability, fault tolerance, and graceful degradation. • Build comprehensive monitoring, logging, and alerting using Prometheus, Grafana, and ELK stack. • Identify and resolve performance bottlenecks across infrastructure domains.

Ansible DNS Grafana Kubernetes Linux Prometheus Python Terraform

View details: Senior Site Reliability Engineer – SRE

Illinois

$165K - $225K / year

Apply

Job Closed

DevOps Engineer

AAPC

Advancing the Business of Healthcare

DevOps Engineer149 days ago

Other RemoteTeam 51-200Since 1988H1B No Sponsor

Company Site LinkedIn

• Continuously assess technology hosted in the public cloud against industry standards and security compliance. • Streamline Infrastructure platforms to 100% “everything as code" in the public cloud. • Write code once and reuse it as much as possible for Infra as code, CICD pipelines, templates, etc. • Implements end-to-end automated CICD practices for build, scan, packaging, test, and deployment. • Drive Containerization across application workloads to leverage native cloud features and scalability. • Achieve 100% compliance on all infrastructure vulnerabilities and package vulnerabilities. • Drive 100% self-service and reusable automation across stakeholders for platform requests. • Add instrumentation across Infrastructure for monitoring and alerting on internal problems. • Build processes and diagnostic tools to troubleshoot, maintain, and optimize Infrastructure. • Adopt continuous learning of modern data engineering practices. • Maintain industry standards through incremental adoption of new technology and best practices. • Create and continuously maintain high-quality documentation for platform and DevOps practices.

AWS Azure Docker Python Terraform

View details: DevOps Engineer

United States

Apply

Job Closed

Deployment Engineer

Cyngn

Autonomous Vehicle solutions and retrofits for industrial use cases across logistics, material handling, and mining.

DevOps Engineer150 days ago

Other RemoteTeam 51-200H1B Sponsor

Company Site LinkedIn

• Lead end-to-end deployment of autonomous robotic systems at customer facilities. • Conduct site surveys and assess automation readiness, infrastructure constraints, and ODD requirements. • Generate 3D and semantic maps and validate localization and navigation performance. • Install, configure, calibrate, and commission robotic systems in live production environments. • Train customer operators, maintenance teams, and site stakeholders; manage handoff to support. • Work directly with the Customer Solutions Engineering team to take their designs in a handoff and perform a full implementation and betterment of these designs. • User Acceptance Testing (UAT): Define and lead the final testing phase during the implementation, proving to the customer that the system meets all safety and throughput Key Performance Indicators (KPIs). • Perform the handoff and orientation of the vehicle with the customer as well as part of the handoff. • Act as the primary technical point of contact during deployments and early operations. • Work directly with customer IT teams to configure networks, resolve firewall/port issues, and ensure reliable connectivity. • Run regular performance reviews with customers, using data to drive operational improvements and adoption. • Head the deployment of the routes on site, and coordinate with the customer to make sure these routes remain optimized overtime. • Analyze robot logs, sensor data, and system metrics using tools such as Foxglove, RViz, RQT_Bag, and PlotJuggler. • Diagnose hardware, software, perception, and infrastructure issues in the field. • Own incident resolution across Tier 1–3 support, ensuring fast MTTR and clear root-cause documentation. • Monitor fleet health and KPIs using Grafana and internal dashboards. • Provide structured feedback from the field to Product, Engineering, QA, and Perception teams. • Support validation activities including FoV studies, data collection, and perception audits. • Collaborate with OEM partners to integrate robotics hardware and software into new vehicle platforms. • Build and maintain scalable deployment playbooks, checklists, and Jira-based workflows.

Grafana Linux

View details: Deployment Engineer

United States

$90K - $112K / year

Apply

Job Closed

Senior DevOps Engineer, Infrastructure, MLOps

Job Description

Job Requirements

Benefits

Related Guides

Related Categories

Related Job Pages

More DevOps Engineer Jobs

Junior DevOps Engineer

Senior Site Reliability Engineer – SRE

DevOps Engineer

Deployment Engineer