Job Closed

This listing is no longer active.

DistroKid logo
DistroKid

We're the easiest way for music creators to get music into Spotify, Apple Music, and all major streaming services.

Senior Systems Operations Engineer

DevOps EngineerDevOps EngineerFull TimeRemoteSeniorTeam 51-200Since 2013H1B No SponsorCompany SiteLinkedIn

Location

United States

Posted

70 days ago

Salary

$155K - $170K / year

Seniority

Senior

Job Description

Senior Systems Operations Engineer

DistroKid

• Design, deploy, and manage scalable and highly available cloud infrastructure on AWS, with deep expertise in core services (EC2, EKS, S3, RDS, IAM, VPC, and beyond). • Develop and maintain disaster recovery plans leveraging AWS capabilities for backup and replication to ensure business continuity. • Collaborate with engineering and security teams to improve infrastructure health, security, and long-term scalability. • Design reusable Terraform/OpenTofu modules following DRY principles and organizational standards; implement module versioning and lifecycle strategies. • Direct the migration of manual infrastructure to code; establish patterns and best practices for IaC adoption across the team. • Implement IaC testing strategies, including validation, linting, and integration testing, using tools such as Terraform-Compliance or Checkov. • Architect and maintain complex Bitbucket pipeline configurations for multi-environment IaC deployments; implement pipeline security best practices. • Implement AIOps practices, leveraging AI tools to enhance monitoring, incident response, and predictive alerting. • Use AI-assisted development and operations tools (e.g., Cursor, Claude) to accelerate troubleshooting, code review, and documentation generation. • Evaluate and implement AI-powered automation to reduce operational toil, improve repeatability, and scale platform capabilities. • Define and implement SLOs for services; guide and/or participate in incident response and conduct blameless postmortems. • Implement chaos engineering practices to proactively identify system weaknesses before they impact production. • Build and maintain comprehensive monitoring solutions using tools such as CloudWatch and Datadog to track performance and drive optimization. • Develop automation scripts and tools in Python, Bash, or similar languages to streamline operations and eliminate manual toil. • Build self-service capabilities for development teams to reduce cognitive load and enable developer autonomy across the organization. • Guide the solution architecture and end-to-end implementation of DistroKid’s first Internal Developer Portal (IDP). • Define the IDP roadmap and success criteria in partnership with engineering leadership; establish golden paths, service catalogs, and self-service workflows that reduce deployment friction and accelerate developer productivity. • Drive adoption of the IDP across engineering teams; gather feedback, iterate on the platform, and measure impact through developer experience metrics and reduced time-to-deploy. • Guide cost optimization initiatives; implement rightsizing recommendations, reserved-capacity strategies, and tagging standards for cost allocation. • Monitor and optimize AWS resource usage; select appropriate services and configurations to meet performance requirements cost-effectively. • Direct planning, decision-making, and execution for infrastructure projects; own workstreams end-to-end. • Partner cross-functionally with engineering, security, and product teams; communicate impact in terms of company strategy and OKRs. • Provide technical mentorship to junior and mid-level engineers; invest in team growth and foster a culture of continuous learning. • Maintain and contribute to infrastructure documentation, runbooks, and architectural decision records to ensure knowledge sharing and operational consistency.

Job Requirements

  • Bachelor’s degree in Computer Science, Information Technology, a related field, or equivalent practical experience.
  • 5+ years of experience in systems operations, platform engineering, or DevOps with a focus on cloud infrastructure and containerized environments.
  • Proven production experience with AWS services (EC2, EKS, S3, RDS, IAM, VPC, API Gateway, Event Bridge, etc) and Kubernetes.
  • 5+ years of hands-on experience with Infrastructure as Code tools, specifically Terraform and/or OpenTofu, including module design, state management, remote backends, and IaC testing.
  • Strong knowledge of Linux/Unix administration, systems, and shell scripting.
  • Proficiency in Python, Go, or similar programming languages.
  • Experience with CI/CD pipelines for infrastructure deployments (Bitbucket Pipelines, Jenkins, or similar).
  • Experience with monitoring and observability tools (Prometheus, Grafana, CloudWatch, or Datadog).
  • Demonstrated experience implementing or working with AIOps tools, practices, or AI-assisted operations in a professional context.
  • Experience using AI-assisted development tools (e.g., Cursor, Warp, Claude, or similar) to accelerate engineering work.

Benefits

  • Retirement plans (401k, SIPP, etc.)
  • Health insurance
  • Generous paid time off
  • Parental leave
  • Home office allowance
  • Flexible work schedules
  • Paid and discounted subscriptions
  • Regular engagement activities

Related Categories

Related Job Pages

More DevOps Engineer Jobs

OtherRemoteTeam 5,001-10,000H1B Sponsor

Join the team leading the next evolution of virtual care. At Teladoc Health, you are empowered to bring your true self to work while helping millions of people live their healthiest lives. Here you will be part of a high-performance culture where colleagues embrace challenges, drive transformative solutions, and create opportunities for growth. Together, we’re transforming how better health happens. Summary of Position The Principal Platform Engineer (DevOps / Developer Experience) is a senior individual contributor who accelerates platform delivery by pairing strong software engineering with deep platform/operations expertise. This role sets the technical bar, works collaboratively across cross teams, and delivers reusable patterns that improve delivery speed, reliability and developer productivity. Essential Duties and Responsibilities Accelerate Top Priorities - Act as a technical “force multiplier” on the highest-priority initiatives; clarify approach, resolve ambiguity, and drive work to completion with high quality and pragmatic trade-offs. - Reduce cross-team friction by defining clear interfaces, breaking work into deliverable increments, and enabling parallelization through strong architecture boundaries. Raising Engineering Standards - Establish and model best practices for engineering excellence: design docs/RFCs, architecture reviews, code review discipline, and effective automated testing strategies. - Drive API-first and “platform as a product” behaviors: define and promote consistent platform interfaces that reduce bespoke integrations and siloed solutions. Build Paved Roads - Create reusable platform capabilities (templates/modules/golden paths) that reduce reinvention and speed up delivery for teams. - Drive automation opportunities (including agentic/AI-enabled workflows) that improve operational and delivery efficiency. Improve Operational Excellence - Lead cross-cutting improvements that enhance stability and reduce toil: observability standards, alert hygiene, incident learning loops, and resilience patterns. - Partner with operations and platform stakeholders to measurably improve reliability outcomes and reduce operational drag on platform delivery teams. Partner and Mentor - Coach senior/staff engineers by pairing on real work, running reviews, and teaching pragmatic system-level thinking. - Set clear examples of technical leadership, collaboration, and accountability without formal people management responsibility. On-call Participation - Participate in the on-call rotation and contribute to restoration, root cause learning, and prevention. Required Qualifications - Bachelor’s degree in Computer Science, Engineering, or a related technical field. - 15+ years of hands-on software engineering designing, building, testing, deploying and operating large-scale distributed systems in cloud-native environments. - 5+ years operating at Staff or Principal scope, leading multi-quarter, cross-team technical initiatives that span 3+ teams and deliver organization-level outcomes. - 8+ years of experience designing and operating microservices-based systems, including API design and versioning, authentication and authorization frameworks (e.g. OAuth, OIDC, IAM), and Infrastructure-as-Code (e.g. Terraform, Cloudformation, ARM) - Deep hands-on experience (5+ years) in at least three of the following: Kubernetes and container orchestration platforms, public cloud infrastructure (AWS/Azure/GCP), CI/CD systems and deployment automation, Infrastructure-as-Code and configuration management, and production operations, reliability tooling and on-call systems. - Demonstrated ownership of production systems supporting business-critical workloads, including participation in incident response, post-incident reviews, and reliability improvements at scale. Preferred Qualifications - Proven ability to operate as a self-directed technical leader, navigating ambiguity, defining problem spaces, and driving clarity and alignment across multiple teams. - Demonstrated success influencing technical direction across globally distributed teams and multiple levels of the organization without formal authority. - Strong written and verbal communication skills, with the ability to translate complex technical concepts for engineering, product and executive audiences. - Experience designing or evolving internal platforms or self-service capabilities that materially improve developer experience, delivery throughput, or operational efficiency. - Strong background in observability (metrics, logs, traces), incident management, and reliability practices, with a track record of improving system health and reducing operational toil. - Deep understanding of performance optimization, system resilience, and observability in high-scale production environments. - Experience working in regulated industries such as healthcare or fintech, including familiarity with compliance-driven architectural and security considerations. - Familiarity with healthcare data standards (e.g. FHIR, HL7) and platform security best practices. The base salary range for this position is $180,000 - $210,000. In addition to a base salary, this position is eligible for a performance bonus and benefits (subject to eligibility requirements) listed here: Teladoc Health Benefits 2026. Total compensation is based on several factors including, but not limited to, type of position, location, education level, work experience, and certifications. This information is applicable for all full-time positions. We follow a Flexible Vacation Policy, intended for rest, relaxation, and personal time. All time off must be approved by your manager prior to use. You will also receive 80 hours of Paid Sick, Safe, and Caregiver Leave annually. This applies to full-time positions only. If you are applying for a part-time role, your recruiter can provide additional details. As part of our hiring process, we verify identity and credentials, conduct interviews (live or video), and screen for fraud or misrepresentation. Applicants who falsify information will be disqualified. Teladoc Health will not sponsor or transfer employment work visas for this position. Applicants must be currently authorized to work in the United States without the need for visa sponsorship now or in the future. Why join Teladoc Health? - Teladoc Health is transforming how better health happens. Learn how when you join us in pursuit of our impactful mission. - Chart your career path with meaningful opportunities that empower you to grow, lead, and make a difference. - Join a multi-faceted community that celebrates each colleague’s unique perspective and is focused on continually improving, each and every day. - Contribute to an innovative culture where fresh ideas are valued as we increase access to care in new ways. - Enjoy an inclusive benefits program centered around you and your family, with tailored programs that address your unique needs. - Explore candidate resources with tips and tricks from Teladoc Health recruiters and learn more about our company culture by exploring #TeamTeladocHealth on LinkedIn. As an Equal Opportunity Employer, we never have and never will discriminate against any job candidate or employee due to age, race, religion, color, ethnicity, national origin, gender, gender identity/expression, sexual orientation, membership in an employee organization, medical condition, family history, genetic information, veteran status, marital status, parental status, or pregnancy). In our innovative and inclusive workplace, we prohibit discrimination and harassment of any kind. Teladoc Health respects your privacy and is committed to maintaining the confidentiality and security of your personal information. In furtherance of your employment relationship with Teladoc Health, we collect personal information responsibly and in accordance with applicable data privacy laws, including but not limited to, the California Consumer Privacy Act (CCPA). Personal information is defined as: Any information or set of information relating to you, including (a) all information that identifies you or could reasonably be used to identify you, and (b) all information that any applicable law treats as personal information. Teladoc Health’s Notice of Privacy Practices for U.S. Employees’ Personal information is available at this link.

United States
$180K - $210K / year
Banner Health logo

DevOps Engineer IV

Banner Health

Making health care easier, so life can be better.

DevOps Engineer70 days ago
OtherRemoteTeam 10,001+Since 1999H1B Sponsor

Department Name: Cloud Platforms/Infrastructure Work Shift: Day Job Category: Information Technology Estimated Pay Range: $53.63 - $89.38 / hour, based on location, education, & experience.In accordance with State Pay Transparency Rules. Health care is constantly changing, and at Banner Health, we are at the front of that change. We are leading health care to make the experience the best it can be. We want to change the lives of those in our care – and the people who choose to take on this challenge. If changing health care for the better sounds like something you want to be part of, we want to hear from you. The Digital Business Technology team is responsible for enabling technology to that enhances consumer, Patient, Provider and Employee experiences across Banner Health. The Digital Business Technology team takes pride in being obsessed with enabling self-service, eliminating time-consuming transactional and manual tasks, and implementing innovative solutions to solve complex problems. This can be a remote position if you live in the AZ or CO only. Your pay and benefits (Total Rewards) are important components of your Journey at Banner Health. Banner Health offers a variety of benefit plans to help you and your family. We provide health and financial security options, so you can focus on being the best at what you do and enjoying your life. Within Banner Health Corporate, you will have the opportunity to apply your unique experience and expertise in support of a nationally-recognized healthcare leader. We offer stimulating and rewarding careers in a wide array of disciplines. Whether your background is in Human Resources, Finance, Information Technology, Legal, Managed Care Programs or Public Relations, you'll find many options for contributing to our award-winning patient care. POSITION SUMMARY This position is a highly experienced individual contributor in Development Operations. The position will lead a project team to perform the tasks necessary to analyze, design, configure, implement and support PaaS solutions, related services, processes, applications, and integrations. This will involve influencing IT functional areas, product owners and vendors to develop detailed design, execution and troubleshooting of strategic solutions in support of these systems. The position leads the efforts of resolving application and configuration issues/concerns, providing ongoing analysis of performance, implementation of approved changes, and ensuring continual service improvements. Will be responsible for Architect Cloud services that span storage, security, networking, and compute cloud capabilities. Responsible for all aspects of application production support, deployment and monitoring and develop tools to support these activities. Leads mission critical applications and associated platforms, ensuring the highest levels of availability, security, performance and stability are always maintained. Designs and builds tools and solutions with a strong bias towards automating as many aspects of support as possible to reduce or eliminate trivial support activities. CORE FUNCTIONS 1. Anticipates internal and external business challenges and recommends best practices to improve services, processes or products. Manages projects or programs. Recognized as an expert within the organization and within their field or function. 2. Solves unique and complex problems that have a broad impact on the business. Presents complex ideas, anticipates potential objectives and persuades others to adopt a different point of view. 3. Develops innovative services, technologies, processes or products that address current and future customer problems or needs. Interacts primarily with customers, peers, peers’ managers, patients and physicians across the organization. 4. Makes decisions with general functional, company and industry guidelines. May manage budget for large and/or complex projects or programs. MINIMUM QUALIFICATIONS Bachelor’s degree or equivalent working knowledge. Must have in-depth knowledge of concepts within job function as would normally be obtained in eight to twelve years' work experience developing Enterprise Applications. Must possess strong knowledge of programming and cloud technology. Needs experience in medium scale project planning. Successful candidate will have skills to mentor less experienced team members. Requires strong communication and presentation skills to explain and resolve complex technical issues to technical and non-technical audiences. Requires ability to influence and interact across facilities and at various levels. As is typical in this industry, variable shifts and hours and carrying/responding to a pager may be required. PREFERRED QUALIFICATIONS Cloud platform Certifications: AZ-301, AZ-400, and AZ-500. Significant development and operations / engineering experience with the ability to apply that knowledge to solve complex problems. Three to four years' experience implementing Enterprise Cloud Solutions. Strong working knowledge of Java/C++/C#/.Net Core, hardware environment, and use of program logic. Additional related education and/or experience preferred. Anticipated Closing Window (actual close date may be sooner): 2026-07-17 EEO Statement: EEO/Disabled/Veterans Our organization supports a drug-free work environment. Privacy Policy: Privacy Policy

United States
$54 - $89 / hour
Job Closed
Cross Border Talents logo

Consultant AI DevOps Engineer

Cross Border Talents

🌎 Your international recruitment partner for hard to find professionals and jobs all over the globe.

DevOps Engineer70 days ago
Full TimeRemoteTeam 201-500Since 2013H1B No Sponsor

Role Description Join a leading data & AI consultancy delivering enterprise AI/ML/GenAI solutions, with a strong focus on analytics, forecasting, and AI transformation. This role combines hands-on engineering + consulting mindset, focusing on deploying LLMs/SLMs in production, building scalable infrastructure, and advising stakeholders. This is not just DevOps — this is AI infrastructure + client-facing impact + technical leadership. Location: Remote | Poland Compensation: PLN 230,400 – 249,600 / year Job Type: Full-Time Key Areas: AI | GenAI | MLOps | Azure | Consulting What You'll Do - Design and deploy AI/ML/GenAI systems at scale - Lead infrastructure and automation for: - Model lifecycle - Inference pipelines - Build secure, scalable Azure environments - Own CI/CD pipelines for AI workloads - Optimize performance, cost, and monitoring - Act as technical advisor to stakeholders and clients - Mentor engineers and contribute to best practices Tech Stack - Azure - Kubernetes - Terraform - Ansible - Python - Bash / PowerShell - CI/CD (Azure DevOps, GitHub Actions, Jenkins) - Linux - macOS Qualifications - 5+ years in DevOps / Cloud Engineering - Proven experience in: - AI/ML/GenAI systems - Deploying LLMs/SLMs in production - Strong Azure expertise (key requirement) - Advanced scripting (Python + Bash/PowerShell) - Deep knowledge of: - IaC (Terraform / Ansible) - CI/CD pipelines - Experience with Kubernetes & scalable systems Bonus - Consulting / client-facing experience - Mentoring / leadership exposure - Multi-cloud experience (AWS / GCP) - Azure certifications Compensation & Benefits - PLN 230,400 – 249,600 annually - 110+ trainings + Udemy access - Strong internal growth opportunities - Workation + flexible work model - Inclusive, DEI-driven culture Location & Eligibility - Remote within Poland - Must be based in Poland - Must have valid work authorization - No visa sponsorship Ideal Candidate Profile - Has deployed LLMs/SLMs in production (not just API usage) - Strong in Azure + automation + infrastructure design - Comfortable in client-facing environments - Advisory roles - Balances hands-on engineering + communication skills

Poland
PLN230.4K - PLN249.6K / year
Job Closed
Cross Border Talents logo

Consultant AI DevOps Engineer | LLM Deployment & Azure

Cross Border Talents

🌎 Your international recruitment partner for hard to find professionals and jobs all over the globe.

DevOps Engineer70 days ago
Full TimeRemoteTeam 201-500Since 2013H1B No Sponsor

Role Description Join a leading data & AI consultancy delivering enterprise AI/ML/GenAI solutions, with a strong focus on analytics, forecasting, and AI transformation. This role combines hands-on engineering and a consulting mindset, focusing on deploying LLMs/SLMs in production, building scalable infrastructure, and advising stakeholders. This is not just DevOps — this is AI infrastructure, client-facing impact, and technical leadership. What You'll Do - Design and deploy AI/ML/GenAI systems at scale - Lead infrastructure and automation for: - Model lifecycle - Inference pipelines - Build secure, scalable Azure environments - Own CI/CD pipelines for AI workloads - Optimize performance, cost, and monitoring - Act as technical advisor to stakeholders and clients - Mentor engineers and contribute to best practices Tech Stack - Azure - Kubernetes - Terraform - Ansible - Python - Bash / PowerShell - CI/CD (Azure DevOps, GitHub Actions, Jenkins) - Linux - macOS Qualifications - 5+ years in DevOps / Cloud Engineering - Proven experience with AI/ML/GenAI systems - Deploying LLMs/SLMs in production - Strong Azure expertise (key requirement) - Advanced scripting (Python + Bash/PowerShell) - Deep knowledge of: - IaC (Terraform / Ansible) - CI/CD pipelines - Experience with Kubernetes & scalable systems Bonus - Consulting / client-facing experience - Mentoring / leadership exposure - Multi-cloud experience (AWS / GCP) - Azure certifications Compensation & Benefits - PLN 230,400 – 249,600 annually - 110+ trainings + Udemy access - Strong internal growth opportunities - Workation + flexible work model - Inclusive, DEI-driven culture Location & Eligibility - Remote within Poland - Must be based in Poland - Must have valid work authorization - No visa sponsorship Ideal Candidate Profile - Has deployed LLMs/SLMs in production (not just API usage) - Strong in Azure, automation, and infrastructure design - Comfortable in client-facing environments and advisory roles - Balances hands-on engineering and communication skills

Poland
PLN230.4K - PLN249.6K / year
Job Closed