Job Closed

This listing is no longer active.

Business Wire logo
Business Wire

Global Leader in News Content Distribution

Senior DevOps Engineer

DevOps EngineerDevOps EngineerFull TimeRemoteSeniorTeam 501-1,000Since 1961H1B No SponsorCompany SiteLinkedIn

Location

United States

Posted

19 days ago

Salary

$160K - $175K / year

Seniority

Senior

Job Description

Senior DevOps Engineer

Business Wire

• Dive deep into transforming our IaC to support cross region resilient AWS infrastructure. • Develop and maintain Infrastructure as Code (Terraform), ensuring reusable modules, environment consistency, and automated provisioning of AWS infrastructure. • Operate and support Kubernetes (EKS) platforms, including cluster lifecycle management, workload deployment, scaling, and container runtime best practices. • Design, implement, and operate standardized CI/CD platforms using GitHub Actions and GitOps tooling (ArgoCD), supporting multi-environment deployments, rollback, and automation. • Implement GitOps deployment patterns and declarative configuration management. • Partner with engineering teams to integrate deployment standards into the application lifecycle. • Produce and maintain documentation and usage standards. • Provide technical mentorship and guidance on DevOps tooling and practices. • Participate in on-call rotation as required.

Job Requirements

  • 10+ years of experience in DevOps, platform engineering, or cloud infrastructure roles.
  • Expertise with Infrastructure as Code using Terraform and CloudFormation.
  • Enterprise experience creating and supporting resilient systems across multiple locations, working within defined RPO and RTOs.
  • Strong hands-on experience building and operating CI/CD platforms (GitHub Actions or equivalent).
  • Deep hands-on experience with Kubernetes and Amazon EKS, including deployment, scaling, and lifecycle management.
  • Strong AWS foundational knowledge (IAM, VPC, EC2, S3, RDS, Lambda, CloudWatch).
  • Experience implementing GitOps workflows using ArgoCD or similar tools.
  • Experience with containerization using Docker.
  • Scripting and automation skills (Bash, Python).

Benefits

  • Ability to work remotely
  • Excellent health benefits that begin on your first day of employment
  • $100 monthly fitness allotment, a tuition reimbursement program, and enhanced mental health resources
  • 401(k) plan with generous company match, and annual profit sharing contribution (subject to company performance)
  • PTO, Floating Holidays, Wellness Day Off, Birthday Day Off, and more!

Related Categories

Related Job Pages

More DevOps Engineer Jobs

AVANTTi logo

DevOps Analyst – Specialist

AVANTTi

Agilidade | Senioridade | Transparência | Responsabilidade Social

DevOps Engineer19 days ago
Full TimeRemoteTeam 51-200H1B No Sponsor

• Define and evolve DevOps architecture in complex environments • Develop and maintain CI/CD pipelines (GitLab, Jenkins, Azure DevOps) • Automate infrastructure using Terraform, Ansible and Packer • Manage and operate environments on AWS, Azure and Google Cloud Platform • Deploy and administer solutions with Docker, Kubernetes and OpenShift • Perform advanced troubleshooting in Linux, Windows and cloud environments • Implement Infrastructure as Code (IaC) and GitOps practices • Create and evolve monitoring and observability strategies (Prometheus, Grafana, Datadog, New Relic, Kibana) • Ensure high availability, performance and scalability of systems • Support development teams with DevOps best practices • Lead technical initiatives and serve as a technical reference for the team

Brazil
TechInsights logo

Senior Site Reliability Engineer – Remote UK

TechInsights

The most trusted source of semiconductor analysis and market information

DevOps Engineer19 days ago
Full TimeRemoteTeam 201-500Since 1989H1B No Sponsor

• Own SLOs, SLIs, and error budgets for all production services; drive error budget discipline across engineering • Design reliability patterns for AI agent pipelines: LLM observability, tool-use tracking, failure detection, and graceful degradation • Architect for blast radius containment — agent failures must have bounded customer impact through isolation, circuit breaking, and rapid recovery • Mature our Canada Central/West active-active architecture toward 24-hour RTO with full regional failover • Lead incident response and post-incident reviews that produce durable fixes; maintain DR procedures through regular testing • Serve as the primary reliability liaison to Software and AI Engineering, translating requirements into actionable standards • Partner with AI Engineering on compute provisioning, model serving, inference latency, and workload isolation • Own CI/CD pipeline strategy (Bitbucket Pipelines, GitHub Actions) — set standards, optimize deployment frequency, and ensure teams can ship confidently • Drive IDP adoption and enable teams on SRE practices: on-call readiness, SLO definition, runbook development, and self-service tooling • Represent reliability in architectural discussions; surface risk before it's committed to design • Own the service catalog — a living inventory of all services, AI agents, dependencies, ownership, and SLOs • Operate Datadog as the single pane of glass for service health, infrastructure, and agentic pipeline telemetry • Build golden path templates in Backstage and/or Atlassian Compass so teams ship reliably without routine SRE involvement • Apply AIOps in Datadog to automate anomaly detection, incident triage, and remediation recommendations • Own infrastructure as code via Terraform and GitOps; enforce IaC policy in partnership with Trust Assurance • Own FinOps visibility into AWS cost segments; model cloud cost impact as AI/ML workloads scale • Formally mentor junior and intermediate SRE engineers, with accountability for their technical growth and career progression • Build AI-assisted automation to progressively reduce toil and scale the team's operational capacity

United Kingdom
£77.6K - £82.2K / year
Aubrant Digital logo

Senior DevOps Engineer

Aubrant Digital

Creating Digital Businesses

DevOps Engineer19 days ago
ContractRemoteTeam 51-200Since 2013H1B No Sponsor

Role Description As a Senior Data Engineer, you will design, build, and tune the data layer that powers our clients' mission-critical applications on Azure and SQL Server. You will own complex query performance, indexing strategy, and concurrency design, and you will architect the data access patterns that connect application code to the database through linq2db, LINQ-to-SQL, Entity Framework, and ADO.NET. You will partner with application engineers, architects, and product teams to deliver high-throughput, low-latency data solutions, and you will mentor others on database design, query optimization, and modern data engineering practices. Qualifications - A database craftsperson who treats query performance, indexing, and concurrency as first-class engineering concerns rather than afterthoughts. - A clear communicator who can explain execution plans, locking behavior, and data access trade-offs to engineers, architects, and product stakeholders. - Comfortable operating with ambiguity, capable of profiling production workloads and proposing concrete solutions backed by evidence. - A mentor who raises the bar for the team through code review, query review, and pattern guidance. - Customer-obsessed and outcome-focused, balancing delivery speed with the long-term health and scalability of the data platform. Requirements - Bachelor's Degree in Computer Science or a related discipline, or equivalent experience; MUST be proficient in written and spoken English (85%). - 5 to 8 years of professional data engineering or back-end engineering experience with a strong database focus. - Expert-level proficiency in SQL on SQL Server 2019+, including complex queries, window functions, set-based operations, query plan analysis, indexing strategy, statistics, RCSI, isolation levels, and Change Data Capture. - Expert-level proficiency in database performance tuning, including bottleneck identification, index design, query rewrites, and concurrency design under production load. - Strong proficiency in C# data access using linq2db, LINQ-to-SQL (DBML), Entity Framework, and ADO.NET; ability to choose the appropriate tool for each scenario and avoid ORM performance pitfalls. - Strong proficiency in Python for data engineering tasks, scripting, and automation. - Hands-on experience with Azure data services (Azure SQL, storage, networking, security) and deploying production data workloads in Azure. - Experience with database CI/CD, schema versioning, and migration tooling. - Solid Git, code review discipline, and familiarity with modern engineering practices including testing and observability. - Experience with Azure Data Factory, Synapse, or other Azure analytics services is a plus. - Experience designing event-driven or streaming data architectures is a plus. - Excellent analytical and problem-solving skills; strong communication, collaboration, customer orientation, innovation mindset, and adaptability under ambiguity. Company Description

Finland
Optum logo

SCCM Configuration Manager

Optum

Optum, part of the UnitedHealth Group family of businesses, is a global organization that delivers care, aided by technology to help millions of people live healthier lives. The work you do with our team will directly improve health outcomes by connecting people with the care, pharmacy benefits, data and resources they need to feel their best. Here, you will find a culture guided by inclusion, talented peers, comprehensive benefits and career development opportunities. Come make an impact on the communities we serve as you help us advance health optimization on a global scale. Join us to start Caring. Connecting. Growing together. At Optum, we support your well-being with an understanding team, extensive benefits and rewarding opportunities. By joining us, you’ll have the resources to drive system transformation while we help you take care of your future. We recognize the power of connection to drive change, improve efficiency and make a difference in health care. Join a team where your skills and ideas can make an impact and where collaboration is key to creating technology that produces healthier outcomes.

DevOps Engineer19 days ago
Full TimeRemoteTeam 160,000Since 2011

Requisition Number: 2354959 Optum is a global organization that delivers care, aided by technology to help millions of people live healthier lives. The work you do with our team will directly improve health outcomes by connecting people with the care, pharmacy benefits, data and resources they need to feel their best. Here, you will find a culture guided by inclusion, talented peers, comprehensive benefits and career development opportunities. Come make an impact on the communities we serve as you help us advance health optimization on a global scale. Join us to start Caring. Connecting. Growing together. As an I O Engineering Consultant, you will deliver reliable and secure device management services in a Dual UEM environment using Microsoft Endpoint Configuration Manager (MECM/SCCM), Microsoft Intune, and Omnissa Workspace ONE UEM. You will own day-to-day endpoint engineering and operational outcomes across application lifecycle (packaging to deployment), OS servicing, patch compliance, vulnerability remediation, and endpoint configuration. The role is execution-focused with solid troubleshooting depth and disciplined ITSM practices. You will routinely participate in major incident bridges/war rooms, ensuring service restoration within agreed SLAs and contributing to measurable MTTR reduction. You will also contribute to AI adoption by proposing and piloting practical ideations (e.g., ticket triage assist, automated log parsing, runbook generation) to improve reliability, speed, and user experience, while adhering to organizational security, privacy, and data handling standards. Primary Responsibilities: - Platform Operations, Health & Availability - Administer MECM components (collections, boundaries, deployments, content distribution, client health monitoring, reporting) - Administer Intune (enrollment, configuration profiles, compliance policies, app assignments, RBAC) and support coexistence with MECM - Administer Workspace ONE UEM (profiles, Smart Groups, compliance, app deployments) and support coexistence with Intune (Dual UEM) - Monitor platform health and availability: enrollment success, policy/app success rates, client health, content distribution health, and remediation backlogs - Maintain device inventory accuracy and operational hygiene; perform routine health checks and standard validations - Application Lifecycle, OS Servicing & Operations - Package and deploy applications (MSI/EXE/MSIX/Win32) with detection methods, dependencies, supersedence, and phased ring deployments - Support OS deployment and servicing (Task Sequences, feature updates, in-place upgrades), driver/BIOS packaging coordination, and rollback plans - Execute post-deployment validation and remediation for failed installs, policy conflicts, and provisioning issues - Patching, Vulnerability Remediation & Compliance - Operate software update/patching via MECM (ADRs, maintenance windows, deployment schedules, compliance reporting) and coordinate exception handling - Support vulnerability remediation initiatives by deploying security patches and approved mitigations within defined timelines - Track compliance posture and drive remediation for non-compliant endpoints; support dashboards and reporting - Troubleshooting, ITSM & War Rooms - Troubleshoot endpoint deployment and client issues using logs and diagnostics (e.g., AppEnforce/AppDiscovery, WUAHandler, CAS, CCMExec) - Work within ITSM processes (Incident/Problem/Change) including change documentation, risk assessment, and post-change validation - Participate in major incident war rooms/bridges; provide technical inputs, implement mitigations, and document restoration actions - Track and drive SLA adherence; contribute to MTTR reduction via faster triage, known error documentation, and repeat-incident prevention - Automation & AI Adoption / Ideation - Create and maintain PowerShell-based operational automations (health checks, compliance reporting, remediation scripts) - Propose AI-assisted improvements (e.g., Copilot-assisted draft runbooks, KB articles, ticket summarization, and log pattern identification) - Pilot small-scale AI/automation ideations with measurable success criteria (time saved, reduced rework, improved compliance) - Ensure AI usage adheres to organizational security, privacy, and data handling policies (no sensitive data leakage) - Comply with the terms and conditions of the employment contract, company policies and procedures, and any and all directives (such as, but not limited to, transfer and/or re-assignment to different work locations, change in teams and/or work shifts, policies in regard to flexibility of work benefits and/or work environment, alternative work arrangements, and other decisions that may arise due to the changing business environment). The Company may adopt, vary or rescind these policies and directives in its absolute discretion and without any limitation (implied or otherwise) on its ability to do so Required Qualifications: - Undergraduate degree or equivalent experience - Core Technical Skills (Primary) - MECM/SCCM: application model, collections, deployments, boundaries, content distribution, client health, reporting - Microsoft Intune: enrollment methods, configuration/compliance policies, Win32 app deployment, troubleshooting, RBAC - Omnissa Workspace ONE UEM: minimum 1 year; administrative depth (profiles, Smart Groups, app distribution, compliance policies, device troubleshooting) - Dual UEM operations: support coexistence, minimize policy conflicts, and ensure consistent end-user experience - Operations, Health & Availability - Health monitoring and remediation: client health, enrollment success, policy/app deployment success, content distribution health - Service reliability ownership: stability, availability, and operational hygiene - Patching & Vulnerability Operations - MECM patching operations: ADRs, deployment rings, maintenance windows, compliance reporting - Vulnerability remediation support: execute remediation plans and validate compliance within required timelines - Automation & Scripting - PowerShell scripting for automation and remediation; familiarity with scheduled tasks and basic safe rollout practices - Ability to create repeatable runbooks and operational checklists - ITSM & Operational Excellence - Working knowledge of Incident/Problem/Change management; ability to write clear change plans and validation steps - Experience working in war rooms and providing concise technical updates for rapid restoration - Ownership mindset for SLA adherence and MTTR reduction - AI Adoption & Continuous Improvement - Comfort using approved AI tools to accelerate documentation, analysis, and knowledge management - Ability to identify repetitive tasks suitable for automation or AI assistance and propose improvements At UnitedHealth Group, our mission is to help people live healthier lives and make the health system work better for everyone. We believe everyone - of every race, gender, sexuality, age, location and income - deserves the opportunity to live their healthiest life. Today, however, there are still far too many barriers to good health which are disproportionately experienced by people of color, historically marginalized groups and those with lower incomes. We are committed to mitigating our impact on the environment and enabling and delivering equitable care that addresses health disparities and improves health outcomes - an enterprise priority reflected in our mission. Optum is a drug-free workplace. © 2026 Optum Global Solutions (Philippines) Inc. All rights reserved.

Philippines