Job Closed

This listing is no longer active.

CVS Health logo
CVS Health

Bringing our heart to every moment of your health.

Principal Cloud Infrastructure Engineer

Infrastructure EngineerInfrastructure EngineerFull TimeRemoteLeadTeam 10,001+Since 1963H1B No SponsorCompany SiteLinkedIn

Location

United States

Posted

9 days ago

Salary

$144.2K - $288.4K / year

Seniority

Lead

Job Description

Principal Cloud Infrastructure Engineer

CVS Health

Role Description We are looking for a Principal Engineer to lead our AWS Cloud Engineering team, owning the Amazon Web Services platform for the enterprise. This is a foundational platform role — you are the AWS technical authority, setting architectural direction, establishing engineering standards, and ensuring the platform is secure, scalable, and built to last. You lead from the front. You design the systems others build on, mentor the engineers around you, and hold the line on quality and best practices. You bring deep AWS expertise, a platform-owner mindset, and the leadership presence to align engineers and stakeholders around a shared technical vision. This role demands a cloud-first thinker who ensures cloud solutions meet business needs efficiently while prioritizing Infrastructure as Code (IaC) to create repeatable, automated deployments. You need to have a proven track record of architecting cloud environments from scratch. You'll drive cloud transformation initiatives across all CSP’s with a focus on AWS platforms while ensuring every design decision considers security, reliability, and scalability. This is not a hands-off leadership role — you write code, review designs, and stay close to the work. Major Responsibilities - AWS Platform Ownership - Own the enterprise AWS platform end-to-end: AWS Organizations structure, account hierarchy, while collaborating with several teams to ensure the platform is stable and compliant. - Define and maintain the AWS Landing Zone — AWS Control Tower, Service Control Policies (SCPs), billing controls, and account vending patterns — as the foundation all product teams build on. - Serve as the final technical authority on AWS architecture decisions, reviewing designs for scalability, security, and operational excellence before they reach production. - Build self-service platform capabilities that enable product engineering teams to move fast without compromising standards. - Technical Team Leadership - Lead the AWS cloud engineering team as the technical anchor — set direction, conduct design reviews, unblock engineers, and drive delivery on platform initiatives. - Establish and enforce engineering standards: IaC patterns, naming conventions, tagging strategy, branching models, and deployment practices. - Mentor engineers at all levels, building depth on the team and raising the bar on what “excellence” looks like in cloud engineering. - Partner with architecture, security, operations, and business stakeholders to translate enterprise requirements into platform capabilities. - Infrastructure as Code & Automation - Design and own the Terraform framework for all AWS resource provisioning — reusable modules, remote state management via S3/DynamoDB, pipeline integration, and policy guardrails. - Build and maintain CI/CD pipelines using AWS CodePipeline, CodeBuild, GitHub Actions, and Amazon ECR for both platform infrastructure and application teams. - Write production-quality automation to extend platform functionality, integrate AWS APIs, and eliminate operational toil. - Implement policy-as-code using OPA, AWS Config Rules, and Service Control Policies to enforce governance at scale without manual gatekeeping. - Networking, Security & Compliance - Architect and operate AWS networking: VPC design, VPC Lattice, AWS PrivateLink, Transit Gateway, AWS WAF, Shield Advanced, NAT Gateway, and hybrid connectivity via AWS Direct Connect and Site-to-Site VPN. - Own the enterprise security posture on AWS — IAM Roles for Service Accounts (IRSA), ECR Image Signing, AWS Secrets Manager, least-privilege IAM design, and SIEM/CSPM integration (AWS Security Hub, Prisma Cloud, or Wiz). - Drive continuous automated compliance across applicable regulatory frameworks (HIPAA, PCI, SOC 2) so controls are enforced in real time, not discovered at audit. - Integrate observability — Amazon CloudWatch, AWS X-Ray, Datadog, and SLO/SLI frameworks — as a first-class platform capability across all workloads. - Platform Strategy & Continuous Improvement - Own the AWS platform roadmap, evaluating new AWS services and capabilities and making deliberate decisions about what the enterprise adopts and when. - Incorporate FinOps practices across the platform: Reserved Instances, Savings Plans, rightsizing, AWS Budgets alerting, and cost allocation as engineering disciplines, not afterthoughts. - Research and pilot emerging AWS capabilities — Amazon Bedrock, EKS Auto Mode, Amazon Q for Developer — evaluating their fit for enterprise adoption. - Foster a culture of operational excellence: blameless postmortems, runbook-driven operations, and continuous improvement cycles that make the platform more reliable over time. Qualifications - 10+ years in cloud and infrastructure engineering with 5+ years of deep, hands-on AWS experience at enterprise scale. - Proven ownership of an AWS Organization — account hierarchy, Billing, Service Control Policies, IAM, and multi-account governance in production. - Demonstrated technical leadership: you have led a platform team or major enterprise cloud initiative, set technical direction, and grown engineers around you. - Deep AWS expertise required across: - Compute & Containers: EKS (Managed + Auto Mode), ECS/Fargate, EC2, Auto Scaling Groups - Networking: VPC, VPC Lattice, AWS PrivateLink, Transit Gateway, AWS WAF, Shield Advanced, Direct Connect - Data & Messaging: Amazon Redshift, SNS/SQS, S3, AWS Glue, Kinesis, Amazon MWAA - Security: IAM, IRSA, AWS Security Hub, ECR Image Signing, Secrets Manager, VPC Endpoints - IaC & Automation: Terraform (modules, remote state, OPA), AWS CodePipeline, AWS Config, CloudFormation - Observability: Amazon CloudWatch, AWS X-Ray, Datadog, SLO/SLI design, PagerDuty integration - Languages: Python, Go, and Terraform Preferred Qualifications - AWS Certified Solutions Architect – Professional (strongly preferred) - AWS Certified DevOps Engineer – Professional - HashiCorp Terraform Associate or Professional certification - Experience in regulated industries applying HIPAA, PCI-DSS, or FedRAMP controls on AWS - Familiarity with AWS Outposts, EKS Anywhere, and multi-cloud connectivity patterns - Experience with Amazon Bedrock, SageMaker, and MLOps patterns on AWS Education - Bachelor’s degree in Computer Science, Engineering, or a related field — or equivalent demonstrated experience. Pay Range The typical pay range for this role is: $144,200.00 - $288,400.00. This pay range represents the base hourly rate or base annual full-time salary for all positions in the job grade within which this position falls. The actual base salary offer will depend on a variety of factors including experience, education, geography and other relevant factors. This position is eligible for a CVS Health bonus, commission or short-term incentive program in addition to the base pay range listed above. This position also includes an award target in the company’s equity award program. Benefits - Comprehensive and competitive mix of pay and benefits. - Medical, dental, and vision coverage. - Paid time off. - Retirement savings options. - Wellness programs and other resources, based on eligibility. Our people fuel our future. Our teams reflect the customers, patients, members and communities we serve and we are committed to fostering a workplace where every colleague feels valued and that they belong.

Related Categories

Related Job Pages

More Infrastructure Engineer Jobs

Sparq logo

Senior Azure Infrastructure Engineer

Sparq

In the age of AI, differentiation isn’t in what you build - it’s in the problems you choose to solve and the outcomes they unlock. That’s why we help leaders cut through the noise, focus on what matters most, and solve it right the first time. By fusing problem-first thinking, deep technical craftsmanship, and fast, flawless delivery, we de-risk transformation and deliver solutions that stick, scale, and prove their worth. With a deep bench of end-to-end technologists, architects, and engineers, no challenge is too complex and no solution is half-built. At Sparq, your mission is our mission. We’re modular by design - meeting you where you are and accelerating you toward where you’re meant to be. We don’t just guide - we climb with you. Embedded alongside your teams, we chart the course, build with precision, and navigate complexity until your outcomes are achieved. Built to solve, not just to build.

Role Description Why you will enjoy Mondays again: - Opportunity to work alongside a talented, diverse team in a collaborative and creative environment - Ongoing investment in your growth through training, mentorship, and industry certifications - Hands-on exposure to cutting-edge technologies across multiple industries - Recognition programs and employee awards within an inclusive culture that celebrates great work, including our AI Innovators Program - Remote or hybrid work options - Competitive salary + bonus opportunities - Robust benefits package, matching 401(k) plan, and substantial PTO - Tuition reimbursement A Day in the Life: - Design, build, and maintain secure Azure cloud infrastructure within client-owned Azure tenants, supporting environments that require private networking, controlled access, and no-public-endpoint architecture. - Develop and manage Infrastructure as Code using Bicep or ARM templates, ensuring changes follow peer-review, version control, and audit-trail best practices. - Configure and support Azure services including Functions Premium, Virtual Networks, Private Endpoints, Key Vault, Entra ID, Managed Identities, and scoped RBAC permissions. - Build and enforce CI/CD pipelines using Azure DevOps and/or GitHub Actions to support reliable infrastructure deployments, environment consistency, and secure release workflows. - Provision and manage Azure PostgreSQL, Blob Storage, and related access controls to ensure data services are secure, properly scoped, and aligned with client architecture standards. - Actively leverage AI tools to streamline tasks, improve the quality of your work, and share best practices with teammates while continuously seeking new ways to integrate AI into your everyday workflows. - Implement observability and security monitoring through Azure Monitor, Log Analytics, and Microsoft Sentinel integration to improve visibility, alerting, and operational readiness. - Partner with security, application development, and client infrastructure teams to troubleshoot issues, review cloud architecture decisions, and support secure cloud delivery across complex Azure environments. - Participate in design reviews, code reviews, and deployment planning sessions to ensure infrastructure solutions are scalable, compliant, and aligned with enterprise cloud governance expectations. Qualifications - Consultative approach and problem solving skills to successfully align digital solutions with long-term business goals of the client - Hands-on experience using AI tools to enhance daily work, or a strong desire to do so, including a willingness to experiment, learn, and champion AI adoption within your role - Commitment to understanding and exceeding client expectations - Ability to perform project oversight and execution of deliverables - Flexibility to adapt within a high-growth organization - Ability to lead, mentor and motivate those around them - Hunger for continuous learning and professional development - Intellectual curiosity to provide creative solutions - Full understanding of the software development life cycle - Ability to positively impact fellow colleagues through effective leadership, presentations, coaching, etc. - Desire to work in a team environment - Good interpersonal, written and verbal communication skills Requirements - C2C is not available - Must be authorized to work in the U.S. without sponsorship Benefits - Robust benefits package, matching 401(k) plan, and substantial PTO - Tuition reimbursement Equal Employment Opportunity Policy Sparq is proud to offer equal employment opportunity without regard to age, color, disability, gender, gender identity, genetic information, marital status, military status, national origin, race, religion, sexual orientation, veteran status, or any other legally protected characteristic. We are committed to providing equal employment opportunities and believe in an inclusive workplace. If you require reasonable accommodations to participate in the job application or interview process, please let us know by contacting recruiting@teamsparq.com . #LI-REMOTE

United States
Orcrist Technologies GmbH logo

Infrastructure Engineer

Orcrist Technologies GmbH

Pioneering Future Technologies with Advanced AI and Data Analytics

Full TimeRemoteTeam 11-50H1B No Sponsor

• Design, size, provision, and operate bare-metal GPU server fleets across on-prem and air-gapped environments (firmware/BIOS, BMC via Redfish/IPMI, OS, drivers) with zero-touch provisioning (PXE/iPXE, MAAS/Metal3/Tinkerbell) and automation (Ansible/Salt, Terraform/Pulumi). • Own the NVIDIA GPU stack end to end: drivers, CUDA, GPU Operator, Container Toolkit, MIG, and DCGM, tuned for inference throughput, latency, and utilization. • Build the bare-metal substrate Kubernetes runs on: node lifecycle, container runtime, GPU device plugins, node feature discovery, and kernel/NUMA tuning. • Engineer data-center networking and resilient storage (VLANs/switching, RDMA, Ceph/ZFS/NVMe) sized to scale without replacing the core, with encryption at rest. • Partner with ML and MLOps on on-prem inference serving (Triton, KServe, vLLM): model deployment, GPU scheduling and sharing, and performance tuning. • Plan and run on-site build-outs: rack integration, power/UPS and cooling sizing, commissioning, capacity planning, runbooks, and operator handover.

Germany
DigitalOcean logo

Principal Infrastructure Architect

DigitalOcean

The cloud ☁️ of choice for developers, startups, and growing digital businesses around the world.

Full TimeRemoteTeam 1,001-5,000Since 2011H1B Sponsor

• Defining and owning DigitalOcean's hardware architectures for next-generation compute environments across our CPU, GPU, Storage, and Infrastructure Server SKU products, navigating shifts in technology, networking topologies, and AI/ML requirements. • Serving as DigitalOcean's primary technical stakeholder for infrastructure technologies partnering with SMEs in the Product and Hardware Engineering organization. • Providing leadership and guidance as a primary stakeholder in the hardware selection and qualification process. • Evangelizing emerging technical concepts to senior leadership to inform product and business strategies. • Architecting end-to-end server designs with vendors and partner teams to support a wide variety of CPU, GPU, and other emerging workloads. • Architecting end-to-end storage topologies with vendor teams to support diverse workloads across performance, capacity, and archival tiers, moving from direct-attached to disaggregated and software-defined storage. • Leading proof-of-concept (PoC) initiatives for emerging infrastructure technologies and transitioning successful pilots into global deployment standards. • Mentor engineers at all levels and contribute to a culture of technical excellence, inclusivity, and impact. • Represent DigitalOcean in the broader community attending conferences, contributing to papers and presentations.

California
$184K - $231K / year
Full TimeRemoteTeam 201-500Since 2013H1B No Sponsor

Role Description Architect the backbone of a high-scale microservices ecosystem and empower engineering teams to deploy with confidence and speed! We’re looking for a Staff Cloud Infrastructure Engineer to join our Product Development team! In this role, you will… - Maintain the core infrastructure platform and standard building blocks that our developers use to run, build, and deploy their code. - Guide engineering teams through common bottlenecks by identifying structured, consistent ways to solve them. - Take ownership of our SLAs, making sure the platform stays secure, available, and ready to scale. - Serve as the technical "go-to" person for the organization, bridging the gap between high-level infrastructure and business requirements. Your day-to-day work and responsibilities include… - Digging into incident trends to figure out what’s actually breaking and tackling the root cause. - Driving improvements in our cloud setups to catch anomalies and performance lags before they become issues. - Leading architecture design sessions to keep our system resilient as we grow. - Keeping our CI/CD pipelines moving as smoothly as possible. - Mentoring the team and sharing knowledge to make sure everyone is up to speed on best practices. - Participation in on-call rotation. Qualifications - A heavy background in running large-scale distributed applications in the cloud. - A strong grasp of networking and Linux fundamentals and a proven ability to use Linux tools to automate everything. - Hands-on experience with IaC tools like Terraform, Pulumi, or CloudFormation. - Extensive experience with monitoring/observability tooling/frameworks like Grafana. - Experience managing Big Data stacks—think Cassandra, Kafka, Spark, or Databricks. - The capacity to recognize and prioritize technical challenges while diligently working towards their resolution. Benefits - Attractive pay structure, including equity. - Great work equipment, and home office allowance for those working in our fully remote locations. - Annual personal learning budget. - Sports allowance. - Benefits vary depending on location. Company Description Supermetrics builds an end-to-end marketing intelligence platform, with 15% of global advertising spend reported through our products. We help marketers turn their data into insights that improve business results and predict the best next step. Our technology streamlines marketing data for over 200,000 businesses through a network of agencies and customers like Shopify, HubSpot, and Nestlé. Since our founding in 2013, we’ve grown from a one-person shop to a key player in the industry—and we’re just getting started! We're a team of 400+ growth-minded people from diverse backgrounds. Together, we make a multicultural, resourceful, and collaborative team. Supermetrics operates on trust, transparency, and a keen customer focus. Forward-looking and action-oriented, we work hard to raise the bar in our industry. As team players, we help each other and win together. We're hiring for a diverse, skilled, and collaborative team and building an inclusive workplace where everyone is treated fairly and respectfully. Our commitment to transparent hiring: At Supermetrics, we leverage AI in our early recruitment stages to efficiently match your skills to our roles and provide a frictionless scheduling experience. While AI helps us operate faster and fairer, all final hiring decisions and deep-dive interviews are conducted by our human team members.

Worldwide