Principal Site Reliability Engineer

DevOps EngineerDevOps EngineerFull TimeRemoteLeadTeam 10,001+Since 2020H1B No SponsorCompany SiteLinkedIn

Location

Virginia

Posted

2 days ago

Salary

$107.5K - $204.5K / year

Seniority

Lead

Job Description

Principal Site Reliability Engineer

RTX

• Spend your days working to automate and improve reliability and continue to push the ARINCDirect infrastructure forward, ensuring it is resilient and reproducible. • Be responsible for service availability, performance, monitoring, incident response, and capacity planning. • Create, improve, and manage environments to ensure decisions on resource allocation, problem identification, and capacity planning are based on accurate data-driven insights. • Maintain a physical infrastructure using Linux • Help facilitate a push towards Kubernetes and declarative infrastructure • Impact technology decision and direction to grow and support the ARINCDirect platform. • Collaborate closely with fellow SREs on your team and extend your collaboration across other teams and disciplines to design dependable and scalable solutions and services. • Identify, implement, and champion process improvements to enhance productivity, collaboration, and delivery efficiency, while ensuring alignment with company goals and industry best practices.

Job Requirements

  • Typically requires a degree in Science, Technology, Engineering or Mathematics (STEM) and minimum 8 years prior relevant experience or an Advanced Degree in a related field and minimum 5 years of experience or in absence of a degree, 12 years of relevant experience.
  • Must be authorized to work in the U.S. without sponsorship now or in the future.
  • Experience as a SRE, Platform Engineer, or related position within a Linux or UNIX environment working on large, complex infrastructures and/or projects using Docker and Kubernetes solutions
  • Experience automating configuration and infrastructure with tools such as Saltstack, Ansible, Terraform or other declarative languages.
  • Experience with hardware; including servers, network switches, & cabling.
  • Experience managing infrastructure using GitOps with continuous delivery (CD) pipelines.
  • Established proficiency in at least one (ideally more) of the following: Python, Linux Shell (bash, awk, sed).
  • Experience with PostgreSQL, or equivalent RDBMS and SQL in general.
  • Familiarity with Cloud infrastructure, ideally AWS.
  • Understanding of SRE principles including building observability solutions and exposing metrics to inform SLO's and KPI's.
  • Understanding of how IT infrastructure services work, including: DNS, DHCP, LDAP, NFS.
  • Understanding of network segmentation, routing and VPNs.

Benefits

  • Medical, dental, and vision insurance
  • Three weeks of vacation for newly hired employees
  • Generous 401(k) plan that includes employer matching funds and separate employer retirement contribution, including a Lifetime Income Strategy option
  • Tuition reimbursement program
  • Student Loan Repayment Program
  • Life insurance and disability coverage
  • Optional coverages you can buy: pet insurance, home and auto insurance, additional life and accident insurance, critical illness insurance, group legal, ID theft protection
  • Birth, adoption, parental leave benefits
  • Ovia Health, fertility, and family planning
  • Adoption Assistance
  • Autism Benefit
  • Employee Assistance Plan, including up to 10 free counseling sessions
  • Healthy You Incentives, wellness rewards program
  • Doctor on Demand, virtual doctor visits
  • Bright Horizons, child and elder care services
  • Teladoc Medical Experts, second opinion program
  • And more!

Related Categories

Related Job Pages

More DevOps Engineer Jobs

Full TimeRemoteTeam 1-10H1B Sponsor

• Provide technical leadership for complex network projects in datacenter and cloud environments • Design and implement connectivity solutions for high-tech platforms at global scale • Define standards and best practices for network infrastructure • Manage network resources with a focus on automation and infrastructure as code • Perform advanced troubleshooting and handle critical incident response • Conduct technical reviews of change procedures • Research and evaluate new technologies and services

Brazil
Full TimeRemoteTeam 201-500Since 2018H1B No Sponsor

• Analyze complex problems and demonstrate strong problem-solving skills; • Collaborate with cross-functional teams to propose pragmatic solutions to real-world problems and take ownership of end-to-end implementation; • Design, build, and operate the shared infrastructure platform to support Swile’s growth, new products, and international expansion; • Continuously improve the developer experience by treating the platform as a product: simplify workflows, shorten time-to-market, and automate everything that improves engineers’ productivity; • Train, coach, and support software engineers to make teams autonomous in DevOps and SRE practices, enabling our "you build it, you run it" philosophy at scale; • Ensure platform security and compliance while keeping costs under control; • Understand the business context to deliver relevant solutions to users and clients; • Stay current with new technologies and architectures and exercise sound judgment about their applicability at Swile; • Provide constructive, high-quality feedback in code reviews and mentor peers to promote technical growth within the team.

Brazil
Full TimeRemoteTeam 1,001-5,000Since 2002H1B Sponsor

• DevOps duties include creating and maintaining online distribution repos • Developing Linux-based packages and patches • Developing and testing end-user installation strategies • Automating build and deployment processes • Contributing to developing, operating, and maintaining DevSecOps processes, tools, and an automation pipeline • Ensuring the DevSecOps environment facilitates designing, implementing, deploying, and executing automated tests • Continuously investigating new technologies to improve productivity, quality, ease of use, and maintenance. • Automating DevOps pipelines, software builds, and software tests. • Authoring and maintaining automation deployment and monitoring scripts • Developing and maintaining security controls and systems • Managing virtualized and containerized assets • Creating virtual and physical network configurations, servers, communications, and test and developer engineering support • Linux software package creation and deployment and for creation and maintenance of the on-line distribution repository • Design, integration, test, and initial deployment of a variety of applications in an enterprise Kubernetes environment using modern tools and patterns • Responsible for creating infrastructure as code environments for complex virtualized networks • Train others to perform similar DevSecOps and automation tasks • Maintain and evangelize content to support a corporate library of knowledge.

United States
Lazer Technologies logo

Senior Infrastructure/DevOps Engineer, Fintech

Lazer Technologies

A digital product studio designed to help successful enterprises bring ideas to market faster and more successfully.

DevOps Engineer2 days ago
Full TimeRemoteTeam 51-200H1B No Sponsor

• Quickly implement and adapt infrastructure using Terraform, Pulumi, or other major IaC tools. • Docker is critical. Deeply understand how to design, build, and optimize secure, multi-stage Dockerfiles. • Design, build, and manage robust CI/CD pipelines to automate testing, building, and deployment across environments. • Provision and manage foundational services. Deep expertise in one major provider is required, transferable to the other. • Expertise in at least one major container platform: EKS, GKE, ECS, Fargate, or Cloud Run. (Kubernetes is highly valued, particularly EKS or GKE.) • Know when to use load balancers, VPNs for secure connectivity, and private VPCs for isolation. Apply subnetting, routing, VPC peering, and NAT gateways to build secure systems. • S3 (AWS) or Cloud Storage (GCP). • RDS (AWS) or CloudSQL (GCP). • Deploy event-driven components using AWS Lambda, GCP Cloud Functions, or equivalents. • Protect PII; apply encryption, secrets management, network firewalls, and web application firewalls (AWS WAF, GCP Cloud Armor) following security best practices. • Write high-quality automation and tooling in Go, Python, Node.js, or Bash for client-specific operational challenges. • Ensure robust monitoring and high system uptime.

Canada