Job Closed

This listing is no longer active.

Virtru logo
Virtru

Respect the people. Respect the data. Virtru equips you to protect your data anywhere and everywhere it's shared.

Site Reliability Engineer

DevOps EngineerDevOps EngineerOtherRemoteMid LevelTeam 51-200Since 2012H1B No SponsorCompany SiteLinkedIn

Location

District Of Columbia + 1 moreAll locations: District Of Columbia | Washington

Posted

90 days ago

Salary

$125K - $155K / year

Seniority

Mid Level

Bachelor Degree2 yrs expEnglishAWSGCPKubernetesLinuxPython

Job Description

Site Reliability Engineer

Virtru

• Help build and maintain cloud infrastructure using Infrastructure as Code • Contribute to CI/CD pipelines and delivery automation that development teams rely on • Support Kubernetes workloads: troubleshoot issues, implement improvements, and pick up platform best practices as you go • Collaborate on GitOps workflows and platform improvements • Work on observability and monitoring, including dashboards, alerting, and making production easier to understand • Help expand and improve the team's self-service engineering platform

Job Requirements

  • 2-5 years of experience in SRE, DevOps, infrastructure, or a related engineering role
  • Experience with cloud platforms (AWS, GCP, or similar)
  • Familiarity with containers and Kubernetes (production experience is a plus)
  • Comfort writing automation in a language like Python or Go
  • Understanding of Linux fundamentals
  • Clear communication skills and the ability to explain technical concepts to different audiences
  • Curiosity and a desire to learn

Benefits

  • A Flexible PTO policy — we strongly encourage you to take time off (in addition to 14 holidays) to ensure that you are getting the proper time needed to unplug and recharge.
  • A $1,500 annual Learning & Development Stipend focused on providing you the resources to continually learn and professionally grow.
  • Frequent company-sponsored team celebrations that provide ample opportunities to connect with teammates and be social!
  • Access to an Employee Assistance Program
  • Access to Headspace, a mental health app tailored to your specific needs.
  • A flat 3% contribution to your retirement account
  • A high degree of flexibility — Have an appointment, errand, or family emergency to take care of? Hop to it! We give you the time and space to take care of you and your own first.
  • Competitive compensation
  • Generous parental, medical, and bereavement policies
  • 401K contribution and stock options
  • Full medical, dental, and vision benefits
  • New Hire Swag and IT Welcome boxes
  • Structured semi-annual 360° performance reviews

Related Categories

Related Job Pages

More DevOps Engineer Jobs

OneStream logo

Senior Cloud DevOps Engineer

OneStream

OneStream is how today’s Finance teams can go beyond just reporting on the past and Take Finance Further™ by steering the business to the future. It’s the only enterprise finance platform that unifies financial and operational data, embeds AI for better decisions and productivity, and empowers the CFO to become a critical driver of business strategy and execution. Our vision is to be the operating system for modern finance, digitizing core financial functions and empowering the CFO to become a critical driver of business strategy.

DevOps Engineer90 days ago

Senior Cloud DevOps Engineer – Advanced Networking/Azure    Location: Remote, USA  Employment Type: Full-Time  Compensation: $140,000.00 - $160,000.00 (Range applies to US candidates only) + Benefits/Variable Comp/Equity - Range may vary based on experience.   Benefits Offered: Vision, Medical, Life, Dental, 401K    Summary  OneStream Software is a hyper-growth SaaS company focused on financial and operational data analytics for the largest companies in the world. We host our software on the Microsoft Azure cloud in many regions around the world using a variety of Azure technologies. This position focuses on the implementation of automations used to deliver, manage, and secure our cloud environments. Ideal for those staying at the forefront of technology and automating infrastructure deployments. This vital role within Cloud Services requires knowledge and experience designing, implementing, and monitoring scalable and secure cloud networking architecture in the FedRAMP space. The employee is expected to work well in a small team and willing to share responsibilities with other team members as needed. He or she will interact with internal staff, managers, and customers to implement and maintain IT operations. A passion for technology and learning, and the ability to grow others are vital for success in this role.    Primary Duties and Responsibilities  - Lead the design, continuous monitoring, implementation, and security operations of Azure cloud solutions, ensuring they meet industry best practices and comply with FedRAMP High, IL4 requirements.  - Lead team in developing modular Infrastructure-as-Code utilizing Terraform, PowerShell, ARM, Bicep, and YAML languages.   - Lead projects of moderate complexity to completion.  - Sustain a high level of reliability for key automated systems.   - Leads teams to define, estimate, and implement requirements for new automations or services of moderate complexity needing development.   - Stay up to date with the latest Azure and FedRAMP regulatory changes and industry trends, advising teams on potential impacts and necessary adjustments.  - Update technical documentation, workflows, and knowledge base articles.  - Provide feedback in pull requests and peer coding reviews.  - Solid knowledge in focused areas of OneStream Software.  - Participate in on-call rotation to support production systems. - Assist in efforts to debug the problems which arise in production.  - Ability to mentor others in several technical areas. - Understanding practical use of FedRAMP/SOC controls to assist Compliance and Security teams.  Required Education and Experience  - BS/BA in computer science, engineering, or technology-related field (or equivalent work experience).  - 8+ years of cloud infrastructure experience.   - 2+ years of compliance programs and security control sets such as NIST SP 800-53, FedRAMP High, IL4, as applied to cloud SaaS, PaaS, and IaaS environments.  - Expert knowledge of:   - VNets/vWAN, subnets, UDRs, routing, peering.   - ExpressRoute, VPN Gateway, Private Link/Endpoint.   - Azure Firewall, NSG/ASG, WAF, Application Gateway, Web Application Firewalls.    - Hands-on experience implementing network design and firewall configurations, as it pertains to connecting to government networks (BCAP) utilizing Azure Firewall and/or Palo Alto.  - Hands on experience implementing IPv6 routing and strict egress filtering strategies.  - Ability to translate DISA STIGs and NIST controls into enforceable network guardrails and evidence artifacts.  - Advanced understanding of Infrastructure-As-Code concepts and tooling (Terraform, CloudFormation templates, Bicep or ARM templates) on Microsoft Azure, Amazon Web Services (AWS), or Google Cloud Platform (GCP).   - Deep knowledge of Configuration Management/Orchestration utilities such as Ansible, PowerShell DSC, Chef, and Puppet.   - Advanced understanding of cloud concepts including elasticity, security, and identity management.   - Well versed familiarity with Agile Development methodologies utilizing Jira or Azure DevOps Boards.   - Strong understanding of Azure Kubernetes Services (AKS) with container-based deployment skills or other platforms such as OpenShift, GKS, EKS.   - Proficient knowledge in Software Development Lifecycles.  - 8+ years of hands-on experience with the following technologies, tools, and concepts:   - Automating processes using PowerShell, Bash, CLI, REST APIs, python, ARM Templates or other scripting languages.  - Comfortable leveraging source control tools such as Git, BitBucket, or GitHub.   - Microsoft Azure, Amazon Web Services (AWS) or Google Cloud (GCP).  - Microsoft Windows 11, Windows Server, IIS, Microsoft SQL Server, Active Directory.    Preferred Education and Experience  - Experience working for a cloud service provider (CSP), managed service provider (MSP), or SaaS provider.   - 8+ years of relevant Azure experience deploying and managing leveraging Infrastructure-as-Code (IAC) concepts.   - Microsoft Windows Server 2016-2022, IIS, Microsoft SQL Server, Azure Active Directory.   - Debian, Ubuntu, or other flavors of the Linux operating systems.  - Any certifications such as Microsoft Certified:  Azure Administrator Associate (AZ-103, AZ-104), Azure Solutions Architect Expert (AZ-300, AZ-301), CCNP, CCIE, CISSP, Azure DevOps Engineer Expert (AZ-400), Certified Kubernetes Administrator (CKE), CISSP, Information Technology Infrastructure Library (ITIL) Foundation, Microsoft Certified Professional (MCP), CompTIA Security+/Network+ is a plus.    Knowledge, Skills, and Abilities  - Deal well with ambiguous/undefined problems.  - Ability to self-motivate and work independently.  - Strong organizational and prioritization skills.  - Ability to find and apply effective solutions to emerging problems and challenges.  - Strong attention to detail.   - Ability to estimate your efforts.  - Ability to get up to speed quickly with modern technologies and services.   - Work well in a fast-paced environment.   - Ability to multitask on a variety of projects.   - Comfortable communicating with all levels of management.   - Experience with OneStream Software not required.    Travel (remove if not applicable)  - No travel is required.    Who We Are  OneStream is how today’s Finance teams can go beyond just reporting on the past and Take Finance Further™ by steering the business to the future. It’s the only enterprise finance platform that unifies financial and operational data, embeds AI for better decisions and productivity, and empowers the CFO to become a critical driver of business strategy and execution. Our vision is to be the operating system for modern finance, digitizing core financial functions and empowering the CFO to become a critical driver of business strategy. To learn more visit www.onestream.com.    Why Join The OneStream Team  - Transparency around corporate structure, salary, and benefits.  - Core value of customer success.  - Variety of project work (not industry-specific).   - Strong culture and camaraderie.  - Multiple training opportunities.    Benefits at OneStream    OneStream employees are passionate, hardworking individuals who go above and beyond to keep our customers happy and follow through on our mission statement. They consistently deliver the best and in turn, we make every effort to keep them cared for and happy. A sample of the benefits we provide are:  - Excellent Medical Plan.  - Dental & Vision Insurance.  - Life Insurance.  - Short & Long Term Disability.  - Vacation Time.  - Paid Holidays.  - Professional Development.  - Retirement Plan.    All candidates must be legally authorized to work for any company in the country where this position is located without sponsorship.  OneStream is an Equal Opportunity Employer.    #LI-TO1 #LI-REMOTE

United States
Job Closed
Hatch IT logo

Site Reliability Engineer

Hatch IT

CardioOne partners with independent cardiologists to provide innovative solutions that improve patient outcomes and reduce costs. Their platform helps their physician partners thrive in today’s fee-for-service environment and prepare for success in value-based care. In February 2024, they partnered with WindRose Health Investors as well as top physician services and payor executives to grow their team and invest in their next phase of growth. CardioOne offers a magnificent work environment, good working conditions, and competitive pay. They take pride in creating a culture of employee engagement that translates into an exemplary patient experience. Join them in their mission to positively impact US cardiology.

DevOps Engineer90 days ago

This description is a summary of our understanding of the job description. Click on 'Apply' button to find out more. Role Description CardioOne is seeking a highly skilled Site Reliability Engineer (SRE) to ensure the reliability, scalability, security, and performance of their production systems and services. The SRE will bridge the gap between software development and operations, implementing automation, monitoring, and best practices to enable rapid, reliable delivery of applications. You will report directly to the Senior Director of Engineering. - Ensure high availability, scalability, and performance of production systems. - Implement and maintain SLIs, SLOs, and SLAs for critical services. - Conduct capacity planning and performance tuning. - Automate infrastructure provisioning using IaC tools such as Terraform, Terragrunt, and Ansible. - Develop automation to minimize manual operations and improve deployment workflows. - Build CI/CD pipelines to support rapid and reliable deployments. - Design and maintain monitoring, logging, and alerting systems (Datadog). - Participate in on-call rotations and lead incident response efforts. - Perform root-cause analysis and develop postmortems to prevent recurring issues. - Manage cloud infrastructure (AWS, Azure) and container orchestration platforms (Kubernetes, ECS). - Optimize system architecture for reliability and fault tolerance. - Implement best practices for security, networking, and service resilience. - Work closely with development teams to design reliable microservices and distributed systems. - Advocate for SRE principles and drive operational excellence across engineering teams. - Mentor engineers on reliability practices, tooling, and automation strategies. Qualifications - Bachelor’s degree in Computer Science, Engineering, or equivalent experience. - 3–7 years of experience in SRE, DevOps, or Systems Engineering roles. - Strong proficiency with Linux systems and shell scripting. - Experience with cloud platforms (AWS, Azure). - Hands-on experience with Kubernetes/ECS and container technologies (Docker). - Proficiency in at least one programming language: Python or Java. - Experience with CI/CD pipelines and DevOps tooling. - Strong understanding of distributed systems, networking, and security fundamentals. - Strong analytical and problem-solving skills. - Excellent communication and cross-team collaboration. - Ability to thrive in fast-paced, high-stakes environments. - A mindset focused on continuous improvement and operational excellence. Requirements - Experience with observability stacks (OpenTelemetry). - Knowledge of database management (PostgreSQL). - Experience with configuration management tools (Ansible, Chef, Puppet). - Familiarity with zero-downtime deployments and chaos engineering practices. Benefits - Medical, dental, and vision insurance. - 401(k) plan with a match for eligible employees. - PTO (Personal Time Off) and sick time for full-time employees. Company Description CardioOne partners with independent cardiologists to provide innovative solutions that improve patient outcomes and reduce costs. Their platform helps their physician partners thrive in today’s fee-for-service environment and prepare for success in value-based care. In February 2024, they partnered with WindRose Health Investors as well as top physician services and payor executives to grow their team and invest in their next phase of growth. CardioOne offers a magnificent work environment, good working conditions, and competitive pay. They take pride in creating a culture of employee engagement that translates into an exemplary patient experience. Join them in their mission to positively impact US cardiology.

United States + 171 moreAll locations: United States | Canada | Brazil | Colombia | Argentina | Chile | Venezuela | Bolivia | Ecuador | French Guiana | Guyana | Paraguay | Peru | Suriname | Uruguay | Mexico | Costa Rica | El Salvador | Guatemala | Honduras | Nicaragua | Panama | Dominican Republic | Puerto Rico | Bahamas | Guadeloupe | Haiti | Jamaica | Martinique | Montserrat | United Kingdom | Germany | France | Estonia | Portugal | Hungary | Poland | Ukraine | Romania | Bulgaria | Czechia | Slovakia | Belarus | Moldova | Sweden | Greece | Belgium | Italy | Ireland | Switzerland | Netherlands | Finland | Malta | Denmark | Lithuania | Croatia | Spain | Austria | Bosnia And Herzegovina | Iceland | Luxembourg | North Macedonia | Montenegro | Norway | Serbia | Slovenia | Albania | Cyprus | Latvia | Monaco | South Africa | Egypt | Algeria | Angola | Benin | Botswana | Burkina Faso | Burundi | Cameroon | Cabo Verde | Central African Republic | Chad | Congo | Côte D'ivoire | Democratic Republic of the Congo | Equatorial Guinea | Eritrea | Ethiopia | Gabon | Gambia | Ghana | Guinea | Guinea-bissau | Kenya | Lesotho | Liberia | Libya | Madagascar | Malawi | Mali | Mauritania | Mauritius | Mayotte | Morocco | Mozambique | Namibia | Niger | Nigeria | Réunion | Rwanda | Senegal | Seychelles | Sierra Leone | Somalia | Sudan | Eswatini | Tanzania | Togo | Tunisia | Uganda | Zambia | Zimbabwe | Georgia | Turkey | Israel | United Arab Emirates | Armenia | Azerbaijan | Bahrain | Iraq | Jordan | Kuwait | Lebanon | Oman | Qatar | Saudi Arabia | Palestine | Yemen | India | Japan | Philippines | Pakistan | Thailand | Singapore | Vietnam | Taiwan | Indonesia | Cambodia | Laos | Malaysia | Myanmar | South Korea | China | Afghanistan | Bangladesh | Bhutan | Kazakhstan | Kyrgyzstan | Maldives | Mongolia | Nepal | Sri Lanka | Tajikistan | Turkmenistan | Uzbekistan | Australia | Papua New Guinea | Kiribati | Palau | French Polynesia | Tuvalu | New Zealand
Job Closed
Visa logo

Manager, Site Reliability Engineer – Platform

Visa

Based in Foster City, California, Visa is a global payments technology organization. Visa was founded in 1958, coinciding with Bank of America’s launch of the

DevOps Engineer90 days ago

• Act as the technical owner of the Platform Squad, defining, driving, and enforcing platform standards across the full lifecycle (design, rollout, upgrades, and decommissioning) for: Cloud infrastructure Kubernetes Service Mesh. • Ensure platform components are designed and operated according to SRE principles, focusing on reliability, scalability, and operational simplicity. • Drive architectural decisions with a sustainable platform vision, balancing innovation, security, and operational stability. • Define, build, and continuously improve operational processes for internal and external consumers, including: Platform onboarding and adoption Change management and release processes Incident, problem, and escalation management. • Act as a point of escalation for complex platform incidents and reliability risks, participating in on-call rotations as needed. • Ensure platform operations comply with internal controls, audit requirements, and security standards. • Establish and own platform observability standards, ensuring consistent implementation of Golden Signals: Latency Traffic Errors Saturation. • Define and track platform SLIs, SLOs, and error budgets in partnership with internal consumers. • Use metrics and operational data to drive prioritization, reliability improvements, and capacity planning decisions. • Foster a collaborative, servant-leadership culture that enables squads to self-serve while maintaining guardrails. • Collaborate closely with application engineering teams, other SRE squads, and stakeholders across security, compliance, and architecture. • Promote knowledge sharing through strong documentation and enablement around platform usage and best practices. • Provide technical mentorship and guidance to platform engineers, supporting engineering excellence and growth. • Support the Squad Manager in planning, prioritization, and execution of platform initiatives. • Ensure work is visible, well-documented, and aligned with broader SRE and company objectives.

Brazil
Job Closed
Visa logo

Staff Site Reliability Engineer – DevOps

Visa

Based in Foster City, California, Visa is a global payments technology organization. Visa was founded in 1958, coinciding with Bank of America’s launch of the

DevOps Engineer90 days ago

• Lead the implementation and optimization of CI/CD pipelines • Develop and maintain Infrastructure as Code (IaC) scripts to automate infrastructure provisioning and management • Identify and implement automation opportunities to improve efficiency and reduce manual effort • Ensure best practices in CI/CD and IaC to promote consistency, repeatability, and compliance • Maintain CI/CD resilience by avoiding unplanned or uncommunicated changes • Serve as an example of diligence and reliability to the team • Make high-impact technical contributions recognized by the team and organization • Write effective post-mortem documentation for internal and external stakeholders • Mentor and provide constructive feedback to engineers across the company • Review pull requests and source code, focusing on improving CI/CD and automation practices • Serve as a consultant for engineers from different squads • Solve complex and unknown problems under pressure • Support and participate in On-Call rotations • Stay up-to-date with the latest technology trends in CI/CD and automation • Lead and execute Proof of Concepts (POCs) to introduce new technologies to the team

Brazil
Job Closed