Job Closed

This listing is no longer active.

Inspira Financial provides health, wealth, retirement, and benefits solutions that strengthen and simplify the health and wealth journey. With more than 7 million clients, representing over $62 billion in assets, Inspira works with thousands of employers, plan sponsors, recordkeepers, TPAs, and other institutional partners — helping the people they care about plan, save, and invest for a brighter future. Inspira relentlessly pursues better outcomes for all with our automatic rollover services, health savings accounts, emergency savings funds, custody services, and more. Learn more at inspirafinancial.com.

Reliability Engineer (Remote)

DevOps EngineerDevOps EngineerFull Time Remote Mid LevelTeam 1,537Since 2000Company Site

Location

Illinois

Posted

81 days ago

Salary

$62K - $107K / year

Seniority

Mid Level

Bachelor Degree5 yrs expEnglishAWS Azure Shell Chef Datadog Docker GCP Jenkins Kubernetes PowerShell Python Terraform VMware

Job Description

Join Us! Take the next step in your journey at Inspira Financial. You will help businesses and individuals thrive today, tomorrow, and into retirement. Become part of a company that is people centric and client obsessed in every interaction; a community of forward-thinking individuals focused on driving results to deliver our mission with an unwavering commitment to integrity. Join us as we strengthen and simplify the health and wealth journey -- relentlessly pursuing better outcomes for all. We believe in finding the best talent! While some roles are based at one of our office locations, remote roles can sit in any of the following states: AL, AZ, FL, GA, IA, IL, IN, MI, MN, MO, NC, NE, PA, SC, TN, TX, UT, VA and WV. Remote status and role locations are subject to change. Relocation is not provided. Employees within a 90-minute radius of our Oak Brook, IL headquarters are required to adhere to the company in-office work guidelines of 4 days per month minimum from 10 am to 2 pm (1 of the 4 days must be a Monday or Friday). This requirement does not apply to support specialist positions. Don't meet every single requirement? Here at Inspira Financial, we believe there is no "perfect" candidate and want to encourage applying even if all the requirements listed aren't met. Our goal is to build an authentic workplace by valuing diversity in our candidates. We work to ensure that our team reflects the diversity of the businesses and clients we serve. We are always looking to expand our growing team with dynamic and enthusiastic individuals. If you enjoy a collaborative, fun environment that champions career development, Inspira Financial is the place for you! We look forward to receiving your application! Check out this Inspira Financial video to learn more about our company! Inspira Financial provides health, wealth, retirement, and benefits solutions that strengthen and simplify the health and wealth journey. With more than 7 million clients, representing over $62 billion in assets, Inspira works with thousands of employers, plan sponsors, recordkeepers, TPAs, and other institutional partners -- helping the people they care about plan, save, and invest for a brighter future. Inspira relentlessly pursues better outcomes for all with our automatic rollover services, health savings accounts, emergency savings funds, custody services, and more. Learn more at inspirafinancial.com . We have been recognized for our remarkable growth on lists such as Crain's Fast 50 and Inc. 5000, and for our outstanding workplace culture and benefits with Built In's 2025 Best Places to Work and Gallagher's 2022 Best-In-Class Employer awards. Job Summary & Responsibilities The Reliability Engineer (RE) will report to the Reliability Engineering Manager in the Technology Department. RE will work closely with engineering, security, and infrastructure teams to ensure Inspira's systems are highly available, scalable, and secure. RE will play a crucial role in deployments, incident response, system reliability, and performance optimization, while also contributing to long reliability-term infrastructure strategies. Working within a team environment, the RE participates directly IT realiabilityin solution creation, providing hands-on support as well as operational support and training. This individual must be creative, client focused, solutions-driven, organized, and have the ability to thrive in a dynamic environment. - Partner with the Engineering and Security teams to create, implement and apply SRE principles, processes, and controls. - Build & support Site Reliability function & participate in building tools to monitor and report system KPIs. - Monitoring of Platform and Environment with tools such as Datadog, Azure Monitor, etc. - Configure and Support the Disaster Recovery and Business Resumption Plan as it relates to the backup and restoration of the technology infrastructure. - Ensure run books are updated on a regular basis - Utilize programming skills to design and develop programs or scripts for various repetitive functions - Contribute to long-term infrastructure strategies and reliability improvements. - Performs all duties with a focus on goals of Inspira, which includes risk mitigation - Support inbound calls/emails, maintaining tickets within the issue tracking application related to Infrastructure Support - Crosstrain other team members to facilitate coverage - Other duties as assigned Preferred Qualifications EDUCATION: - Bachelor's degree in computer science or equivalent experience - Certifications preferred: AZ-900, Datadog Fundamentals EXPERIENCE AND SKILLS: - Minimum 3 years of experience in Information Technology - 3+ years of role specific experience - Minimum of 3 years of experience with: - Experience with IaC tools such as Terraform, bash scripting, etc - Experience supporting Containerization Platforms such as K8s and Docker - Experience working with Automation tools such as ADO, Jenkins, and Chef - Experience working with Observability tools such as Datadog and Azure Monitor - Knowledge of principles such as SLIs, SLOs, and error budgets. - Familiarity with observability concepts beyond monitoring, such as distributed tracing and log correlation. - Knowledge of Virtual Machines and Container concepts. - Knowledge of Security as it relates to Cloud Environments including the Shared Security Model - Scripting languages such as Powershell, Bash, Python, etc. - Experience with Cloud Services Azure (preferred), Google or AWS - Experience with BDR solutions such as Veeam, VMWare Site Recovery, and Azure Backup/Site Recovery - Ability to work independently with minimal supervision - Must have excellent written and verbal communication skills - Strong analytical skills, follow-up capability, and problem-solving ability - Ability to conduct research into hardware and software issues and products as required - Ability to effectively prioritize and execute tasks in a high-pressure environment - Ability to use strong interpersonal and presentation skills to share ideas, solutions, and strong working relationships with business units including non-technical users, technical leads, and developers - Experience working with a ticketing system and internal clients - Ability to respond to emails and text messages after hours to resolve critical issues - Must possess strong skills in personal diplomacy and client service while consistently demonstrating a high level of motivation, commitment to teamwork, professionalism and trustworthiness - Strong vendor management skills - Highly self-motivated and directed - Experience in a high availability environment preferred - Knowledge of ITIL/ITSM practices and framework preferred OTHER REQUIREMENTS: - Infrequent travel - Ability to provide personal transportation from time to time. - Ability to work overtime. - Prolonged periods of sitting at a desk and working on a computer Compensation & Benefits $91,000-$107,000 per year

Benefits

401(K) matching, Dedicated diversity and inclusion staff, Dental insurance, Disability insurance, Volunteer in local community, Family medical leave, Flexible work schedule, Generous parental leave, Generous PTO, Health insurance, Job training & conferences, Open door policy, Life insurance, Paid volunteer time, Open office floor plan, Paid holidays, Paid sick days, Onsite office parking, Partners with nonprofits, Performance bonus, Pet insurance, Promote from within, Lunch and learns, Remote work program, Team based strategic planning, Continuing education available during work hours, Tuition reimbursement, Vision insurance, Wellness programs, Diversity employee resource groups, Hiring practices that promote diversity

Related Categories

DevOps Engineer

Related Job Pages

DevOps Engineer Jobs in Illinois Remote Full-time Jobs (US)Remote Python Jobs (US)More Remote Jobs

More DevOps Engineer Jobs

NC - DevOps Engineer - 231

Thaloz

We help companies achieve their goals and expand their business through technology.

DevOps Engineer81 days ago

Other RemoteTeam 51-200H1B No Sponsor

Company Site LinkedIn

We are looking for a highly skilled DevOps Engineer with deep expertise in container orchestration and cloud infrastructure. You will be responsible for designing, deploying, and maintaining scalable, reliable, and secure infrastructure across AWS environments. You will work closely with development, security, and platform teams to accelerate software delivery and ensure operational excellence. Responsibilities: • Design, deploy, and manage containerized workloads using Amazon ECS (Elastic Container Service) and Amazon EKS (Elastic Kubernetes Service). • Build and maintain CI/CD pipelines to automate software delivery workflows. • Develop and manage Docker container images, registries (ECR), and container lifecycle best practices. • Implement Infrastructure as Code (IaC) using tools such as Terraform, CloudFormation, or CDK. • Monitor, troubleshoot, and optimize cloud infrastructure performance, availability, and cost. • Enforce security best practices across containerized environments (IAM roles, network policies, secrets management). • Collaborate with software engineers to containerize applications and migrate workloads to ECS/EKS. • Manage Kubernetes cluster configurations, namespaces, Helm charts, and service mesh integrations. • Define and maintain observability standards using tools like CloudWatch, Prometheus, Grafana, or Datadog. • Participate in on-call rotations and incident response processes.

AWS Amazon ECS Amazon EKS Kubernetes Docker Terraform Amazon IAM Helm Amazon CloudWatch Prometheus Grafana Datadog Amazon EC2 GitHub Actions Jenkins GitLab CI Python Argo CD Istio

View details: NC - DevOps Engineer - 231

Brazil

Apply

Job Closed

DevSecOps Engineer

Deel

Deel is a financial services company that has developed a payroll system for remote teams, connecting localized payments and compliance in the convenience of one platform. The priv

DevOps Engineer81 days ago

Full Time Remote

• Develop and maintain automated security tools and processes to identify vulnerabilities, perform code analysis, monitor systems and conduct security testing. This includes integrating security scanners, static code analysis tools, and vulnerability assessment tools into the CI/CD pipeline. • Work with infrastructure and operations teams to design and implement secure cloud infrastructure, network architecture, and deployment processes. This involves ensuring proper access controls, encryption, and monitoring are in place. • Implement security monitoring tools and processes to proactively identify and respond to security events and anomalies. This includes log analysis, intrusion detection, and system monitoring. • Foster collaboration and communication between development, operations, and security teams. Act as a liaison to ensure that security requirements are understood and integrated into the development process. • Assist in compliance assessments and audits to ensure adherence to regulatory requirements and industry standards. Collaborate with auditors and provide necessary documentation and evidence of security controls.

Docker JavaScript Kubernetes Python TypeScript

View details: DevSecOps Engineer

Brazil

Apply

Job Closed

Principal Engineer - Release Engineering

Fastly

Founded in 2001, Fastly is a privately-held internet company offering the Fastly Edge Cloud platform, a content delivery network that helps digital businesses s

DevOps Engineer81 days ago

Full Time Remote

Company Site

Role Description We are looking for a Principal Release Engineer to join Fastly’s Release Engineering team. The Release Engineer is responsible for the set-up, maintenance, and ongoing development of continuous build/integration and deployment infrastructure. In this role, you will create and maintain fully automated CI build processes for multiple environments, including our global edge cache fleet, internal applications, and applications hosted in AWS and GCP. The ideal candidate will care deeply about providing other engineers with a seamless release experience and have a deep understanding of what engineers care about, how they ship code, and what world-class delivery infrastructure looks like. Responsibilities - Design, build, and operate release tooling across building, packaging, signing, artifact management, and deploying software. - Drive initiatives that make our engineers happier and more productive by reducing lead time for changes. - Collaborate with development and SRE teams to develop policies, standards, guidelines, governance, and related guidance for CI/D operations. - Support developers with build automation, merge resolution, CI, test automation, deployment based on tools usage and policies, standards. - Troubleshoot issues along the CI/D pipeline. - Participate in on-call support rotation. Qualifications - 10+ years of experience. - Ability to excel within an "Agile" environment (i.e. user stories, sprints, iterative development, continuous integration, continuous delivery, shared ownership, test-driven development, etc.). - Deep expertise in at least one of the following languages: Ruby, Python, Go. - Expertise with automation tools such as Jenkins, GitHub Actions, or Dagger. - Strong written and verbal communication skills. - Experience with Infrastructure-as-Code frameworks such as Chef, Terraform, Ansible, etc. - Familiarity with Varnish, Nginx, or other cache and proxy servers. - Knowledge of source code control management systems and configuration management (i.e. Git, GitHub, etc.) and code branching/merging strategies. - Experience with Linux and containerization, particularly with Docker & orchestration platforms like Kubernetes. - Experience with a Cloud-based environment, particularly AWS and/or GCP. Both would be ideal! - Good understanding of quality control and test automation in agile-based continuous integration environments. - Experience with Omnibus and/or Debian packaging. - Experience with artifact repositories such as Artifactory or Sonatype Nexus. - Some experience with SQL and relational databases administration (i.e. Oracle, MySQL). - Open source license tracking, auditing, and reporting. Requirements - This position will require you to be available during core business hours and occasional nights and weekends as needed for on-call support. Benefits - We care about you. Fastly works hard to create a positive environment for our employees, and we think your life outside of work is important too. - We offer a comprehensive benefits package designed to meet your needs. Our offerings may vary depending on the country where you work and are subject to change. Company Description - Fastly helps people stay better connected with the things they love. - Fastly’s edge cloud platform enables customers to create great digital experiences quickly, securely, and reliably. - Fastly’s customers include many of the world’s most prominent companies, including GitHub, Yelp, Paramount, and JetBlue. - We're building a more trustworthy Internet.

AWS GCP Ruby Python Jenkins GitHub Actions Terraform Ansible Nginx Git Linux Docker Kubernetes Debian SQL Oracle Database MySQL

View details: Principal Engineer - Release Engineering

United States

$133K - $159K / year

Apply

Site Reliability Engineer - Observability

Cluepoints

At CluePoints, we’re redefining how clinical trials are run. As the premier provider of Risk-Based Quality Management (RBQM) and Data Quality Oversight software, we harness advanced statistics, artificial intelligence, and machine learning to ensure the quality, accuracy, and integrity of clinical trial data, helping life sciences organizations bring safer, more effective treatments to patients faster. Ambitious, fast-growing technology scale-up Dynamic and diverse international team representing more than 20 nationalities Culture of collaboration, flexibility, and continuous learning Values of Care, Passion, and Smart Disruption Mission to create smarter ways to run efficient clinical trials and deliver AI-powered insights that improve human outcomes worldwide

DevOps Engineer81 days ago

Full Time RemoteTeam 201-500

Role Description The Site Reliability Engineer, Observability & RUM is responsible for improving end-to-end observability across our platforms and customer-facing applications, with a particular focus on frontend and Real User Monitoring (RUM). This role combines core SRE practices with ownership of monitoring, logging, tracing, alerting, and user-experience telemetry in production. - Help evolve observability capabilities across Azure and Kubernetes environments. - Improve incident detection and diagnosis. - Support decisions around managed versus self-managed observability tooling. - Partner closely with Engineering, Support, QA, and Security teams to ensure systems ship with actionable telemetry, dashboards, alerts, and operational runbooks. Qualifications - 5+ years of experience in Site Reliability Engineering, DevOps, Platform Engineering, or Observability Engineering roles. - Strong hands-on experience with observability and monitoring platforms, including several of the following: Elastic, Grafana, Prometheus, OpenTelemetry, Sentry, monitoring agents, and managed APM/observability platforms. - Experience implementing and supporting Real User Monitoring (RUM) and frontend/application observability in production environments. - Ability to work across frontend, backend, and platform teams to improve telemetry, alerting, and incident diagnosis. - Experience evaluating or operating managed observability platforms and understanding the trade-offs versus self-managed stacks. Requirements - (Nice to have) Experience supporting ML, AI, or LLM-backed services in production (RAG, LangSmith, Arize Phoenix, LangChain, LangGraph, Azure OpenAI, OpenAI, or Anthropic APIs). Company Description At CluePoints, we’re redefining how clinical trials are run. As the premier provider of Risk-Based Quality Management (RBQM) and Data Quality Oversight software, we harness advanced statistics, artificial intelligence, and machine learning to ensure the quality, accuracy, and integrity of clinical trial data, helping life sciences organizations bring safer, more effective treatments to patients faster. - Ambitious, fast-growing technology scale-up. - Diverse international team representing more than 20 nationalities. - Culture of collaboration, flexibility, and continuous learning. - Values of Care, Passion, and Smart Disruption. - Mission to create smarter ways to run efficient clinical trials and deliver AI-powered insights that improve human outcomes worldwide.

Observability / Monitoring Azure Kubernetes Grafana Prometheus Sentry AI / ML AI LLM Phoenix LangChain OpenAI API

View details: Site Reliability Engineer - Observability

Poland

Apply

Job Closed

Reliability Engineer (Remote)

Job Description

Benefits

Related Guides

Related Categories

Related Job Pages

More DevOps Engineer Jobs

NC - DevOps Engineer - 231

DevSecOps Engineer

Principal Engineer - Release Engineering

Site Reliability Engineer - Observability