Job Closed
This listing is no longer active.
Flex is a collection of smart and easy-to-use tools that pair with Open Dental to supercharge your patient engagement.
Site Reliability Engineer
Location
United States
Posted
84 days ago
Salary
0
Seniority
Senior
Job Description
Site Reliability Engineer
Flex Dental Solutions
• Own the availability, performance, and resilience of our production systems. • Partner closely with engineering, product, and leadership to reduce operational risk. • Proactively monitor application health and performance across cloud infrastructure (AWS). • Lead incident response, including triage, mitigation, root cause analysis (RCA). • Lead and participate in disaster recovery drills and security incident simulations. • Build and maintain Infrastructure as Code (IaC) using AWS-native tooling. • Collaborate with development teams to improve CI/CD reliability. • Work closely with stakeholders and product teams to ensure technical reliability aligns with business needs. • Reduce operational toil through automation, tooling, and process improvements. • Champion best practices across security, availability, performance, and incident response.
Job Requirements
- 3+ years of experience in Site Reliability Engineering, DevOps, or a closely related role.
- Strong hands-on experience operating production systems in AWS (EC2, ECS, RDS, IAM, CloudWatch).
- Experience implementing Infrastructure as Code (CloudFormation, CDK, or Terraform).
- Proficiency in Node.js or Python for automation and operational tooling.
- Experience with Docker and container-based deployments (ECS preferred; Kubernetes a plus).
- Strong understanding of MySQL operations, backups, and performance monitoring.
- Proficiency with Git-based workflows and CI/CD systems.
Benefits
- Health insurance
- 401(k) matching
- Flexible work hours
- Paid time off
- Professional development opportunities
Related Guides
Related Categories
Related Job Pages
More DevOps Engineer Jobs
• Manage complex cloud-based architectures using AWS services like EC2, Lambda, RDS, S3, EFS, and VPC to support scalable, high-performance game development workflows. • Implement test automation and ensure pipeline efficiency across platforms. • Develop and enforce best practices for configuration management, infrastructure as code (IaC), and version control. • Define SLAs, SLOs and SLIs, runbooks, and other monitoring and incident response patterns, including automation and notifications. • Ensure OSE’s CI/CD cloud infrastructure is scalable, reliable, performant, and secure. • Work with game teams to build-out, maintain, and evolve the company’s build, test, and deployment systems and pipelines (CI/CD and test automation). • Architect and optimize continuous integration and deployment pipelines for building, testing, and deploying PC/console games using Jenkins, Git, and Perforce. • Design and implement custom metrics and dashboards to monitor performance, availability, and reliability of game servers, CI/CD pipelines, and backend services, ensuring operational transparency and actionable insights. • Profile and optimize system resource usage (CPU, memory, disk I/O, and network) for both game servers and CI/CD systems, ensuring efficient resource allocation and minimizing bottlenecks in the pipeline and production environments. • Implement log aggregation and analysis systems using AWS CloudWatch Logs to centralize logs across services, making troubleshooting and incident response efficient. • Beyond gathering functional requirements, work diligently to surface quality attributes and other non functional specifications. • Develop and maintain concise software and technical design documents. • Work well as part of a collaborative and flexible team. • Stay up-to-date with industry trends and emerging technologies in DevOps and gaming, recommending improvements to enhance our development processes.
Senior DevOps Engineer – Azure, AKS
opinov8Globally recognized digital and engineering solutions partner.
• Architect and implement scalable, production-grade infrastructure on Azure and AKS; • Lead infrastructure delivery for an ML platform (TimescaleDB, message queues, autoscaling); • Design and own CI/CD pipelines and deployment strategies; • Set the bar for infrastructure quality - changes are tested, validated, and documented before sign-off; • Establish best practices across the team.
Senior DevOps/Platform Engineer
opinov8Globally recognized digital and engineering solutions partner.
• Design, develop, and maintain core platform components and infrastructure using Infrastructure as Code (IaC) principles (e.g., Terraform, CloudFormation, Ansible). • Build and operate self-service tools and workflows for application deployment, monitoring, logging, and scaling. • Implement and manage containerization technologies (e.g., Docker, Kubernetes) and orchestration platforms. • Develop and maintain CI/CD pipelines that are efficient, secure, and scalable. • Implement and enforce platform security best practices and integrate security tools into the platform. • Design and implement robust monitoring, alerting, and observability solutions for the platform. • Troubleshoot and resolve platform-related issues, collaborating with development teams as needed. • Contribute to the definition and evolution of our platform architecture and roadmap. • Stay up-to-date with the latest trends and technologies in platform engineering and cloud-native development. • Work effectively under pressure and balance competing tasks in a rapidly changing environment. • Participate in on-call rotation during day PST hours.
Senior DevOps Engineer – AWS, GCP
opinov8Globally recognized digital and engineering solutions partner.
• Design, develop, and maintain core platform components and infrastructure using Infrastructure as Code (IaC) principles (e.g., Terraform, CloudFormation, Ansible). • Build and operate self-service tools and workflows for application deployment, monitoring, logging, and scaling. • Implement and manage containerization technologies (e.g., Docker, Kubernetes) and orchestration platforms. • Develop and maintain CI/CD pipelines that are efficient, secure, and scalable. • Implement and enforce platform security best practices and integrate security tools into the platform. • Design and implement robust monitoring, alerting, and observability solutions for the platform. • Troubleshoot and resolve platform-related issues, collaborating with development teams as needed. • Contribute to the definition and evolution of our platform architecture and roadmap. • Stay up-to-date with the latest trends and technologies in platform engineering and cloud-native development. • Work effectively under pressure and balance competing tasks in a rapidly changing environment. • Participate in on-call rotation during day PST hours.


