Job Closed
This listing is no longer active.
A strangely human digital agency
Senior DevOps Engineer
Location
United States
Posted
7 days ago
Salary
$115K - $155K / year
Seniority
Senior
Job Description
Senior DevOps Engineer
Oddball
• Build, maintain, and improve CI/CD pipelines using GitHub Actions to support application development teams across the program • Manage and optimize AWS cloud infrastructure supporting secure, scalable software delivery • Build and maintain infrastructure as code using Terraform and/or CloudFormation • Support a shift-left approach to security by embedding automated scanning, policy checks, and compliance validation into delivery pipelines • Monitor system health, performance, and security using CloudWatch and related tooling • Collaborate with development teams to improve developer experience and streamline deployment workflows • Support federal security and compliance requirements including FISMA and NIST 800-53
Job Requirements
- Applicants must be authorized to work in the United States. In alignment with federal contract requirements, certain roles may also require U.S. citizenship and the ability to obtain and maintain a federal background investigation and/or a security clearance.
- Hands-on experience building and maintaining CI/CD pipelines in production environments
- Proficiency with AWS cloud services and infrastructure management
- Experience with infrastructure as code tools such as Terraform or CloudFormation
- Experience with GitHub Actions or similar pipeline tooling
- Familiarity with containerization using Docker, Kubernetes, or ECS
- Solid scripting skills in Python or Bash
- Understanding of federal security frameworks including FISMA and NIST 800-53 is a plus
- Thrives in a remote, collaborative Agile environment and genuinely enjoys working closely with a cross-functional team
- Communicates clearly and openly, whether documenting infrastructure decisions or collaborating with engineers on pipeline improvements
Benefits
- Fully remote
- Annual stipend
- Comprehensive Benefits Package
- Company Match 401(k) plan
- Flexible PTO, Paid Holidays
Related Guides
Related Categories
Related Job Pages
More DevOps Engineer Jobs
Senior Reliability Engineer
Kohl'sIt’s no secret that our associates love #LifeAtKohls and we know you will too.
• Ensure the resilience and availability of Kohl’s systems and applications • Collaborate closely with development teams • Contribute to architectural designs • Conduct risk assessments and design for failure • Implement robust monitoring and failover mechanisms • Drive error budget and Service Level Objective (SLO) adoption across products • Drive incident response efforts, perform root cause analysis and implement preventative measures to enhance system reliability • Establish consistent practices that elevate Kohl’s operational excellence through automation and process improvements • Follow software lifecycle and drive reliability, observability, and efficiency across product teams within an assigned domain • Identify repeated toil and find opportunities for automation and risk reduction • On-call on a rotation to respond to production incidents and conduct blameless retros and root-cause analyses (RCAs) to drive a culture of continuous improvements • Proactively identify failures before they cause outages using chaos engineering techniques such as edge cases, failure modes and design review • Advise on capacity planning and provide continuous assessments on systems behavior and consumption • Work with product managers to identify and prioritize work for reliability best practices (i.e., leveraging SLIs/SLOs/Error Budgets) • Mentor and assist engineers on the team
DevOps Engineer
TransUnionTransUnion is a global information and insights company that makes trust possible by ensuring that each consumer is reliably and safely represented in the marketplace. We do this by having an accurate and comprehensive picture of each person. This picture is grounded in our legacy as a credit reporting agency which enables us to tap into both credit and public record data; our data fusion methodology that helps us link, match and tap into the awesome combined power of that data; and our knowledgeable and passionate team, who stewards the information with expertise, and in accordance with local legislation around the world. Because of our work, organizations can better understand consumers in order to make more informed decisions, and earn their trust through great, personalized experiences, and the proactive extension of the right opportunities, tools and offers. In turn, consumers can be confident that their data identities will result in the opportunities they deserve. We make trust possible, so businesses and consumers can transact with confidence and achieve great things. We call this Information for Good®—it’s our purpose, and what drives us every day.
• Design, implement, and manage cloud infrastructure (primarily on GCP or similar platforms) • Deploy, monitor, and maintain applications on Kubernetes environments (GKE or similar) • Automate infrastructure provisioning and configuration using tools such as Ansible or equivalent • Build and manage CI/CD pipelines for efficient and automated deployments • Ensure high availability, scalability, and security of cloud-based systems • Monitor system performance, troubleshoot issues, and support production environments • Collaborate closely with development teams to improve deployment processes and system reliability • Implement logging, monitoring, and alerting solutions (e.g., Cloud Monitoring, Prometheus, Grafana) • Manage containerized applications using Docker and Kubernetes • Maintain and improve Infrastructure as Code (IaC) practices
Senior Site Reliability Engineer
Omilia - Conversational IntelligenceOmilia is the leading provider of Natural Language Understanding enabled IVR & natural dialogue interaction solutions.
- Ensure platform reliability and availability across production and pre-production environments through proactive monitoring, alerting, and automation. - First response for incidents, contribute to problem management and root cause analysis. - Supporting the development team's effort towards reliability, creating a solid reliability culture within the development lifecycle. - Develop troubleshooting documentation for production support resources. - Collaborate with Engineering teams to develop optimised and productive runbooks, operational documentation and automation of operational tasks. - Collaborate with development and cloud engineering teams to embed reliability and performance into the software delivery lifecycle. - Design, implement, and evolve observability solutions (metrics, logs, traces, dashboards) using tools such as Prometheus, Grafana, and ELK. - Participate in on-call rotations and continuously improve alert quality and response processes. - Champion a culture of reliability, performance, and continuous improvement across teams.
Role Description This is a hands-on, get-it-done engineer who keeps ZIRO’s software running smoothly from code to production and everything in between. You take complex, inherited systems and turn them into clean, scalable, well-documented infrastructure that teams actually love to use. You build and improve CI/CD pipelines, monitoring, and release processes that make shipping software fast, safe, and repeatable. At the end of the day, your work empowers engineers to move quickly with confidence and ensures our platforms are stable, reliable, and ready for anything. What you will do - Build, Scale and Maintain a Consistent and repeatable release process for products managed by the R&D Ops team - Build and maintain infrastructure to support ZIRO’s SaaS products with high availability (failover/disaster recovery) - Plan and maintain infrastructure to support CI/CD pipeline and automated testing - Writing, updating documentation such as runbooks/playbooks for operations to follow - Point of escalation for operations and engineering - Provide support and escalation for CI/CD pipeline - Build and Maintain monitoring and management systems for systems managed by the R&D Ops team Qualifications - 5–8+ years in DevOps, SRE, or Infrastructure roles supporting SaaS or enterprise platforms - Strong experience designing and maintaining CI/CD pipelines, automated testing, and release workflows - Deep knowledge of cloud infrastructure (Azure/AWS) including high availability and disaster recovery - Hands-on experience with Infrastructure as Code tools (e.g., Terraform, Ansible) and system automation - Proven ability to troubleshoot production issues, build monitoring/alerting systems, and partner closely with engineering and ops teams Benefits - Flexible, take what you need PTO 🏖️ - Competitive wages 💵 - Company sponsored health, vision and dental plans ⚕️ - Fully remote roles 💻 - Home office budget 📎 - Company sponsored social events 🤩 Company Description ZIRO is a leader in Unified Communications, helping customers deliver modern voice through Teams Phone and Microsoft 365. We help companies migrate, automate and manage their phone systems with industry-leading technology and decades of expertise with enterprise calling. Our platform simplifies the complex, helps IT teams move faster, and deliver a unified experience to every user. We’re a growing, people-first team that values accountability, courage, innovation, passion, selflessness and good judgment. We are on the mission to make every conversation count, and you can be too!




