Job Closed
This listing is no longer active.
Providing Insights That Elevate Potential
Staff DevOps Engineer
Location
United States
Posted
171 days ago
Salary
$170K - $175K / year
Seniority
Lead
Job Description
Staff DevOps Engineer
Riverside Insights
• Own strategy and implementation for hosting .NET, Python, Node.js, and React applications on AWS using Infrastructure-as-Code (Terraform). • Automate AWS infrastructure and optimize for scalability, observability, and cost. • Support SDLC deployments across all environments through production. • Manage and migrate data stores (Postgres, SQL Server, Oracle) to cloud-native AWS solutions. • Lead tooling improvements for modern CI/CD pipelines. • Prioritize timelines and deliverables across the engineering ecosystem. • Mentor and coach team members while fostering a collaborative DevOps culture. • Partner with engineering and product leaders to analyze requirements and deliver high-quality solutions.
Job Requirements
- Proven experience as a staff engineer, tech lead, or engineering manager with coaching and mentoring responsibilities.
- Expertise in AWS multi-account, multi-region deployments and Infrastructure-as-Code (Terraform).
- Strong knowledge of Kubernetes, auto-scaling strategies, and cost optimization.
- Hands-on experience with CI/CD pipelines (e.g., GitHub Actions), containerization, and observability tools (CloudWatch, DataDog).
- Familiarity with security audits and certifications (e.g., SOC2).
- Proficiency in scripting (bash and/or Python).
- Experience with EdTech platforms or similar SaaS environments (preferred).
- Advanced knowledge of AWS services and emerging cloud technologies (preferred).
- Familiarity with legacy-to-cloud migration strategies (preferred).
- Strong collaboration skills and ability to thrive in fast-paced, ambiguous environments (preferred).
Benefits
- Medical, Dental, and Vision plans
- Company paid basic life and AD and D insurance
- Company paid long-term disability
- Paid Parental Leave
- Supplemental life insurance options
- Company paid Employee Assistance Program (EAP)
- Retirement plan with discretionary company matching
- Flexible Spending Account (FSA) and Health Savings Account (HSA) options
- Premium subscription to Calm for employee and dependents
- 33 days of company paid time off (PTO, Holidays, Wellness Days)
- Quarterly Focus Days
- Flexible work arrangements
- Tuition Reimbursement Program
- Company orientation and 30, 60, 90 Day Onboarding
Related Guides
Related Categories
Related Job Pages
More DevOps Engineer Jobs
DevOps Specialist
Tech Minds AgencyA Team of Tech Experts Driving Business Success: Web/Mobile Development, Digital Marketing, and Skill-Enhancing Courses
• Design and implement scalable, secure, and resilient cloud-native applications using Azure Service. • Design and manage Azure Data Lake environments for large-scale data ingestion, processing, and analytics. • Design and implement CI/CD pipelines using Azure DevOps, GitHub Actions, or Jenkins • Develop and deploy cloud applications using Azure services like App Services, Functions, AKS, and Logic Apps • Automate infrastructure provisioning with tools like Terraform, ARM templates, or Bicep • Monitor and optimize cloud environments using Azure Monitor, Application Insights, and Log Analytics • Collaborate with development and operations teams to streamline release cycles and improve system reliability • Troubleshoot and resolve issues in cloud infrastructure and application deployments
• Own end-to-end deployment, publishing, and configuration for iOS and Android mobile applications • Manage App Store Connect and Google Play Console workflows, including signing, provisioning, and compliance • Automate mobile build and release processes to improve consistency and reduce manual effort • Coordinate closely with Engineering, Product, and Professional Services teams to ensure smooth releases • Design, build, and maintain Ansible automation for deployments, APIs, IIS configuration, certificate rotation, and environment standardization • Use Terraform to provision and manage infrastructure in a repeatable, auditable manner • Reduce configuration drift by establishing infrastructure-as-code as the source of truth • Create reusable automation patterns that support both mobile and backend systems • Operate and tune IIS in Windows-based production environments, including performance optimization and safe restarts • Support containerized workloads (Docker/Kubernetes) and help guide their adoption as part of the platform’s future state • Contribute to CI/CD pipeline improvements that support reliable, predictable deployments
Senior ML Infrastructure – DevOps Engineer
Pathwaypathway.com - The smartest way to build Data Products
• Design, operate, and scale GPU and CPU clusters for ML training and inference (Slurm, Kubernetes, autoscaling, queueing, quota management). • Automate infrastructure provisioning and configuration using infrastructure‑as‑code (Terraform, CloudFormation, cluster‑tooling) and configuration management. • Build and maintain robust ML pipelines (data ingestion, training, evaluation, deployment) with strong guarantees around reproducibility, traceability, and rollback. • Implement and evolve ML‑centric CI/CD: testing, packaging, deployment of models and services. • Own monitoring, logging, and alerting across training and serving: GPU/CPU utilization, latency, throughput, failures, and data/model drift (Grafana, Prometheus, Loki, CloudWatch). • Work with terabyte‑scale datasets and the associated storage, networking, and performance challenges. • Partner closely with ML engineers and researchers to productionize their work, translating experimental setups into robust, scalable systems. • Participate in on‑call rotation for critical ML infrastructure and lead incident response and post‑mortems when things break.
• Design, implement, and manage CI/CD pipelines using GitHub Actions, GitOps, GCP-native tools, Azure DevOps, and Jenkins. • Develop, automate, and maintain infrastructure as code across multi-cloud environments (GCP, Azure; Kubernetes, Cloud Run, AKS, GKE). • Build, deploy, and operate containerized services, ensuring health, security, scalability, and high availability. • Integrate tools, platforms, and APIs to streamline operations across AI/ML and virtual agent ecosystems. • Develop and maintain automation scripts and tooling using Python, Shell, and Java/Groovy where appropriate. • Monitor system and application health, implement proactive alerting, and continuously improve reliability and performance. • Troubleshoot infrastructure, networking, and deployment issues across cloud services and application stacks. • Collaborate with engineering and product teams to optimize release processes and platform reliability. • Promote DevOps best practices and maintain clear documentation for system architecture, operations, and troubleshooting.



