Job Description
DevOps Engineer
Thrill
• Build and maintain production infrastructure in AWS. • Manage Linux servers. • Operate Kubernetes clusters. • Administer and optimize PostgreSQL databases. • Operate monitoring & observability. • Be part of the on-call rotation for the infrastructure components. • Ownership of the CI/CD process. • Work on improving infrastructure and application security. • Manage CloudFlare, WAF, and DDoS protection solutions to improve our stance in this area.
Job Requirements
- 7–10 years of Infra/DevOps experience (minimum 5 years).
- Hands‑on AWS, Kubernetes, Postgres, Linux.
- Strong networking + monitoring/observability experience.
- Python skills and Pulumi knowledge are a plus.
- Understanding or interest in infrastructure & application security and threat mitigation.
- English B2.
Related Guides
Related Categories
Related Job Pages
More DevOps Engineer Jobs
• Apply SRE principles to Customer Success • Detect issues commonly occurring in the platform • Proactively find improvements in the platform • Work on escalations and longer-running, more complex technical cases • Assist those using the Supabase platform with complex and/or long-running issues • Deliver on synchronous and asynchronous engagements with Supabase customers • Serve as an internal champion for the platform and how customers use it.
Site Reliability Engineer – AI & ML Infrastructure, Kubernetes, Terraform
DeepgramBuilding foundational AI for speech transcription and understanding.
• Architect and maintain our core computing platform using Kubernetes on AWS and on-premise, providing a stable, scalable environment for all applications and services. • Develop and manage our entire infrastructure using Infrastructure-as-Code (IaC) principles with Terraform, ensuring our environments are reproducible, versioned, and automated. • Design, build, and optimize our AI/ML job scheduling and orchestration systems, integrating Slurm with our Kubernetes clusters to efficiently manage GPU resources. • Provision, manage, and maintain our on-premise bare metal server infrastructure for high-performance GPU computing. • Implement and manage the platform's networking (CNI, service mesh) and storage (CSI, S3) solutions to support high-throughput, low-latency workloads across hybrid environments. • Develop a comprehensive observability stack (monitoring, logging, tracing) to ensure platform health, and create automation for operational tasks, incident response, and performance tuning. • Collaborate with AI researchers and ML engineers to understand their infrastructure needs and build the tools and workflows that accelerate their development cycle. • Automate the life cycle of single-tenant, managed deployments
Senior DevOps Software Engineer
eClinical SolutionsWe bring people and data together to support tomorrow’s breakthroughs
• Design, develop, test, and deploy scalable, secure, and highly interactive web applications • Own and evolve core platform modules • Influence application and system architecture • Lead by example through clean, well-tested code • Collaborate closely with Product Management, QA, and other engineers • Provide technical mentorship and guidance to other engineers • Diagnose and resolve complex production issues • Ensure solutions meet eClinical Solutions quality standards
Senior Site Reliability Engineer
ZscalerZscaler helps leading organizations in 180+ countries securely transform their networks and applications for a mobile and cloud-first world. Founded in 2008, th
• Expertly navigate networking principles, firewalls, and load balancing solutions to ensure robust infrastructure performance • Partner with Software Engineering and Infrastructure teams to design, implement, and deploy comprehensive end-to-end monitoring solutions • Execute seamless patches and upgrades, ensuring all administrative tools and utilities remain current and high-performing • Proactively monitor applications and services, participating in an on-call rotation to resolve issues and implement strategic prevention measures • Troubleshoot complex technical challenges and provide clear, candid communication regarding issues and their resolutions.




