Job Closed

This listing is no longer active.

Makro PRO

Makro PRO is an exciting new digital venture by the iconic Makro. Our proud purpose is to build a technology platform that will help make business possible for restaurant owners, hotels, and independent retailers, and open the door for sellers. We welcome bold, energetic, and thoughtful people who share our belief in collaboration, diversity, excellence, and putting customers at the heart of our work. Clear focus Diverse Workplace (Our members are from around the world!) Non-hierarchical and agile environment Growth opportunity and career path

DevOps / SRE

Location

Worldwide

Posted

7 days ago

Salary

0

Seniority

Mid Level

No structured requirement data.

Job Description

DevOps / SRE

Makro PRO

Role Description Own ARIP's operational substrate — infrastructure-as-code (on Data Engineering & Advanced Analytic (DEAA's) Terraform standard), CI/CD pipelines with eval-gate enforcement, per-agent observability, per-agent cost meter, FinOps, and incident response. Inherit production environment from Databricks contractor in Q1 2027 and harden it for Wave 3 scale across 15 agents and 5 suites. Remote candidates outside of Thailand are welcome to apply. - Adopt DEAA's Terraform standard for all ARIP infrastructure; author ARIP-specific modules (agent runtime, vector DB, KG database); weekly drift detection — zero unmanaged production resources - Build ARIP CI/CD pipelines on DEAA's spine with eval-gate enforcement — no agent reaches production without eval-pass; ≤1hr deployment lead time, ≤15min rollback time - Implement per-agent cost meter end-to-end (LLM tokens, vector DB queries, model inference) and surface to DEAA's GenAI Cost Dashboard (DTB-51) - Stand up ARIP on-call rotation; author runbooks for every production agent and service; lead incident response; MTTR < 60min for P0/P1 - Implement ARIP cost tagging policy (team / domain / environment / agent / suite / persona) aligned with DEAA's standard; report monthly to ARIP Cost Review - Execute Databricks contractor hand-over in Q1 2027: inherit IaC, runbooks, observability; refactor to Lotus's standards. Qualifications - 5+ years SRE / DevOps with production ownership of AI / data-intensive or agent-based platforms - Terraform at enterprise scale: modules, state management, drift detection, environment promotion — expert; Terraform Associate / Professional preferred - CI/CD for ML/AI services: GitLab CI/CD or comparable with eval-gate integration; cloud (Azure preferred); AZ-500 helpful - OpenTelemetry + Langfuse (or equivalent) for LLM observability in production; FinOps: tagging policies, per-invocation cost attribution for LLM systems - Incident response: on-call, post-mortems, runbook authorship at senior level; rollback orchestration with quarterly game days - Calibre: Senior DevOps / SRE from Agoda, Grab, Shopee, LINE MAN Wongnai, SCBX, KBank, or AI-native infra teams.

Related Categories

Related Job Pages

More DevOps Engineer Jobs

Ping Identity logo

Senior DevOps Engineer

Ping Identity

Identity Security for the Global Enterprise

DevOps Engineer7 days ago
Full TimeRemoteTeam 1,001-5,000Since 2002H1B No Sponsor

• Manage customer environments and ensure their uptime, performance and security. • Fulfill service requests related to customer environments to perform normal day to day operational tasks. This can include configuration changes and access requests. • Onboard new customers by setting up their environments, configuring standard settings and working with the customer to set up connectivity. • Upgrade and audit existing customer environments to ensure they are up-to-date with the latest software versions, patches, and security standards. • Monitor the environment for issues and work with the support organization to troubleshoot in the event of issues. • Collaborate with the development team to deploy new releases to customer environments. • Document all procedures and maintain an up-to-date knowledge base for other team members. • Report to the PingOne Advanced Services Software Development Manager

United Kingdom

Senior Azure Cloud DevOps Engineer

Carrum Health

Carrum Health is a healthcare company that partners with employers to provide employees access to high-quality medical care through a network of top providers. Carrum Health aims t

DevOps Engineer7 days ago

Title: Senior Azure Cloud DevOps Engineer Location: United States Job type: Full Time Department: Engineering Work type: Remote Job Description: At Carrum, we are transforming how we pay for, deliver and experience healthcare. If you are passionate about changing healthcare and want to finally get rid of surprise bills, poor quality, and high prices, while thriving in an entrepreneurial, cutting-edge environment, we would love to connect with you. In 2014 Carrum reinvented the Centers of Excellence (COE) category in digital health. Today, 95% of the US population lives within 50 miles of a Carrum COE and our providers rank in the top 10% nationally. Our team&rsquo;s execution has been recognized by the venture community and we&rsquo;ve raised more than $96M in aggregate from investors like OMERS, Tiger Global Management and Wildcat Ventures. Our impact has been externally proven in a 2021 RAND Corporation study and featured as a Harvard Business School (HBS) case study. As a Senior Azure Cloud DevOps Engineer, you will be primarily responsible for bridging development and operations through collaboration, infrastructure as code (IaC), automation, and systems maintenance. This role requires excellent communication skills to work with stakeholders, understand their needs, and convey progress. The ideal candidate will excel at designing and implementing best-practice IaC solutions, automating the deployment, scaling, and management of application infrastructure, and maintaining systems to meet company needs. Additionally, you will lead incident response and root cause analysis efforts to ensure quick issue resolution and prevention. Carrum is at an inflection point where we are modernizing our cloud architecture and building a scalable, compliant Azure foundation for future enterprise growth, You will lead infrastructure projects end-to-end, from problem definition and technical design through ticket creation and implementation, rollout, documentation, and operational ownership Beyond these core responsibilities, you will be expected to integrate security best practices into our infrastructure lifecycle to ensure compliance with data protection regulations. You will also be responsible for comprehensive documentation of technical procedures, configurations, and deployed solution architectures. This role reports directly to the Senior DevOps Engineering Manager. The salary range for this role is $160K-$200K, depending on geography and level of experience, plus possible equity and an annual bonus. You&rsquo;re excited about this opportunity because you will... - Work with a diverse group of people from a wide variety of backgrounds that value inclusion and openness - Hold yourself and others accountable for spending that extra 10% on a project to deliver great documentation in addition to the functionality itself - Relish working with your team and cross-functionally with members from other teams to both prevent and solve problems that impact our patients - Be a peer leader and expert on our cloud-based infrastructure platform that powers Carrum Health, and take it to the next level of maturity while keeping it dead simple - Design for simplicity and intelligence - Take ownership of the code you develop and release your work product frequently into the production environment - Write great documentation so that others know how to best utilize the components you build, and to remind your future self - Channel your inner product manager through well-written stories to inform our backlog and create great products. At times, create your own tickets. We&rsquo;re excited about you because&hellip; - 7+ years of experience working with cloud infrastructure, CI/CD pipelines, and/or site reliability engineering - 5+ years of experience working with container orchestration tools (e.g. AKS, EKS) as well as the supporting deployment models and monitoring/observability of them - Expertise in launching fullstack applications and data ecosystems leveraging technology such as: Github Actions, Azure Terraform Modules, Azure Landing Zones, Azure Networking, Azure App Insights, Azure FrontDoor, Azure PLDMC Services, and Azure Data Factory - You have previous experience scaling platforms on Azure with a thorough understanding of reference architectures, showcasing your ability to design and implement cloud solutions that align with industry best practices - Expertise in implementing DevOps culture, fostering collaboration between development and operations teams - Experienced in designing monitoring solutions around cloud infrastructure management - A passion for working with Engineering teams to help them identify where and when problems can be solved with cloud infrastructure and pipeline automation - Experience optimizing system reliability, scalability, and performance through DevOps methodologies - You are motivated by identifying and prioritizing problems as much as you are solving them, and you are comfortable sharing your recommendations with team members and leaders - Preferred Industry Certifications include: AZ-400: Microsoft Certified: DevOps Engineer Expert, Certified Kubernetes Administrator (CKA), Certified Kubernetes Application Developer (CKAD), Docker Certified Associate, HashiCorp Terraform Associate, Security Certifications related to Kubernetes, Terraform, and cloud technologies. Please list valid, verifiable certification numbers. - Your previous platforms have good uptime and don&rsquo;t require the CEO to call us to tell us everything is down, with monitoring tools like New Relic, DataDog, and CloudWatch - You have interpersonal skills and are empathetic, courteous, and friendly - You have experience in the healthcare space Other benefits: - Stock option plan - Flexible schedules and remote work - Chicago and San Francisco offices available - Self-managed vacation days, within reason - Paid parental leave - Health, vision, and dental insurance - 401K retirement plan About Carrum We&rsquo;re a health tech company that brings value-based care to the masses. We help employers deliver a memorable patient experience, immediately lower healthcare costs, and drive better outcomes and achieve this through the power of technology and human-centered design. Since launching in 2014, we&rsquo;ve partnered with Fortune 500 employers and top hospitals across the nation. We&rsquo;ve been recognized by Harvard Business School and featured in TechCrunch, The Los Angeles Times, Washington Post, and Modern Healthcare. We believe we&rsquo;re only scratching the surface of our opportunity and we&rsquo;re looking for incredible people like you to help us realize our full impact. Carrum Health is an equal opportunity employer and encourages all applicants from every background and life experience.

United States
$160K - $200K / year
Peraton Corporation logo

Senior Cloud Platform, DevSecOps Engineer

Peraton Corporation

Peraton Corporation, a national security company headquartered in Herndon, Virginia, supplies solutions for mission-critical programs and systems. Founded in 2017, Peraton's missio

DevOps Engineer7 days ago

Provide expertise in AWS cloud architecture and migration, manage large-scale Kubernetes platforms, design reusable Infrastructure as Code modules, and implement DevSecOps practices to ensure security and compliance in cloud operations.

Remote
Full TimeRemoteTeam 10,001+H1B No Sponsor

• Design and implement stable runtime environments based on Kubernetes (RKE2 / Rancher) • Build and maintain end-to-end CI/CD pipelines using GitLab CI/CD and Nexus • Orchestrate complex data pipelines and batch workloads in Apache Airflow (KubernetesPodOperator) • Implement enterprise-grade security standards, including: HashiCorp Vault integration, Secrets management, RBAC policies • Configure and maintain monitoring and alerting for ML systems (Prometheus, Grafana, ELK) • Collaborate closely with Data Engineering and Data Science teams to optimize container resource usage

Poland