Job Closed

This listing is no longer active.

Sezzle

Financially empowering the next generation of consumers.

DevOps Engineer II

DevOps EngineerDevOps EngineerFull Time Remote SeniorTeam 201-500Since 2016H1B SponsorCompany Site LinkedIn

Location

Mexico

Posted

68 days ago

Salary

$2.7K - $6K / month

Seniority

Senior

Bachelor Degree3 yrs expEnglishKubernetes Linux Shell Scripting

Job Description

• Act as a technical partner to developers—debugging pipelines, build failures, and deployment/configuration issues. • Diagnose CI/CD failures across GitLab, Helm, Kubernetes, and container build systems. • Work closely with engineers across the company to understand pain points and opportunities for tooling enhancements. • Leverage AI tooling to help yourself and other teams be more productive in pipeline development and troubleshooting. • Measure and optimize the performance, reliability, and maintainability of CI/CD pipelines and platform services. • Drive initiatives to reduce waste in the development lifecycle. • Design and evolve Kubernetes-based deployment workflows using GitLab, Helm, and best-practice DevOps patterns. • Own platform and infrastructure components that accelerate development.

Job Requirements

At least 3 years of DevOps, platform, or full-stack software engineering experience.
Practical experience with Kubernetes, Helm, and containerized deployment workflows.
Comfort with shell scripting and automation in Linux-based environments.
Familiarity with AI-based dev tooling, such as Claude Code, Codex, Gemini CLI, Cursor.
BS in Computer Science or similar degree, or equivalent work experience.

Related Categories

DevOps Engineer

Related Job Pages

Remote Full-time Jobs (US)More Remote Jobs

More DevOps Engineer Jobs

Ingénieur(e) DevOps – Infrastructure & Systèmes Senior

FlyingEye

Drones Professionnels. Formation. Réparation & entretien. Sécurité, Surveillance, Photogrammétrie, LIDAR, Agriculture

DevOps Engineer68 days ago

Full Time RemoteTeam 11-50Since 2009H1B No Sponsor

Company Site LinkedIn

• Concevoir et maintenir les environnements Docker / Docker Compose • Automatiser les installations et déploiements (scripts, templates, bonnes pratiques) • Définir des procédures d’exploitation et des runbooks • Participer à la conception et au déploiement sur des environnements IaaS / Cloud • Mettre en place des stratégies de sauvegarde, restauration et haute disponibilité • Administrer et faire évoluer les outils internes : GitLab self-hosted, runners, registry • Structurer et faire évoluer la CI/CD (GitLab CI ; GitHub Actions selon contexte) • Déployer et maintenir une stack d’observabilité (Prometheus / Grafana / Loki ou équivalent) • Gérer les sujets réseau : routage, pare-feu, proxy, DNS • Appliquer un hardening pragmatique (accès, services, ports, journaux, secrets)

Cloud DNS Docker Grafana Linux Prometheus

View details: Ingénieur(e) DevOps – Infrastructure & Systèmes Senior

France

Apply

Job Closed

Principal Site Reliability Engineer - Observability

Elastic

Self-described as the leading platform for search-powered solutions, Elastic helps organizations, their customers, and their employees find what they need faster while protecting a

DevOps Engineer68 days ago

Full Time Remote

Role Description We're looking for a Principal Site Reliability Engineer to join the Observability Solution as part of the team building the next generation of Infrastructure Observability experiences leveraging the new Search AI and agentic capabilities. - Collaborate with product management, product design, customers, and multiple teams across Elastic (especially our own SRE teams) in defining and evolving the end-to-end InfraObs experiences that enable both human and agentic users. - Deliver and continually evolve the experiences leveraging the Elastic Platform capabilities and coding agents. - Be a contact point for other teams within Elastic, including helping Support with difficult cases or aligning with teams providing the foundations for developing integrations. - Consult with the Elastic Stack engineers on designing new features. - Foster a culture of mutual respect, collaboration, and consensus-based decision-making. - Be an awesome person to work with, somebody who sincerely empathizes with others. Qualifications - Engineers with an SRE background and experience operating large-scale production services with the help of Observability tools. - Proficiency operating production infrastructure in K8s and at least one of the three major CSPs. - Proficiency using Observability tools. - Ability to work with a high level of autonomy, tackling projects and guiding them from beginning to end. - Ability to use AI coding agents in the delivery workflow. - Excellent verbal and written communication skills. Requirements - Experience as a user of the Elastic Stack (bonus points). Benefits - Competitive pay based on the work you do here and not your previous salary. - Health coverage for you and your family in many locations. - Ability to craft your calendar with flexible locations and schedules for many roles. - Generous number of vacation days each year. - We match up to $2000 (or local currency equivalent) for financial donations and service. - Up to 40 hours each year to use toward volunteer projects you love. - Minimum of 16 weeks of parental leave.

View details: Principal Site Reliability Engineer - Observability

Spain

Apply

Site Reliability Engineer

Fortyx

AI powered cybersecurity assistant, combining threat prevention and cyber awareness training in real time.

DevOps Engineer68 days ago

Full Time RemoteTeam 1-10H1B No Sponsor

Company Site LinkedIn

• Collaborate with software engineering and operations teams to design, build, and maintain cloud-based infrastructure using AWS and Terraform • Implement and enhance infrastructure-as-code (IaC) practices using Terraform to ensure reproducibility and scalability of infrastructure components • Develop and maintain monitoring solutions to proactively identify performance bottlenecks, system outages, and other potential issues • Participate in incident response and root cause analysis efforts to drive continuous improvement and prevent future incidents • Optimise system performance, reliability, and cost efficiency through continuous monitoring, performance tuning, and capacity planning • Identify opportunities to automate manual processes and improve system resilience • Utilise Python or Bash scripting to create and maintain automation tools for various operational tasks and deployments • Implement and improve continuous integration and continuous deployment (CI/CD) pipelines • Collaborate with security teams to implement best practices for securing cloud infrastructure and services • Ensure compliance with relevant industry standards and regulations • Support CI/CD pipelines for application deployments and updates • Contribute to the design and implementation of deployment strategies that promote zero-downtime releases • Maintain clear and up-to-date documentation for infrastructure configurations, processes, and incident resolution procedures • Participate in knowledge sharing with team members to enhance overall expertise and skill sets

AWS Cloud EC2 Python Terraform

View details: Site Reliability Engineer

United Kingdom

Apply

Manager, Site Reliability Engineer

Tempo Software

Adaptive SPM for AI-Accelerated Innovation | Modular Solutions, Compounding Value | 30,000+ Customers

DevOps Engineer68 days ago

Full Time RemoteTeam 201-500H1B No Sponsor

Company Site LinkedIn

• Lead, mentor, and grow a team of Site Reliability Engineers, focusing on career development, performance management, and hiring. • Define the team's roadmap and strategy for platform reliability, scaling, and operational efficiency. • Provide technical oversight and direction for the design and implementation of key infrastructure projects, including CI/CD pipelines and automation for build, release, and deployment processes. • Partner closely with engineering teams and product managers to ensure the reliability and performance requirements of new products and features are met. • Oversee the maintenance and continuous improvement of the AWS-based platform to ensure it scales effectively. • Drive the adoption of AI tooling to enhance SRE productivity and introduce intelligent automation of SRE processes. • Champion SRE best practices, including error budget management, effective on-call rotations, incident response, and post-mortem processes.

AWS Cloud Kubernetes

View details: Manager, Site Reliability Engineer

Canada

Apply