Job Closed
This listing is no longer active.
Financially empowering the next generation of consumers.
DevOps Engineer II
Location
Mexico
Posted
68 days ago
Salary
$2.7K - $6K / month
Seniority
Senior
Job Description
DevOps Engineer II
Sezzle
• Act as a technical partner to developers—debugging pipelines, build failures, and deployment/configuration issues. • Diagnose CI/CD failures across GitLab, Helm, Kubernetes, and container build systems. • Work closely with engineers across the company to understand pain points and opportunities for tooling enhancements. • Leverage AI tooling to help yourself and other teams be more productive in pipeline development and troubleshooting. • Measure and optimize the performance, reliability, and maintainability of CI/CD pipelines and platform services. • Drive initiatives to reduce waste in the development lifecycle. • Design and evolve Kubernetes-based deployment workflows using GitLab, Helm, and best-practice DevOps patterns. • Own platform and infrastructure components that accelerate development.
Job Requirements
- At least 3 years of DevOps, platform, or full-stack software engineering experience.
- Practical experience with Kubernetes, Helm, and containerized deployment workflows.
- Comfort with shell scripting and automation in Linux-based environments.
- Familiarity with AI-based dev tooling, such as Claude Code, Codex, Gemini CLI, Cursor.
- BS in Computer Science or similar degree, or equivalent work experience.
Related Guides
Related Categories
Related Job Pages
More DevOps Engineer Jobs
Ingénieur(e) DevOps – Infrastructure & Systèmes Senior
FlyingEyeDrones Professionnels. Formation. Réparation & entretien. Sécurité, Surveillance, Photogrammétrie, LIDAR, Agriculture
• Concevoir et maintenir les environnements Docker / Docker Compose • Automatiser les installations et déploiements (scripts, templates, bonnes pratiques) • Définir des procédures d’exploitation et des runbooks • Participer à la conception et au déploiement sur des environnements IaaS / Cloud • Mettre en place des stratégies de sauvegarde, restauration et haute disponibilité • Administrer et faire évoluer les outils internes : GitLab self-hosted, runners, registry • Structurer et faire évoluer la CI/CD (GitLab CI ; GitHub Actions selon contexte) • Déployer et maintenir une stack d’observabilité (Prometheus / Grafana / Loki ou équivalent) • Gérer les sujets réseau : routage, pare-feu, proxy, DNS • Appliquer un hardening pragmatique (accès, services, ports, journaux, secrets)
Principal Site Reliability Engineer - Observability
ElasticSelf-described as the leading platform for search-powered solutions, Elastic helps organizations, their customers, and their employees find what they need faster while protecting a
Role Description We're looking for a Principal Site Reliability Engineer to join the Observability Solution as part of the team building the next generation of Infrastructure Observability experiences leveraging the new Search AI and agentic capabilities. - Collaborate with product management, product design, customers, and multiple teams across Elastic (especially our own SRE teams) in defining and evolving the end-to-end InfraObs experiences that enable both human and agentic users. - Deliver and continually evolve the experiences leveraging the Elastic Platform capabilities and coding agents. - Be a contact point for other teams within Elastic, including helping Support with difficult cases or aligning with teams providing the foundations for developing integrations. - Consult with the Elastic Stack engineers on designing new features. - Foster a culture of mutual respect, collaboration, and consensus-based decision-making. - Be an awesome person to work with, somebody who sincerely empathizes with others. Qualifications - Engineers with an SRE background and experience operating large-scale production services with the help of Observability tools. - Proficiency operating production infrastructure in K8s and at least one of the three major CSPs. - Proficiency using Observability tools. - Ability to work with a high level of autonomy, tackling projects and guiding them from beginning to end. - Ability to use AI coding agents in the delivery workflow. - Excellent verbal and written communication skills. Requirements - Experience as a user of the Elastic Stack (bonus points). Benefits - Competitive pay based on the work you do here and not your previous salary. - Health coverage for you and your family in many locations. - Ability to craft your calendar with flexible locations and schedules for many roles. - Generous number of vacation days each year. - We match up to $2000 (or local currency equivalent) for financial donations and service. - Up to 40 hours each year to use toward volunteer projects you love. - Minimum of 16 weeks of parental leave.
Site Reliability Engineer
FortyxAI powered cybersecurity assistant, combining threat prevention and cyber awareness training in real time.
• Collaborate with software engineering and operations teams to design, build, and maintain cloud-based infrastructure using AWS and Terraform • Implement and enhance infrastructure-as-code (IaC) practices using Terraform to ensure reproducibility and scalability of infrastructure components • Develop and maintain monitoring solutions to proactively identify performance bottlenecks, system outages, and other potential issues • Participate in incident response and root cause analysis efforts to drive continuous improvement and prevent future incidents • Optimise system performance, reliability, and cost efficiency through continuous monitoring, performance tuning, and capacity planning • Identify opportunities to automate manual processes and improve system resilience • Utilise Python or Bash scripting to create and maintain automation tools for various operational tasks and deployments • Implement and improve continuous integration and continuous deployment (CI/CD) pipelines • Collaborate with security teams to implement best practices for securing cloud infrastructure and services • Ensure compliance with relevant industry standards and regulations • Support CI/CD pipelines for application deployments and updates • Contribute to the design and implementation of deployment strategies that promote zero-downtime releases • Maintain clear and up-to-date documentation for infrastructure configurations, processes, and incident resolution procedures • Participate in knowledge sharing with team members to enhance overall expertise and skill sets
Manager, Site Reliability Engineer
Tempo SoftwareAdaptive SPM for AI-Accelerated Innovation | Modular Solutions, Compounding Value | 30,000+ Customers
• Lead, mentor, and grow a team of Site Reliability Engineers, focusing on career development, performance management, and hiring. • Define the team's roadmap and strategy for platform reliability, scaling, and operational efficiency. • Provide technical oversight and direction for the design and implementation of key infrastructure projects, including CI/CD pipelines and automation for build, release, and deployment processes. • Partner closely with engineering teams and product managers to ensure the reliability and performance requirements of new products and features are met. • Oversee the maintenance and continuous improvement of the AWS-based platform to ensure it scales effectively. • Drive the adoption of AI tooling to enhance SRE productivity and introduce intelligent automation of SRE processes. • Champion SRE best practices, including error budget management, effective on-call rotations, incident response, and post-mortem processes.




