Job Closed

This listing is no longer active.

Everseen logo
Everseen

Inventing with Heart.

DevOps Engineer – III

DevOps EngineerDevOps EngineerFull TimeRemoteSeniorTeam 1,001-5,000Since 2007H1B No SponsorCompany SiteLinkedIn

Location

Australia

Posted

119 days ago

Salary

0

Seniority

Senior

Job Description

DevOps Engineer – III

Everseen

Everseen: A leader in vision AI solutions for the world’s leading retailers. The role: As a DevOps Engineer III, you will be part of the L3 support team for Operations across Edge/on‑prem and cloud, owning complex incidents end‑to‑end: triage, deep‑dive debugging, root‑cause analysis, remediation, and follow‑ups.    Having a good understanding of our product, its components and their interactions is essential in troubleshooting and problems remediation. Strong Linux administration (RHEL primarily, plus Ubuntu) and OpenShift/Kubernetes expertise are essential.   To reduce Ops toil, you will build targeted automations (Python, Bash, Ansible) and automate new and existing SOPs used by Operations.   You will execute safe cloud deployments and upgrades via GitOps and IaC pipelines (Flux, Ansible, Terraform) on AKS and GKE—coordinating validation and rollback plans—and contribute to the maintenance of existing GitLab CI/CD pipelines together with the DevOps engineering teams.    You will design and continuously refine Alertmanager rules and standardize actionable Grafana dashboards with Operations, ensuring effective use of Prometheus metrics and logs (Grafana Alloy, Thanos).   Beyond day‑to‑day operations, you’ll apply deep DevOps, CI/CD, and infrastructure automation expertise, drive best practices, share knowledge through workshops and mentoring, write and maintain documentation and SOPs (Standard Operating Procedure), test infrastructure, and collaborate across teams to optimize systems and workflows.

Job Requirements

  • 4+ years in DevOps-related roles with a strong focus on automation.
  • Proficient in DNS, routing, container communication, firewalls, reverse-proxying, load-balancing, edge to cloud communication and troubleshooting.
  • Strong system administration skills are required for deploying and troubleshooting OS level outages and Everseen’s containerized Edge application in customer network.
  • Extensive experience with Azure (or GCP), including fully automated infrastructure and deployment.
  • Experience with monitoring and optimizing cloud costs.
  • Proven experience in implementing and managing CI/CD pipelines (GitLab CI/CD preferred) and excellent knowledge of Git and associated workflows (e.g., Gitflow).
  • Proven experience with monitoring, logging, and alerting tools and stacks.
  • Excellent scripting skills in Bash and Python.
  • Advanced knowledge of Kubernetes and Openshift, including cluster management, orchestration and auto-scaling, deployments using Helm charts and GitOps.
  • Proven experience with microservices architecture and related deployment strategies.
  • Expertise with Terraform modules.
  • Deep experience with Ansible, including writing complex playbooks, roles, and using Ansible Vault for secrets management.
  • Strong understanding of DevSecOps principles and experience implementing security best practices within CI/CD pipelines.
  • Capable of engaging in technical discussions with stakeholders and leading DevOps projects. Mentors and coaches team members.

Benefits

  • Health insurance
  • retirement plans
  • paid time off
  • flexible work arrangements
  • professional development opportunities

Related Categories

Related Job Pages

More DevOps Engineer Jobs

Kohl's logo

Reliability Engineer

Kohl's

It’s no secret that our associates love #LifeAtKohls and we know you will too.

DevOps Engineer119 days ago
OtherRemoteTeam 10,001+Since 1962H1B No Sponsor

• Ensure the resilience and availability of Kohl’s systems and applications • Collaborate closely with development teams to review designs • Conduct risk assessments and implement robust monitoring and failover mechanisms • Drive incident response efforts and perform root cause analysis • Establish consistent practices that elevate Kohl’s operational excellence through automation and process improvements • Identify repeated toil and find opportunities for automation and risk reduction • On-call on a rotation to respond to production incidents • Advise on capacity planning and provide continuous assessments on systems behavior and consumption

United States
Job Closed
Remedy Product Studio logo

DevOps Engineer

Remedy Product Studio

Remedy supports founders and established companies in creating the next generation of great digital products

DevOps Engineer119 days ago
Full TimeRemoteTeam 51-200H1B No Sponsor

• Be an expert with AWS Cloud infrastructure and services and train others. • Automate pipeline deployments, tooling implementations and infrastructure deployments. • Automate solutions and develop processes to audit and monitor conformance with security standards. Remediate any security violations. • Review and optimize application deployments for NFRs like performance, scalability and robustness. • Innovate and prototype new tools and practices as a part of Remedy’s architecture offerings. • Promote DevOps best practices and culture among software engineers in the organization. • Able to operate with ambiguity and put together reliable and complex solutions • Are able to design for long term robustness while allowing pragmatic near term rollouts • Support various internal teams deploying solutions and developing applications.

Brazil
Job Closed
Restaurant365 logo

Site Reliability Engineer II

Restaurant365

Restaurant365 is a computer software company that specializes in providing high-quality Software-as-a-Service (SaaS) solutions to the restaurant industry. The platform is cloud-bas

DevOps Engineer119 days ago

• The Site Reliability Engineer II will be responsible for supporting, enhancing, and maintaining Restaurant365’s cloud infrastructure and applications. • Collaborate with DevOps, development, and infrastructure teams to resolve moderately complex issues, propose improvements, and strengthen the reliability, scalability, and security of our SaaS platform. • Respond to production incidents, perform triage and troubleshooting, and contribute to post-incident analysis. • Identify and automate manual processes to improve efficiency and reduce risk. • Enhance and evolve monitoring tools and platforms to improve observability. • Promote and apply best practices for reliability, scalability, and performance across engineering. • Implement and support cloud automation using Terraform, Ansible, or CloudFormation. • Work within change management protocols to provide maximum uptime for production systems. • Participate in on-call rotation, providing 24x7 support for incidents and contributing to root cause analysis. • Partner with developers, architects, vendors, and IT teams to ensure reliable system operations. • Research and remediate vulnerabilities in coordination with security teams. • Maintain documentation of infrastructure, monitoring, runbooks, and incident response procedures.

United States
$98.6K - $138.0K / year
Job Closed
Mashgin logo

Deployment Engineer

Mashgin

World's fastest AI powered Touchless self-checkout ecosystem. YC W15.

DevOps Engineer119 days ago
OtherRemoteTeam 11-50Since 2015H1B No Sponsor

• Responsible for the deployment and installation of technology and hardware across the country • Handling customer support tickets and calls • Ensuring that projects are completed on time • Working directly with product engineers and customers • Diagnosing and troubleshooting issues remotely and in the field • Planning and preparing for deployments

New York
$105K / year