Talent Hackers logo
Talent Hackers

Top talent from the fastest-growing continent on earth.

Senior DevOps Engineer

DevOps EngineerDevOps EngineerFull TimeRemoteSeniorTeam 11-50Since 2024H1B No SponsorCompany SiteLinkedIn

Location

South Africa

Posted

19 days ago

Salary

0

Seniority

Senior

Job Description

Senior DevOps Engineer

Talent Hackers

• Built and maintained observability, alerting, and triage systems • Improve system reliability and incident response • Established and managed multi-stage environments (dev, staging, prod) • Strengthened infrastructure security across IAM, networking, and secrets management • Designed and implemented CI/CD pipelines with automated testing and deployment • Supported SOC 2 compliance by implementing monitoring, access controls, and audit-ready infrastructure • Developed OAuth-based authentication and enabled client-specific SSO integrations • Improved performance and efficiency through infrastructure/ database optimization • Enhanced job scheduling, alerting, and internal tooling to increase engineering efficiency

Job Requirements

  • 5+ years of DevOps experience (preferably in a startup environment)
  • Demonstrated, hands on experience working with AWS
  • Kubernetes, Terraform, Docker, and CI/CD
  • Strong (C-level) English communication skills
  • Ability to work full-time EST

Related Categories

Related Job Pages

More DevOps Engineer Jobs

Review ALL logo

Site Reliability Engineer

Review ALL

We are your Recruitment Team!

DevOps Engineer19 days ago
Full TimeRemoteTeam 11-50Since 2023H1B No Sponsor

• Own reliability for our global bare metal fleet — monitoring, alerting, incident response, post-mortems • Build and maintain internal tooling: NetBox (infrastructure source of truth), Python/Go services • Drive automation for hardware lifecycle: provisioning, decommissioning, firmware updates, network changes • Collaborate with platform engineers on the provisioning stack • Participate in on-call rotation

Brazil
HostPapa logo

Senior DevOps Engineer

HostPapa

Let Papa take care of you!

DevOps Engineer19 days ago
Full TimeRemoteTeam 51-200Since 2006H1B No Sponsor

• Design, evolve, and operate scalable and elastic cloud architectures for multi tenant SaaS platforms • Continuously challenge and improve existing infrastructure and architectural decisions to remove performance, scalability, and operability bottlenecks • Design and maintain cloud native and hybrid solutions, integrating cloud platforms with on prem systems when required • Build, maintain, and improve CI/CD pipelines that enable fast, safe, and repeatable deployments • Promote and enforce Infrastructure as Code (IaC) practices using Terraform • Automate provisioning, configuration, scaling, and recovery to reduce manual operational effort • Improve deployment strategies in collaboration with SRE teams to increase reliability and predictability • Design and operate containerized platforms using Docker and Kubernetes • Support and evolve microservices architectures, ensuring deployment safety, isolation, and scalability • Operate and support production and pre-production environments and troubleshoot complex infrastructure issues • Participate in incident response and on call rotations when required, working with SREs to reduce operational toil • Maintain clear and up to date documentation for infrastructure, pipelines, and operational procedures • Partner closely with engineering teams to improve developer experience, delivery velocity, and platform reliability • Support other tasks or projects as assigned to meet team and business needs

Canada
TechInsights logo

Senior Site Reliability Engineer

TechInsights

The most trusted source of semiconductor analysis and market information

DevOps Engineer19 days ago
Full TimeRemoteTeam 201-500Since 1989H1B No Sponsor

• Own SLOs, SLIs, and error budgets for all production services; drive error budget discipline across engineering • Design reliability patterns for AI agent pipelines: LLM observability, tool-use tracking, failure detection, and graceful degradation • Architect for blast radius containment — agent failures must have bounded customer impact through isolation, circuit breaking, and rapid recovery • Mature our Canada Central/West active-active architecture toward 24-hour RTO with full regional failover • Lead incident response and post-incident reviews that produce durable fixes; maintain DR procedures through regular testing • Serve as the primary reliability liaison to Software and AI Engineering, translating requirements into actionable standards • Partner with AI Engineering on compute provisioning, model serving, inference latency, and workload isolation • Own CI/CD pipeline strategy (Bitbucket Pipelines, GitHub Actions) — set standards, optimize deployment frequency, and ensure teams can ship confidently • Drive IDP adoption and enable teams on SRE practices: on-call readiness, SLO definition, runbook development, and self-service tooling • Represent reliability in architectural discussions; surface risk before it's committed to design • Operate Datadog as the single pane of glass for service health, infrastructure, and agentic pipeline telemetry • Extend observability to AI workloads: LLM latency, token consumption, agent completion rates, and pipeline throughput • Build golden path templates in Backstage and/or Atlassian Compass so teams ship reliably without routine SRE involvement • Own infrastructure as code via Terraform and GitOps; enforce IaC policy in partnership with Trust Assurance • Own FinOps visibility into AWS cost segments; model cloud cost impact as AI/ML workloads scale • Formally mentor junior and intermediate SRE engineers, with accountability for their technical growth and career progression • Build AI-assisted automation to progressively reduce toil and scale the team's operational capacity

Poland
zł18.8K - zł20K / year

Lead Mission Engineer (DevOps)

STR

STR makes the world a safer place by developing technology and applying it to solve emerging national security challenges.

DevOps Engineer19 days ago
Full TimeRemoteTeam 800Since 2010

About the Team: Information processing and sense-making systems are the lifeblood of national security efforts, facilitating an understanding of the global situation, strategic planning, and tactical execution. The reliability, accessibility, and sophistication of these systems can determine the outcomes of conflicts before they begin. STR’s Analytics and Command & Control (AC2) Division focuses on developing actionable, advanced technology solutions to provide asymmetric advantages within the information domain. The Mission Applications (MA) Group within the AC2 Division specializes in ensuring the technology that STR creates produces outsized mission impact. Staff within the MA group have a deep understanding of missions and technology needs for national security with skills targeting transition of technology to operational use, including product management, UX Product implementation, software integration & mission engineering, platform engineering, DevOps, and program management. MA staff combine with other science and engineering staff from the various STR research and development groups to form dedicated product teams focused on accelerating the operational transition of cutting-edge software technologies. The Role: As a Lead Mission Engineer (DevOps) in the Mission Applications Group, you will develop and deploy software essential for a specific, real-world objective or "mission" in partnership with our customers. You have: 1) expertise in developing and deploying software solutions in complex and high-security environments, 2) deep domain knowledge in the complex missions executed across the Department of Defense, and 3) expertise in modern frameworks, building full stack applications, and integrating APIs for deployment in high-security environments. You will work as a leader of software teams to design, develop, and maintain infrastructure and tooling to deploy software in operational use. You have experience managing relationships with customer environments while simultaneously leading engineering teams to ensure the delivered product meets the highest standards of quality, reliability, and performance.  What you will do: - Supporting cloud (e.g., AWS) toolsets in unclassified environments - Configuring and maintaining multiple CI/CD environments at different classification levels, focusing on developer productivity, application performance, security monitoring, and alerting - Deployment and Field Testing: Traveling to integration sites to deploy, test, and triage software in secure operational environments - Running and managing incident response and post-outage reviews to improve deployment, monitoring, and alerting processes - Coaching team members on DevOps best practices while continuously helping to improve processes. Who You Are: - Active Top Secret Security Clearance with SCI eligibility, for which U.S citizenship is needed by the U.S government - BS in Computer Science or related technical field with at least 7 years of work experience. Equivalent experience will be considered - Experience deploying software for DoD or IC missions - A passion for crafting user-centric interfaces and a keen eye for design details - Demonstrated strong knowledge of modern technologies such as React, TypeScript, and Python - Knowledge of performance-focused tools and techniques for rendering large datasets and building responsive, smooth interfaces - A demonstrated ability to adopt new languages, libraries, and technologies - Organized, detail-oriented, and with an ability to work both independently and collaboratively - Demonstrated experience as an effective communicator to both technical and non-technical audiences Even Better: - Active TS/SCI security clearance - Advanced Degree in Computer Science, Information Technology, or related technical field - Prior US Military Service with experience in joint planning, large force battle monitoring, and fluidity with large force debrief best practices - Located in San Diego, CA, Denver, CO, Arlington, VA, Boston, MA, or willing to relocate Pay Information Full-Time Salary Range: $175,000 - $240,000 The salary range listed is based on external market data. Offers are based on factors, such as but not limited to, the candidate’s experience, education, training, key skills/critical skills, security clearances, and prevailing market and business conditions. STR is a growing technology company with locations near Boston, MA, Arlington, VA, near Dayton, OH, Melbourne, FL, and Carlsbad, CA. We specialize in advanced research and development for defense, intelligence, and national security in: cyber; next generation sensors, radar, sonar, communications, and electronic warfare; and artificial intelligence algorithms and analytics to make sense of the complexity that is exploding around us. STR is committed to creating a collaborative learning environment that supports deep technical understanding and recognizes the contributions and achievements of all team members. Our work is challenging, and we go home at night knowing that we pushed the envelope of technology and made the world safer. STR is not just any company. Our people, culture, and attitude along with their unique set of skills, experiences, and perspectives put us on a trajectory to change the world. We can't do it alone, though - we need fellow trailblazers. If you are one, join our team and help to keep our society safe! Visit us at www.str.us for more info. STR is an equal opportunity employer. We are fully dedicated to hiring the most qualified candidate regardless of race, color, religion, sex (including gender identity, sexual orientation and pregnancy), marital status, national origin, age, veteran status, disability, genetic information or any other characteristic protected by federal, state or local laws. If you need a reasonable accommodation for any portion of the employment process, email us at appassist@str.us and provide your contact info. Pursuant to applicable federal law and regulations, positions at STR require employees to obtain national security clearances and satisfy the requirements for compliance with export control and other applicable laws.

Virginia
$157K - $224K / year