Job Closed

This listing is no longer active.

Nerdy Dragon

Senior DevOps Engineer – Full Time Contractor

DevOps EngineerDevOps EngineerContract Remote SeniorTeam 1-10Since 2008H1B No SponsorCompany Site LinkedIn

Location

Costa Rica

Posted

140 days ago

Salary

Seniority

Senior

Bachelor Degree4 yrs expEnglishAWS Amazon EC2 Kubernetes Python Ruby SDLC Terraform

Job Description

• Creation and management of AWS cloud infrastructure and configuration as well as ensuring systems and services are leveraging best practices, security, and standards. • Strong experience and expertise with managing, securing, and scaling Kubernetes. • Provide strong partnership with development teams to develop internal platforms and services to enable expediency and self-service for development teams. • Provide partnership with other teams such as financial, legal, and operations. • Provide, maintain, and optimize the build process to support continuous integration. • Investigates, debugs and drives improvements to the engineering and build automation process. • Support and improve efficiency and effectiveness of tools (CI/CD, automated testing, automation and release). • Work cross-functionally to assess risk and help deliver countermeasures that protect customers and company data • Ensure that the SaaS services and associated infrastructure maintain required levels of security, availability, reliability, scalability, and performance to meet SLAs. • Build incident management, operational monitoring, and alerting capabilities to proactively report, troubleshoot, and fix problems. • With the team, own vendor management for existing and new vendors the team leverages. • Regular on-call rotations for team owned services and infrastructure.

Job Requirements

4+ years of industry experience enabling developers through DevOps
Industry experience with security and vulnerability of cloud infrastructure (prevention, detection, and remediation).
Strong experience building and maintaining AWS (Amazon Web Services) infrastructure (EC2, ELB, ALB, VPC, IAM, Route53, Lambda, Kinesis, CloudWatch, etc.).
Experience deploying and managing infrastructure with Terraform
Experience with various programming languages (e.g. Python, Ruby, Golang, shell).
Strong experience with pipeline tools such as GitHub Actions.
Diligent mentality, robust sense of responsibility/accountability and strong verbal/written communication, documentation, and collaboration skills.
Demonstrated experience collaborating with a team to develop and enhance amazing platforms and services.
AI-native developer productivity, comfortable using and securing AI tools in the SDLC (code assistants, automation, prompt/tool orchestration)

Benefits

Competitive USD Compensation: Enjoy a market-leading rate paid in U.S. dollars.
100% Remote (Home Country Only): Work from anywhere in your home country—no relocation required, no borders crossed.
Flexible Time Off: Our flexible PTO lets you recharge on your own terms and when you need it the most.
Local Holiday Pay: We honor your nation’s official holidays with paid time off—celebrate what matters to you.
Continuous Learning: Get a free, all-inclusive learning membership for you and your household—including 1-on-1 tutoring hours, unlimited on-demand classes, and access to our full suite of learning products and services.
Supercharge with AI: Gain exclusive access to cutting-edge AI tools that boost your productivity, making you feel almost super-human (cape not included).
Feedback-Rich, Collaborative Culture: Tap into regular training, peer reviews, and a team that treats every team member as a vital collaborator and owner in our success.
Make a Global Impact: Your expertise fuels an innovative platform used by learners around the world—be part of something transformative.

Related Categories

DevOps Engineer

Related Job Pages

Remote Python Jobs (US)More Remote Jobs

More DevOps Engineer Jobs

Senior Site Reliability Engineer

JUUL Labs

An electronic cigarette company, JUUL Labs is the creator of the JUUL e-cigarette, which uses nicotine salts found in leaf-based tobacco. Founded to improve the

DevOps Engineer141 days ago

Other Remote

Company Site

• A Senior Site Reliability Engineer (SRE) is expected to own the operational stability and performance of Juul’s hybrid cloud infrastructure (Nutanix, AWS/GCP). • This involves leading automation efforts, architecting for reliability, and acting as the final escalation point for critical incidents to ensure the platform is scalable and efficient. • Design, deploy, and maintain enterprise-scale Nutanix AHV clusters and Prism Central for multi-cluster management. • Expert-level proficiency with Nutanix CLI (nCLI and acli) for advanced operations, troubleshooting, and automation. • Develop automation scripts using Nutanix REST APIs, Python SDK, PowerShell, and Terraform for infrastructure-as-code. • Design disaster recovery solutions using Leap, Protection Domains, cross-cluster replication, and metro clustering. • Lead L3 troubleshooting using advanced diagnostics, log analysis (CVM, Genesis), NCC health checks, and cluster service resolution.

AWS GCP Kubernetes Python TCP/IP Terraform

View details: Senior Site Reliability Engineer

United States

$150K - $184K / year

Apply

Job Closed

Cloud Operations Engineer

CyberSheath

Assess, Implement, Manage (AIM™)

DevOps Engineer141 days ago

Other RemoteTeam 51-200Since 2012H1B No Sponsor

Company Site LinkedIn

• Provision and deliver computer systems and services, both on-premise and cloud hosted solutions • Regularly perform migrations to Office 365 (email, SharePoint, OneDrive, Teams), server migrations to Azure, and implement various Azure technologies • Design and deliver secure cloud solutions in Office 365 and Azure • Architecture, design, system evaluation and analysis, and infrastructure assessments • Deploy and maintain tools and monitoring agents such as endpoint protection, vulnerability management, log collection, multifactor authentication, RMM, etc. • Track all activities, detailed case notes, and time entries within the Service Desk Ticketing system • Work with internal CyberSheath stakeholders to ensure key tasks and timeline are identified, on time, and on budget • Provide timely updates and feedback to the internal team and clients regarding project status activities being on-track. • Drive technical implementation tasks to completion by budgeted deadlines • Proactively communicate with clients to ensure requests are properly addressed • Collaborate with team members to troubleshoot onboarding implementation issues as they arise • Other duties as assigned.

Azure TCP/IP

View details: Cloud Operations Engineer

United States

$110K - $130K / year

Apply

Job Closed

Senior Site Reliability Engineer

Zeta Global

We deliver better experiences for consumers and better results for your brand.

DevOps Engineer141 days ago

Other RemoteTeam 1,001-5,000Since 2007H1B Sponsor

Company Site LinkedIn

• Design, implement, and manage SLOs, SLIs, and error budgets, ensuring reliability aligns with user expectations and business objectives. • Develop production-grade software to enhance system reliability and reduce manual toil through automation. • Implement and optimize observability solutions using tools like OpenTelemetry, with a focus on high-cardinality metrics, distributed tracing, and actionable insights. • Drive postmortem processes and lead in-depth root cause analyses for incidents, ensuring lessons learned are effectively applied to prevent recurrence. • Define and monitor MTTx metrics (MTTA, MTTR, MTTF), using them to guide system improvements and measure reliability progress. • Design and participate in Chaos Engineering exercises. • Collaborate with engineering teams to design systems with reliability and scalability in mind, incorporating capacity planning, resiliency patterns, and modern deployment strategies (e.g., Canary, Blue-Green). • Lead design reviews for alerting strategies, ensuring effective signal-to-noise ratios in monitoring and incident management. • Advocate for and implement best practices in incident response and system design to achieve optimal uptime and performance.

AWS Distributed Systems Docker Grafana Jenkins Kubernetes Microservices Prometheus Python Terraform

View details: Senior Site Reliability Engineer

United States

$140K - $170K / year

Apply

Job Closed

Manager, Site Reliability Engineering

Veeam Software

Your Single Backup and Data Management Platform for Cloud, Virtual and Physical

DevOps Engineer141 days ago

Full Time RemoteTeam 1,001-5,000Since 2006H1B Sponsor

Company Site LinkedIn

• Hire, onboard, and grow your SRE team; coach career development and performance • Foster a psychologically safe, blameless culture that favors learning over blame and emphasizes engineering over firefighting • Ensure a sustainable operational coverage; monitor on-call health and workload • Track and cap toil so engineers spend the majority of time on project work that reduces future toil • Establish and operationalize SLIs/SLOs and error budgets with service owners; run reliability reviews and hold teams accountable to outcomes • Define reliability standards, runbooks, readiness checklists, and alerting patterns (including SLO-based alerting) • Partner with product/EMs to align reliability work with service goals and customer experience, not as a gate but as an enabler • Ensure incident response readiness; lead/coordinate major incidents; drive fast, high-quality postmortems and systemic fixes • Measure MTTR, change failure rate, SLO posture, and repeat-incident reduction; publish learning broadly • Lead software-first reliability investments: observability, deployment safety (canary/blue-green), resilience testing/chaos, and self-service guardrails • Drive platform improvements (IaC, CI/CD, Kubernetes) and internal tools that scale operations and improve developer experience

Azure Grafana Kubernetes Prometheus Terraform

View details: Manager, Site Reliability Engineering

Czechia

Apply

Job Closed

Senior DevOps Engineer – Full Time Contractor

Job Description

Job Requirements

Benefits

Related Guides

Related Categories

Related Job Pages

More DevOps Engineer Jobs

Senior Site Reliability Engineer

Cloud Operations Engineer

Senior Site Reliability Engineer

Manager, Site Reliability Engineering