GoDaddy is a web services platform that helps individuals and businesses worldwide start, grow, and manage their online presence. GoDaddy employs team members a

Site Reliability Engineer

DevOps EngineerDevOps EngineerFull Time Remote Mid Level Company Site

Location

India

Posted

5 days ago

Salary

Seniority

Mid Level

No structured requirement data.

Job Description

Role Description The Senior Site Reliability Engineer – Datacenter is a technical leader responsible for designing, delivering, and optimising enterprise-scale DC infrastructure. This role focuses on building scalable, secure, and efficient systems that support business growth and sustainability goals. Working closely with multi-functional teams, you will drive infrastructure standards, improve reliability and performance, and see opportunities for innovation and cost optimisation. You will help shape long-term infrastructure strategy, including capacity planning and adoption of new technologies. As a technical lead, you will guide medium to large-scale data centre projects end-to-end, aligning solutions with business needs and industry standard methodologies. You will contribute to decisions around architecture, resilience, and scalability while supporting teams in delivering high-quality outcomes. In this role, you will also lead complex, multi-team initiatives, ensuring smooth execution across planning, implementation, and continuous improvement. You will assist in managing risks, supporting compliance with security and regulatory standards, and driving operational excellence across global environments. - Build scalable and efficient server infrastructure solutions that meet current and future business needs, with a focus on cost optimisation and innovation. - Partner with multi-functional teams to transform business and product requirements into clear, actionable technical specifications. - Evaluate, select, and manage server hardware and vendors, including maintaining supplier relationships and procurement documentation (e.g., BOMs). - Plan and monitor capacity (power, space, cooling, network), analyse utilisation trends, and drive improvements in efficiency and rack/datacentre usage. - Ensure infrastructure security and compliance by applying established guidelines, partnering with security teams, and addressing vulnerabilities proactively. - Lead operational excellence through troubleshooting, documentation, reporting, collaborator communication, automation, and continuous process improvement. Qualifications - 8+ years of demonstrated experience in at least two areas of data centre infrastructure (e.g., power, cooling, cabling, or physical build) with a solid understanding of Layer 1 environments. - Working knowledge of power and capacity concepts (e.g., kW, amps, PUE) and server hardware fundamentals, including components and performance considerations. - Proficiency in core infrastructure and system concepts, such as storage, RAID, CPU, memory, operating systems, file systems, and networking basics. - Hands-on experience with hardware troubleshooting, diagnostics, and system configuration, including OS installs and core services (e.g., DNS, DHCP, SSH). - Ability to build strong relationships across multiple functions and communicate clearly with both technical and non-technical collaborators. - Comfortable with scripting or automation using at least one language (e.g., Python, Bash, PowerShell, or Java). - Experience using AI-powered tools (e.g., GitHub Copilot, Claude) to automate tasks, improve efficiency, or support infrastructure build and analysis. - Ability to contribute to process improvements, automation, and documentation to improve team efficiency and consistency. Requirements - Exposure to advanced data centre and networking environments, including deeper knowledge of performance optimisation and scalability. - Familiarity with BIOS, BMC, and RAID controller configurations and managing hardware at a low level. - Experience supporting enterprise-scale customers or environments, with an understanding of reliability and operations guided by service agreements. - Flexibility to travel between data centre locations (if required) and support distributed infrastructure operations. Benefits - Paid time off. - Retirement savings (e.g., 401k, pension schemes). - Bonus/incentive eligibility. - Equity grants. - Participation in our employee stock purchase plan. - Competitive health benefits. - Other family-friendly benefits including parental leave.

Related Categories

DevOps Engineer

Related Job Pages

Remote Full-time Jobs (US)More Remote Jobs

More DevOps Engineer Jobs

DevOps – Pleno, Sr

Seekerh

Conectando Talentos aos Desafios do Seu Negócio

DevOps Engineer5 days ago

Full Time RemoteTeam 1-10Since 2019H1B No Sponsor

Company Site LinkedIn

• Garantir disponibilidade, escalabilidade e resiliência de ambientes cloud. • Atuando com automação, CI/CD, troubleshooting avançado, monitoração, gestão de incidentes e melhoria contínua. • Implementar IaC, apoiar times de desenvolvimento e propor evoluções arquiteturais.

Ansible AWS Cloud Docker Grafana Jenkins Kubernetes Linux Prometheus Python Terraform

View details: DevOps – Pleno, Sr

Brazil

Apply

Senior SRE

Raízen

Reshaping the future of energy

DevOps Engineer5 days ago

Full Time RemoteTeam 10,001+Since 2011H1B No Sponsor

Company Site LinkedIn

• Support distributed environments on Azure and AWS, working directly on the advancement of corporate observability using tools such as Grafana, Prometheus, OpenTelemetry, Loki, Tempo and Zabbix; • Troubleshoot critical environments, perform incident analysis, and define dashboards, alerts and operational metrics; • Work closely with development, architecture and infrastructure teams to ensure application quality, availability and performance; • Drive the evolution of automation practices, GitOps, monitoring of Kubernetes environments (AKS/EKS) and continuous platform improvement.

AWS Azure Cloud DNS Docker Grafana Kubernetes Linux Prometheus Python

View details: Senior SRE

Brazil

Apply

Lead SRE – Observability

athenahealth

We provide network-enabled services, mobile apps, and data-driven insights to hospitals and medical organizations.

DevOps Engineer5 days ago

Full Time RemoteTeam 5,001-10,000Since 1997H1B Sponsor

Company Site LinkedIn

• Build and operate scalable observability and telemetry platforms that process logs, metrics, traces, and events across production environments • Support monitoring, alerting, and instrumentation strategies that improve service visibility and operational insight • Partner with engineering teams to strengthen telemetry collection and overall observability • Design resilient, automated infrastructure and platform services that improve reliability, scalability, and efficiency • Develop Infrastructure as Code and automation solutions that reduce toil and improve consistency • Lead technical initiatives from architecture through implementation with attention to performance, reliability, security, and maintainability • Troubleshoot complex production issues involving distributed systems, Linux infrastructure, networking, cloud services, and telemetry pipelines • Participate in incident response and on-call processes • Help drive operational excellence, root cause analysis, and continuous improvement • Mentor engineers on SRE best practices, observability strategy, and scalable systems design • Contribute to long-term platform strategy and reliability improvements.

AWS Cloud Distributed Systems ElasticSearch Grafana Kafka Linux Prometheus Python Terraform Go

View details: Lead SRE – Observability

Massachusetts

$143K - $243K / year

Apply

AI Evaluator - Software Expert

Mercor

Cincinnatus is an enterprise staffing company that partners with leading technology companies to source and employ highly skilled professionals for full-time and long-term contingent roles. Cincinnatus serves as the employer of record for these engagements, providing W-2 employment, payroll, benefits, and compliance, while placing employees directly within client teams to work on high-impact initiatives. Roles hired through Cincinnatus are not project-based or freelance engagements. They are structured, role-based positions that typically involve full-time or fixed-term commitments, close collaboration with a client's internal teams, and integration into standard enterprise workflows. Cincinnatus is a legal entity separate from Mercor. While opportunities may be discovered through Mercor's platform, employment, onboarding, payroll, and benefits for these roles are administered by Cincinnatus. Equal Employment Opportunity Cincinnatus is proud to be an Equal Employment Opportunity employer. We do not discriminate based upon race, religion, color, national origin, sex (including pregnancy, childbirth, reproductive health decisions, or related medical conditions), sexual orientation, gender identity, gender expression, age, status as a protected veteran, status as an individual with a disability, genetic information, political views or activity, or any other legally protected characteristic. Cincinnatus is committed to providing reasonable accommodations for qualified individuals with disabilities and disabled veterans throughout the job application process.

DevOps Engineer5 days ago

Part Time RemoteH1B No Sponsor

Role Description - Evaluate AI-generated artifacts against domain-specific quality rubrics. - Identify factual, aesthetic, and presentation errors in documents, spreadsheets, and slide decks. - Provide clear, structured written feedback to improve AI model outputs. - Apply deep subject-matter expertise to grade outputs for accuracy and rigor. - Work independently and asynchronously to meet deadlines. Qualifications - Must-Have: 5+ years of relevant professional experience in Software / AI / IT / data. - Native or professional fluency in English. - Highly proficient in Microsoft Office and Google Workspace, especially Slides. - Preferred: Advanced degree (Master's or higher) from a reputable institution. Company Description Mercor connects elite creative and technical talent with leading AI research labs. Headquartered in San Francisco, our investors include Benchmark, General Catalyst, Peter Thiel, Adam D'Angelo, Larry Summers, and Jack Dorsey.

View details: AI Evaluator - Software Expert

Worldwide

$80 - $120 / hour

Apply

Site Reliability Engineer

Job Description

Related Guides

Related Categories

Related Job Pages

More DevOps Engineer Jobs

DevOps – Pleno, Sr

Senior SRE

Lead SRE – Observability

AI Evaluator - Software Expert