GoDaddy is a web services platform that helps individuals and businesses worldwide start, grow, and manage their online presence. GoDaddy employs team members a
Site Reliability Engineer
Location
India
Posted
5 days ago
Salary
0
Seniority
Mid Level
No structured requirement data.
Job Description
Site Reliability Engineer
GoDaddy
Role Description The Senior Site Reliability Engineer – Datacenter is a technical leader responsible for designing, delivering, and optimising enterprise-scale DC infrastructure. This role focuses on building scalable, secure, and efficient systems that support business growth and sustainability goals. Working closely with multi-functional teams, you will drive infrastructure standards, improve reliability and performance, and see opportunities for innovation and cost optimisation. You will help shape long-term infrastructure strategy, including capacity planning and adoption of new technologies. As a technical lead, you will guide medium to large-scale data centre projects end-to-end, aligning solutions with business needs and industry standard methodologies. You will contribute to decisions around architecture, resilience, and scalability while supporting teams in delivering high-quality outcomes. In this role, you will also lead complex, multi-team initiatives, ensuring smooth execution across planning, implementation, and continuous improvement. You will assist in managing risks, supporting compliance with security and regulatory standards, and driving operational excellence across global environments. - Build scalable and efficient server infrastructure solutions that meet current and future business needs, with a focus on cost optimisation and innovation. - Partner with multi-functional teams to transform business and product requirements into clear, actionable technical specifications. - Evaluate, select, and manage server hardware and vendors, including maintaining supplier relationships and procurement documentation (e.g., BOMs). - Plan and monitor capacity (power, space, cooling, network), analyse utilisation trends, and drive improvements in efficiency and rack/datacentre usage. - Ensure infrastructure security and compliance by applying established guidelines, partnering with security teams, and addressing vulnerabilities proactively. - Lead operational excellence through troubleshooting, documentation, reporting, collaborator communication, automation, and continuous process improvement. Qualifications - 8+ years of demonstrated experience in at least two areas of data centre infrastructure (e.g., power, cooling, cabling, or physical build) with a solid understanding of Layer 1 environments. - Working knowledge of power and capacity concepts (e.g., kW, amps, PUE) and server hardware fundamentals, including components and performance considerations. - Proficiency in core infrastructure and system concepts, such as storage, RAID, CPU, memory, operating systems, file systems, and networking basics. - Hands-on experience with hardware troubleshooting, diagnostics, and system configuration, including OS installs and core services (e.g., DNS, DHCP, SSH). - Ability to build strong relationships across multiple functions and communicate clearly with both technical and non-technical collaborators. - Comfortable with scripting or automation using at least one language (e.g., Python, Bash, PowerShell, or Java). - Experience using AI-powered tools (e.g., GitHub Copilot, Claude) to automate tasks, improve efficiency, or support infrastructure build and analysis. - Ability to contribute to process improvements, automation, and documentation to improve team efficiency and consistency. Requirements - Exposure to advanced data centre and networking environments, including deeper knowledge of performance optimisation and scalability. - Familiarity with BIOS, BMC, and RAID controller configurations and managing hardware at a low level. - Experience supporting enterprise-scale customers or environments, with an understanding of reliability and operations guided by service agreements. - Flexibility to travel between data centre locations (if required) and support distributed infrastructure operations. Benefits - Paid time off. - Retirement savings (e.g., 401k, pension schemes). - Bonus/incentive eligibility. - Equity grants. - Participation in our employee stock purchase plan. - Competitive health benefits. - Other family-friendly benefits including parental leave.
Related Guides
Related Categories
Related Job Pages
More DevOps Engineer Jobs
• Garantir disponibilidade, escalabilidade e resiliência de ambientes cloud. • Atuando com automação, CI/CD, troubleshooting avançado, monitoração, gestão de incidentes e melhoria contínua. • Implementar IaC, apoiar times de desenvolvimento e propor evoluções arquiteturais.
• Support distributed environments on Azure and AWS, working directly on the advancement of corporate observability using tools such as Grafana, Prometheus, OpenTelemetry, Loki, Tempo and Zabbix; • Troubleshoot critical environments, perform incident analysis, and define dashboards, alerts and operational metrics; • Work closely with development, architecture and infrastructure teams to ensure application quality, availability and performance; • Drive the evolution of automation practices, GitOps, monitoring of Kubernetes environments (AKS/EKS) and continuous platform improvement.
Lead SRE – Observability
athenahealthWe provide network-enabled services, mobile apps, and data-driven insights to hospitals and medical organizations.
• Build and operate scalable observability and telemetry platforms that process logs, metrics, traces, and events across production environments • Support monitoring, alerting, and instrumentation strategies that improve service visibility and operational insight • Partner with engineering teams to strengthen telemetry collection and overall observability • Design resilient, automated infrastructure and platform services that improve reliability, scalability, and efficiency • Develop Infrastructure as Code and automation solutions that reduce toil and improve consistency • Lead technical initiatives from architecture through implementation with attention to performance, reliability, security, and maintainability • Troubleshoot complex production issues involving distributed systems, Linux infrastructure, networking, cloud services, and telemetry pipelines • Participate in incident response and on-call processes • Help drive operational excellence, root cause analysis, and continuous improvement • Mentor engineers on SRE best practices, observability strategy, and scalable systems design • Contribute to long-term platform strategy and reliability improvements.
AI Evaluator - Software Expert
MercorCincinnatus is an enterprise staffing company that partners with leading technology companies to source and employ highly skilled professionals for full-time and long-term contingent roles. Cincinnatus serves as the employer of record for these engagements, providing W-2 employment, payroll, benefits, and compliance, while placing employees directly within client teams to work on high-impact initiatives. Roles hired through Cincinnatus are not project-based or freelance engagements. They are structured, role-based positions that typically involve full-time or fixed-term commitments, close collaboration with a client's internal teams, and integration into standard enterprise workflows. Cincinnatus is a legal entity separate from Mercor. While opportunities may be discovered through Mercor's platform, employment, onboarding, payroll, and benefits for these roles are administered by Cincinnatus. Equal Employment Opportunity Cincinnatus is proud to be an Equal Employment Opportunity employer. We do not discriminate based upon race, religion, color, national origin, sex (including pregnancy, childbirth, reproductive health decisions, or related medical conditions), sexual orientation, gender identity, gender expression, age, status as a protected veteran, status as an individual with a disability, genetic information, political views or activity, or any other legally protected characteristic. Cincinnatus is committed to providing reasonable accommodations for qualified individuals with disabilities and disabled veterans throughout the job application process.
Role Description - Evaluate AI-generated artifacts against domain-specific quality rubrics. - Identify factual, aesthetic, and presentation errors in documents, spreadsheets, and slide decks. - Provide clear, structured written feedback to improve AI model outputs. - Apply deep subject-matter expertise to grade outputs for accuracy and rigor. - Work independently and asynchronously to meet deadlines. Qualifications - Must-Have: 5+ years of relevant professional experience in Software / AI / IT / data. - Native or professional fluency in English. - Highly proficient in Microsoft Office and Google Workspace, especially Slides. - Preferred: Advanced degree (Master's or higher) from a reputable institution. Company Description Mercor connects elite creative and technical talent with leading AI research labs. Headquartered in San Francisco, our investors include Benchmark, General Catalyst, Peter Thiel, Adam D'Angelo, Larry Summers, and Jack Dorsey.



