Job Closed

This listing is no longer active.

Senior Site Reliability Engineer

DevOps EngineerDevOps EngineerOtherRemoteSeniorTeam 5,001-10,000H1B SponsorCompany SiteLinkedIn

Location

United States

Posted

94 days ago

Salary

0

Seniority

Senior

No structured requirement data.

Job Description

Senior Site Reliability Engineer

Akamai Technologies

This description is a summary of our understanding of the job description. Click on 'Apply' button to find out more. Role Description Do you enjoy collaborating with teams to solve complex challenges? Do you enjoy solving large scale distributed content delivery challenges? Join our critical Platform and Reliability Engineering Team! The Platform & Reliability Engineering team is responsible for defining, measuring, & optimizing the key performance indicators of delivery customers. Your expertise in software engineering and systems administration will be instrumental in building robust and resilient infrastructure. In this role, you'll play a pivotal role in shaping the future of our products. You'll collaborate with product teams from the earliest stages of development to ensure the reliability, scalability, and performance of our systems. You'll define key performance indicators (KPIs), advance the state of monitoring, alerting and operational responses, and investigate complex performance issues. As a Senior Site Reliability Engineer, you will be responsible for: - Working on Internet technologies to improve the performance, availability, and scalability of large distributed content delivery systems. - Engaging in collaborative efforts with cross-functional teams, including Product & engineering, to define and establish measurable SLIs and SLOs. - Providing technical expertise and feedback to ensure system designs and implementations meet reliability and performance requirements. - Monitoring platform availability and performance, debug issues by leveraging data analysis skills and implement corrective actions to avoid recurrence. - Developing and implementing automation solutions to improve operational efficiency and reduce toil. - Participating in design reviews and providing technical guidance to ensure designs meet requirements for scalability, performance, and robustness. Qualifications - 5 years of relevant experience and a Bachelor's degree in Computer Science or its equivalent. - Familiarity with Internet protocols (DNS/HTTP/TLS/TCP etc.). - Experience utilizing Oracle SQL for data integrity checks, root cause analysis of data anomalies, and the development of data reports. - Proficiency in Scripting languages (Python, bash, JavaScript etc.). - Experience with monitoring and alerting systems (e.g., Prometheus, Grafana, ADBMS, Datadog), including metric collection, alerting, dashboarding, and troubleshooting. - Fluency working in a UNIX/Linux computing environment. Benefits - Flexible working options through FlexBase, allowing 95% of employees to choose to work from home, the office, or both. - Opportunities to grow, flourish, and achieve great things. - Benefits surrounding all aspects of your life, including health, finances, family, work-life balance, and personal endeavors. - Industry-leading benefits including healthcare, 401K savings plan, company holidays, vacation (PTO), sick time, parental leave, and an employee assistance program. Company Description Akamai powers and protects life online. Leading companies worldwide choose Akamai to build, deliver, and secure their digital experiences, helping billions of people live, work, and play every day. With the world's most distributed compute platform—from cloud to edge—we make it easy for customers to develop and run applications, while we keep experiences closer to users and threats farther away.

Job Requirements

  • 5 years of relevant experience and a Bachelor's degree in Computer Science or its equivalent.
  • Familiarity with Internet protocols (DNS/HTTP/TLS/TCP etc.).
  • Experience utilizing Oracle SQL for data integrity checks, root cause analysis of data anomalies, and the development of data reports.
  • Proficiency in Scripting languages (Python, bash, JavaScript etc.).
  • Experience with monitoring and alerting systems (e.g., Prometheus, Grafana, ADBMS, Datadog), including metric collection, alerting, dashboarding, and troubleshooting.
  • Fluency working in a UNIX/Linux computing environment.

Benefits

  • Flexible working options through FlexBase, allowing 95% of employees to choose to work from home, the office, or both.
  • Opportunities to grow, flourish, and achieve great things.
  • Benefits surrounding all aspects of your life, including health, finances, family, work-life balance, and personal endeavors.
  • Industry-leading benefits including healthcare, 401K savings plan, company holidays, vacation (PTO), sick time, parental leave, and an employee assistance program.

Related Categories

Related Job Pages

More DevOps Engineer Jobs

DevOps Engineer94 days ago
Full TimeRemoteTeam 1,001-5,000Since 1964H1B No Sponsor

• Configure, and administer cloud and hybrid infrastructure and applications. • Define configuration management and deployment strategies. • Ensure availability and stability of production environments both cloud and on premises. • Develop, and design software automation and scripts to orchestrate cloud and virtualization technologies with defined scope, schedule and expectations with a focus on Operations. • Provides subject matter expert technical support to customers using Cloud Platform products, solutions and APIs with a focus on Operations. • Design and implement DevSecOps capabilities (e.g. observability, continuous monitoring, tractability). • Work with the development teams in resolving software and other related problems associated with the cloud deployment infrastructure and the code deployed onto platforms such as Database and/or Middleware. • Manage day-to-day deployment and maintenance activities. • Analyze and tune information systems performances. • Security administration and monitoring, in adherence to the security rules.

Belgium
Job Closed
Five9 logo

Senior Site Reliability Engineer, SRE

Five9

Helping Companies Bring Joy to CX.

DevOps Engineer94 days ago
Full TimeRemoteTeam 1,001-5,000Since 2001H1B Sponsor

• Focus on modernizing application deployments • Tackle technical debt and enhance legacy Linux-based systems • Create internal tools to improve system management and automate operational tasks • Build out observability stack • Establish meaningful SLIs and achieve reliability targets

India
Job Closed
OtherRemoteTeam 501-1,000Since 1998H1B Sponsor

Position Summary:  Telestream is a seeking a DevOps Engineer to ensure seamless collaboration between our Software Development and IT Operations teams.  Your extensive experience and technical expertise in CI/CD pipelines, infrastructure automation and cloud platforms, will allow you to excel in this role.

United States
ITRex Group logo

Senior Build & Release Engineer

ITRex Group

We turn AI ambition into working systems — Generative AI, data, and everything in between

DevOps Engineer94 days ago
Full TimeRemoteTeam 201-500Since 2009H1B No Sponsor

• Lead the design, architecture, and management of CI/CD pipelines using GitHub Actions (and similar tools), ensuring fast, reliable, and reproducible software delivery • Implement and enforce test-driven deployment systems, integrating automated testing, validation, and monitoring to maintain code quality and accelerate feedback cycles • Containerize applications and microservices with Docker, optimize image builds, and manage deployment pipelines for distributed environments • Oversee the build, packaging, and publishing lifecycle for JavaScript, TypeScript, and C++ packages, including versioning, semantic tagging, and NPM or internal registry publication • Develop and maintain cross-platform build pipelines using CMake, NPM or equivalent tools, ensuring consistent compilation and release workflows across web, desktop, and mobile • Automate end-to-end release processes, including tagging, building, signing, and distributing mobile, web, and desktop applications • Define and manage Infrastructure as Code (IaC) to provision and maintain reliable, scalable, and secure infrastructure environments • Collaborate closely with development, QA, and operations teams to troubleshoot deployment issues, optimize performance, and improve release reliability • Continuously improve observability and feedback loops, leveraging monitoring and alerting systems to maintain operational excellence

Poland
Job Closed