Hewlett Packard Enterprise is the global edge-to-cloud company advancing the way people live and work. We help companies connect, protect, analyze, and act on their data and applications wherever they live, from edge to cloud, so they can turn insights into outcomes at the speed required to thrive in today’s complex world.

Site Reliability Engineer

DevOps EngineerDevOps EngineerFull Time Remote Mid Level

Location

Poland + 1 more

Posted

72 days ago

Salary

PLN154.5K - PLN305.5K / year

Seniority

Mid Level

No structured requirement data.

Job Description

Role Description We are looking for a highly motivated, self-driven, and dedicated Site Reliability Engineer possessing hands-on experience with: - Experience building and running reliable and fault-tolerant production cloud systems at scale on AWS. - Coding infrastructure automation with Terraform, Terragrunt, Packer, CI/CD, and knowing how to use configuration management systems like Ansible. - Hands-on experience with Linux/Unix operating systems internals, file systems, system tuning, administration, and networking. - Deep experience in microservice technologies, container orchestration, and continuous deployment (Kubernetes, Docker, Helm, GitOps with Flux). - Experience in designing, building, maintaining production services, and troubleshooting large-scale distributed systems. - Experience with technologies like Apache Kafka, Apache Storm, Apache Flink, Apache Airflow and Spark, Postgres, Redis, Elasticsearch, Arango, Cassandra. - Experience with observability tools and methodology (monitoring, logging, tracing, SLOs/SLIs) for detecting and diagnosing issues in advance before causing service impact or performance degradation. - Possess strong programming skills in Shell, Python, Golang and/or Ruby. - Deliver efficiently and effectively. - Strong problem-solving and debugging skills with a high sense of ownership. Responsibilities: - Engage in and improve the whole lifecycle of services - from inception and design, through to deployment, operation, and refinement. - Support development of services from planning phase before they go live through activities such as system design consulting, developing software platforms and frameworks, capacity planning and launch reviews. - Provide technical leadership and guidance to other team members on managing availability and performance of mission critical services, on building automation to prevent problem recurrence, and building automated responses for non-exceptional service conditions. - Maintain services once they are living by measuring and monitoring availability, latency, and overall system health. - Scale systems sustainably through mechanisms like automation and evolve systems by pushing for changes that improve reliability and velocity. - Capacity planning the growth of cloud infrastructure. - Improve operational processes such as deployments and upgrades. - Manage execution of project priorities, deadlines, and deliverables. - Be on an on-call rotation to respond to incidents that impact platform availability. - Use your on-call shift to prevent incidents from happening. - Experience in incident response, including conducting post-mortems and implementing lessons learned, enhances system reliability. Qualifications - 10+ years of engineering or systems experience. - Experience leveraging cloud architecture, applying site reliability principles, and/or demonstrating sensitivity to operational concerns. - Strong understanding of network design and architecture. - Scaling and managing distributed systems. - Significant experience with monitoring and observability platforms. - Demonstrated ability to debug, fix, and optimize code. - Troubleshooting skills across network, application, and distributed services layers. - The ability to learn quickly and adapt to new technologies is essential. - Excellent communications skills, both verbal and written. Benefits - Health & Wellbeing: We strive to provide our team members and their loved ones with a comprehensive suite of benefits that supports their physical, financial and emotional wellbeing. - Personal & Professional Development: We invest in your career because the better you are, the better we all are. We have specific programs catered to helping you reach any career goals you have. - Unconditional Inclusion: We are unconditionally inclusive in the way we work and celebrate individual uniqueness.

Related Categories

DevOps Engineer

Related Job Pages

Remote Full-time Jobs (US)More Remote Jobs

More DevOps Engineer Jobs

Senior Software Engineer, DevOps/Infrastructure

Hotel Engine

Innovating business travel with a free-to-use hotel booking platform.

DevOps Engineer72 days ago

Full Time RemoteTeam 201-500Since 2018H1B No Sponsor

Company Site LinkedIn

• Lead the technical evolution of the control plane — not just keep it running, but decide where it goes next. • Design and ship platform primitives (Terraform modules, pipeline templates, account/networking patterns) that feature teams adopt because they're better than the alternative — not because they're mandated. • Partner with embedded infra engineers to identify recurring friction across verticals and turn it into self-serve capability. • Own the boring-but-critical work: AWS Organization hygiene, CI/CD reliability, vendor contracts and integration health, incident response on shared infrastructure. • Mentor across the discipline. We hold infra standups twice a week — that's where you'll teach, learn, and stay aligned with the embedded engineers. • Reduce the SDLC step-function count. Every new approval gate, every "ask infra first" workflow is a tax — your job is to lower it.

AWS Cloud SDLC Terraform

View details: Senior Software Engineer, DevOps/Infrastructure

United States

$121.4K - $168K / year

Apply

DevOps Engineer, GitHub Migration Projects

Atmosera

Solution Enablement, Solution Management, Solution Training - Atmosera is the Apps, Data, and Azure Expert

DevOps Engineer72 days ago

Contract RemoteTeam 51-200H1B No Sponsor

Company Site LinkedIn

• Migrate source code repositories from: GitHub Enterprise Server, BitBucket, or GitLab to GitHub Enterprise Cloud. • Interface with client engineers and leadership to gather requirements and provide updates. • Build and document a migration path, including tooling and automation. • Collaborate with project management to plan and execute migration waves. • Analyze existing GitHub Enterprise Server configurations and repository metadata to ensure accurate migration. • Troubleshoot issues related to file sizes, compatibility, network connectivity, or permissions during migration. • Build and maintain scripts to support acceleration of migration activities, interfacing with required API from the source control management systems. • Build GitHub Actions workflows to support migration efforts.

Cloud Python

View details: DevOps Engineer, GitHub Migration Projects

Costa Rica

Apply

Job Closed

DevOps – Mid/Senior

Cappta

Alcance novos horizontes conectando seu negócio a nossa Plataforma White Label de Tecnologia e Serviços Financeiros 🚀

DevOps Engineer72 days ago

Full Time RemoteTeam 51-200Since 2011H1B No Sponsor

Company Site LinkedIn

• Administration and maintenance of cloud environments; • Management of VMs, networking, security, DNS, certificates, and load balancers; • Troubleshooting production environments; • Support for compliance and maintenance of technical requirements related to PCI DSS; • Implementation and adjustment of cloud security controls; • Management of containerized environments (Docker, Kubernetes, Rancher); • Support for CI/CD pipelines and routine automation; • Performance and availability monitoring; • Automation via scripts and/or Infrastructure as Code; • Technical support to internal teams ensuring infrastructure best practices; • Assist in security and development team processes.

Cloud DNS Docker Kubernetes Linux MongoDB MySQL Puppet Python Terraform

View details: DevOps – Mid/Senior

Brazil

Apply

Job Closed

Senior DevOps Engineer

Cakto

DevOps Engineer72 days ago

Full Time RemoteTeam 51-200Since 2023H1B No Sponsor

Company Site LinkedIn

• Design and maintain scalable, resilient cloud architecture • Build CI/CD pipelines with a focus on performance and reliability • Implement infrastructure as code with standards and governance • Ensure full observability (logs, metrics, tracing) • Respond to critical incidents and conduct post-mortems • Optimize infrastructure cost and performance • Work closely with the engineering team to evolve the platform

AWS Azure Cloud Google Cloud Platform Grafana Kubernetes Linux Prometheus Terraform

View details: Senior DevOps Engineer

Brazil

Apply

Site Reliability Engineer

Job Description

Related Guides

Related Categories

Related Job Pages

More DevOps Engineer Jobs

Senior Software Engineer, DevOps/Infrastructure

DevOps Engineer, GitHub Migration Projects

DevOps – Mid/Senior

Senior DevOps Engineer