Job Closed

This listing is no longer active.

Playson logo
Playson

BIG PLAY IS ON

Principal Site Reliability Engineer – Platform Tribe

DevOps EngineerDevOps EngineerFull TimeRemoteLeadTeam 201-500Since 2012H1B No SponsorCompany SiteLinkedIn

Location

Ukraine

Posted

114 days ago

Salary

0

Seniority

Lead

Job Description

Principal Site Reliability Engineer – Platform Tribe

Playson

• Manage day-to-day alerts, system checks, and issue escalation as necessary. • Provide 24x7 on-call support for critical SaaS events. • Document issues and remediation steps. • Proactively create monitors within the EKS/K8s ecosystem. • Deploy to EKS/K8s cluster using Terraform and Helm/Flux. • Enhance infrastructure health by implementing checks and scripts to address known issues. • Maintain and develop deployment code. • Implement/integrate new technologies into our Cloud Infrastructure. • Collaborate with other teams to provide top-notch support and assistance. • Prioritize customer focus in planning deployments/updates, ensuring minimal impact. • Conduct RCA and take necessary corrective actions to prevent issue recurrence. • Assign alert-related actions to the appropriate team after investigation. • Handle support requests for environment-specific actions.

Job Requirements

  • Proficiency in Kubernetes (deployment, scaling, troubleshooting)
  • Experience with configuration management tools like FluxCD/ArgoCD
  • Strong experience with issue processing (RCA, Postmortems)
  • Familiarity with AWS, Terraform, Docker, CI/CD
  • Experience with monitoring tools like DataDog, Prometheus, Grafana, and logging solutions like Elasticsearch, Logstash, and Kibana (ELK Stack) or AWS CloudWatch
  • Strong understanding of networking concepts and protocols
  • Proficiency in at least one scripting language (e.g., Python, NodeJS, Go)
  • Proficiency in Git or other version control systems
  • Familiarity with incident response and management tools like PagerDuty, Opsgenie, or VictorOps
  • Ownership, proactiveness, persistence, and passion for maintaining a high-traffic online platform.

Benefits

  • Competitive Salary and annual performance/salary reviews
  • Realistic and transparent Bonus system (15-20%), paid quarterly
  • Unlimited paid vacation leave & paid sick leave
  • Flexible work schedule to accommodate your needs
  • 100% Remote
  • Medical Insurance for you +1
  • Financial Support for Life Events & Extended Parental Leave
  • Paid professional development courses and trainings
  • B2B contracts

Related Categories

Related Job Pages

More DevOps Engineer Jobs

OtherRemoteTeam 51-200H1B No Sponsor

• Own and evolve our AWS infrastructure across ECS/Fargate and EKS (Kubernetes), including RDS/Postgres, S3, IAM, and VPC. • Own and evolve CI/CD pipelines (GitHub Actions, Argo CD) and Infrastructure-as-Code (Terraform). • Set up alerting, log aggregation, and performance dashboards (e.g., CloudWatch, Datadog, or Open Telemetry). • Implement secure-by-default practices; support SOC 2 / HIPAA readiness. • Write Python/Bash scripts for backups, monitoring, deployment hooks, etc. • Own Postgres operational concerns including tuning, access control, backups, and zero-downtime migrations. • Design and own our EKS platform, leading workload migrations, defining cluster architecture, autoscaling strategy (Karpenter), GitOps workflows (Argo CD), and reliability standards. • Make and document infrastructure decisions; define best practices for reliability, security, and cost.

Virginia

Staff Member, Systems Reliability Engineering

MEMX

MEMX is an exchange operator and market technology platform dedicated to delivering transparent, efficient, and cost-effective securities trading services designed to revolutionize

DevOps Engineer114 days ago

• Responsible for providing support of MEMX exchange platforms including on-call, respond to incidents and support triaging the issue • Help isolate and resolve unplanned system outages • Work with cross-functional teams to support the availability of all MEMX exchange platforms. This includes market operations, systems, networking and development teams • Help improve operational processes (such as deployments and upgrades) by identifying areas which need improvement • Document every action so that the findings turn into repeatable actions which eventually can be automated • Debug issues as they arise, across the different services and interaction points • Enhance monitoring and alerting based on symptoms • Run nightly processes that are essential to exchange operations. We automate as much as possible but there are processes that require a level of manual input and attention

Sri Lanka
Espresso Systems logo

DevOps Engineer

Espresso Systems

something's brewing ☕️

DevOps Engineer114 days ago
OtherRemoteTeam 11-50H1B Sponsor

Espresso Systems builds foundational infrastructure to power tomorrow’s internet, where digital assets are able to move across chains as easily as info flows across the web. It’s the lead developer of the Espresso Network, the first base layer built from the ground up to provide rollups with the functionality they’ve always needed but never had: fast, secure finality for users’ transactions and seamless composability with other rollups. Rollups today rely on infrastructure that wasn’t built with them in mind, resulting in their isolation and the fragmentation of users, developers and liquidity within the Ethereum ecosystem. This prevents rollups from achieving the seamless composability that’s essential to web3's vision. Espresso solves this by providing rollups with secure, real-time visibility into what’s happening on all integrated chains, including their own, empowering the apps to execute crosschain interactions immediately without waiting for slow settlement or trusting centralized sequencers. The Espresso Network is currently live on mainnet in its initial release. In addition to fast confirmations and data availability, rollups seeking further decentralization can also opt to use the Espresso Network as a decentralized sequencer. As the first purpose-built base layer for rollups, Espresso supports a fast-growing ecosystem of interconnected chains regardless of tech stack, VM, or settlement layer—from established chains like Celo and ApeChain to emerging app-specific chains attracting their first users. At Espresso Systems, we work with leading teams developing rollups and innovating around interoperability, including Offchain Labs, Polygon, Caldera, AltLayer, Cartesi, Across, Hyperlane, and beyond—all united in our mission to build a unified, composable ecosystem where rollups are free to achieve their full potential regardless of where they choose to settle. The Espresso team comprises a diverse and passionate group of contributors from around the world. We are developers, designers, and researchers who have contributed in academia, open-source communities, policymaking, and beyond. We have raised roughly $60 million from leading investors in technology and crypto, including a16z, Greylock Partners, Electric Capital, Sequoia Capital, and Polychain Capital. As a DevOps Engineer, you will assist the development team in building infrastructure to support production of the sequencer software, as well as build tooling for the deployment of test networks. Responsibilities Monitoring and management of cloud environments (AWS, Azure, GCP) Assisting with CI/CD pipelines and general code management practices Develop and maintenance of tooling for deployment of Espresso Systems services Requirements Benefits Fully remote with flexible hours Work alongside the brightest minds in the crypto space Competitive salary + equity package Regular team off-sites to international locations Unlimited vacation policy Top-tier health, dental, and vision coverage for US employees

United States
Zenith Health logo

Build Your Own Job Description

Zenith Health

Zenith Health is building the platform to transform real pregnancy experiences into evidence – establishing a foundation for data-driven decisions, improved care, and better outcomes. Our mission is for every pregnancy health question to have an answer informed by real evidence, not guesswork or anecdotes.

DevOps Engineer114 days ago

About the company At Zenith Health, we're changing the way that real-world pregnancy data is captured and used to improve maternal and infant health outcomes. We're building a platform that empowers pregnant individuals to access existing information and evidence on health outcomes in pregnancies, as well as contribute their experiences to a growing body of evidence. There are  millions of real-world data points being generated every day across the 6+ million pregnancies happening each year — ranging from medication safety to lifestyle and dietary choices — and we believe these hold the key to advancing maternal and infant health outcomes. Our aim is to bridge key existing information gaps by better leveraging this real-world data, and improve the maternal health evidence base to drive more informed decision-making in pregnancy. We’re passionate about transforming the way pregnancy and maternal health data is captured, understood, and used to improve patient health, and we are backed by Shaper Capital on our mission. By joining Zenith, you’ll be part of a collaborative, passionate team of people who are smart, nice, and can get things done. If you’re driven by a desire to make a tangible impact on patients and medical research alike, and are excited to be part of a high-growth startup, we’d love to hear from you. About the role If you’re excited by our mission and want to build with us, but don’t see a role that’s the perfect fit for you – submit your interest here and tell us what you’re exceptional at. We are looking to expand our team across software engineering, data science, product, partnerships, and marketing in the near future, so if Zenith’s mission sounds compelling, we’d love to connect. We’re eager to work with people who feel passionately about our problem space and align with our values, so regardless of your functional area of expertise let us know below how you can contribute!

New York