Job Closed

This listing is no longer active.

Peec AI logo
Peec AI

Helping companies get discovered on AI search

SRE / Platform Engineer

DevOps EngineerDevOps EngineerFull TimeRemoteSeniorTeam 11-50Since 2025H1B No SponsorCompany SiteLinkedIn

Location

Europe

Posted

105 days ago

Salary

€100K - €150K / year

Seniority

Senior

Job Description

SRE / Platform Engineer

Peec AI

• Own the reliability, scalability, and performance of Peec AI’s core systems and infrastructure • Design, build, and maintain the tooling, automation, and monitoring that keep our services fast, secure, and highly available • Partner closely with product and engineering teams to ensure new features are reliable, observable, and easy to operate from day one • Develop and refine incident response practices, ensuring issues are triaged quickly and resolved with minimal user impact • Proactively identify and address bottlenecks, single points of failure, and operational inefficiencies across the stack • Champion operational excellence and a culture of reliability, driving best practices across the engineering organization

Job Requirements

  • 5+ years of experience in Site Reliability Engineering, Infrastructure Engineering, or similar roles supporting production systems at scale
  • Deep expertise with Infrastructure as Code tools (Terraform, Pulumi, CloudFormation, etc.)
  • Strong experience with observability platforms (e.g., Datadog, Sentry, Prometheus, Grafana) and incident response tooling (PagerDuty, Incident.io, or similar)
  • Proven proficiency with major cloud platforms (GCP, AWS, or Azure) and modern distributed systems
  • Strong programming and scripting skills (e.g., TypeScript and Python) for automation and tooling
  • A track record of diagnosing complex system problems and implementing robust, long-term solutions
  • Solid understanding of CI/CD, Kubernetes, containerization, networking, databases, and cloud security principles
  • Excellent problem-solving skills, attention to detail, and a strong commitment to operational excellence
  • Bonus Points: Experience supporting AI/ML workloads or data-intensive systems
  • Prior SRE experience in a high-growth startup or globally distributed infrastructure environment
  • Familiarity with zero-downtime migrations, multi-region architectures, or compliance frameworks

Benefits

  • Aggressive equity compensation package
  • Remote working (applicants must be located within ±3 hours of the Berlin (CET) time)

Related Categories

Related Job Pages

More DevOps Engineer Jobs

DevOps Engineer105 days ago
OtherRemoteTeam 51-200Since 2019H1B No Sponsor

• Optimize release deployments and maintain secure cloud infrastructure • Handle day-to-day operations and problem-solving • Ingest new solutions and products from the Build/Automation organization • Use monitoring and logging tools to solve issues • Conduct post-mortem analysis and identify potential issues for improvement • Setup, monitor, and maintain DevOps cloud-based SAAS products and solutions • Maintain security and data privacy and ensure compliance • Work with architects on deployment architecture, security, and CI/CD implementations • Setup and maintain Kubernetes clusters on cloud environments • Analyze and solve operational issues, and respond to incidents • Conduct root cause analysis and implement continuous improvements • Evaluate new technology options and vendor products

United States
Job Closed
DroneUp logo

SRE – Platform Engineer

DroneUp

DroneUp is a leader in drone flight services that transforms organizations using drone technology and delivery solutions. The company develops SaaS platforms that have mobile app t

DevOps Engineer105 days ago

• Broad domain architect for the internal developer platform and all cloud engineering • Drive architecture for tooling or in-house software • Mentor other platform engineers to drive strong engineering practices • Enablement of platform engineering technical capabilities in our internal client teams in software engineering • Peer with the senior architects and engineers in software engineering • Architecture and engineering focused on GCP environment • Architect and oversee GKE cluster operations and workload management • Provide feedback to others and participate in peer reviews / pair programming • Drive the broad adoption of Test Driven Development through designing, development, and debugging unit and integration tests for new and existing infrastructure and code • Continuous curiosity of existing implementations and new technologies and sharing with the team • Practice continuous improvement across all job areas and personally / professionally • Clearly communicate with platform engineering teams and other stakeholders and provide technical direction while doing so • Stay current with platform changes and third-party libraries. • Proactively investigate better solutions for current solutions • An understanding of Open Telemetry and true observability and the difference between it and monitoring and logging • Grow the engineering culture towards a high-performing team • Practice the arts of self-service, least privilege and security by default in all solutions • Define and maintain Service Level Objectives (SLOs), Service Level Indicators (SLIs), and error budgets • Lead incident response, including on-call rotations, root cause analysis, and post-mortem reviews • Implement and optimize monitoring, alerting, and observability systems for system reliability • Collaborate on capacity planning and performance optimization to ensure high availability • Other duties as assigned

United States
$125K - $150K / year
OtherRemoteTeam 1,001-5,000H1B No Sponsor

• Design, implement, and maintain CI/CD pipelines to support automated build, test, and deployment workflows • Partner with engineering teams to streamline release processes and improve deployment reliability • Implement and manage monitoring, logging, and alerting solutions to ensure system health and performance • Define and maintain cost monitoring and alerting strategies to optimize cloud spend and prevent unexpected usage • Automate infrastructure provisioning and configuration using Infrastructure as Code (IaC) • Troubleshoot production issues and lead root cause analysis efforts • Establish DevOps best practices around reliability, security, and operational excellence • Continuously evaluate tools and processes to improve scalability, availability, and efficiency • Mentor junior engineers and contribute to a strong DevOps culture

United States
Job Closed
OtherRemoteTeam 1,001-5,000H1B No Sponsor

• Lead and manage SRE operations supporting 24/7/365 availability • Own uptime, SLA compliance, SLIs, SLOs, error budgets, MTTR, and incident trends • Oversee incident management, on-call rotations, and post-incident reviews • Lead FinOps practices across hybrid environments • Drive right-sizing, optimization, and elimination of infrastructure waste • Establish cost visibility, allocation, and reporting • Define and maintain observability standards across hybrid environments, such as AWS, Azure and Vsphere • Utilize platforms such as Coralogix, Open Telemetry, and FireHydrant • Champion GitOps practices and pull request governance • Lead Terraform-based infrastructure automation initiatives • Partner across Product, Engineering, Infrastructure, Finance, and Support teams • Lead, mentor, and develop a high-performing SRE team

United States
Job Closed