Your Single Backup and Data Management Platform for Cloud, Virtual and Physical
Senior Platform Engineer – Cloud Workloads
Location
California
Posted
3 days ago
Salary
$172.8K - $320.9K / year
Seniority
Senior
Job Description
Senior Platform Engineer – Cloud Workloads
Veeam Software
• Design, build, and maintain observability pipelines using the Elastic Stack (Elasticsearch, Kibana, Fleet) across Azure and AWS workloads • Develop and own SLO/SLI dashboards and error budget reporting for BaaS platform services • Respond to and lead incident response for distributed, multi-tenant cloud workloads; own runbook creation, maintenance, and continuous improvement • Build and refine proactive support tooling, including pattern analysis, tenant correlation dashboards, and baseline deviation alerting, to reduce reactive support burden • Manage and maintain Elastic Fleet agent policies, enrollment health, and log streaming pipelines across Azure and AWS worker fleets • Partner with SRE, R&D, and Proactive Support teams to close observability gaps, including tenant identification workflows and admin portal integrations
Job Requirements
- 5+ years of experience in cloud platform engineering, SRE, or infrastructure roles supporting commercial SaaS products
- Deep hands-on experience with Elastic Stack: Building dashboards, writing KQL/Query DSL, managing Fleet
- Proven experience operating and troubleshooting distributed, multi-tenant workloads on Azure and/or AWS
- Strong understanding of Azure cloud services: AKS, Entra ID, Key Vault, Service Bus, Cosmos DB, Private Endpoints, etc.
- Experience with incident response in production cloud environments, including runbook development and post-incident review
- Experience with IaC tools (Azure Bicep, Terraform) and CI/CD pipelines (Azure DevOps, GitHub Actions)
- Strong scripting skills in Bash, Python, or PowerShell
- Ability to work cross-functionally with SRE, product, and customer-facing support teams
Benefits
- Unlimited paid time off, 12 paid holidays including 4 global VeeaMe Days for self-care and 24 paid volunteer hours annually through Veeam Cares
- Paid parental leave: 8 weeks for all parents, 16 weeks for birthing parents
- Medical, dental, and vision coverage starting on your first day
- Mental health support, therapy sessions, and digital wellness tools via our Employee Assistance Program
- 401(k) retirement plan with company matching contributions
- Fertility, adoption, and surrogacy support through Maven, plus paid volunteer time
- AirVet: 24/7 virtual veterinary care at no cost
- Legal services, identity protection, and supplemental health insurance options
- Tax-advantaged spending accounts for healthcare, dependent care, and commuting
- Opportunities to learn and grow through on-demand libraries (LinkedIn Learning, O’Reilly), mentoring, workshops, and learning events like our annual Global Day of Learning
Related Guides
Related Categories
Related Job Pages
More Platform Engineer Jobs
• You'll own the systems every other team at Phaidra depends on for ingestion, storage, serving, and orchestration. • Design and build scalable components for the data platform that enable high-throughput data ingestion and processing. • Design and develop systems to store and serve batch data for analytics. • Contribute to the design and implementation of API services and scalable event-driven applications that power the product backend. • Design clear, extensible software interfaces for internal consumers and maintain a high release-quality bar. • Design and optimize data storage and retrieval mechanisms for high throughput, security, and ease of access. • Own and operate your services in production, including releases, deployments, and on-call rotations, meeting Phaidra's high bar for operational excellence. • Lead cross-functional initiatives collaborating with engineers, product managers, and TPMs across teams. • Mentor your peers and be a technical role model on the team.
Platform Performance Engineer (m/f/x) with an AI-first mindset
DynatraceDynatrace is a global application performance management software firm and a former member of Compuware. As an employer, the company is in support of helping it
Your role at DynatraceJoin the Quality, Security, and Privacy Team at Dynatrace. Our team plays a critical role in delivering a high-quality product experience while ensuring our customers' data is safer with us than anywhere else. We achieve this by embedding performance best practices and analysis workflows throughout the software development lifecycle. We’re looking for Platform Performance Engineers with an AI-first mindset—engineers who don’t just analyze systems, but continuously look for ways to automate, augment, and scale their work using AI and intelligent agents. What you'll do - Partner with engineering teams to monitor, analyze, and optimize performance across complex distributed systems. - Leverage AI tools and agent-based workflows to accelerate root cause analysis, identify patterns, and reduce manual toil. - Drive consistency and excellence in performance practices by coaching teams and promoting scalable, automated approaches. - Agentify performance workflows—from regression detection to anomaly analysis—to enable continuous, self-improving systems. - Integrate automated performance testing and regression detection into CI/CD pipelines. - Dogfood the Dynatrace platform to uncover performance issues and provide data-driven, actionable insights. - Contribute to internal tooling, playbooks, and knowledge sharing to help teams resolve issues faster and more autonomously. What will help you succeed - Passion for performance optimization, distributed systems, and scalability challenges - An AI-first mindset: you actively explore how AI can accelerate engineering workflows and reduce manual effort - Curiosity and drive to automate and rethink how work gets done - Strong collaboration skills and the ability to influence engineering teams - Experience with Java / Spring Boot / Go is a nice to have - Familiarity with Kubernetes and cloud-native architectures is a plus - Experience building or extending automation, tooling, or agents (e.g., scripts, pipelines, AI-assisted workflows) is also a plus Why you will love being a Dynatracer - Dynatrace is a leader in unified observability and security. - We provide a culture of excellence with competitive compensation packages designed to recognize and reward performance. - Our employees work with the largest cloud providers, including AWS, Microsoft, and Google Cloud, and other leading partners worldwide to create strategic alliances. - You'll get to work at the forefront of innovation with Dynatrace Intelligence—the industry's first agentic operations system. Bringing together deterministic and agentic AI, it helps teams understand what's happening, why it matters, and what to do next— automatically - Over 50% of the Fortune 100 companies are current customers of Dynatrace. - For this position we offer exclusive relocation support to our R&D Headquarters in Linz, ranked among the top seven most sustainable cities in the world, where cutting-edge innovation intersects with an exceptional quality of life and affordable living costs! Click this link to find out more. Compensation and Rewards - Due to legal reasons, we are obliged to list a salary range for this position, which is €74,000 up to €92,000 gross per year based on full-time employment (38.5 h/week). - We’ve listed the salary range for transparency, but if your experience and skills bring unique value, we’d still love to hear from you—please apply even if you’re outside the range. Equal Employment OpportunityDynatrace provides equal employment opportunities to all employees and applicants without regard to race, color, religion, gender, gender identity or expression, sexual orientation, national origin, genetics, disability, age, veteran status, or any other protected characteristic. We actively foster an inclusive workplace that celebrates differences and promotes accessibility, collaboration, and growth for all.
Platform Engineer
Poland and Eastern EuropeXebia is a global tech company with a journey in CEE that started with two Polish companies – PGS Software and GetInData. We are a team of 1,000+ experts delivering top-notch work across cloud, data, and software. We work on impactful projects across various sectors including fintech, e-commerce, aviation, logistics, media, and fashion, helping clients build scalable platforms and cutting-edge applications. Our clients include notable names like McLaren, Aviva, Deloitte, Spotify, Disney, ING, UPS, Tesco, Truecaller, AllSaints, Volotea, Schmitz Cargobull, Allegro, and InPost.
Role Description You will be: - evolving and operating a multi-cloud platform ecosystem across AWS and Azure, - enabling engineering teams through self-service capabilities, reusable platform components and an improved developer experience, - defining and governing engineering standards, golden paths and best practices for infrastructure delivery and CI/CD, - driving automation initiatives to increase reliability, scalability and operational efficiency across the platform landscape, - partnering with product and engineering teams to solve complex technical challenges and accelerate delivery, - leading strategic initiatives in areas such as cloud optimisation, security and adoption of emerging technologies, including AI-enabled capabilities. Qualifications - 6+ years of experience in Platform Engineering and/or DevOps Engineering within complex multi-cloud environments, - strong expertise in Terraform, including module development, Terratest, versioning strategies and large-scale state management, - hands-on experience administering Harness, including delegates, governance, templates, RBAC, Harness IDP and Harness CCM (Cloud Cost Management) in production environments across AWS and Azure, - practical experience with Backstage, including plugin development, catalog configuration, TechDocs and GitHub integration, - proficiency in Python and/or Node.js for platform tooling and automation, operational Lambdas, - solid understanding of multi-cloud networking concepts, including VPC/VNet design, Transit Gateway, hub-and-spoke architectures and split-horizon DNS, - experience implementing security best practices, including IRSA, Workload Identity, least-privilege IAM and policy-as-code using OPA/Conftest, - knowledge of FinOps practices, including tagging strategies, budgeting, rightsizing, waste identification and reporting, - experience designing CI/CD solutions with golden paths, governance guardrails, exception handling and artifact management, - practical experience using AI-powered assistants (e.g. Claude Code, GitHub Copilot, Cursor) to improve productivity, quality, or decision-making in software delivery, with familiarity with MCP and LLM tooling, including agent workflows, prompt governance and auditability, - strong communication skills and English proficiency (at least B2 level). Requirements - Work from the European Union region and a work permit are required. Benefits - Nice to have: - experience with AWS Control Tower, AWS Organizations and account vending machine patterns, - experience with Azure Management Groups, Azure Policy and Azure Blueprints, - experience with Kubernetes platform engineering, including EKS managed add-ons, cluster upgrade automation and Velero, - familiarity with Crossplane or Cluster API, - experience with GitOps practices using ArgoCD and/or FluxCD, - experience provisioning Kubernetes platforms using EKS Blueprints or AWS CDK, - exposure to GCP platform engineering environments, - understanding of Java and/or .NET build ecosystems from an operational perspective, - AWS, Azure and/or GCP certifications at Solutions Architect level or higher. Recruitment Process - CV review – - HR call – - Interview – - Team / Client Interview – - Decision
Quality Platform Lead
Branch InternationalBranch International is a FinTech startup committed to delivering a range of financial services to mobile-enabled customers in emerging markets around the world
Title: Quality Platform Lead Location: India Job Description: Full Time Experienced Branch Overview Branch is a leading AI-powered lending fintech with 50M+ downloads across India and Africa. We use alternative data and machine learning to expand financial access for millions of people traditionally excluded from the formal financial system. Founded by the former CEO of Kiva.org and backed by leading investors including Andreessen Horowitz, Visa, and the IFC, Branch combines mission-driven impact with world-class technology and scale. In India, Branch operates as a regulated digital lending institution and Middle Layer NBFC, building trusted and accessible financial products for millions of customers across the country. Our 250+ member India team is growing rapidly and works across technology, data science, risk, product, and operations to solve high-impact problems at scale. Certified as a Great Place to Work in 2025, Branch offers the opportunity to build meaningful careers while shaping the future of inclusive fintech in one of the world’s fastest-growing digital economies. About Role: At Branch, quality has always been the responsibility of the engineers, product managers, and business owners who build and ship the product. We deliberately do not have a traditional QA team gating releases — we believe the accountability for what goes to production must sit with the people building it, not with a separate function checking their work. As the Quality Platform Lead, you will build and lead a small, high-leverage team that audits production against our stated expectations — functional, behavioral, regulatory, and experiential — and reports honestly on what they find. You will not own release gates. You will not sign off on builds. Engineers and Product retain full ownership of what they ship and when. Your team's mandate is to be the independent, automation-first lens on whether what's running in production actually matches what we said we were building, and to make any drift visible to the people who can fix it. You will architect and champion a continuous, automated quality-auditing ecosystem. Instead of acting as a release gatekeeper, this leader will build smart automation and AI-driven agents to continuously evaluate production against company standards, providing the high-fidelity feedback loops our engineering, product, and business teams need to maintain ultimate accountability. This is a builder role. You will define the operating model from scratch, hire a small team of strong engineers, and lean heavily on automation, agents, and continuous production verification rather than manual test cycles. Success looks like a team that catches more, manually does less, and grows the organization's confidence in its own output without becoming a bottleneck to it. Responsibilities - Establish charter, operating principles, success metrics, and the team itself. Reinforce in every process that this is an audit function, not a release gate. - Design and operate an automation-first stack: AI agents, synthetic monitoring, production probes, behavioral diffing, and continuous sampling. - Design and execute a continuous non-intrusive production auditing strategy to catch subpar experiences, regressions, and drift. Establish measurable metrics across user experience and system behavior. Continuous automated coverage that scales with the product instead of a pile of manual test cases. - Translate audit findings into actionable insights for Engineering and Product. Partner on root-causing systemic issues without taking ownership of their bug fixes, and build dashboards that give the organization real-time visibility into production quality. - Recruit, grow, and empower a team of strong builders first and quality specialists second. Hold a high bar on technical depth. - Be a credible technical voice on incidents, regressions, and quality patterns. Stay hands-on enough to contribute to the team's stack. - Own your team's full delivery cycle and engineering standards: code review, testing discipline, observability, and responsible AI use. - Represent the team's findings through clear written reports, metrics, and well-run forums. Resist pressure to convert audit findings into release blocks. Qualifications - 8+ years building production software, with at least 2 years leading engineers directly. Some of that experience in environments where you owned quality, reliability, or production verification at scale. - Track record of building automation-first quality or verification systems — test infrastructure, synthetic monitoring, production probing, chaos or behavioral testing rather than managing manual QA. - Hands-on experience with LLMs, AI agents, or sophisticated testing tools to automate non-deterministic flows, and a sharp view on where modern AI changes the economics of test and audit work. - Deep familiarity with modern automation frameworks (Playwright, Cypress), CI/CD, and observability tools (Datadog, New Relic). You think in terms of sampling, blast radius, and error budgets rather than green/red test suites. - Strong foundation in backend or full-stack systems, distributed systems, and data modeling. Mobile or AI systems experience a plus. - Strong conviction in an engineering-owned quality model and a resistance to letting a quality function become a scapegoat for bugs. - Experience hiring engineers and calibrating a team's bar, including hard calls on performance. - Exceptional, highly structured communication skills, with the ability to influence teams using data and operate effectively across asynchronous environments. - Background working in fast-paced, high-scale environments; experience in fintech, payments, lending, AI, or mobile is a strong plus. Benefits of Joining - Competitive salary and equity package - Fast-paced, collaborative, and high-autonomy work culture - Hybrid work setup designed for flexibility and work-life balance - Fully paid group medical insurance and personal accident insurance - Generous paid time off, plus company-declared public holidays - Fully paid parental leave for fathers and non-birthing parents (12 weeks), in addition to 26 weeks of statutory maternity leave - Monthly WFH stipend, along with a one-time home office setup budget - $500 annual professional development budget - Quarterly social meet-ups and sponsored monthly team lunches We’re looking for more than just qualifications -- if you’re unsure that you meet the criteria but identify with our vision of providing equal opportunity to everyone to access financial services, please do not hesitate to apply! Branch International is an Equal Opportunity Employer. The company does not and will not discriminate in employment on any basis prohibited by applicable law.



