Revolutionizing the Transportation of Goods
Senior Data Platform Engineer
Location
Pennsylvania
Posted
3 days ago
Salary
0
Seniority
Senior
Job Description
Senior Data Platform Engineer
Stack AV
• Design and operate distributed storage systems for scheduling and executing large-scale batch workloads. • Build and maintain an open source, modern data platform. • Optimize utilization of storage resourcesImprove reliability and fault tolerance of large-scale storage systems and data platform components. • Collaborate with teams across the company to understand workload requirements and improve platform capabilities. • Contribute to platform tooling, automation, and CI/CD workflows.
Job Requirements
- 7+ years of experience building and operating distributed storage systems or modern data platforms.
- Experience operating streaming platforms such as Kafka or Pulsar.
- Fluent in Python, and SQL, with experience writing and maintaining highly available data applications using Trino and Apache Spark.
- Knowledge of table formats (Iceberg, Delta Lake, Hudi, Xtable).
- Experience operating and optimizing at least one RDBMS (Postgres, MySQL).
- Strong debugging and problem-solving skills in complex distributed systems.
- Ability to collaborate across teams and communicate technical concepts clearly.
Benefits
- Equal opportunity workplace
- Diverse teams produce the best ideas and outcomes
- Culture of inclusion, entrepreneurship, and innovation
Related Guides
Related Categories
Related Job Pages
More Platform Engineer Jobs
Platform Engineer
Platform9 SystemsPlatform9 is the leader in simplifying enterprise Private Clouds. Founded by a team of VMware cloud pioneers, we are dedicated to transforming IT operations. Our flagship product, Private Cloud Director, turns your existing hardware into a full-featured, future-ready private cloud. Over 30,000 nodes in production at some of the world’s largest enterprises. Inclusive, globally distributed company backed by prominent investors. Values: innovation, customer obsession, ownership, radical candor, and excellence.
Role Description We are seeking a highly motivated and experienced Platform Engineer to join our growing team. In this role, you will be responsible for the design, implementation, and maintenance of our cloud infrastructure, ensuring high availability, scalability, and security. You will be working closely with our engineering team to automate deployments, manage infrastructure as code, and troubleshoot production issues. This is a unique opportunity to work on cutting-edge technologies and contribute to the success of a rapidly growing company. We offer a fast-paced and collaborative work environment where you will have the opportunity to learn and grow your skills. Responsibilities - Design, implement, and maintain our cloud infrastructure on AWS and OCI, including Kubernetes clusters, OpenStack environments, and supporting services. - Automate infrastructure provisioning, configuration management, and application deployments using tools like Terragrunt. - Implement and manage monitoring and logging solutions using Prometheus, Grafana, and other relevant tools. - Develop and maintain internal tooling and scripts to improve operational efficiency. - Troubleshoot and resolve production issues related to infrastructure, applications, and performance. - Collaborate with engineering teams to implement and maintain CI/CD pipelines. - Participate in on-call rotation to ensure 24/7 availability of critical services. - Stay up-to-date on the latest technologies and trends in cloud computing and platform engineering. Qualifications - 5-7 years of experience in a DevOps or Platform engineering role, with a strong understanding of cloud infrastructure and operations. - Extensive experience with Kubernetes, including cluster administration, deployment strategies, and troubleshooting. - Experience with OpenStack is highly desirable, but not required. - Proficiency in infrastructure-as-code tools like Terragrunt or Ansible. - Strong scripting skills in Python or similar languages. - Strong programming skills in Golang or similar languages. - Strong configuration management skills with Salt, Chef or similar languages. - Experience with Observability tools like Groundcover, Prometheus, Cortex, Grafana, and Loki. - Experience with CI/CD tools and best practices. - Experience with administrating and debugging on Linux-based operating systems. - Excellent problem-solving and troubleshooting skills. - Strong communication and collaboration skills. - Strong incident management experience. Benefits - Competitive Compensation and Equity - Medical Healthcare for you and your family - Wellness Benefits - Professional Development/ Global certifications - Reward and Recognition Programs - Team Building Activities - Hackathon - Company Wide Programs
Role Description As the Senior Director of Platform Engineering, you will own the foundation that the entire Altana engineering organization builds on: the cloud infrastructure that runs our products and the developer experience that determines how fast and how safely we ship. You will lead a multidisciplinary organization spanning Cloud Engineering, Developer Experience, Site Reliability, and DevSecOps / Release Engineering — with end-to-end ownership of our build, test, and release pipelines. This is a moment of real transformation in how software is built. We are reorganizing our engineering practice around an AI-first, agent-first model, where AI coding harnesses and autonomous agents are first-class participants in the software development lifecycle. You will set the technical strategy, build and develop the team, and personally raise the bar on engineering excellence. You will be measured by: - The reliability and cost-efficiency of our platforms - The velocity and happiness of every engineer who depends on them - How effectively Altana adopts AI across the development lifecycle Key areas under your stewardship - Cloud Engineering: Design, manage, and secure the multi-cloud infrastructure that powers our products. - Developer Experience: Own the infrastructure and tooling the engineering organization relies on every day. - Site Reliability: Ensure our systems are reliable, available, performant, and efficient. - DevOps / Release Engineering: Bridge development and operations, automating and optimizing the software delivery lifecycle. Qualifications - Experience building, leading, and scaling best-in-class platform and/or developer experience organizations. - 10+ years of real-world experience developing or managing complex engineering systems. - A proven track record of driving developer experience and developer productivity. - Direct, hands-on experience with AI-assisted and agentic development. - Deep expertise with CI/CD systems. - Strong software engineering fundamentals. - Production experience with cloud platforms. - A track record tackling large-scale infrastructure in support of fast-paced ML/AI-enabled products. - Strong sense of shared ownership and excellent communication skills. Requirements - Define and own a clear, multi-year technical strategy for Platform Engineering. - Set the reference architecture and standards for cloud infrastructure, CI/CD, and developer tooling. - Lead build-vs-buy and platform investment decisions. - Own the Developer Acceleration program. - Drive an AI-first, agent-first transformation of the software development lifecycle. - Establish how AI-assisted and agent-generated code is reviewed, tested, secured, and governed. - Gather requirements from application teams and translate them into well-scoped platform work. - Establish and continuously improve best practices for monitoring and incident response. - Drive a culture of ownership, resilience, and accountability. - Partner across product, engineering, security, data science, and commercial teams. - Lead, coach, and inspire a multidisciplinary organization of engineers and managers. Benefits - Flexible Time Off: Agency over your own time off for work-life balance. - Parental Leave: 14 weeks for non-birthing parents and up to 26 weeks for birthing parents, all paid at 100% of base salary. - Health Benefits: Full suite of medical, vision, and dental benefits with generous employer contributions. - Supplemental Benefits: Life, short- and long-term disability, and AD&D insurance coverage at no cost. - 401(k) Savings: Guideline 401(k) retirement savings program. - Commuter Benefits: Pre-tax funds for public transit or parking. - Wellness: Free premium subscription to Calm for mental and emotional health. - Pet Insurance: Coverage for pets with Wishbone insurance and/or Total Pet vet service. - Employee Assistance Program: Free access to confidential personal support. - Dependent Care FSA: Access to a Dependent Care FSA for childcare expenses.
Staff Platform Engineer, AI Enablement
ZocdocZocdoc is the beginning of a better healthcare experience for millions of patients every month.
Role Description At Zocdoc, we're on a mission to give power to the patient -- and that starts with enabling our engineering teams to build and ship software safely, efficiently, and with confidence. As a Staff Platform Engineer, you will: - Design, build, and improve foundational systems, tooling, and paved roads that create leverage across our engineering organization. - Collaborate across the business to turn platform strategy into practical, adoptable solutions. - Emphasize hands-on technical leadership with meaningful input into platform direction. - Reduce friction across the development lifecycle and enable teams to focus on improving healthcare access. - Play a key role in how developers at Zocdoc build, test, and deploy software. - Abstract operational complexity, improve testability and safety, and streamline the developer journey across frontend, backend, and mobile surfaces. Qualifications - 7+ years of experience building or evolving internal developer platforms, productivity tooling, or CI/CD systems used by multiple teams. - Experience designing and operating distributed systems in AWS or similar cloud environments. - Strong judgment around platform abstractions and APIs; knowing when to standardize and when to stay flexible. - Ability to take ambiguous problems, break them into deliverable pieces, and drive them to measurable outcomes. - Clear communication of technical trade-offs and building consensus with peers and stakeholders. - Motivated by enabling other engineers to build reliable, scalable, and secure systems with confidence. - Value learning, curiosity, and operational excellence, and model those qualities for the team. Requirements - Enjoy building platforms that enable productivity, consistency, and autonomy. - Motivated by making other engineers more effective, not just shipping your own features. - Like designing cohesive developer experiences that teams actually want to adopt. - Care deeply about developer empathy, usability, and reducing cognitive load. - Prefer solving real problems through thoughtful design, iteration, and collaboration. - Comfortable influencing peers and adjacent teams through clear communication and working code. - Find satisfaction in building systems that scale predictably and help other engineers succeed. - Enjoy learning and teaching others about leveraging AI-based technologies to achieve real-world results. Benefits - Flexible work environment for those based in NYC (Role is open to remote). - Unlimited Vacation. - 100% paid employee health benefit options (including medical, dental, and vision). - 401(k) with employer funded match. - Corporate wellness program with Wellhub. - Sabbatical leave (for employees with 5+ years of service). - Competitive paid parental leave and fertility/family planning reimbursement. - Cell phone reimbursement. - Employee Resource Groups and ZocClubs to promote shared community and belonging. - Great Place to Work Certified.
• Own our data ingestion layer end-to-end, including completing our migration to open-source ingestion tooling (dlt) and maintaining reliability as the stack evolves • Manage dbt models, tests, documentation, and the semantic layer - the definitions that determine what every metric means across the business • Own Dagster orchestration: scheduling, retries, alerting, and failure handling across all pipeline runs • Keep Lightdash metadata, dimension/measure definitions, and access controls accurate and current • Accelerate data refresh cycles to support near-real-time operational use across the business • Build monitoring, failure alerting, and anomaly detection into the stack so issues surface proactively • Chase data through systems when things go wrong: trace why records drop or transform unexpectedly between source and dashboard, and resolve the root cause rather than the symptom • Establish and document data quality standards and lineage practices across the warehouse • Partner with the Director of Strategy & Analytics — and the Technology Lead once that role is filled — on platform infrastructure, system integrations, and technical initiatives where data is a core component • Build and maintain reverse ETL pipelines to push warehouse data back into operational tools • Contribute to A/B testing infrastructure and the systems that support consistent metric definitions across the org • Own separation of dev and production environments: deployment pipelines, change management, access controls, and release practices • Maintain infrastructure documentation and ensure the platform is operable beyond any single person



