Instacart invites the world to share love through food. This is how homemade is made.
Senior Staff Software Engineer, Data Infrastructure
Location
United States
Posted
57 days ago
Salary
$254K - $321K / year
Seniority
Lead
Job Description
Senior Staff Software Engineer, Data Infrastructure
Instacart
We're transforming the grocery industry At Instacart, we invite the world to share love through food because we believe everyone should have access to the food they love and more time to enjoy it together. Where others see a simple need for grocery delivery, we see exciting complexity and endless opportunity to serve the varied needs of our community. We work to deliver an essential service that customers rely on to get their groceries and household goods, while also offering safe and flexible earnings opportunities to Instacart Personal Shoppers. Instacart has become a lifeline for millions of people, and we’re building the team to help push our shopping cart forward. If you’re ready to do the best work of your life, come join our table. Instacart is a Flex First team There’s no one-size fits all approach to how we do our best work. Our employees have the flexibility to choose where they do their best work—whether it’s from home, an office, or your favorite coffee shop—while staying connected and building community through regular in-person events. Learn more about our flexible approach to where we work. About the Role Instacart's backend systems serve millions of customers every year, processing petabytes of data daily to power everything from personalized grocery recommendations to real-time fraud detection and advertiser measurement. As Senior Staff Software Engineer on the Data Infrastructure team, you will set the technical direction for the data platform that underpins the entire company’s data strategy — from the storage and compute layer to streaming infrastructure, analytics tooling, and governance systems. This is a role for a systems thinker who thrives at the intersection of deep technical depth and company-level strategic impact. You will define the multi-year architecture roadmap for our core data platform, drive major platform investment decisions, and operate as a thought leader both internally across engineering leadership and externally within the data infrastructure community. Your work will directly influence how Instacart makes decisions at scale and shape the platform economics of one of the most data-intensive technology companies in grocery commerce. About the Team The Data Infrastructure team builds and operates the systems that power Instacart’s data ecosystem — a modern data lakehouse built on Apache Iceberg, a multi-engine compute platform spanning stream processing and analytical query engines, and self-serve tooling that enables Product, Data Science, ML, Ads, Finance, and other engineering teams to move fast with data. We balance two mandates: maintaining a highly reliable production data platform serving the business today, and architecting the infrastructure that will serve it for the next three to five years. We operate in a world of significant scale and material infrastructure spend, where architectural decisions carry both technical and financial consequences. Some of the core technologies in our stack include: Apache Iceberg, Apache Flink, Trino, ClickHouse, Apache Kafka, Apache Spark, Snowflake, Databricks, Confluent, Airflow, dbt, Delta Lake, Scala, Python, Postgres, and AWS. The team operates with a high degree of ownership and autonomy. You will work closely with engineering leadership, data science, ML platform, ads infrastructure, finance engineering, and senior stakeholders across the organization. ABOUT THE JOB What You’ll Do - Define and Drive Data Infrastructure Vision: Own the multi-year technical vision and roadmap for Instacart’s core data platform (storage, compute, streaming, orchestration, analytical serving). Translate company data strategy (monetization, federated access, real-time) into a coherent, actionable architecture plan. Align with leadership and proactively evolve the architecture for scale, maturity, and cost. - Lead Platform Strategy (Build, Buy, Ownership): Architect the ownership strategy for the data platform, determining build vs. buy (including managed services vs. open-source self-hosting). Lead technical/business case evaluations, full cost-benefit modeling, and risk analysis for major investments. Design phased migrations to ensure reliability while achieving long-term independence and cost efficiency. - Own the Data Lakehouse Foundation: Drive the architecture and delivery of the open lakehouse, including unified table format, compute engine portfolio, and storage governance. Expand multi-engine compute (interactive, batch, stream processing). Define standards for data storage, access, governance, and sharing to enable compute portability and prevent lock-in. Ensure reliable scaling without proportional cost increase. - Drive Real-Time and Streaming Infrastructure: Own the architecture for streaming data, event-driven pipelines, stream processing, and real-time serving for critical use cases (Ads, Fraud, ML). Make principled decisions on deployment models balancing cost, availability, and operational maturity. - Pioneer AI-native Data Infrastructure Engineering: Lead the adoption, application, and cultural integration of AI/LLM tools across the data platform lifecycle, setting a high standard for AI-augmented workflows, driving high-leverage opportunities from automation to cost optimization, and partnering with other teams to embed AI-powered capabilities into the platform itself. - Elevate Engineering Excellence: Serve as the senior technical voice, setting standards for system design and reliability. Lead architecture reviews. Mentor staff/senior engineers, fostering ownership and execution. Be a visible engineering leader, contributing to hiring and cross-org alignment. - Partner Deeply with Stakeholders: Collaborate with Data Science, ML Platform, Ads Infra, Product Eng, Finance Eng, and Security to translate needs into reliable, self-serve infrastructure. Represent Data Infra in architectural forums, ensuring decisions support business priorities (monetization, compliance, AI). Clearly communicate complex trade-offs to technical and executive audiences. Minimum Qualifications - 10+ years of software engineering, focused on data infrastructure or distributed systems at scale.Sets technical direction for large-scale data platforms, defining multi-year architecture roadmaps and influencing strategy. Experience in high-growth, data-intensive environments with significant infrastructure scale and spend. - Expertise in modern data lakehouse architectures, open table formats (Iceberg, Delta Lake, Hudi), and compute/storage trade-offs, in distributed query/compute systems (Trino, Spark, ClickHouse, etc.) for performance tuning and production reliability and event-driven infrastructure (Kafka, Flink, etc.) - Proven track record owning and executing major infrastructure platform transitions, including build vs. buy, migration design, and risk management. - Experience building compelling business cases for infrastructure investments, including cost-benefit analysis and TCO modeling. - Exceptional technical communication for clear architecture documents, strategy memos, and proposals to drive leadership alignment. Strong ownership, comfort with ambiguity, and organizational influence to drive large, multi-team initiatives from concept to production. Preferred Qualifications - Familiarity with data governance, compliance frameworks (SOX, CPRA, GDPR), and designing governance controls into the platform architecture. - Experience with FinOps and data platform cost optimization, including managing multi-million dollar infrastructure budgets and negotiating vendor contracts. - Deep knowledge of SQL and strong proficiency in Python or Scala for systems-level work. - Experience with orchestration systems (e.g., Apache Airflow) and data transformation pipelines (e.g., dbt) in large-scale production environments. - Track record of building and growing high-performing data infrastructure teams. - Bachelor’s, Master’s, or PhD in Computer Science, Computer Engineering, Electrical Engineering, or equivalent practical experience. #LI-Remote Instacart provides highly market-competitive compensation and benefits in each location where our employees work. This role is remote and the base pay range for a successful candidate is dependent on their permanent work location. Please review our Flex First remote work policy here. Offers may vary based on many factors, such as candidate experience and skills required for the role. Additionally, this role is eligible for a new hire equity grant as well as annual refresh grants. Please read more about our benefits offerings here. For US based candidates, the base pay ranges for a successful candidate are listed below. CA, NY, CT, NJ $304,000—$321,000 USD WA $292,000—$308,000 USD OR, DE, ME, MA, MD, NH, RI, VT, DC, PA, VA, CO, TX, IL, HI $279,000—$294,500 USD All other states $254,000—$268,000 USD
Related Guides
Related Job Pages
More Full-stack Engineer Jobs
Senior Software Engineer II, Page Builder – Retailer Platform
InstacartInstacart invites the world to share love through food. This is how homemade is made.
• Lead the CMD service extraction: architect and drive the migration of our Content Management Domain from a Ruby monolith into a dedicated Go service using a strangler-pattern approach; design proto-first API contracts (e.g., v2/GetPlacements), implement concurrent visibility condition evaluation via goroutines, and establish formal SLOs (99.9% availability, under 30ms P90 placement fetch) for a system handling 7M daily requests across 15+ consumer surfaces. • Design composable extensibility: replace 125+ hardcoded placement format types with a single composable type built on React components and Liquid templates to eliminate weeks of full-stack engineering per new format and unlock enterprise retailer customization at scale. • Shape the AI-native content platform: expose Page Builder capabilities as MCP-compatible endpoints, enabling AI agents to create, preview, QA, and publish pages end-to-end; define how LLM-powered content creation, AI carousels, and agentic page management integrate with CMD. • Drive cross-team architecture: own the technical relationship with Shopping/URSA, Feeds, Growth, Loyalty, and Ads; lead API contract design, coordinate migration sequencing, and ensure CMD evolves as a reliable, well-documented platform that other teams can confidently build on. • Mentor and multiply the team: raise the engineering bar across Page Builder by mentoring engineers, establishing robust design patterns, and contributing to a culture where AI-assisted development is the default.
• Build a full stack application from frontend to backend services with multiple databases • Contribute in building highly impactful technology used by large companies • Collaborate with team on project development and problem-solving
• Innovation is hardcoded in Entefy’s DNA and we’re about to do something that’s never been done before. • Entefy seeks to move the dial on what is considered technically possible, and in doing so, to make life better for everyone. • If you’d like to change the world, this is your chance to make a career of it.
• Join the Entefy team, building a full stack application from frontend to backend services • Help build the next-generation platform to transform digital interaction for people and thinking machines • Contribute in building highly impactful technology used by large companies

