IDEA Public Schools is an education management-focused nonprofit organization that believes every single child who wants to can participate in higher education
Data Platform Engineer (26-27)
Location
United States
Posted
44 days ago
Salary
$89.6K - $105K / year
Seniority
Mid Level
Job Description
Data Platform Engineer (26-27)
IDEA Public Schools
Data Platform Engineer This is a vacancy for the 26-27 school year with a target start date of July 1, 2026 Mission: The Data Platform Engineer builds and operates IDEA’s data infrastructure on Snowflake, enabling reliable, scalable access to data that supports analytics, reporting, and research across multiple states. This role designs automated ingestion pipelines, optimizes platform performance and cost, and ensures the data platform functions as a production-grade system for downstream teams. Reporting to the Manager of Data Platform Engineering, this engineer works hands-on with ELT pipelines, infrastructure-as-code, and Snowflake administration while contributing to IDEA’s transition from legacy ETL systems to a modern lakehouse architecture. Supervisory Responsibilities: Individual contributor role with no direct reports. Senior engineers may mentor peers and lead technical initiatives. Location: This is a full-time remote position based in Texas, with preference given to candidates who live in Austin, El Paso, Houston, Permian Basin (Midland/Odessa), Rio Grande Valley, San Antonio, and Tarrant County (Fort Worth), or who are willing to relocate. Travel Expectations: Minimal travel (5–10% annually) for collaboration, training, or critical implementation milestones. What You’ll Do – Accountabilities Essential Duties: - Design, build, and maintain automated ELT pipelines ingesting data from diverse source systems into Snowflake. - Configure and manage cloud-native ingestion tools and custom Python-based pipelines when needed. - Build and maintain Bronze-layer tables with schema evolution handling, audit metadata, and lineage. - Implement ingestion-level validation and monitoring to catch issues early. - Document source configurations, refresh schedules, and troubleshooting procedures. - Partner with Analytics Engineering to ensure ingestion patterns support downstream transformation needs. - Administer Snowflake environments, including databases, schemas, warehouses, access controls, and security settings. Additional Duties and Responsibilities: - Optimize performance and cost through warehouse sizing, clustering, query analysis, and resource monitoring. - Manage Snowflake objects using infrastructure-as-code patterns. - Implement security best practices including RBAC, encryption, auditing, and network policies. - Evaluate and adopt new Snowflake capabilities as appropriate. - Own Terraform-based infrastructure definitions for Snowflake and related platform components. - Automate recurring operational tasks such as provisioning, access grants, and environment setup. - Build CI/CD pipelines for infrastructure changes with testing and safe deployment practices. - Develop reusable templates and modules to accelerate onboarding of new sources and domains. - Maintain clear documentation and runbooks for platform operations. - Implement monitoring and alerting for pipelines, platform health, and performance. - Troubleshoot pipeline failures and platform issues using systematic root-cause analysis. - Embed observability (logging, metrics, alerts) into all production pipelines. - Collaborate closely with Analytics Engineering, DataOps, and Data Governance partners. - Participate in code reviews and design discussions. - Share platform knowledge through documentation, mentoring, and team forums. - Contribute to retrospectives and continuous improvement efforts. Knowledge and Skills – Competencies - Make Strategic Decisions: This team member uses data, feedback, and insights to inform thoughtful decision-making, while considering the impact on their direct reports and team. They communicate decisions with clear rationale and begin to connect their choices to broader team objectives. - Manage Work and Teams: This team member sets clear, measurable goals and regularly reflects on progress, adjusting actions as needed. They prioritize work aligned with their goals using a task management system and consistently meet deadlines through effective time management. - Grow Self and Others: This team member regularly offers affirming and adjusting feedback, maintaining a positive balance that reinforces growth and motivation. They provide transparent, candid performance insights and offer consistent coaching and development aligned with individual goals, supporting both direct reports and cross-functional partners. - Build a Culture of Trust: This team member proactively builds strong personal and professional relationships with individual stakeholders and regularly seeks feedback to improve their work experience. They create a supportive environment where others feel safe to take risks and learn from mistakes without fear of retribution. - Communicate Deliberately: This team member communicates thoughtfully by anticipating potential misunderstandings and providing necessary context to ensure clarity. They leverage structured communication channels to address challenges, ask meaningful questions, and guide conversations toward solutions, while actively listening to the concerns of others. Additional Skills: Required - Hands-on experience administering Snowflake or similar cloud data platforms. - Strong SQL skills for data extraction, validation, and performance tuning. - Experience building and operating automated data pipelines. - Proficiency with Python and scripting for automation and operational tooling. - Experience operating production systems with monitoring and incident response. - Familiarity with infrastructure-as-code and CI/CD concepts. Preferred - Experience with cloud-native ingestion tools (e.g., Fivetran, Airbyte). - Experience with Terraform or similar IaC tooling. - Familiarity with dbt and analytics engineering workflows. - Exposure to orchestration, data quality, or observability tools. - Experience with education, public sector, or regulated data environments. Required Education and Experience: - Bachelor’s degree in a technical field or equivalent practical experience. - 3+ years of experience in data engineering, platform engineering, or related roles. - Demonstrated experience building and operating production data systems. - Hands-on experience with Snowflake or comparable cloud data warehouses. Preferred Education and Experience: - Snowflake or cloud platform certifications. - Experience supporting multi-team data platforms at scale. - Strong Python proficiency beyond basic scripting. Physical Requirements: - Prolonged periods working on a computer and in virtual meetings - Ability to travel domestically via car and air travel to campuses and state office - Flexibility for occasional evening meetings with distributed stakeholders across time zones What We Offer: Compensation & Benefits: Salaries for people entering this role typically fall between $89,600 and $105,300, commensurate with relevant experience and qualifications and in alignment with internal equity. This role is also eligible for performance pay based on organizational performance and goal attainment. Additionally, we offer medical, dental, and vision plans, disability, life insurance, parenting benefits, flexible spending account options, generous vacation time, referral bonuses, professional development, and a 403(b) plan. You can find more information about our benefits at https://ideapublicschools.org/careers/benefits/. * IDEA may offer a relocation stipend to defray the cost of moving for this role, if applicable. Application process: Submit your application online through Jobvite. Please note that applications will be reviewed on an ongoing basis until the position is filled. Applicants are encouraged to apply as early as possible. Learn more about IDEA At IDEA the Staff Experience Team uses our Core Values to promote human connection and a culture of integrity, respect, and belonging for all Team and Family members. Learn more about our Commitment to Core Values here: https://ideapublicschools.org/our-story/#core-values
Related Guides
Related Categories
Related Job Pages
More Data Engineer Jobs
We Impact Lives Through Purpose-Driven Work in A People First Culture Ascend Learning, a leading healthcare and learning technology company, is the connection between a powerful portfolio of brands serving students, educators, and employers with outcomes-based, data-driven solutions across the lifecycle of learning. From testing to certification, Ascend Learning products are used by physicians, emergency medical professionals, nurses, allied health professionals, certified personal trainers, financial advisors, skilled trades professionals and insurance brokers. Headquartered in Burlington, MA, with additional office locations and hybrid and remote workers in cities across the U.S., Ascend Learning was recognized by Newsweek and Plant-A Insights Group as one of America’s 2025 Greatest Workplaces as well as America’s Best Places to work for Mental Well-Being for 2025. We're always looking for talented, passionate professionals to join us in our mission to help change lives. If this sounds like an environment where you'd thrive, read on to learn more. WHAT YOU'LL DO As a Senior Data Engineer, you will be responsible for designing, building, and maintaining scalable and robust data pipelines and architectures. You will play a critical role in enabling data-driven decision-making by ensuring the availability, integrity, and performance of our data systems. You will work closely with business intelligence team, data scientists, and other stakeholders to support advanced analytics initiatives and contribute to our overall data strategy. WHERE YOU’LL WORK This position has the flexibility of remote work in the United States. HOW YOU’LL SPEND YOUR TIME - Design and optimize complex data models, schemas, and storage structures across SQL Server, Snowflake, and other platforms to maximize performance, scalability, and data accessibility. - Lead the implementation and continuous improvement of enterprise-wide data warehouse strategies, ensuring data integrity, consistency, and alignment with organizational standards. - Design, develop, and maintain robust, scalable ETL/ELT pipelines leveraging SQL Server, Kafka, Azure Data Factory, Snowflake, and complementary technologies to support high-volume, real-time, and batch data processing. - Partner closely with business intelligence teams, data scientists, analysts, and key stakeholders to gather detailed requirements, translate them into scalable technical solutions, and deliver actionable data assets that drive business outcomes. - Collaborate with the data architecture team to define and implement scalable, high-performance data infrastructure and solutions aligned with enterprise data strategy. - Proactively monitor, troubleshoot, and optimize data pipelines and database systems to ensure high availability, reliability, and minimal latency, swiftly addressing any bottlenecks or failures. - Define, enforce, and champion data governance frameworks, ensuring compliance with data privacy regulations, security policies, and organizational standards. - Drive enhanced data observability by implementing comprehensive monitoring, logging, and alerting mechanisms to enable proactive identification and resolution of data quality and pipeline issues. - Stay abreast of emerging data engineering technologies, tools, and industry best practices to continuously innovate and improve data platform capabilities. - Provide technical leadership and mentorship to junior data engineers, fostering a culture of excellence, knowledge sharing, and adherence to best practices in data engineering and management. WHAT YOU'LL NEED - High school diploma or GED required. Bachelor’s degree in computer science, software engineering, information technology, or a related field or relevant work experience preferred - 6+ years of experience in a data engineering role with a strong focus on data warehousing and ETL processes - Skilled in SQL Server, MySQL, and Snowflake. Design data models, write SQL queries, and optimize database performance. - Experienced in building ETL pipelines using Azure Data Factory, SSIS, and Kafka for data integration and transformation. - Strong background in real-time data processing with Kafka and streaming technologies. - Knowledgeable in data warehouse design, data modeling, and database best practices. - Proficient in Python and Java for scripting and automating data workflows. - Hands-on experience with Azure Cloud Services like Azure Data Factory, Azure Databricks, and Azure SQL Database. Deploy and manage scalable data solutions on Azure. - Familiar with cloud infrastructure, automation, and security best practices to ensure reliable and secure data systems. - Understand data governance, data security, and compliance standards such as GDPR and HIPAA. - Strong problem-solving skills to quickly identify and fix data engineering issues. - Good communication skills to explain technical details clearly to both technical teams and business users. - Experience with RESTful APIs for integrating data systems. - Comfortable with additional programming languages like Scala and advanced Python. - Familiar with data visualization and BI tools to help deliver clear insights. BENEFITS - Flexible and generous paid time off - Competitive medical, dental, vision and life insurance - 401(k) employer matching program - Parental leave - Wellness resources - Charitable matching program - On-site workout facilities (Leawood, Gilbert, Burlington) - Community outreach groups - Tuition reimbursement Fostering A Sense of Belonging Our values-driven culture unifies our teams and inspires a mindset of action, innovation, and collaboration, with a relentless focus on customers. We seek out and celebrate all people and perspectives and cultivate an inclusive culture where everyone can thrive, feel valued, and be authentic. Our culture is firmly rooted in the belief that by embracing our differences and drawing on diverse perspectives, we are a stronger, more innovative, and more successful organization where employees experience a sense of belonging. Ascend Learning, LLC is proud to be an equal opportunity employer (M/F/Vets/Disabled). No agency or search firm submissions will be accepted. Applications for U.S.-based positions with Ascend Learning, LLC must be legally authorized to work in the United States, and verification of employment eligibility will be required at the time of hire.
Thinkific is a learning commerce platform that helps learning businesses turn knowledge into impact. By bringing together community, courses, and content with commerce, we power transformative learning experiences that help businesses grow their revenue—and reach millions of learners around the world. We’re a team of 300+ Thinkers building products that matter. Every role at Thinkific contributes to raising the bar for online learning, supporting learning businesses, and creating real-world impact. You’ll work alongside curious, collaborative teammates who care deeply about what they build and who they build it for. We’re committed to a fair, inclusive, and human hiring experience. Our team is here to guide you every step of the way, so you always know what to expect! Are you a data engineer who loves writing code, or a full-stack developer with a knack for data? Either way, we'd love to talk. We're looking for an Intermediate Data Engineer to join us at Thinkific As an Intermediate Data Engineer on our Data team, you'll be closely collaborating with engineering teams, designers, and stakeholders across the company to solve data problems that span the full stack — from pipelines and infrastructure to customer-facing data products. You'll be part of a small, dynamic team — the problems you'll work on won't always come with a neat brief, and we need someone who takes ownership and runs with it. Reporting to a Principal Data Engineer, your work will directly impact both internal decision-making and customer success. Your goal will be to help shape our data products and infrastructure to have a top-line impact on our company goals. Here's how you'll accomplish this: - Take ownership of undefined problems — dig in, scope the work, and ship solutions, whether that's a pipeline, a dashboard, a tracking implementation, or something we haven't thought of yet - Ship customer-facing analytics dashboards and features — this is production code, not just SQL and charts - Work directly with engineering teams to implement tracking services across our products - Interface with teams across the company who need data solutions, translating fuzzy requirements into concrete deliverables - Raise the bar on technical quality across the data team — through architecture decisions, reducing tech debt, code reviews, and bringing software development best practices to a high-performing data team - Work with your team to conduct new technology research; bring fresh ideas and concepts to bear on how we integrate AI into our data workflows The person we have in mind likely: - Has 3–5 years of experience working across data engineering, full-stack development, analytics, or some combination — we care more about range than a specific title - Has solid data engineering experience — you've built and maintained pipelines, worked with warehouses, and understand data modeling and quality - Has full-stack development experience — you've shipped production code that users interact with, not just internal tooling - Has hands-on experience with dbt and BigQuery or similar stacks (Snowflake, Databricks, etc.) - Is self-motivated and resourceful — you ship things, communicate clearly across technical and non-technical teams, and are always looking to level up - Is comfortable working across the stack and across disciplines; you don't need to be an expert everywhere, but you're not afraid to jump in - For reference, our current stack is primarily dbt (SQL) with Python for data work, Looker for our Analytics layer, and React, TypeScript (with some Ruby), and embedded Looker on the product side — but we're more interested in your engineering fundamentals than specific language experience. - Loves to learn and grow, They’ve found (and keep looking for) ways to level up their skills in this field, whether that’s through formal education, gaining professional experience, or maybe even building their own business These things would also be nice, but we think you could learn them on the job: - Experience with LLM/agent development — building AI-powered data products, automated workflows, or integrating LLMs into data pipelines and analytics - Experience with GCP and Terraform - Any exposure to ML - Experience white-labeling a BI tool or embedding analytics into a customer-facing product - Experience building or maintaining event tracking systems - Familiarity with modern data stack tools beyond dbt/BigQuery (Airflow, Dagster, etc.) - Experience working in startup-style environments or autonomous teams within larger orgs - Experience working in a SaaS environment We’re committed to fair and transparent pay that reflects both where you are and where you can grow to. This role has a salary range base of $88,200 - $110,200 - $132,200 designed to capture the full journey from developing skills to excelling in the position. Most new hires start between the minimum and midpoint, which aligns with being fully capable in the role. Salaries above the midpoint are typically reserved for team members who have demonstrated strong, consistent performance, deep expertise, and a significant positive impact within the role. For high-demand or hard-to-fill positions like this one, we may hire above midpoint for candidates who bring exceptional experience, skills, or impact potential. Diversity, Equity, Inclusion and Belonging & Accessibility This is just our initial idea of who we’re looking for! At Thinkific, we know that people have unique career journeys. If your experience is close to what we’ve described but you feel that you might be missing a few of the requirements, please still apply! We believe in equal opportunity and are committed to diversity, equity, inclusion, and belonging across every facet of our business. We’re also committed to providing a comfortable and accessible interview experience for every candidate. If there are any accommodations our team can make throughout our hiring process (big or small), please let us know. What you can expect if you join Thinkific: 👏 An amazing team of talented, passionate, and kind Thinkers. Together, we’ve built an amazing, award-winning culture—we’re a Certified Great Place to Work and one of Canada’s Most Admired Corporate Cultures by Waterstone! 🚀 The chance to build, improve, and innovate on a platform that’s driving positive impact for thousands of businesses and millions of students around the world. 💸 A competitive compensation package including base salary, equity, team-wide bonuses, and an Employee Share Purchase Plan. 🌴Flexible Paid Time Off to maintain mental and physical health. Our team is encouraged to take a minimum 4 weeks of vacation, plus Thinker Holidays (extended long weekends in the summer) and time off for the December holiday season. 🩺 Health Benefits and Wellness: Comprehensive benefits starting on Day 1 include health, vision, and dental coverage for you and your family, $3,000 for mental health care, a short-term health plan, and an additional $900/year health or personal spending account. Plus, family friendly benefits include generous parental leave top-ups for up to 32 weeks, as well as fertility coverage and personalized return to work options. 🤝 3 Paid Volunteer Days each year to give back to your community and support the causes you care about. 💻 Flexible Work. Choose to work from home from anywhere in Canada, at our Vancouver HQ, a co-working space, or anywhere there’s wifi for a change of scenery. ⬆️ Learning & Growth. An annual $2,000 CAD Learn and Grow fund for conferences, seminars, or courses, plus training, mentorship, coaching, and internal promotion opportunities. 🏡 A home office setup so you’re ready to succeed with a company-owned Macbook Pro and a budget to order a desk, chair, or any accessories to help you work comfortably and productively. 💙 A place where you can bring your whole self to work. We know that different perspectives lead to amazing ideas, more innovation, and, ultimately, our success as a company. We welcome applicants of all backgrounds, experiences, beliefs, identities, and statuses. Whoever you are—we can’t wait to meet you! The Thinkific Vancouver office operates on the traditional, ancestral, and unceded territories of the xʷməθkʷəy̓əm (Musqueam), Sḵwx̱wú7mesh (Squamish), and Sel̓íl̓witulh (Tsleil-Waututh) Nations of the Coast Salish People. We encourage everyone to learn more about the original caretakers of the land that you currently occupy.
• You'll focus on solving problems and creating value for business by building solutions that are reliable and scalable to work with the size and scope of the company. • You will be tasked with creating a custom-built pipeline on GCP stack. • You will be part of teams that implement vendor sourced enterprise software, configuring that software, customizing it, and integrating with other internal systems.
Vālenz® Health is the platform to simplify healthcare – the destination for employers, payers, providers and members to reduce costs, improve quality, and elevate the healthcare experience. The Valenz mindset and culture of innovation combine to create a distinctly different approach to an inefficient, uninspired health system. With fully integrated solutions, Valenz engages early and often to execute across the entire patient journey – from care navigation and management to payment integrity, plan performance and provider verification. With a 99% client retention rate, we elevate expectations to a new level of efficiency, effectiveness and transparency where smarter, better, faster healthcare is possible. About This Opportunity: As a Data Engineer I, you’ll play a hands-on role in building and supporting scalable data pipelines within our cloud-based Lakehouse environment (Azure Databricks, Delta Lake), leveraging tools like Spark and PySpark. You’ll help bring in healthcare data from a variety of sources, ensuring it’s accurate, reliable, and ready to support analytics and reporting needs across the organization. You’ll also partner closely with the broader Analytics team to make sure data is delivered in a way that’s clear and actionable. Over time, you’ll build expertise in managing large, complex datasets and contribute to evolving our data architecture to support new and emerging data sources as the business grows. Things You’ll Do Here: - Create and maintain processes to acquire, validate, and enrich data from various sources. - Support the migration of on-premise data systems (SQL Server) to a cloud-based lakehouse architecture (Azure Databricks, Delta Lake), including data transformation and pipeline re-architecture. - Develop and optimize ETL/ELT pipelines using PySpark and Spark SQL. - Implement Lakehouse + Delta architecture best practices to ensure a standardized and scalable way that we store and process our data, including schema enforcement, ACID transactions, and data versioning. - Orchestrate data pipelines using Databricks Workflows (Jobs) or similar tools. - Implement data quality frameworks, validation checks, and monitoring for pipeline reliability. - Optimize performance and cost of data pipelines. - Collaborate on CI/CD practices for data pipelines, including testing, deployment, and versioning. - Partner with data analysts, data scientists, and business stakeholders to identify new sources of data and estimate feasibility of acquiring specific data sources. - Design and implement data models to support analytics, reporting, and data warehousing use cases. - Take an active role in agile processes. - Perform other duties as assigned. Reasonable accommodation may be made to enable individuals with disabilities to perform essential duties. What You’ll Bring to the Team: - 1+ years of work experience in a data engineering role. - Bachelor’s degree or greater in a quantitative field such as statistics, mathematics, engineering, computer science, finance, or economics or equivalent practical experience - Hands-on experience with Databricks (Spark, PySpark, Delta Lake) and/or migrating RDBMS systems to a data lakehouse. - Experience working the most common types of healthcare data (medical claims, eligibility, provider network rosters, Rx claims, etc) from a variety of sources. - Strong organizational skills and time management capacity to balance multiple projects with limited supervision. - Ability to build (and re-evaluate) a process from the ground up. - Strong investigative skills with ability to search beyond the initial results. - High attention to detail, with overwhelming desire to test and double-check your own results. - Comfortable working with messy data and ambiguous results - Hands-on experience with SQL and Python (including PySpark) for distributed data processing. - Experience building and optimizing large-scale distributed data pipelines for both batch and streaming ingestion. - Our data platform is undergoing a transformation from a traditional on-premise architecture to a modern cloud-based lakehouse on Azure. Technologies used: - Cloud & Modern Data Platform: - Azure (Blob Storage / Data Lake Storage, Synapse Analytics) - Databricks (Spark, PySpark, Delta Lake, Databricks Workflows) - Delta Lake architecture (ACID transactions, schema enforcement, time travel) - Data Engineering & Development: - Python (including PySpark), SQL - Data pipeline orchestration and workflow management - Version control (Git, Azure DevOps) - Legacy / Transitional Systems: - SQL Server (on-premise RDBMS) - .NET / C#-based data processing applications - Migration from traditional ETL and relational systems to cloud-based lakehouse architecture Where You’ll Work: This is a fully remote position, and we’ll provide all the necessary equipment! - Work Environment: You’ll need a quiet workspace that is free from distractions. - Technology: Reliable internet connection—if you can use streaming services, you’re good to go! - Security: Adherence to company security protocols, including the use of VPNs, secure passwords, and company-approved devices/software. - Location: You must be US based, in a location where you can work effectively and comply with company policies such as HIPAA. Why You'll Love Working Here Valenz is proud to be recognized by Inc. 5000 as one of America’s fastest-growing private companies. Our team is committed to delivering on our promise to engage early and often for smarter, better, faster healthcare. With this commitment, you’ll find an engaged culture – one that stands strong, vigorous, and healthy in all we do. Benefits - Generously subsidized company-sponsored Medical, Dental, and Vision insurance, with access to services through our own products, Healthcare Blue Book and KISx Card. - Spending account options: HSA, FSA, and DCFSA - 401K with company match and immediate vesting - Flexible working environment - Generous Paid Time Off to include vacation, sick leave, and paid holidays - Employee Assistance Program that includes professional counseling, referrals, and additional services - Paid maternity and paternity leave - Pet insurance - Employee discounts on phone plans, car rentals and computers - Community giveback opportunities, including paid time off for philanthropic endeavors At Valenz, we celebrate, support, and thrive on inclusion, for the benefit of our associates, our partners, and our products. Valenz is committed to the principle of equal employment opportunity for all associates and to providing associates with a work environment free of discrimination and harassment. All employment decisions at Valenz are based on business needs, job requirements, and individual qualifications, without regard to race, color, religion or belief, national, social, or ethnic origin, sex (including pregnancy), age, physical, mental or sensory disability, HIV Status, sexual orientation, gender identity and/or expression, marital, civil union or domestic partnership status, past or present military service, family medical history or genetic information, family or parental status, or any other status protected by the laws or regulations in the locations where we operate. We will not tolerate discrimination or harassment based on any of these characteristics.


