Job Closed
This listing is no longer active.
We help people care for their home from top to bottom — and empower small businesses nationwide to grow.
Senior Data Engineer
Location
California + 5 moreAll locations: California | New Jersey | New York | Massachusetts | Texas | Washington
Posted
128 days ago
Salary
$179.4K - $232.1K / year
Seniority
Senior
Job Description
Senior Data Engineer
Thumbtack
• Collaboratively refine and evangelize a comprehensive framework for integrating data-thinking into the software development lifecycle for product teams • Design, architect, and maintain core marketplace datasets, data marts, and feature stores that support a blend of mature products and features with a rapidly evolving product line, in partnership with analytics, data science, and machine learning • Integrate with teams consisting of product engineers, analysts, data scientists, machine learning engineers throughout Thumbtack to understand their data needs, and help design datasets with the same engineering rigor as any other software we design • Drive data quality and best practices across different business areas • Help build the next generation data products at Thumbtack, based on real-time data products on top of Apache Kafka
Job Requirements
- 4 or more years of experience designing and building data sets and warehouses
- Excellent ability to understand the needs of and collaborate with stakeholders in other functions, especially Analytics, and identify opportunities for process improvements across teams
- Expertise in SQL for analytics/reporting/business intelligence and also for building SQL- and Python-based transforms inside an ETL pipeline, or similar
- Experience designing, architecting, and maintaining a data warehouse and data marts that seamlessly stitches together data from production databases, clickstream data, and external APIs to serve multiple stakeholders
- Expertise building the above with a modern data stack based on a cloud-native data warehouse, in our case we use BigQuery, dbt, and Apache Airflow, but a similar stack is fine
- Experience or strong interest in applying AI-enabled workflows to accelerate development velocity and improve data engineering practices
- Strong sense of ownership and pride in your work, from ideation and requirements-gathering to project completion and maintenance
Benefits
- Thumbtack embraces diversity
- Equal opportunity workplace
- No discrimination based on various characteristics
- Reasonable accommodation for disabilities
Related Guides
Related Categories
Related Job Pages
More Data Engineer Jobs
• Maintain and update static master data directly in Salesforce using Data Loader, Data Import Wizard, and other native tools. • Work with business data owners and SMEs to identify all required source data across various systems. • Analyse source structures and map fields, relationships, and reference data to Salesforce objects. • Lead data cleansing with stakeholders to remove duplicates, standardize values, populate missing mandatory fields, and correct inconsistent or deprecated values. • Transform and prepare data based on agreed mapping logic and cleansing rules. • Execute or support test data loads in SIT and UAT. • Monitor data quality through dashboards, reports, and exception logs; drive continuous improvement. • Handle routine BAU data requests, updates, extracts, and reconciliations.
• Lead the technical conversion of SAS datasets, programs, and logic to SQL Server • Analyze existing SAS workflows and translate them into efficient SQL Server–based solutions • Design, implement, and optimize SQL Server schemas, stored procedures, views, and queries • Own SQL Server administration, including: Performance tuning and indexing, Security and access controls, and b ackup, recovery, and reliability practices • Validate data accuracy and performance against existing SAS outputs • Collaborate with technical and business stakeholders to ensure a successful migration • Document architecture, migration patterns, and operational procedures
• Design and deploy scalable Google Cloud services in GCP. • Implement IAM access controls to ensure secure and compliant data environments. • Develop and implement robust data ingestion pipelines from diverse sources. • Develop and enforce data validation processes to maintain accuracy and reliability. • Enhance data quality and efficiency through continuous optimization. • Analyze raw data to uncover patterns, trends, and opportunities. • Produce clear design documentation and solution roadmaps. • Lead projects independently while collaborating effectively within larger teams. • Mentor and cross‑train junior and senior Data Engineers, sharing expertise on complex assignments. • Advance your skills through hands‑on experience and formal learning opportunities. • Support pre‑sales activities by providing accurate work estimates and technical input. • Partner closely with Project Management to deliver projects on time and within budget. • Travel occasionally as required for project or team needs.
• As a Senior Data Engineer at Orijin, you will be a technical leader responsible for building, scaling, and modernizing the company’s data platform. • Your primary focus will be on data modeling, pipelines, architecture, reliability, and performance, ensuring that data is trusted, timely, and production-ready. • As a member of the Enterprise Services team, you will partner closely with data analysts, engineers and product managers to shape how data is modeled and used. • You will bring an analytical mindset to pipeline design and enable high-quality insights across the organization. • You will ensure company-wide confidence in data quality and enable data-enabled differentiating products and services. • Design and evolve Orijin’s data architecture to support scalability, reliability, and near–real-time use cases. • Define standards for data modeling, orchestration, versioning, and deployment. • Own the design, build, and maintenance of production-grade data pipelines across batch and streaming workloads. • Architect and maintain data systems using tools such as AWS (S3, RDS, Redshift, Lambda, DMS, Glue etc.).



