Database & Infrastructure Engineer (Full Stack)

Data EngineerData EngineerOther Remote

Location

United States

Posted

127 days ago

Salary

$150K - $250K / year

PostgreSQL Python TypeScript Amazon S3 Deno ETL CI/CD

Job Description

ABOUT KLED Kled is building the largest opt-in human data network in the world. We are not a labeling firm. We are not a task marketplace. We are a consumer application where people upload their real photos, videos, and documents and get paid continuously. We then filter, standardize, and license that data to frontier AI labs and enterprises that need fresh, rights-aware training data. Since launching our mobile app in 2026, we have: • Reached #1 on the App Store (Finance) with 0 paid marketing • Scaled to 200,000+ active data contributors • Processed 1.5–3M uploads per day • Raised $5M+ from investors behind SpaceX, Airbnb, Coinbase, xAI, OpenAI, Anthropic, Spotify, Lyft, Uber, and more Our mission is to let anyone download the app and earn a real living wage from uploading their data. ABOUT THE ROLE Database & Infrastructure Engineer (Full-Stack Systems) We process millions of files per day and store hundreds of millions of media records. Your job is to make our data layer world-class. You will: • Optimize and scale our PostgreSQL (Supabase) infrastructure • Design indexing, partitioning, and query strategies for large-scale media datasets • Improve performance across ingestion, enrichment, and retrieval pipelines • Build internal tools for querying and auditing large datasets • Create customer-ready dataset sample packs • Design and automate dataset exports and delivery pipelines (S3, secure transfers, custom formats) • Work across backend, ML, and product teams to support new features This is not just DBA work. You’ll help design the systems that move and package the data powering frontier AI labs. WE’RE LOOKING FOR • Strong PostgreSQL expertise (indexing, partitioning, performance tuning) • Experience working with large datasets (100M+ records preferred) • Deep understanding of storage systems (S3 or similar object storage) • Strong backend experience (TypeScript, Python, or similar) • Comfort building internal tooling and automation scripts • Ability to move between database, backend, and infrastructure work Bonus: • Experience with data pipelines (ETL, transformation layers) • Experience with vector databases (pgvector, FAISS, Pinecone) • Experience delivering structured datasets to enterprise customers • DevOps experience (CI/CD, infra automation) • Experience working with media-heavy systems CURRENT STACK Backend • PostgreSQL (Supabase) — 188M+ media files • S3 storage • Deno / TypeScript edge functions • Python ML pipelines Frontend • SwiftUI (migrating to Flutter) COMPENSATION • Base salary: $150,000 - $250,000 • $150,000 – $350,000 equity • Benefits • Relocation support • SF HQ (SOMA) or remote We move fast and work hard (9–9 culture). If you're excited to build the world’s largest consumer app, let’s talk! GROWTH OPPORTUNITY You’ll join a team operating at the frontier of applied AI data infrastructure. We move fast and work 7 days a week. In this role, you’ll have the opportunity to: • Own core systems that power one of the largest human data networks in the world • Design infrastructure that directly influences what data trains next-generation AI models • Build at real scale - millions of uploads per day, adversarial environments, global contributors • Ship alongside a team that has built marketplaces, AI systems, and products used by millions If you’re excited to move fast, build systems that matter, and help define how human data powers frontier AI, let’s talk.

Related Categories

Data Engineer

Related Job Pages

Remote Python Jobs (US)More Remote Jobs

More Data Engineer Jobs

Staff Data Engineer

tvScientific

Performance TV Advertising Platform

Data Engineer127 days ago

Other RemoteTeam 51-200Since 2020H1B No Sponsor

Company Site LinkedIn

About tvScientific tvScientific is the first and only CTV advertising platform purpose-built for performance marketers. We leverage massive data and cutting-edge science to automate and optimize TV advertising to drive business outcomes. Our solution combines media buying, optimization, measurement, and attribution in one, efficient platform. Our platform is built by industry leaders with a long history in programmatic advertising, digital media, and ad verification who have now purpose-built a CTV performance platform advertisers can trust to grow their business. We are seeking a Staff Data Engineer to lead the design, implementation, and evolution of our identity services and data governance platform. This role is critical to ensuring trusted, privacy-safe, and well-governed data across the organization. You will work at the intersection of data engineering, identity resolution, privacy, and platform reliability. This is an individual contributor role, where you will work to define and implement a strategic vision for data engineering within the organization. What you'll do: - Identity Services: - Design and maintain a scalable identity resolution platform - Build pipelines and services to ingest, normalize, link, and version identity data across multiple sources - Ensure deterministic and probabilistic matching logic that is transparent, auditable, and measurable - Partner with product and analytics teams to expose identity data through reliable, well-documented APIs and datasets - Build and operate batch and streaming pipelines using modern data stack tools - Create clear documentation, standards, and runbooks for identity and governance systems - Data Governance & Trust - Own data governance foundations including data lineage, quality checks, schema enforcement, and access controls - Implement privacy-by-design principles (PII handling, consent enforcement, retention policies) - Collaborate with legal, privacy, and security teams to operationalize regulatory requirements (e.g., GDPR, CCPA) - Establish monitoring and alerting for data quality, freshness, and integrity What we're looking for: - Data engineering experience with proven track record building data infrastructure using Spark with Scala - Proven experience building data infrastructure using Spark with Scala for at least 5 years - Experience in delivering significant technical initiatives and building reliable, large scale services - Experience in delivering APIs backed by relationship-heavy datasets - Experience implementing data governance practices, including data quality, metadata management, and access controls - Strong understanding of privacy-by-design principles and handling of sensitive or regulated data - Familiarity with data lakes, cloud warehouses, and storage formats - Strong proficiency in AWS services - Successful design and implementation of scalable and efficient data infrastructure - High attention to detail in implementation of automated data quality checks - Effective collaboration with cross-functional teams - Excellent written and verbal communication skills - Bachelor's degree in Computer Science or a related field In-Office Requirement Statement: - We recognize that the ideal environment for work is situational and may differ across departments. What this looks like day-to-day can vary based on the needs of each organization or role. Relocation Statement: - This position is not eligible for relocation assistance. Visit our PinFlex page to learn more about our working model. #LI-SM4 #LI-REMOTE At Pinterest we believe the workplace should be equitable, inclusive, and inspiring for every employee. In an effort to provide greater transparency, we are sharing the base salary range for this position. The position is also eligible for equity. Final salary is based on a number of factors including location, travel, relevant prior experience, or particular skills and expertise. Information regarding the culture at Pinterest and benefits available for this position can be found here. US based applicants only $155,584—$320,320 USD

Scala Apache Spark AWS

View details: Staff Data Engineer

United States

$155K - $320K / year

Apply

Job Closed

Staff Data Engineer

tvScientific

Performance TV Advertising Platform

Data Engineer127 days ago

Other RemoteTeam 51-200Since 2020H1B No Sponsor

Company Site LinkedIn

About tvScientific tvScientific is the first and only CTV advertising platform purpose-built for performance marketers. We leverage massive data and cutting-edge science to automate and optimize TV advertising to drive business outcomes. Our solution combines media buying, optimization, measurement, and attribution in one, efficient platform. Our platform is built by industry leaders with a long history in programmatic advertising, digital media, and ad verification who have now purpose-built a CTV performance platform advertisers can trust to grow their business. As a Staff Data Engineer at tvScientific, you will be a key player in implementing the robust data infrastructure to power our data-heavy company. You will collaborate with our cross-functional teams to evolve our core data pipelines, design for efficiency as we scale, and store data in optimal engines and formats. This is an individual contributor role, where you will work to define and implement a strategic vision for data engineering within the organization. What you'll do: - Design and implement robust data infrastructure in AWS, using Spark with Scala - Evolve our core data pipelines to efficiently scale for our massive growth - Store data in optimal engines and formats, matching your designs to our performance needs and cost factors - Collaborate with our cross-functional teams to design data solutions that meet business needs - Design and implement knowledge graphs, exposing their functionality both via Batch Processing and APIs - Leverage and optimize AWS resources while designing for scale - Collaborate closely with our Data Science and Product teams - How we'll define success: - Successful design and implementation of scalable and efficient data infrastructure - Timely delivery and optimization of data assets and APIs - High attention to detail in implementation of automated data quality checks - Effective collaboration with cross-functional teams What we're looking for: - Production data engineering experience - Proficiency in Spark and Scala, with proven experience building data infrastructure in Spark using Scala - Experience in delivering significant technical initiatives and building reliable, large scale services - Experience in delivering APIs backed by relationship-heavy datasets - Familiarity with data lakes, cloud warehouses, and storage formats - Strong proficiency in AWS services - Expertise in SQL for data manipulation and extraction - Excellent written and verbal communication skills - Bachelor's degree in Computer Science or a related field - Nice-to-haves: - Experience in adtech - Experience implementing data governance practices, including data quality, metadata management, and access controls - Strong understanding of privacy-by-design principles and handling of sensitive or regulated data - Familiarity with data table formats like Apache Iceberg, Delta - Previous experience building out a Data Engineering function - Proven experience working closely with Data Science teams on machine learning pipelines In-Office Requirement Statement: - We recognize that the ideal environment for work is situational and may differ across departments. What this looks like day-to-day can vary based on the needs of each organization or role. Relocation Statement: - This position is not eligible for relocation assistance. Visit our PinFlex page to learn more about our working model. #LI-SM4 #LI-REMOTE At Pinterest we believe the workplace should be equitable, inclusive, and inspiring for every employee. In an effort to provide greater transparency, we are sharing the base salary range for this position. The position is also eligible for equity. Final salary is based on a number of factors including location, travel, relevant prior experience, or particular skills and expertise. Information regarding the culture at Pinterest and benefits available for this position can be found here. US based applicants only $155,584—$320,320 USD

Apache Spark Scala AWS SQL

View details: Staff Data Engineer

United States

$155K - $320K / year

Apply

Sr. Data Engineer

tvScientific

Performance TV Advertising Platform

Data Engineer127 days ago

Other RemoteTeam 51-200Since 2020H1B No Sponsor

Company Site LinkedIn

About tvScientific tvScientific is the first and only CTV advertising platform purpose-built for performance marketers. We leverage massive data and cutting-edge science to automate and optimize TV advertising to drive business outcomes. Our solution combines media buying, optimization, measurement, and attribution in one, efficient platform. Our platform is built by industry leaders with a long history in programmatic advertising, digital media, and ad verification who have now purpose-built a CTV performance platform advertisers can trust to grow their business. As a Senior Data Engineer at tvScientific, you will be a key player in implementing the robust data infrastructure to power our data-heavy company. You will collaborate with our cross-functional teams to evolve our core data pipelines, design for efficiency as we scale, and store data in optimal engines and formats. This is an individual contributor role, where you will work to define and implement a strategic vision for data engineering within the organization. What you'll do: - Implement robust data infrastructure in AWS, using Spark with Scala - Evolve our core data pipelines to efficiently scale for our massive growth - Store data in optimal engines and formats - Collaborate with our cross-functional teams to design data solutions that meet business needs - Built out fault-tolerant batch and streaming pipelines - Leverage and optimize AWS resources while designing for scale - Collaborate closely with our Data Science and Product teams - How we'll define success: - Successful implementation of scalable and efficient data infrastructure - Timely delivery and optimization of data assets and APIs - High attention to detail in implementation of automated data quality checks - Effective collaboration with cross-functional teams What we're looking for: - Production data engineering experience - Proficiency in Spark and Scala, with proven experience building data infrastructure in Spark using Scala - Familiarity with data lakes, cloud warehouses, and storage formats - Strong proficiency in AWS services - Expertise in SQL for data manipulation and extraction - Excellent written and verbal communication skills - Bachelor's degree in Computer Science or a related field - Nice-to-Haves - Experience in adtech - Experience implementing data governance practices, including data quality, metadata management, and access controls - Strong understanding of privacy-by-design principles and handling of sensitive or regulated data - Familiarity with data table formats like Apache Iceberg, Delta In-Office Requirement Statement: - We recognize that the ideal environment for work is situational and may differ across departments. What this looks like day-to-day can vary based on the needs of each organization or role. Relocation Statement: - This position is not eligible for relocation assistance. Visit our PinFlex page to learn more about our working model. #LI-SM4 #LI-REMOTE At Pinterest we believe the workplace should be equitable, inclusive, and inspiring for every employee. In an effort to provide greater transparency, we are sharing the base salary range for this position. The position is also eligible for equity. Final salary is based on a number of factors including location, travel, relevant prior experience, or particular skills and expertise. Information regarding the culture at Pinterest and benefits available for this position can be found here. US based applicants only $123,696—$254,667 USD

Apache Spark Scala AWS SQL

View details: Sr. Data Engineer

United States

$123K - $254K / year

Apply

Senior Staff Data Engineer

SonderMind

Your all-in-one wellness solution that connects you to affordable, quality mental healthcare

Data Engineer127 days ago

Other RemoteTeam 201-500Since 2014H1B No Sponsor

Company Site LinkedIn

About SonderMind At SonderMind, we believe everyone deserves one personalized, connected, and effective mental health destination to take care of their mental health and well-being at any stage of life. SonderMind care encompasses everything from therapy and medication management to meditation and mindfulness exercises. Our clinicians leverage our digital tools and research to deliver increasingly high-quality care and to develop thriving practices. Combining technology and human connection, SonderMind drives better outcomes through our comprehensive approach. Learn more about SonderMind at sondermind.com or download the mobile app, available on iOS and Android. To follow the latest SonderMind news, get to know our clients, and learn about what it’s like to work at SonderMind, you can follow us on Instagram, Linkedin, and Twitter. Additionally, we expect all team members to effectively leverage modern AI technologies as part of their everyday workflow, and to continuously adapt as new tools emerge. Familiarity with job-relevant AI platforms such as Gemini, ChatGPT, Claude, GitHub Copilot or other industry-standard AI productivity tools is expected and considered essential for success at this company. About the Role We are hiring a Senior Staff Data Engineer to play a senior technical leadership role on SonderMind’s Data Platform team. This role focuses on designing, building, and evolving the data systems that support analytics, experimentation, machine learning, and clinical outcomes across the business. As a Senior Staff Data Engineer you will operate with a high degree of ownership and autonomy. You will be responsible not only for delivering reliable data infrastructure, but also for shaping technical direction, establishing best practices, and enabling other teams to move faster with high-quality, trusted data—while operating within the constraints of a regulated healthcare environment. What you’ll do - Design, build, and maintain core data pipelines and infrastructure that power analytics, experimentation, machine learning and AI fine tuning and agentic use cases across product, clinical, and operations teams. - Own data architecture decisions related to ingestion, transformation, storage, and serving layers, with a focus on scalability, reliability, cost efficiency, and maintainability. - Establish and enforce data quality, observability, and reliability standards, including SLAs, monitoring, alerting, and incident response practices for critical datasets. - Partner closely with analytics, data science, and product engineering teams to understand data needs and translate them into well-designed, reusable data models and pipelines. - Lead technical initiatives and drive best practices across the data engineering team, including code quality, testing, documentation, and data contracts. - Ensure data systems meet privacy, security, and compliance requirements, with a strong understanding of handling sensitive mental health and healthcare data. - Mentor and support other data engineers, providing technical guidance, design feedback, and helping raise the overall engineering bar. - Support other responsibilities and ad-hoc projects as needed based on evolving business and platform needs. What does success look like? First 3–6 Months: - Develop a strong understanding of SonderMind’s data landscape, key business use cases, and regulatory constraints. - Take ownership of critical pipelines or platform components and begin making meaningful improvements to reliability and clarity. - Build trust with analytics, data science, and engineering partners through consistent delivery and thoughtful technical decisions. Continued Success in the Role: - Data pipelines are reliable, observable, and well understood, with fewer surprises and faster issue resolution. - Teams across the company are able to move faster because data is accessible, well modeled, and trusted. - Technical decisions scale well over time and reduce long-term maintenance and operational burden. - You influence platform direction beyond your own code by raising standards and helping others make better technical choices. How Performance Is Measured: - Reliability and quality of core data systems, including uptime, freshness, and accuracy. - Impact on team velocity and downstream consumers, including analytics, ML, AI, and product. - Technical leadership and influence across the data organization. - Alignment with SonderMind’s career competencies, including ownership, collaboration, and thoughtful problem-solving. Who You Are Qualifications: - 8+ years of experience in data engineering, platform engineering, or backend engineering roles. - Strong proficiency in SQL and at least one general-purpose programming language (Python strongly preferred). - Hands-on experience building and maintaining production data pipelines at scale. - Experience working with modern data platforms, including cloud data warehouses, orchestration tools, and transformation frameworks. - Strong understanding of data modeling, pipeline reliability, and system design trade-offs. - Proven ability to work cross-functionally and translate business needs into effective technical solutions. - Bachelor’s degree in Computer Science, Engineering, or a related field, or equivalent practical experience. Preferred (Not Required): - Experience in healthcare, mental health, or other regulated environments. - Familiarity with streaming or event-driven data systems. - Experience supporting machine learning or experimentation workflows. - Exposure to data governance, privacy, or compliance frameworks. Our Benefits The anticipated salary range for this role is $180,000 - $200,000. Final compensation will be determined based on a variety of factors, including relevant experience, skills, education, and past performance. As leaders in redesigning behavioral health, we walk the walk with our employees' benefits. We want the experience of working at SonderMind to accelerate people’s careers and enrich their lives, so we focus on meeting SonderMinders wherever they are and supporting them in all facets of their lives and work. Our benefits include: - A generous PTO policy, with a minimum of three weeks off per year - A holiday schedule that follows standard U.S. holidays - Free therapy coverage benefits to ensure employees have access to the care they need (must be enrolled in a qualifying medical plan to participate) - Competitive Medical, Dental, and Vision coverage, with plans to meet every need — including HSA (with $1,100 company contribution) and FSA options - Employer-paid short-term disability, long-term disability, life & AD&D, plus coverage of the salary difference for up to seven weeks of short-term disability leave (after the required waiting period) - Eight weeks of paid Parental Leave; if the parent also qualifies for STD, this benefit is in addition, allowing for 8–16 weeks of paid leave - 401(k) retirement plan with 100% match on up to 4% of base salary, immediately vested - Join teammates from across the country at our annual company gathering - Company shutdown between Christmas and New Year’s - Supplemental life insurance, pet insurance, commuter benefits, and more Application Deadline This position will be an ongoing recruitment process and will be open until filled. Equal Opportunity SonderMind does not discriminate in employment opportunities or practices based on race, color, creed, sex, gender, gender identity or expression, pregnancy, childbirth or related medical conditions, religion, veteran and military status, marital status, registered domestic partner status, age, national origin or ancestry, physical or mental disability, medical condition (including genetic information or characteristics), sexual orientation, or any other characteristic protected by applicable federal, state, or local laws.

SQL Python Observability / Monitoring ETL

View details: Senior Staff Data Engineer

United States

$180K - $200K / year

Apply