Job Closed
This listing is no longer active.
Leading Digital Infrastructure Services
Principal AI & Data Engineer
Location
United States
Posted
45 days ago
Salary
$150K - $200K / year
Seniority
Lead
Job Description
Principal AI & Data Engineer
US Signal
• Assess and rationalize the enterprise data warehouse; design and implement a governed semantic layer using dbt • Design and deploy production LLM applications on the org's data fabric — prompt engineering, model integration, shipped systems • Build text-to-SQL and natural language interfaces that let business users query operational data through conversational AI • Architect RAG pipelines end-to-end: vector store design, chunking strategy, embedding model selection, internal and customer-facing deployment • Engineer and orchestrate multi-agent systems for enterprise workflow automation, including framework selection and API integration
Job Requirements
- 8+ Years Experience applying machine learning and AI in production environments, with a demonstrated track record of shipping scalable, reliable systems beyond the prototype stage.
- Deep proficiency with LLM frameworks (e.g., LangChain, LlamaIndex), agentic architectures, and RAG pipeline design, including vector databases, embedding models, and retrieval optimization.
- Strong Python fluency with practical experience across the modern data stack, including data warehousing platforms, orchestration tooling, and semantic layer frameworks such as dbt.
- Proven ability to own and deliver end-to-end production systems independently, with evidence of technical leadership and accountability for architecture decisions in fast-paced environments.
Benefits
- Generous paid time off policy, including vacation and 10 paid holidays
- Competitive and comprehensive medical, dental, and vision benefits plans with Flexible Spending benefits including medical/dental expenses and dependent care
- 401(k) retirement plan with a generous contribution
- Group Term Life Insurance covered 100% by employer
- Wellness Incentive to promote overall employee well-being
- Paid volunteer time
- Business casual dress code
Related Guides
Related Categories
Related Job Pages
More Data Engineer Jobs
Engenheiro de Dados Pl/Sr
GFT TechnologiesAs a pioneer for digital transformation GFT develops sustainable solutions across new technologies.
• Profissionais especializados em organizar, coletar e processar (ferramentas ETL) grandes volumes de dados, traduzindo os objetivos de negócios do cliente em uma estratégia de gerenciamento de informações e inteligência de negócios. • Também será responsável por manter os pipelines de dados e garantir a segurança no acesso as informações. • Sua atuação também pode abranger a criação e exposição de métricas (painéis) sobre os dados obtidos, garantindo assim qualidade, e integridade a estas informações.
Role Description The Data Architect / Data Engineering Lead provides technical leadership for data architecture, data engineering, database modernization, and AI/ML enablement across the NRCS IT ecosystem. This role is responsible for guiding the transformation of legacy data platforms into scalable, cloud-native architectures on AWS. The position works in close coordination with the Enterprise Lead Architect, Government Program Managers, and cross-functional delivery teams to execute data management, modernization, and operational sustainment activities under the OMNI contract. Qualifications - 7 years of progressive experience in data architecture, data engineering, and database administration across enterprise environments. - 5+ years of hands-on experience designing and deploying data solutions on AWS, including direct experience with S3, Glue, EMR/Spark, Lambda, Step Functions, DMS, RDS (PostgreSQL, Aurora), DynamoDB, OpenSearch, and Lake Formation. - Deep expertise in Microsoft SQL Server and PostgreSQL/PostGIS. - Proven experience building production data pipelines for batch, streaming, and geospatial workloads. - Strong proficiency in SQL/T-SQL, Python, and PySpark. - Demonstrated ability to design and implement enterprise data architectures including data warehouses, data lakes, lakehouses (Delta Lake), and service-layer integration patterns. - 3+ years of experience supporting federal IT programs, with familiarity with FISMA, NIST RMF, ATO processes, and federal change management requirements. - Experience with CI/CD pipelines, Git-based version control, Terraform or CloudFormation, Liquibase, and automated quality/security gates. - Experience working within SAFe Agile or equivalent iterative delivery frameworks, including backlog management in Jira. Requirements - Direct experience with USDA systems. - Experience with FPAC IT governance, the Technical Guidance Framework (TGF), and FPAC CI/CD pipeline standards. - Hands-on experience with AWS Bedrock, SageMaker, and Generative AI patterns (RAG, embeddings, natural-language-to-SQL, LangChain). - Experience with geospatial data engineering, including PostGIS, GeoPackage, ArcGIS WFS/WMS services, and spatial data pipelines. - Experience with AI-enabled legacy modernization platforms (e.g., Rhino.ai or equivalent). - Azure experience (Synapse, ADF, ADLS, Azure ML Studio, Databricks on Azure) as a complement to primary AWS focus. - Relevant certifications: AWS Solutions Architect, AWS Data Analytics Specialty, Azure Data Engineer Associate (DP–203), or equivalent. - Master’s degree in Computer Science, Data Science, or related field. Responsibilities - Define and maintain data architecture standards, patterns, and governance practices across all NRCS systems. - Lead conceptual and logical decomposition of monolithic database structures into domain-aligned, modular schemas. - Architect service-layer data access patterns to replace direct cross-database queries. - Design and maintain data models for enterprise soil data systems. - Align supported systems with USDA’s cloud-native Lakehouse Data Strategy. - Register and maintain schemas, interfaces, and metadata in AWS DataZone. - Design, build, and maintain end-to-end data engineering pipelines using AWS-native services. - Modernize legacy SSIS-based ETL/ELT pipelines to cloud-native equivalents. - Build and operate AWS DMS full-load and CDC pipelines. - Implement Delta Lake standards and performance tuning across ingestion frameworks. - Develop serverless orchestration workflows using AWS services. - Implement data quality controls and maintain audit-ready evidence of data management activities. - Provide senior-level DBA support for SQL Server clusters and PostgreSQL/PostGIS. - Lead database schema versioning and deployment automation. - Execute database modernization activities including re-platforming from on-premises SQL Server to AWS RDS/Aurora. - Design and implement AI/ML and Generative AI solutions using AWS services. - Support AWS migration from DISC data centers. - Maintain audit-ready documentation for all data architecture decisions and configurations. - Conduct architecture reviews and deliver knowledge transfer sessions. Benefits - Competitive compensation and benefits packages including paid vacation, medical, dental, vision, matching 401K plan, tuition/training reimbursement, and Long & Short-Term Disability.
• Deliver a scalable platform with governed data to fuel analytics, AI, and data-driven insights. • Collaborate closely with Analytics Engineers to provide data & data models. • Build & maintain python & SQL based platform automation process. • Help drive technical & architectural decisions on the data platform.
Staff Data Engineer, tvScientific
Anne Arundel DermatologyLeading dermatology provider in the Mid-Atlantic and Southeastern regions.
About Pinterest: Millions of people around the world come to our platform to find creative ideas, dream about new possibilities and plan for memories that will last a lifetime. At Pinterest, we’re on a mission to bring everyone the inspiration to create a life they love, and that starts with the people behind the product. Discover a career where you ignite innovation for millions, transform passion into growth opportunities, celebrate each other’s unique experiences and embrace the flexibility to do your best work. Creating a career you love? It’s Possible. At Pinterest, AI isn't just a feature, it's a powerful partner that augments our creativity and amplifies our impact, and we’re looking for candidates who are excited to be a part of that. To get a complete picture of your experience and abilities, we’ll explore your foundational skills and how you collaborate with AI. Through our interview process, what matters most is that you can always explain your approach, showing us not just what you know, but how you think. You can read more about our AI interview philosophy and how we use AI in our recruiting process here. About tvScientific tvScientific is the first and only CTV advertising platform purpose-built for performance marketers. We leverage massive data and cutting-edge science to automate and optimize TV advertising to drive business outcomes. Our solution combines media buying, optimization, measurement, and attribution in one, efficient platform. Our platform is built by industry leaders with a long history in programmatic advertising, digital media, and ad verification who have now purpose-built a CTV performance platform advertisers can trust to grow their business. We are seeking a Staff Data Engineer to lead the design, implementation, and evolution of our identity services and data governance platform. This role is critical to ensuring trusted, privacy-safe, and well-governed data across the organization. You will work at the intersection of data engineering, identity resolution, privacy, and platform reliability. This is an individual contributor role, where you will work to define and implement a strategic vision for data engineering within the organization. What you'll do: - Design and maintain a scalable identity resolution platform - Build pipelines and services to ingest, normalize, link, and version identity data across multiple sources - Ensure deterministic and probabilistic matching logic that is transparent, auditable, and measurable - Partner with product and analytics teams to expose identity data through reliable, well-documented APIs and datasets - Build and operate batch and streaming pipelines using modern data stack tools - Create clear documentation, standards, and runbooks for identity and governance systems - Own data governance foundations including data lineage, quality checks, schema enforcement, and access controls - Implement privacy-by-design principles (PII handling, consent enforcement, retention policies) - Collaborate with legal, privacy, and security teams to operationalize regulatory requirements (e.g., GDPR, CCPA) - Establish monitoring and alerting for data quality, freshness, and integrity What we're looking for: - Production data engineering experience - Bachelor’s degree in computer science, related field or equivalent experience - Proficiency in Spark and Scala, with proven experience building data infrastructure in Spark using Scala - Experience in delivering significant technical initiatives and building reliable, large scale services - Experience in delivering APIs backed by relationship-heavy datasets - Experience implementing data governance practices, including data quality, metadata management, and access controls - Strong understanding of privacy-by-design principles and handling of sensitive or regulated data - Familiarity with data lakes, cloud warehouses, and storage formats - Strong proficiency in AWS services - Excellent written and verbal communication skills - Successful design and implementation of scalable and efficient data infrastructure - High attention to detail in implementation of automated data quality checks - Effective collaboration with cross-functional teams - Demonstrated ability to use AI to improve speed and quality in your day-to-day workflow for relevant outputs - Strong track record of critical evaluation and verification of AI-assisted work (e.g., testing, source-checking, data validation, peer review) - High integrity and ownership: you protect sensitive data, avoid over-reliance on AI, and remain accountable for final decisions and deliverables In-Office Requirement Statement: - We recognize that the ideal environment for work is situational and may differ across departments. What this looks like day-to-day can vary based on the needs of each organization or role. Relocation Statement: - This position is not eligible for relocation assistance. Visit our PinFlex page to learn more about our working model. #LI-SM4 #LI-REMOTE At Pinterest we believe the workplace should be equitable, inclusive, and inspiring for every employee. In an effort to provide greater transparency, we are sharing the base salary range for this position. The position is also eligible for equity. Final salary is based on a number of factors including location, travel, relevant prior experience, or particular skills and expertise. Information regarding the culture at Pinterest and benefits available for this position can be found here. US based applicants only $177,185—$364,795 USD Our Commitment to Inclusion: Pinterest is an equal opportunity employer and makes employment decisions on the basis of merit. We want to have the best qualified people in every job. All qualified applicants will receive consideration for employment without regard to race, color, ancestry, national origin, religion or religious creed, sex (including pregnancy, childbirth, or related medical conditions), sexual orientation, gender, gender identity, gender expression, age, marital status, status as a protected veteran, physical or mental disability, medical condition, genetic information or characteristics (or those of a family member) or any other consideration made unlawful by applicable federal, state or local laws. We also consider qualified applicants regardless of criminal histories, consistent with legal requirements. If you require a medical or religious accommodation during the job application process, please complete this form for support.



