Job Closed

This listing is no longer active.

Kin logo
Kin

Social change through culture.

Senior Data Engineer

Location

United States

Posted

63 days ago

Salary

$125K - $153K / year

Seniority

Senior

Job Description

Senior Data Engineer

Kin

• Design and build scalable, production-grade data pipelines and data models to power downstream analytics and enterprise reporting • Lead the migration from a Redshift/DBT warehouse architecture to a modern lakehouse architecture (e.g., S3, Glue, Databricks, Unity Catalog) • Implement and enforce data validation standards, QA standards, and effective data lifecycle management • Optimize pipeline performance, cost, and data quality in a large-scale cloud data environment • Ensure compliance with data security and privacy regulations (e.g., GDPR, CCPA, GLBA) through robust pipeline design, access controls, and monitoring • Collaborate cross-functionally with App Engineering, BI, and business teams to translate ambiguous requirements into scalable, well-modeled datasets • Mentor data engineers and promote best practices in software engineering, documentation, and distributed data processing • Leverage AI-assisted development tools where appropriate to improve engineering efficiency, code quality, and observability

Job Requirements

  • 4+ years of hands-on data engineering experience building and maintaining production ETL/ELT pipelines
  • Experience designing data architectures in cloud environments (AWS strongly preferred; Azure or GCP acceptable)
  • Expertise in distributed data processing frameworks (e.g., Apache Spark, Kafka, Hadoop, or similar)
  • Proficiency in Python (Pandas, NumPy, etc.) and advanced SQL for large-scale data transformation and querying
  • Experience with lake and lakehouse patterns, including open table formats (e.g., Iceberg, Hudi, Delta)
  • Experience tuning ETL performance and optimizing infrastructure costs in large-scale environments handling terabytes of data
  • Proven ability to model raw data into well-structured, analytics-ready datasets
  • Clear written and verbal communication skills, with the ability to explain complex technical concepts to non-technical stakeholders

Benefits

  • Competitive salary and company equity through Restricted Stock Units (RSUs), granted as part of our standard compensation package and based on role and level
  • 401(k) with company match up to 4% of eligible earnings
  • Multiple medical plan options, plus dental and vision coverage
  • Company-funded HSA contributions (based on medical plan selection)
  • Company-paid life insurance and short-term disability
  • A variety of supplemental benefit options, including long-term disability, critical illness, accident, legal, and pet insurance
  • Access to mental health support and confidential counseling resources
  • Flexible PTO for exempt employees (most employees take 15–20 days per year), plus 8 company-observed holidays
  • Paid parental leave, including up to 14 weeks at 100% pay for birthing parents and 8 weeks at 100% pay for non-birthing parents
  • Career mobility and internal growth opportunities across the organization
  • Professional development budgets for certifications, conferences, and learning available, subject to management approval

Related Categories

Related Job Pages

More Data Engineer Jobs

Full TimeRemoteTeam 1,001-5,000Since 1946H1B Sponsor

AIR’s Employment & Economic Opportunity Program is seeking a Education and Workforce Data Strategist to integrate applied research, data science, and capacity-building for public sector change management. The position will work with internal teams and external partners to support the effective use of education and workforce administrative data for decision-making, continuous improvement, and policy development within and across states. The ideal candidate brings experience working with complex datasets and is equally comfortable conducting analyses and guiding others in how to interpret and use data in applied settings. This position has the flexibility to work remotely within the United States (U.S.) (not including U.S. territories) or from one of AIR’s U.S. office locations but requires availability to participate in meetings across all continental U.S. time zones. About AIR: Founded in 1946 and headquartered in Arlington, Virginia, the American Institutes for Research (AIR) is a nonpartisan, not-for-profit organization that conducts behavioral and social science research and delivers technical assistance to address some of the most pressing challenges in the United States and globally. We generate evidence and apply data-driven solutions that expand opportunities and improve lives for all. Responsibilities: - Conduct quantitative analyses using administrative and survey data to support research, evaluation, and technical assistance efforts. - Contribute to the design of analytic approaches, including descriptive, longitudinal, and quasi-experimental methods. - Provide coaching and technical assistance to state and local clients and partners on the use of data for decision making. - Support clients in refining research questions, interpreting findings, and applying results to practice. - Design and facilitate workshops, trainings, or office hours to strengthen data use capacity among government and community-based organizations. - Prepare, clean, validate, and document complex administrative datasets. - Support data integration and linkage efforts across education and workforce systems within and across states. - Translate analytic findings into clear, actionable insights for non-technical audiences. - Contribute to written deliverables, including reports, briefs, presentations, and data visualizations. - Work collaboratively with multidisciplinary teams, including researchers, data scientists, and project staff. - Support multiple projects simultaneously, contributing to deliverables and timelines. - Apply and model best practices in reproducible research, including code documentation and version control. - Ensure data quality and integrity across all phases of analytic work. Qualifications: Education, Knowledge, and Experience - PhD in related subject area (Economics, Statistics, Public Policy, Public Health, Human Development, Political Science, Psychology, Sociology, or related fields), or a Master’s degree with a minimum of 4 years of relevant applied research experience. - Experience working on research or evaluation projects for federal, state, or local agencies or other public or nonprofit entities. - Demonstrated experience with research design, data analysis, and reporting. - Experience with data collection activities from the field using a variety of methodologies and data collection techniques such as interviews, focus groups, observations, and/or survey research. - Familiarity with federal contracting and compliance standards is preferred, but not required. Skills - Demonstrated ability to conduct quantitative analyses and support others in analytic thinking and data use. - Proficiency in one or more statistical/programming languages (e.g., R, Python). - Experience working with administrative or operational datasets, including data cleaning, validation, and documentation and experience linking or integrating datasets across systems or agencies. - Experience using AI tools to synthesize information, draft content, analyze data, or improve efficiency in day-to-day work and ability to apply sound judgment when interpreting AI-generated outputs. - Demonstrated ability to integrate emerging and nontraditional data sources and clear understanding of the methodological limitations and analytic opportunities associated with high-frequency data. - Strong understanding of research design and analytic methods (e.g., descriptive analysis, longitudinal analysis, or causal inference approaches). - Ability to communicate complex technical information clearly to non-technical audiences. - Experience working collaboratively in team-based environments and managing multiple tasks. - Experience training, mentoring, or coaching others in data use, analysis, or interpretation. - Familiarity with data systems and infrastructure, including SQL, databases, or cloud-based environments. - Knowledge of privacy, confidentiality, and restricted-use data environments. - Experience with data visualization tools (e.g., Tableau, Power BI, Shiny). - Background in policy-relevant domains of education and workforce. Disclosures: Applicants must be currently authorized to work in the U.S. on a full-time basis. Employment-based visa sponsorship (including H-1B sponsorship) is not available for this position. Depending on project work, qualified candidates may need to meet certain residency requirements. American Institutes for Research is an equal employment opportunity/affirmative action employer. All qualified applicants will receive consideration for employment without discrimination on the basis of age, race, color, religion, sex, gender, gender identity/expression, sexual orientation, national origin, protected veteran status, or disability. AIR adheres to strict child safeguarding principles. All selected candidates will be expected to adhere to these standards and principles and will therefore undergo reference and background checks. AIR maintains a drug-free work environment. ACCESSIBILITY NOTICE: If you need a reasonable accommodation for any part of the employment process due to a physical or mental disability, please send an email to Taliba Boone at tboone@air.orgor call 202.403.5000. Fraudulent Job Scams Warning & Disclaimer: AIR is aware of individuals falsely presenting themselves as AIR representatives. Fraudulent job scams seek to extract sensitive information or money from victims. To protect yourself, please be aware that AIR recruitment will only email you from an “@air.org” domain. Please take extra caution while examining the email address, for example jdoe@air.org is correct and jdoe@aircareers.org is not a legitimate AIR email address. If you are unsure of the legitimacy of a communication you have received, please reach out torecruitment@air.org. If you see a job scam, or lose money to one, report it to the Federal Trade Commission (FTC) atReportFraud.ftc.gov. You can also report it to your state attorney general. Find out more about how to avoid scams atftc.gov/scams. #LI-MP1 #LI-Remote AIR’s Total Rewards Program, is designed to reward our staff competitively and motivate them to achieve our critical mission. This position offers the anticipated annual salary as listed. Salary offers are made based on internal equity within the institution and external equity with competitive markets. Please note this is the annual salary range for candidates that are based in the United States. Anticipated Annual Salary Range $92,700—$123,600 USD

United States
$92.7K - $123K / year
Novartis logo

Senior Expert Data Science

Novartis

Novartis is a leading global pharmaceutical and healthcare research and solutions company dedicated to improving patient lives by uncovering solutions to curren

Data Engineer63 days ago

Title: Expert/Senior Expert Data Science Location: East Hanover, NJ, Cambridge, MA, or Durham, NC Job Description: LI# Hybrid This position will be located at the East Hanover, NJ, Durham NC, or Cambridge, MA site and will not have the ability to be located remotely. We are seeking a talented and motivated Data Scientist to join the Data and Statistical Sciences team within the Novartis Cell and Gene Therapy Scientific Office. The Data and Statistical Sciences team advances Novartis’s goal of bringing safe and efficacious cell and gene therapies to market through supporting development scientists with data infrastructure, analytics, and a vision for the future of digital-enabled therapeutic development. The successful candidate will play a key role in establishing data flows and connections from laboratory systems to data science platforms, advancing our understanding of bioprocesses, and contributing to the optimization and improvement of our development processes. Responsibilities: - Lead efforts to establish data flows and connections from laboratory systems to our data science platform. - Drive cross-functional collaboration across various business domains, including IT, engineering, laboratory teams, to identify data sources and develop data extraction methods that ensure data integrity and security. - Quickly learn the use of tools, data sources, and analytical techniques needed to answer a wide range of critical business and scientific questions. - Design and implement data pipelines to ingest, clean, and transform raw data for analysis. - Leverage statistical and machine learning techniques to analyze integrated data sets, both structured and unstructured, extracting valuable insights into biopharmaceutical manufacturing processes. - Communicate analytical findings to business users through compelling business presentations, interactive visualization tools, and contextual storytelling techniques, ensuring a clear and impactful understanding of the derived insights from data analysis. - Develop and maintain data models and schemas to support ongoing data analysis and reporting needs. - Be adaptable in providing support for a wide array of assessments, including process comparability, when necessary. - Stay abreast of advancements in data technologies and best practices, recommending and implementing improvements to data infrastructure, models, and techniques as needed. - Actively participate in data projects, defining scope, timelines, and resource allocation, ensuring successful execution. Collaborate proactively with team members to plan, anticipate change, manage stakeholders, identify risks, and resolve issues. Essential Requirements: This is a dual posting. The final level & title of the offer role would be determined by the hiring team based on the skills, experience & capabilities required to perform the role at the level the role has been offered (Expert Data Science OR Senior Expert Data Science): Expert Level: A minimum of a Bachelor's Degree with 4 years / Masters Degree with 2 years / Recent PhD. A combination of relevant industry experience and education will also be considered. Training and experience demonstrating proficiency in data management will be valued. Senior Expert Level: A minimum of Bachelor's Degree with 7 years / Masters Degree with 5 years/ PhD with 2 years of experience or dedicated postdoc / graduate training in the field of bioprocess or data science. A combination of relevant industry experience and education will also be considered. Training and experience demonstrating proficiency in data management will be valued. - Proficiency in programming languages such as Python or R. - Strong understanding of modeling and statistical concepts. - Excellent teamwork and communication skills, with a collaborative and proactive approach to problem-solving. - Strong analytical and problem-solving skills, with the ability to interpret and communicate findings effectively to both technical and non-technical stakeholders. Desired Requirements: - Knowledge of bioprocess engineering and biopharmaceutical manufacturing, with optional experience in cell or gene therapy (lentiviral or adenoviral technology). - Experience with app development platforms like Red Hat OpenShift or Posit Connect is preferred. - Familiarity with web development technologies including JSON, CSS, and HTML is preferred. - Experience with tools and platforms commonly used in the biopharmaceutical industry (e.g., JMP/JSL/SAS, SIMCA, AVEVA PI) is strongly preferred. The salary for this position is expected to range as follows: Expert Data Science: $103,600 and $192,400 per year Senior Expert Data Science: $119,700 and $222,300 per year The final salary offered is determined based on factors like, but not limited to, relevant skills and experience, and upon joining Novartis will be reviewed periodically. Novartis may change the published salary range based on company and market factors. Your compensation will include a performance-based cash incentive and, depending on the level of the role, eligibility to be considered for annual equity awards. US-based eligible employees will receive a comprehensive benefits package that includes health, life and disability benefits, a 401(k) with company contribution and match, and a variety of other benefits. In addition, employees are eligible for a generous time off package including vacation, personal days, holidays and other leaves. EEO Statement: The Novartis Group of Companies are Equal Opportunity Employers. We do not discriminate in recruitment, hiring, training, promotion or other employment practices for reasons of race, color, religion, sex, national origin, age, sexual orientation, gender identity or expression, marital or veteran status, disability, or any other legally protected status. Accessibility and reasonable accommodations The Novartis Group of Companies are committed to working with and providing reasonable accommodation to individuals with disabilities. If, because of a medical condition or disability, you need a reasonable accommodation for any part of the application process, or to perform the essential functions of a position, please send an e-mail to us.reasonableaccommodations@novartis.com or call +1(877)395-2339 and let us know the nature of your request and your contact information. Please include the job requisition number in your message. Salary Range $103,600.00 - $192,400.00 Skills Desired Clinical Trials, Computer Programming, Data Analysis, Programming Languages, Reporting, Statistical Analysis

New Jersey + 2 moreAll locations: New Jersey | Massachusetts | North Carolina
$103.6K - $192.4K / year
Full TimeRemoteTeam 501-1,000H1B No Sponsor

Role Description Aretum is seeking a skilled and highly motivated Data Engineer. As a Data Engineer, you will build and manage all data ingestion, transformation, reconciliation, and analytics pipelines. Due to the nature of our work as a federal consulting organization, employees may be expected to handle Controlled Unclassified Information (CUI) and must adhere to applicable safeguarding and compliance requirements. Responsibilities - Ingest data from FHIR APIs, CDW, and other VA sources - Normalize and reconcile medication and patient data - Build transformation pipelines for risk scoring inputs - Support batch and near-real-time processing - Ensure data quality, consistency, and traceability - Programming: Python (primary), SQL (advanced), optional Scala - Data Processing Frameworks: Apache Spark, AWS EMR, Databricks (preferred) - ETL/ELT Design: Pipeline orchestration, incremental vs full loads, data validation - API Integration: REST APIs, JSON parsing, pagination, authentication (OAuth2) - FHIR Data Handling: Patient, MedicationRequest, Observation, etc. - Data Modeling: Relational and semi-structured schema design - Data Quality & Validation: Deduplication, reconciliation logic, anomaly detection - Streaming vs Batch Processing: Understanding tradeoffs and implementation patterns - Storage Technologies: S3, relational DBs, NoSQL basics - Performance Optimization: Partitioning, parallelization, query tuning - Versioning & Lineage: Data version control, reproducibility of datasets Travel Requirements This is a remote position; however, occasional travel may be required based on project needs, client meetings, team collaboration events, or training sessions. Travel is expected to be less than 10% and will be communicated in advance whenever possible. Requirements - Due to federal contract requirements, only U.S. citizens are eligible for this position. - This position supports a federal government contract and requires the ability to obtain and maintain a Public Trust or Suitability Determination, depending on the agency’s background investigation requirements. Benefits - Health Care Plan (Medical, Dental & Vision) - Retirement Plan (401k) - Life Insurance (Basic, Voluntary & AD&D) - Paid Time Off - Family Leave (Maternity, Paternity) - Short Term & Long-Term Disability - Training & Development Company Description Aretum is a mission-driven organization committed to delivering innovative, technology-enabled solutions to our customers across defense, civilian, and homeland security sectors. Our teams work at the intersection of strategy, technology, and transformation, helping agencies solve their most critical challenges. We believe in investing in our people and creating a culture where collaboration, inclusion, and professional growth are at the forefront.

United States
Trend Health Partners logo

Data Engineer I (US Remote)

Trend Health Partners

An independent, tech-enabled payment integrity company.

Data Engineer63 days ago
Full TimeRemoteTeam 201-500Since 2018H1B No Sponsor

TREND Health Partners is a tech-enabled payment integrity company. Our mission is to facilitate collaboration between payers and providers for mutual benefit and waste reduction, ultimately improving access to healthcare. We achieve this by aligning the common goals of payers and providers and fostering collaboration through a shared technology platform and seamless workflows. Joining TREND Health Partners means becoming part of a dynamic, growing organization that promotes a collaborative and innovative work environment. Our comprehensive compensation package includes competitive salaries, highly valued health insurance, a 401(k) plan with employer match, paid parental leave, and more. The primary responsibility of the Data Engineer I is supporting the design, development, and maintenance of data pipelines and workflows. This position assists in data integration and transformation processes using tools like Python, T-SQL, SSMS, and Databricks. They collaborate with other Data Engineers to understand technical aspects and architecture design, work with large and complex healthcare data sets, help in translating business requirements into technical specifications, participate in code reviews, and contribute to team knowledge sharing. They also assist in developing and maintaining documentation for data engineering processes and systems. Role and Responsibilities - Support the design, development, and maintenance of data pipelines and workflows - Assist in the data integration and transformation processes using Python, T-SQL, SSMS, and Databricks - Collaborate with other Data Engineers to understand the technical aspects of the stack and the architecture design - Work with large and complex healthcare data sets to identify business needs and provide data-driven solutions - Help in translating business requirements into technical specifications - Participate in code reviews and contribute to team knowledge sharing - Assist in the development and maintenance of documentation for data engineering processes and systems - Create ad-hoc reports requested by internal and external partners - Work as a team to support and troubleshoot errors for 100+ data pipelines - Work closely with the application development team to update data and data structures - Provide analysis, recommendations, and feedback to business process owners, leadership team, and the Information Technology department - Propose automated solutions to repeated development tasks Qualifications - Bachelor’s degree with major coursework in Computer Science or Business Administration/related field required. Equivalent work experience in a similar position may be substituted for educational requirements - Knowledge of Python, T-SQL, schema design and relational databases - Familiarity with data integration, transformation, and data warehousing concepts - Proficiency in compiling data, creating reports, and presenting information - Ability to effectively prioritize tasks while working on multiple concurrent projects - Strong written, verbal, and customer service skills - Attention to detail - Active learning mindset, understanding the implications of new information for both current and future problem-solving and decision-making - Skilled in critical thinking, using logic and reasoning to identify the strengths and weaknesses of alternative solutions, conclusions, or approaches to problems - The ability to come up with unusual or clever ideas about a given topic or situation, or to develop creative ways to solve a problem Preferred Skills - Some experience working with data in a professional setting - Understanding of data privacy and security regulations - Strong problem-solving skills and attention to detail - Experience with healthcare data - Experience using Agile framework - Experience with software development life cycle tools such as Red-Gate, Bitbucket, or JIRA Mental and physical demands - This position will be exposed mainly to an indoor office environment and will be expected to work in or around computers and printers - The nature of the work is sedentary, and the employee will be sitting most of the time - Essential physical functions of the job include typing and the repetitive motion to utilize computer software and hardware continuously throughout the day - Essential mental functions of this position include concentrating on analytical tasks, reading information, and verbal/written communication to others continuously throughout the day Related duties as assigned - This job description documents the general nature and level of work but is not intended to be a comprehensive list of activities, duties, or responsibilities required for this position. Consequently, employees may be asked to perform other duties as required - Employees may also be asked to complete certain compliance requirements set forth by our Business Partners in the performance of their jobs including but not limited to requests for background and drug screenings and disclosures of personal health information or personally identifiable information - Exemptions as provided under the ADA and TITLE VII of the Civil Rights Act will be observed and followed - Reasonable accommodations may be made to enable individuals with disabilities to perform the functions outlined above $90,000 - $110,000 a year

United States
$90K - $110K / year
Job Closed