Data Engineer Remote Jobs in Vermont (US)
This page tracks remote data engineer openings that are location-eligible for Vermont.
This page tracks remote data engineer openings that are location-eligible for Vermont.
Open jobs
2,686
Hiring companies this week
8
Salary sample
$26 - $148,750
Jobs added last hour
0
2686 Jobs
1651 Companies
The Prostate Cancer Clinical Trials Consortium (PCCTC) was initiated in 2005 by the Prostate Cancer Foundation (PCF) and the U.S. Department of Defense (DOD) Prostate Cancer Research Program (PCRP) in response to critically unmet needs in prostate cancer clinical research. Established as an independent entity in 2014, the PCCTC, LLC is now the nation’s premier multicenter clinical research organization specializing in cutting-edge prostate cancer research.
Role Description Join the Prostate Cancer Clinical Trials Consortium as a Data Engineer! The Prostate Cancer Clinical Trials Consortium (PCCTC) is seeking a Data Engineer to join a collaborative team of clinical investigators and PCCTC staff working together on a single mission: to design, implement, and complete clinical trials and observational studies in prostate cancer. - Implement and maintain relational database structures for clinical trial data storage in AWS S3, using tools such as DuckDB and/or DuckLake. - Build and maintain ETL pipelines that ingest data from clinical trial data systems (e.g. EDCs), transforming raw clinical data into organized, versioned, analysis-ready datasets. - Develop access layers (database connectors, internal R packages or utilities) that enable our R-focused Data Science Team to query and retrieve data efficiently. - Implement and maintain access management and permissioning structures across data systems, including SharePoint and Airtable. - Maintain data governance standards, including naming conventions, versioning, and documentation, across our active trial portfolio. - Collaborate with Clinical Operations and Data Management teams to understand data flows from sites and ensure upstream processes align with downstream analytic needs. - Use GitHub Enterprise for version control and contribute to CI/CD workflows for pipeline automation where infrastructure allows. Additionally, we wanted to share a few other tools and concepts that we work with as a team: - CDISC data standards (SDTM, ADaM) and how clinical trial data is structured. - DuckDB, DuckLake, or similar analytical database technologies. - R (you won't need to be an R programmer, but understanding how R users consume data will make you more effective). - Airtable structure and maintenance. - SharePoint administration and file system permissioning at scale. - CI/CD for orchestrating automated data pipelines. - Clinical trial data lifecycle, from EDC capture through analysis-ready datasets. Qualifications - An undergraduate degree, preferably in computer science, data engineering, information systems, or a related field. - 2–4 years of experience building or maintaining data pipelines, ETL processes, or database systems. - Working knowledge of SQL and relational database concepts. - Familiarity with cloud storage (AWS S3 preferred) and infrastructure-as-code principles. - Experience with access management, permissioning, or user administration across collaborative platforms. - Exposure to version control (git/GitHub). - Passion for data and creating reliable systems that empower cancer care and clinical research. Requirements - Strong problem-solving and analytical thinking skills with the ability to troubleshoot complex data and system issues. - Excellent collaboration and communication skills, with the ability to work effectively across technical and non-technical teams. - Highly organized with strong attention to detail and a commitment to data accuracy, quality, and documentation. - Ability to manage multiple priorities in a fast-paced environment while meeting deadlines. - Proactive, adaptable, and eager to learn new technologies and contribute to continuous process improvement. - Self-motivated and able to work independently in a fully remote environment while remaining an engaged team member. Benefits - Location: Remote - Reporting to the Director, Data Science - Pay Range: $92,700.00 - $148,400.00 - FSLA Status: Exempt Company Description The Prostate Cancer Clinical Trials Consortium (PCCTC) was initiated in 2005 by the Prostate Cancer Foundation (PCF) and the U.S. Department of Defense (DOD) Prostate Cancer Research Program (PCRP) in response to critically unmet needs in prostate cancer clinical research. Established as an independent entity in 2014, the PCCTC, LLC is now the nation’s premier multicenter clinical research organization specializing in cutting-edge prostate cancer research.
We help mission-focused heroes solve the world’s biggest software challenges.
• Own the design, delivery, and production quality of platform capabilities end to end — from initial architecture through deployment and observability • Build and maintain open source infrastructure packages for airgap and cloud-native environments • Write comprehensive tests at every level — unit, integration, and end-to-end — and hold the rest of the team to the same standard • Work directly with product and customers to translate mission problems into platform capabilities • Develop technical documentation including design specifications, ADRs, and runbooks • Contribute to relevant open source communities and represent Defense Unicorns as a technical peer • Provide technical mentorship and elevate engineering standards across the team • Operate effectively in an asynchronous, fully remote environment
Our higher education SIS, ERP, cloud, and analytics solutions drive digital transformation and enable student success.
• Design, build, and maintain scalable ETL/ELT pipelines using Databricks and Spark • Design and maintain processes to integrate and transform source data into curated, analytics-ready data layers • Collaborate with product, analytics, and engineering teams to translate business needs into data solutions • Maintain data documentation including data dictionaries and lineage • Work with SQL Server, including querying and managing data • Leverage AI-enabled tools and analytics to enhance decision-making, improve efficiency, and drive measurable business outcomes when appropriate. • Apply sound judgment and ethical standards when using AI, ensuring accuracy, data privacy, and responsible human-in-the-loop oversight. • Other duties as assigned
Our higher education SIS, ERP, cloud, and analytics solutions drive digital transformation and enable student success.
Role Description The Data Engineer V leads the design, development, and optimization of scalable data pipelines and analytics platforms. This role partners closely with product, analytics, and engineering teams to deliver high-quality, reliable data solutions that drive business insights. - Design, build, and maintain scalable ETL/ELT pipelines using Databricks and Spark - Design and maintain processes to integrate and transform source data into curated, analytics-ready data layers - Collaborate with product, analytics, and engineering teams to translate business needs into data solutions - Maintain data documentation including data dictionaries and lineage - Work with SQL Server, including querying and managing data - Leverage AI-enabled tools and analytics to enhance decision-making, improve efficiency, and drive measurable business outcomes when appropriate - Apply sound judgment and ethical standards when using AI, ensuring accuracy, data privacy, and responsible human-in-the-loop oversight - Other duties as assigned Qualifications - Proficient programming / scripting in Python, SQL - Proficiency in cloud data platforms (Azure preferred) - Experience building data pipelines that ingest, transform, and load source data into curated data models - Experience with data pipeline orchestration and workflow tools - Strong working knowledge of Git-based version control workflows - Strong proficiency in SQL Server including query optimization and performance tuning - Demonstrated ability to scope and deliver projects within timelines - Ability to collaborate effectively with cross-functional teams to deliver data-driven solutions - Strong problem-solving skills and operates with high autonomy and ownership - 8 years of relevant experience Requirements - Experience in programming / scripting in PySpark, Databricks SQL, DAX - Experience with data modeling techniques (star schema, dimensional modeling) - Experience in creating PowerBI reports - Experience delivering projects within Agile/Scrum environments - Experience in communicating complex technical concepts to non-technical stakeholders - Experience following existing coding standards - Bachelor’s degree in Computer Science or relevant field Benefits - Medical Insurance - Life Insurance - Dental Insurance - Vision Insurance - PTO - Paid Parental Leave - Paid Holidays - Short Term Disability - Long Term Disability - 401K - Educational Assistance
We help mission-focused heroes solve the world’s biggest software challenges.
• Write and maintain test suites — integration, end-to-end, and regression • Write automated tests and test documentation that any engineer can reproduce and act on • Package and deploy open source software components in Kubernetes environments • Build and maintain seed data tooling and scripts that make the platform demonstrable from scratch • Contribute to internal tooling, dashboards, and documentation • Write clear bug reports and reproduction steps • Participate actively in code review, seeking feedback, and growing your technical judgment
Role Description We are currently looking for a new Business Data Migration Consultant – Finance (SAP) to support our growing team and ongoing transformation initiatives. - Plan and execute business-side data migration activities for Finance data objects in line with global, deployment, and country-specific requirements and timelines. - Perform and coordinate data cleansing activities, ensuring completion within agreed schedules and quality standards. - Complete data collection activities for manual and construction data objects. - Collaborate with IT teams to prepare, validate, and maintain value mappings between source and target systems. - Create and maintain master data lists for assigned Finance data objects where applicable. - Provide business insights and detailed information to technical teams to support data extraction and conversion from legacy systems. - Work closely with IT teams and Business Data Owners to identify and confirm country-specific data objects in scope. - Ensure data readiness and data quality throughout the entire migration lifecycle for assigned Finance data objects. - Verify that data is fit for purpose and aligned with business requirements and stakeholder expectations. - Review and formally approve upload files before and after data loads. - Perform manual (type-in) data loads into target systems where required. - Execute dual maintenance activities to ensure data consistency across systems. - Execute and approve data verification scripts and validation activities. - Act as the functional Single Point of Contact (SPoC) for assigned Finance data objects during migration and Hypercare phases. - Support defect management activities, ensuring timely resolution of data related issues. Qualifications - 5+ years of experience in country-level and global roles, preferably within large-scale ERP transformation programs. - Proven SAP project implementation experience. - Strong SAP Finance process knowledge, including Finance master data, Fixed Assets, General Ledger (GL), Profit & Loss (P&L) reporting, WBS, Internal Orders, Accounts Payable (AP), and Accounts Receivable (AR). - Hands-on experience with business-side data migration, including data cleansing, mapping, validation, reconciliation, and sign-off activities. - Strong understanding of Finance data structures and dependencies within SAP environments. - Excellent stakeholder management, communication, and negotiation skills. - Ability to work both strategically and hands-on, including facilitating meetings, tracking progress, and managing issues and risks. - Advanced Microsoft Excel skills for data analysis, cleansing, and reconciliation. - Fluent English and Spanish. - Bachelor’s or Master’s degree preferred. - Candidates must declare criminal record extract not older than three months. Benefits - Broad range of activities, tasks, and projects. - Flexible working conditions. - Minimum 5 weeks of vacation. - Paid sick days. - Meal vouchers. - Vouchers (B-day voucher, wedding, and new born surprise). - Contributions to wellness programs (multisport card). - Fishing for Friends program – our referral program. - Refreshments in the D-ploy office. - Further development and professional advancement. - Friendly and international working environment. - Company-sponsored events. - Competitive salary and various benefits.
Global energy think tank that uses data-driven insights to shift the world to clean electricity.
Role Description Use your coding and data skills to work on the most important challenge of our time: shifting the world to clean energy. Ember publishes the best-in-class dataset on global electricity generation and the data team is growing as we expand our datasets to new areas such as installed renewable capacity, price, battery storage, grids and flexibility, EV and heat pump deployment. We are looking for someone to use their data and coding skills to ensure we can quickly gather and curate data from new sources, as well as helping to run, maintain and document existing data pipelines. This role offers an exciting opportunity to learn from a knowledgeable and passionate team of engineers and analysts while applying your development skills towards the clean energy transition. Key Responsibilities - Develop new ETL scripts using Python to gather and validate data from a variety of sources e.g. APIs, web scraping. - Work with our data engineering team to deploy ETL scripts within our orchestrated data platform based on Dagster and BigQuery. - Help run, maintain, and improve existing data pipelines. - Help ensure that our pipelines are written using best coding and data practices. - Help ensure Ember’s data and output are of the highest standard. Qualifications - At least one year experience developing and deploying Python code. - Experience working with SQL databases. - Numerate and data literate, with excellent data extraction and transformation skills. - A thoughtful and selective approach to the use of AI coding tools and the ability to critically evaluate their outputs. - Fluent in spoken and written English. - Passionate about clean energy. - Driven and keen to learn. - Systematic with careful attention to detail. - Ability to work as part of a remote international team. Requirements - Experience developing data pipelines on an orchestration platform such as Dagster (preferred), Airflow, dbt or Prefect. - Experience with version control software such as Git. - Experience working on cloud platforms, such as GCP (preferred), AWS or Azure. - Experience working with business users to turn research questions into specific data requirements, and developing to those requirements. - Other language skills. - Previous experience within the power sector or clean energy sector. Benefits - We operate a nine-day fortnight meaning our full-time staff are given every other Friday off work with no reduction in pay. - 25 days holiday, plus UK bank holidays (unless local statutory minimums are higher) and for each year that you're part of the team at Ember you'll receive an additional day of holiday, up to a maximum of 5 additional days. - Generous paid maternity and paternity leave. - Flexible working conditions, including the opportunity for part-time work and home working. - Access to a local working space can also be arranged for employees. - Free annual eye tests. - Access to a counselling service. - Funding and allocated time for your training and development. - Paid volunteer day. - Four paid days off to enable low carbon travel. - Time off to donate blood.
Role Description Join phData, a dynamic and innovative leader in the modern data stack. We partner with major cloud data platforms like Snowflake, AWS, Azure, GCP, Fivetran, Pinecone, Glean, and dbt to deliver cutting-edge services and solutions. We're committed to helping global enterprises overcome their toughest data challenges. phData is a remote-first global company with employees based in the United States, Latin America, and India. We celebrate the culture of each of our team members and foster a community of technological curiosity, ownership, and trust. Even though we're growing extremely fast, we maintain a casual, exciting work environment. We hire top performers and allow you the autonomy to deliver results. Qualifications - 8+ years as a hands-on Data Engineer designing and implementing data solutions - Team lead, and/or mentorship of other engineers - Ability to develop end-to-end technical solutions into production — and to help ensure performance, security, scalability, and robust data integration - Programming expertise in Java, Python and/or Scala - Core cloud data platforms including Snowflake, AWS, Azure, Databricks and GCP - SQL and the ability to write, debug, and optimize SQL queries - Client-facing written and verbal communication skills and experience - Create and deliver detailed presentations - Detailed solution documentation (e.g. including POCs and roadmaps, sequence diagrams, class hierarchies, logical system views, etc.) - 4-year Bachelor's degree in Computer Science or a related field Requirements - Production experience in core data platforms: Snowflake, AWS, Azure, GCP, Hadoop, Databricks - Cloud and Distributed Data Storage: S3, ADLS, HDFS, GCS, Kudu, ElasticSearch/Solr, Cassandra or other NoSQL storage systems - Data integration technologies: Spark, Kafka, event/streaming, Streamsets, Matillion, Fivetran, NiFi, AWS Data Migration Services, Azure DataFactory, Informatica Intelligent Cloud Services (IICS), Google DataProc or other data integration technologies - Multiple data sources (e.g. queues, relational databases, files, search, API) - Complete software development lifecycle experience including design, documentation, implementation, testing, and deployment - Automated data transformation and data curation: dbt, Spark, Spark streaming, automated pipelines - Workflow Management and Orchestration: Airflow, AWS Managed Airflow, Luigi, NiFi Benefits - Remote-First Workplace - Medical Insurance for Self & Family - Medical Insurance for Parents - Term Life & Personal Accident - Wellness Allowance - Broadband Reimbursement - Continuous learning and growth opportunities to enhance your skills and expertise - Other benefits include paid certifications, professional development allowance, and bonuses for creating company-approved content
Ascension Health is the largest nonprofit organization that specializes in providing Catholic faith-based, comprehensive healthcare services in the United State
Title: Oncology Data Specialist Location Remote Health Information Management Job ID: 452835 Full Time Remote Day Job Description: Your future role at a glance Location: Remote Department/Specialty: Clinical Quality Registry Services Schedule: Full time Salary: $26.49 - $36.93 per hour #ADSI #internalops #LI-Remote Life at Ascension: Where purpose meets opportunity Ascension is a leading nonprofit Catholic health system with a culture and associate experience grounded in service, growth, care and connection. We empower our 97,000+ associates to bring their skills and expertise every day to reimagining healthcare, together. Recognized as one of the Best 150+ Places to Work in Healthcare and a Military-Friendly Gold Employer, you’ll find an inclusive and supportive environment where your contributions truly matter. Benefits that help you thrive - Comprehensive health coverage:medical, dental, vision, prescription coverage and HSA/FSA options - Financial security & retirement: employer-matched 403(b), planning and hardship resources, disability and life insurance - Time to recharge: pro-rated paid time off (PTO) and holidays - Career growth: Ascension-paid tuition (Vocare), reimbursement, ongoing professional development and online learning - Emotional well-being: Employee Assistance Program, counseling and peer support, spiritual care and stress management resources - Family support:parental leave, adoption assistance and family benefits - Other benefits: optional legal and pet insurance, transportation savings and more How you’ll make an impact in this role Utilize specialty databases to extract patient data from electronic medical records for the analysis of cancer incidence data under direct supervision. Responsibilities: - Screen disease indices, pathology reports, radiology reports, and other clinical documents to identify reportable cancer cases for case finding. - Perform primary data abstraction duties for oncology measures and registries while ensuring high levels of abstraction accuracy and alignment with organizational quality standards in compliance with all oncology regulatory standards. - Maintain the long-term follow up process of data collection and input for all cancer registry patients. - Collect data for National Cancer Database special studies as required. - Adhere to Oncology Abstraction Training Program requirements. What minimum requirements you’ll need Licensure / Certification / Registration: - Oncology Data Specialist credentialed from the National Cancer Registrars Association (NCRA) obtained within 36 Months (3 years) of hire date or job transfer date required. - Registrar specializing in Tumors preferred. Education: - High School diploma equivalency with 2 years of cumulative experience Currently enrolled or successful completion of a NCRA-Accredited Certificate Program OR Associate's degree/Bachelor's degree OR 4 years of applicable cumulative job specific experience required. Equal employment opportunity employer Ascension provides Equal Employment Opportunities (EEO) to all associates and applicants for employment without regard to race, color, religion, sex/gender, sexual orientation, gender identity or expression, pregnancy, childbirth, and related medical conditions, lactation, breastfeeding, national origin, citizenship, age, disability, genetic information, veteran status, marital status, all as defined by applicable law, and any other legally protected status or characteristic in accordance with applicable federal, state and local laws. Fraud prevention notice Prospective applicants should be vigilant against fraudulent job offers and interview requests. Scammers may use sophisticated tactics to impersonate Ascension employees. To ensure your safety, please remember: Ascension will never ask for payment or to provide banking or financial information as part of the job application or hiring process. Our legitimate email communications will always come from an @ascension.org email address; do not trust other domains, and an official offer will only be extended to candidates who have completed a job application through our authorized applicant tracking system.
Bayer is a global pharmaceutical and scientific research company dedicated to providing products that improve quality of life for people around the world. Founded in Germany in 186
Role Description Principal Research Data Engineer for St. Louis, MO to oversee the development & implementation of research data pipelines for producing data layers and storing research data. - Implement & maintain scalable data-intensive processing pipelines that apply geospatial to ML/DL models. - Architect, build & launch new data models to provide intuitive analytics to business users. - Develop infrastructure to inform on key metrics, recommend changes & predict future results. - Develop POCs for new pipelines for integration into science data pipeline through collaboration with diverse research partners. Qualifications - Master’s in Information Science, C.S., Data Science, Data Analytics, or closely related field. - 5 years of experience designing, developing, testing, and implementing scalable geospatial data integration pipelines. - Experience with statistical yield analysis and interactive report and visualization generation. - Experience working with raster & vector geospatial datasets applied to machine learning model generation and deployment in big data environment. - Experience packaging & deploying models and data pipelines using CI/CD practices, including production readiness and performance tuning activities using Python and/or Conda, Docker, Airflow, and Git CI/CD. - Experience using Google Cloud Platform, Google Cloud Functions, Google Big Query, and Data Proc to process data at scale and deliver robust data pipelines. - Familiarity with Avro, Parquet, CSVs, Geotiff and GeoJSON file formats. - Programming in SQL; conducting query optimization & Online Analytical Processing on RDBMS and No-SQL databases. - Experience using QGIS, ArcGIS & Postgis to ingest and process geospatial data in Avro, CSVs, and GeoJSON. Requirements - Telecommuting permitted from anywhere in the U.S. - Salary Range: Employees can expect to be paid a salary between $142,000.00 to $185,000.00. - Additional compensation may include a bonus or commission (if relevant). - The offered salary may vary within this range based on an applicant’s location, market data/ranges, an applicant’s skills and prior relevant experience, certain degrees and certifications, and other relevant factors. Benefits - Health care - Vision - Dental - Retirement - PTO - Sick leave Company Description Bayer is an Equal Opportunity Employer/Disabled/Veterans. Bayer is committed to providing access and reasonable accommodations in its application process for individuals with disabilities and encourages applicants with disabilities to request any needed accommodation(s) using the contact information below. - Bayer is an E-Verify Employer. - Location: United States : Missouri : St. Louis - Division: Enabling Functions - Reference Code: 864075
2,676more opportunities are still waiting for you.Log in now and take your next shot before someone else does.
Python, SQL, ETL, Azure, Git, Airflow