Lead Data Engineer
Location
Worldwide
Posted
1 day ago
Salary
0
Seniority
Lead
Job Description
Lead Data Engineer
phData
Role Description Join phData, a dynamic and innovative leader in the modern data stack. We partner with major cloud data platforms like Snowflake, AWS, Azure, GCP, Fivetran, Pinecone, Glean, and dbt to deliver cutting-edge services and solutions. We're committed to helping global enterprises overcome their toughest data challenges. phData is a remote-first global company with employees based in the United States, Latin America, and India. We celebrate the culture of each of our team members and foster a community of technological curiosity, ownership, and trust. Even though we're growing extremely fast, we maintain a casual, exciting work environment. We hire top performers and allow you the autonomy to deliver results. Qualifications - 8+ years as a hands-on Data Engineer designing and implementing data solutions - Team lead, and/or mentorship of other engineers - Ability to develop end-to-end technical solutions into production — and to help ensure performance, security, scalability, and robust data integration - Programming expertise in Java, Python and/or Scala - Core cloud data platforms including Snowflake, AWS, Azure, Databricks and GCP - SQL and the ability to write, debug, and optimize SQL queries - Client-facing written and verbal communication skills and experience - Create and deliver detailed presentations - Detailed solution documentation (e.g. including POCs and roadmaps, sequence diagrams, class hierarchies, logical system views, etc.) - 4-year Bachelor's degree in Computer Science or a related field Requirements - Production experience in core data platforms: Snowflake, AWS, Azure, GCP, Hadoop, Databricks - Cloud and Distributed Data Storage: S3, ADLS, HDFS, GCS, Kudu, ElasticSearch/Solr, Cassandra or other NoSQL storage systems - Data integration technologies: Spark, Kafka, event/streaming, Streamsets, Matillion, Fivetran, NiFi, AWS Data Migration Services, Azure DataFactory, Informatica Intelligent Cloud Services (IICS), Google DataProc or other data integration technologies - Multiple data sources (e.g. queues, relational databases, files, search, API) - Complete software development lifecycle experience including design, documentation, implementation, testing, and deployment - Automated data transformation and data curation: dbt, Spark, Spark streaming, automated pipelines - Workflow Management and Orchestration: Airflow, AWS Managed Airflow, Luigi, NiFi Benefits - Remote-First Workplace - Medical Insurance for Self & Family - Medical Insurance for Parents - Term Life & Personal Accident - Wellness Allowance - Broadband Reimbursement - Continuous learning and growth opportunities to enhance your skills and expertise - Other benefits include paid certifications, professional development allowance, and bonuses for creating company-approved content
Related Guides
Related Categories
Related Job Pages
More Data Engineer Jobs
Oncology Data Specialist
Ascension HealthAscension Health is the largest nonprofit organization that specializes in providing Catholic faith-based, comprehensive healthcare services in the United State
Title: Oncology Data Specialist Location Remote Health Information Management Job ID: 452835 Full Time Remote Day Job Description: Your future role at a glance Location: Remote Department/Specialty: Clinical Quality Registry Services Schedule: Full time Salary: $26.49 - $36.93 per hour #ADSI #internalops #LI-Remote Life at Ascension: Where purpose meets opportunity Ascension is a leading nonprofit Catholic health system with a culture and associate experience grounded in service, growth, care and connection. We empower our 97,000+ associates to bring their skills and expertise every day to reimagining healthcare, together. Recognized as one of the Best 150+ Places to Work in Healthcare and a Military-Friendly Gold Employer, you’ll find an inclusive and supportive environment where your contributions truly matter. Benefits that help you thrive - Comprehensive health coverage:medical, dental, vision, prescription coverage and HSA/FSA options - Financial security & retirement: employer-matched 403(b), planning and hardship resources, disability and life insurance - Time to recharge: pro-rated paid time off (PTO) and holidays - Career growth: Ascension-paid tuition (Vocare), reimbursement, ongoing professional development and online learning - Emotional well-being: Employee Assistance Program, counseling and peer support, spiritual care and stress management resources - Family support:parental leave, adoption assistance and family benefits - Other benefits: optional legal and pet insurance, transportation savings and more How you’ll make an impact in this role Utilize specialty databases to extract patient data from electronic medical records for the analysis of cancer incidence data under direct supervision. Responsibilities: - Screen disease indices, pathology reports, radiology reports, and other clinical documents to identify reportable cancer cases for case finding. - Perform primary data abstraction duties for oncology measures and registries while ensuring high levels of abstraction accuracy and alignment with organizational quality standards in compliance with all oncology regulatory standards. - Maintain the long-term follow up process of data collection and input for all cancer registry patients. - Collect data for National Cancer Database special studies as required. - Adhere to Oncology Abstraction Training Program requirements. What minimum requirements you’ll need Licensure / Certification / Registration: - Oncology Data Specialist credentialed from the National Cancer Registrars Association (NCRA) obtained within 36 Months (3 years) of hire date or job transfer date required. - Registrar specializing in Tumors preferred. Education: - High School diploma equivalency with 2 years of cumulative experience Currently enrolled or successful completion of a NCRA-Accredited Certificate Program OR Associate's degree/Bachelor's degree OR 4 years of applicable cumulative job specific experience required. Equal employment opportunity employer Ascension provides Equal Employment Opportunities (EEO) to all associates and applicants for employment without regard to race, color, religion, sex/gender, sexual orientation, gender identity or expression, pregnancy, childbirth, and related medical conditions, lactation, breastfeeding, national origin, citizenship, age, disability, genetic information, veteran status, marital status, all as defined by applicable law, and any other legally protected status or characteristic in accordance with applicable federal, state and local laws. Fraud prevention notice Prospective applicants should be vigilant against fraudulent job offers and interview requests. Scammers may use sophisticated tactics to impersonate Ascension employees. To ensure your safety, please remember: Ascension will never ask for payment or to provide banking or financial information as part of the job application or hiring process. Our legitimate email communications will always come from an @ascension.org email address; do not trust other domains, and an official offer will only be extended to candidates who have completed a job application through our authorized applicant tracking system.
Principal Research Data Engineer
BayerBayer is a global pharmaceutical and scientific research company dedicated to providing products that improve quality of life for people around the world. Founded in Germany in 186
Role Description Principal Research Data Engineer for St. Louis, MO to oversee the development & implementation of research data pipelines for producing data layers and storing research data. - Implement & maintain scalable data-intensive processing pipelines that apply geospatial to ML/DL models. - Architect, build & launch new data models to provide intuitive analytics to business users. - Develop infrastructure to inform on key metrics, recommend changes & predict future results. - Develop POCs for new pipelines for integration into science data pipeline through collaboration with diverse research partners. Qualifications - Master’s in Information Science, C.S., Data Science, Data Analytics, or closely related field. - 5 years of experience designing, developing, testing, and implementing scalable geospatial data integration pipelines. - Experience with statistical yield analysis and interactive report and visualization generation. - Experience working with raster & vector geospatial datasets applied to machine learning model generation and deployment in big data environment. - Experience packaging & deploying models and data pipelines using CI/CD practices, including production readiness and performance tuning activities using Python and/or Conda, Docker, Airflow, and Git CI/CD. - Experience using Google Cloud Platform, Google Cloud Functions, Google Big Query, and Data Proc to process data at scale and deliver robust data pipelines. - Familiarity with Avro, Parquet, CSVs, Geotiff and GeoJSON file formats. - Programming in SQL; conducting query optimization & Online Analytical Processing on RDBMS and No-SQL databases. - Experience using QGIS, ArcGIS & Postgis to ingest and process geospatial data in Avro, CSVs, and GeoJSON. Requirements - Telecommuting permitted from anywhere in the U.S. - Salary Range: Employees can expect to be paid a salary between $142,000.00 to $185,000.00. - Additional compensation may include a bonus or commission (if relevant). - The offered salary may vary within this range based on an applicant’s location, market data/ranges, an applicant’s skills and prior relevant experience, certain degrees and certifications, and other relevant factors. Benefits - Health care - Vision - Dental - Retirement - PTO - Sick leave Company Description Bayer is an Equal Opportunity Employer/Disabled/Veterans. Bayer is committed to providing access and reasonable accommodations in its application process for individuals with disabilities and encourages applicants with disabilities to request any needed accommodation(s) using the contact information below. - Bayer is an E-Verify Employer. - Location: United States : Missouri : St. Louis - Division: Enabling Functions - Reference Code: 864075
Data Engineer/Data Project Lead
Koniag Government Services, LLCKoniag Government Services (KGS) is an Alaska Native Owned corporation supporting the values and traditions of our native communities through an agile employee and corporate culture that delivers Enterprise Solutions, Professional Services and Operational Management to Federal Government Agencies.
Role Description Koniag Professional Services LLC, a Koniag Government Services company, is seeking a Data Engineer/Data Project Lead to support KPS and our government customer. The position is remote and requires the candidate to be able to obtain a Public Trust. This position is for a Future New Business Opportunity. The Data Engineer/Data Project Lead HQ will provide strategic leadership and technical expertise in managing complex data infrastructure, systems integration, and data project execution supporting federal discretionary grants administration. This senior-level position is responsible for overseeing and leading large and complex projects within an organization, managing project portfolios, defining project strategies, leading project teams, ensuring project quality, and managing project risks. The role requires a high level of autonomy, leadership, and decision-making in driving project success. Principal responsibilities will include but are not limited to: - Lead all data projects using Agile methodologies while supervising Data Analysts, Data Visualization Specialists, Data Management Specialists, and Data Entry Support Specialists. - Manage project portfolios, define project strategies, ensure project quality, and manage project risks and dependencies. - Design, develop, and maintain data infrastructure including pipelines, ETL processes, APIs, data warehousing solutions, and cloud-based architectures (AWS, Azure) using Python, SQL, serverless technologies, and automated testing frameworks. - Develop workflow-tracking solutions that integrate data from multiple federal systems (HSES, AMS, GrantSolutions, TTA Hub). - Create solutions to generate standardized documentation, such as templates and funding guidance letters. - Communicate technical concepts to diverse audiences and translate business requirements into technical solutions. - Create comprehensive documentation, including technical specifications, data dictionaries, system architecture diagrams, user guides, and runbooks. - Identify automation opportunities, recommend system improvements, and evaluate emerging technologies. - Maintain version control with GitHub, use Docker for containerization, implement CI/CD pipelines, and ensure 99.9% system uptime and high availability. - Travel within and outside the Region to conduct site visits, attend meetings, conferences, and training. Qualifications - Bachelor's degree in Computer Science, Information Systems, Data Science, Data Engineering, or related field from an accredited college or university. - At least 10 years of relevant and progressive post-baccalaureate professional experience directly related to database design, ETL processes, data warehousing, and developing, implementing, and managing/improving complex, high-profile, multi-faceted data projects. - Minimum 3 years of progressive supervisory experience leading technical teams. - Demonstrated proficiency/experience in successfully developing, implementing, and managing/improving complex, high-profile, multi-faceted data projects. - Proven track record delivering complex data projects on time and within budget. - U.S. citizenship or legal authorization to work in the United States. Requirements - Demonstrated ability to coach team members effectively. - Demonstrated proficiency to communicate clearly, both orally and in writing. - Demonstrated proficiency in database design, ETL processes, and data warehousing. - SQL and programming language proficiency, with expert knowledge particularly in Python. - API development and integration expertise. - Experience integrating data from various federal systems using efficient processes. - Data governance and security knowledge, with the ability to ensure compliance with federal security standards. - Proven ability to address team member and client concerns and resolve problems effectively. - Proficient with Microsoft Office Suite and web-based collaboration tools. - Ability to work both independently and collaboratively in a team environment. - Ability to maintain confidentiality and handle sensitive information appropriately. - Ability to pass required background investigations for access to federal facilities and systems. - Willingness to complete mandatory training requirements. - Availability to travel within and outside the Region as needed. Benefits - Competitive compensation. - Extraordinary benefits package including health, dental, and vision insurance. - 401K with company matching. - Flexible spending accounts. - Paid holidays. - Three weeks paid time off. - And more.
Data Warehouse Task Lead
Koniag Government Services, LLCKoniag Government Services (KGS) is an Alaska Native Owned corporation supporting the values and traditions of our native communities through an agile employee and corporate culture that delivers Enterprise Solutions, Professional Services and Operational Management to Federal Government Agencies.
Role Description Koniag Professional Services is seeking an experienced Data Warehouse Task Lead to support our client, the Department of Health and Human Services (HHS), Office of the Chief Information Officer (OCIO), in migrating to a secure, scalable, and integrated Human Capital Management (HCM) solution. The ideal candidate combines strong data warehousing technical skills with an understanding of federal government systems and requirements. The Task Lead will primarily be working remotely but may need to travel onsite to support customer meetings, whiteboarding sessions, presentations to customers, and any in-person meetings as requested by the customer. - Manage schedule, task assignments, requirements, and deliverables for the Data Warehouse workstream to ensure on-time delivery of migration milestones and adherence to project timelines. - Provide guidance to the team on Requirements Gathering techniques, Change Management, Risk Analysis, and Metrics Reporting. - Lead the data analysis, migration planning, and consolidation of approximately 20 disparate HRIT data management systems and five (5) business intelligence systems into a unified Data Warehousing, BI, and ETL platform environment, in support of the Oracle HCM platform. - Develop and document data architecture, ETL workflows, and business intelligence requirements to support HR analytics and reporting across the enterprise. - Assist with testing functionality of and interactions between enterprise systems from an end-user perspective. - Collaborate with fellow task leads and stakeholders in the migration and sustainment of unified HCM capabilities. Qualifications - 4-year college degree - 7+ years overseeing technical and non-technical teams - Experience with Oracle-based Data Warehouse, Business Intelligence, and ETL technologies, including familiarity with Oracle data and analytics tooling - Strong programming language proficiency and experience with data modeling techniques - Experience developing data migration strategies, data mapping documentation, and data governance/data quality frameworks - Experience with enterprise ETL tools such as Informatica - Knowledge of Scrum processes, familiarity with JIRA - Excellent verbal, written and interpersonal communication - Exceptional client relationship skills - Microsoft Office suite skills to include Visio, MS Project, PowerPoint Requirements - Ability to maintain Public Trust Clearance Benefits - Competitive compensation - Extraordinary benefits package including health, dental and vision insurance - 401K with company matching - Flexible spending accounts - Paid holidays - Three weeks paid time off - And more


