Data Engineer
Location
India
Posted
12 days ago
Salary
0
Seniority
Mid Level
Job Description
Data Engineer
phData
Role Description We are looking for a Data Engineer to join our Data Engineering team. In this role, you will design, build, and productionize modern cloud data platforms and pipelines that enable reliable analytics and AI solutions for our clients. You will collaborate closely with client stakeholders, architects, and engineering teams to deliver secure, scalable, high-performing data solutions and advance phData’s delivery excellence. Key Responsibilities - Client Delivery: - Own and drive end-to-end data pipeline and platform implementation for cloud-based analytics and data platform projects across enterprise clients. - Translate business requirements into technical data engineering solutions that align with phData methodologies, standards, and best practices. - Ensure engagements are delivered on time, within scope, and with measurable business value for clients. - Collaboration & Leadership: - Collaborate with data engineers, analytics engineers, data scientists, solution architects, and project managers to deliver successful client engagements. - Provide technical leadership during architecture and design workshops, discovery sessions, and project delivery. - Ensure high quality in deliverables through code reviews, documentation, testing, governance, and production-ready operational practices. - Partner with practice and account leaders to identify opportunities to expand data engineering engagements, improve delivery quality, and standardize reusable patterns and frameworks. - Practice & Firm Contribution: - Contribute to internal initiatives such as IP development, accelerators, reference architectures, playbooks, and internal training and mentoring. - Represent phData with professionalism in all interactions, communicating clearly with both technical and non-technical stakeholders. Qualifications - 3–4 years of experience as a Data Engineer designing, building, and deploying data solutions to production. - Strong programming skills in Python and/or Scala for data engineering workloads. - Hands-on experience with core cloud data platforms such as Snowflake, AWS, Azure, and Databricks. - Advanced SQL skills, including the ability to write, debug, and optimize complex queries. - Experience developing end-to-end technical solutions with a focus on performance, security, scalability, and robust data integration. - Experience creating detailed technical documentation, including solution designs, POCs, roadmaps, sequence diagrams, logical system views, and related artifacts. - Experience delivering projects for external or internal clients in a professional services or consulting environment. - Ability to break down complex problems into structured, actionable steps and drive them through to completion. - Strong written and verbal communication skills in English. - Ability to create and deliver clear, client-facing presentations and solution walkthroughs for both technical and non-technical audiences. - Demonstrated ability to work effectively with distributed and cross-functional teams. - Proven track record of taking ownership, managing multiple priorities, and delivering high-quality work with minimal supervision. Requirements - 4-year Bachelor’s degree in Computer Science or a related field. Preferred Qualifications - Production experience with core data and cloud platforms such as Snowflake, AWS, Azure, GCP, Hadoop, Databricks, and/or Informatica Intelligent Cloud Services (IICS). - Experience with cloud and distributed data storage technologies such as S3, ADLS, HDFS, GCS, Kudu, Elasticsearch/Solr, Cassandra, or other NoSQL storage systems. - Hands-on experience with data integration and streaming technologies such as Spark, Kafka, event/streaming platforms, Streamsets, Matillion, Fivetran, NiFi, AWS Data Migration Services, Azure Data Factory, Google Dataproc, or similar tools. - Experience integrating data from multiple sources (e.g., queues, relational databases, files, search indexes, APIs). - Experience across the complete software development lifecycle including design, documentation, implementation, testing, and deployment of data solutions. - Experience with automated data transformation and data curation using tools such as dbt, Spark, Spark Streaming, and automated data pipelines. - Experience with workflow management and orchestration tools such as Airflow, AWS Managed Airflow, Luigi, or NiFi. - Prior experience working in global or remote teams and partnering across multiple regions is a plus. - Contributions to open-source projects, technical communities, speaking, writing, or creation of internal accelerators are a plus. Location & Time Zone Expectations - This role is based in India and operates primarily in India Standard Time (IST). - We are a remote-first company, and you should be comfortable working with a distributed global team. - Some flexibility may be required to collaborate across time zones with colleagues and clients. - Client needs may occasionally require flexibility in working hours to support key milestones or workshops. Benefits - Remote-First Workplace - Medical Insurance for Self & Family - Medical Insurance for Parents - Term Life & Personal Accident - Wellness Allowance - Broadband Reimbursement - Continuous learning and growth opportunities to enhance your skills and expertise - Other benefits include paid certifications, professional development allowance, and bonuses for creating company-approved content
Related Guides
Related Categories
Related Job Pages
More Data Engineer Jobs
Senior Data Engineer
D2BAn Australian home building company specializing in the construction of quality residential properties. The team focuses on delivering well-designed homes with efficient project management, accurate costing, and strong collaboration with suppliers and contractors.
Role Description This isn't a role where you'll be handed a ticket queue. You'll own meaningful pieces of work, shape how we build, and have a direct hand in what the company becomes over the next few years. - Design and deliver modern data platforms using tools like Snowflake, Microsoft Fabric, Azure, Fivetran, Coalesce, dbt, and Power BI - Lead projects end-to-end - from architecture through to client handover - Champion data governance practices that make platforms trustworthy and scalable - Support clients through the change that comes with adopting new ways of working with data - Help build internal standards and frameworks that shape how the whole team works - Lift capability across the team - by doing good work and being someone others learn from Qualifications - 5+ years of experience in data engineering or modern data platform delivery - Strong SQL and Python, plus hands-on experience with Snowflake, Microsoft Fabric, Fivetran, Coalesce, or dbt - Solid understanding of medallion architecture and dimensional modelling - Familiarity with data governance and metadata tools like Microsoft Purview and Coalesce Catalog - Comfortable using AI tools to accelerate development, whether that's Claude, Snowflake Cortex, or similar - Experience building data pipelines and data products that support AI and ML use cases, including an understanding of what clean, well-structured data looks like for model consumption - A track record of owning projects and delivering results, not just contributing to them - The communication skills to work directly with clients and earn their trust - Previous consulting experience is a plus, but a client-first mindset matters more
Senior Data Engineer
Precision AQPrecision AQ, the top payer marketing agency, supports global pharmaceutical and life sciences clients in the achievement of commercial excellence. We excel at demonstrating the economic, clinical, and societal value of creative medical treatments to payers, providers, patients, and policymakers. As leaders in the generation, analysis, and communication of that evidence, we are improving market access to support our clients in their mission of improving care for patients around the world.
Role Description Precision AQ is building a centralized Data Hub to consolidate fragmented data infrastructure, establish enterprise-wide data governance, and enable AI-ready analytics across our life sciences portfolio. This is a foundational initiative, not a maintenance role. We are hiring a Senior Data Engineer to own the development of our Bronze and Silver data layers, the ingestion, transformation, and enrichment pipelines that turn raw healthcare data into governed, reliable, and reusable data assets ready for business use. This role exists because we need engineers who can build at the intersection of healthcare data complexity and modern data engineering best practices. You will work directly with claims, formulary, and other critical data sources, building the pipelines that feed every downstream analytics use case, product, and client deliverable. You will report to the Engineering Director and work closely with the Enterprise Data Architect to translate architecture patterns into production-grade pipelines. This is a hands-on technical role with significant autonomy. If you want to shape how a growing healthcare and life sciences company builds its data foundation, this is the opportunity. Qualifications - Bachelor’s degree in computer science, information systems, engineering, or a related technical field. Advanced degree is a plus but not required. - 5+ years of professional data engineering experience, with at least 3 years working in modern cloud data environments. - 2+ years hands-on experience with Snowflake, including performance tuning, Snowpipe, and Snowflake-specific optimization patterns. - 2+ years working with dbt (Core or Cloud) in production environments, including incremental models, testing, documentation, and package management. - Demonstrated experience with healthcare data, understanding ICD-10, NPI, NDC codes, and the messy realities of claims data quality. - Production experience with orchestration tools, such as Airflow, Dagster, or similar. - Strong SQL skills, with the ability to write efficient, readable, well-structured SQL and debug complex query performance issues. - Python proficiency for data processing, scripting, and pipeline automation. Requirements - Experience with major claims data aggregators (e.g., McKesson, Symphony Health, IQVIA, Komodo). - Familiarity with formulary and payer data. - Experience implementing medallion architecture or similar layered data patterns. - Background in life sciences, pharma services, or healthcare analytics organizations. - Experience with data observability tools. - Exposure to Snowpipe, Matillion, Fivetran, or other managed ingestion solutions. - Familiarity with Git-based workflows, CI/CD for data pipelines, and infrastructure-as-code. Benefits - This is a remote-friendly role with flexibility for EU-based or India-based candidates. - Occasional travel for stakeholder meetings and strategic planning sessions. Who This Role Is For - You want to build foundational data infrastructure in a growing healthcare and life sciences company, not maintain legacy systems. - You’re energized by healthcare data complexity; claims data is messy, and you find that interesting rather than frustrating. - You thrive with autonomy and accountability; you want ownership of outcomes, not just tasks. - You care about craft; clean code, clear documentation, and maintainable systems matter to you. - You want to see the impact of your work; the pipelines you build will power teams across the organization. This role is not for you if - You’re looking for a pure greenfield, no constraints environment; we have an architecture, standards, and patterns. - You prefer highly specialized, narrow scope; this role spans ingestion, transformation, quality, and collaboration. - You want to work in isolation; this is a collaborative team, and you’ll interact regularly within the engineering team and cross-functionally. - You seek a management track; this is a hands-on IC role.
Role Description Precision AQ is building a centralized Data Hub to consolidate fragmented data infrastructure, establish enterprise-wide data governance, and enable AI-ready analytics across our life sciences portfolio. This is a foundational initiative, not a maintenance role. We are hiring a Senior Data Engineer to own the development of our Bronze and Silver data layers, the ingestion, transformation, and enrichment pipelines that turn raw healthcare data into governed, reliable, and reusable data assets ready for business use. This role exists because we need engineers who can build at the intersection of healthcare data complexity and modern data engineering best practices. You will work directly with claims, formulary, and other critical data sources, building the pipelines that feed every downstream analytics use case, product, and client deliverable. You will report to the Engineering Director and work closely with the Enterprise Data Architect to translate architecture patterns into production-grade pipelines. This is a hands-on technical role with significant autonomy. If you want to shape how a growing healthcare and life sciences company builds its data foundation, this is the opportunity. Qualifications - Bachelor’s degree in computer science, information systems, engineering, or a related technical field. Advanced degree is a plus but not required. - 5+ years of professional data engineering experience, with at least 3 years working in modern cloud data environments. - 2+ years hands-on experience with Snowflake, including performance tuning, Snowpipe, and Snowflake-specific optimization patterns. - 2+ years working with dbt (Core or Cloud) in production environments, including incremental models, testing, documentation, and package management. - Demonstrated experience with healthcare data, understanding ICD-10, NPI, NDC codes, and the realities of claims data quality. - Production experience with orchestration tools, such as Airflow, Dagster, or similar. - Strong SQL skills, able to write efficient, readable, well-structured SQL and debug complex query performance issues. - Python proficiency for data processing, scripting, and pipeline automation. Requirements - Experience with major claims data aggregators (e.g., McKesson, Symphony Health, IQVIA, Komodo). - Familiarity with formulary and payer data. - Experience implementing medallion architecture or similar layered data patterns. - Background in life sciences, pharma services, or healthcare analytics organizations. - Experience with data observability tools. - Exposure to Snowpipe, Matillion, Fivetran, or other managed ingestion solutions. - Familiarity with Git-based workflows, CI/CD for data pipelines, and infrastructure-as-code. Benefits - This is a remote-friendly role with flexibility for EU-based or India-based candidates. - Occasional travel for stakeholder meetings and strategic planning sessions. Who This Role Is For - You want to build foundational data infrastructure in a growing healthcare and life sciences company, not maintain legacy systems. - You’re energized by healthcare data complexity; claims data is messy, and you find that interesting rather than frustrating. - You thrive with autonomy and accountability, wanting ownership of outcomes, not just tasks. - You care about craft; clean code, clear documentation, and maintainable systems matter to you. - You want to see the impact of your work; the pipelines you build will power teams across the organization. This role is not for you if - You’re looking for a pure greenfield, no constraints environment; we have an architecture, standards, and patterns. - You prefer highly specialized, narrow scope; this role spans ingestion, transformation, quality, and collaboration. - You want to work in isolation; this is a collaborative team, and you’ll interact regularly within the engineering team and cross-functionally. - You seek a management track; this is a hands-on IC role.
Data Engineer
Highmark HealthCreating remarkable health experiences, freeing people to be their best.
• Design, develop, and maintain robust data processes and solutions to ensure the efficient movement and transformation of data across multiple systems • Develop and maintain data models, databases, and data warehouses to support business intelligence and analytics needs • Collaborate with stakeholders across IT, product, analytics, and business teams to gather requirements and provide data solutions that meet organizational needs • Monitor work against the production schedule, provide progress updates, and report any issues or technical difficulties to lead developers regularly • Implement and manage data governance practices, ensuring data quality, integrity, and compliance with relevant regulations



