Role Description
We are looking for a Data Engineer to join our Data Engineering team. In this role, you will design, build, and productionize modern cloud data platforms and pipelines that enable reliable analytics and AI solutions for our clients. You will collaborate closely with client stakeholders, architects, and engineering teams to deliver secure, scalable, high-performing data solutions and advance phData’s delivery excellence.
Key Responsibilities
-
Client Delivery:
-
Own and drive end-to-end data pipeline and platform implementation for cloud-based analytics and data platform projects across enterprise clients.
-
Translate business requirements into technical data engineering solutions that align with phData methodologies, standards, and best practices.
-
Ensure engagements are delivered on time, within scope, and with measurable business value for clients.
-
Collaboration & Leadership:
-
Collaborate with data engineers, analytics engineers, data scientists, solution architects, and project managers to deliver successful client engagements.
-
Provide technical leadership during architecture and design workshops, discovery sessions, and project delivery.
-
Ensure high quality in deliverables through code reviews, documentation, testing, governance, and production-ready operational practices.
-
Partner with practice and account leaders to identify opportunities to expand data engineering engagements, improve delivery quality, and standardize reusable patterns and frameworks.
-
Practice & Firm Contribution:
-
Contribute to internal initiatives such as IP development, accelerators, reference architectures, playbooks, and internal training and mentoring.
-
Represent phData with professionalism in all interactions, communicating clearly with both technical and non-technical stakeholders.
Qualifications
-
3–4 years of experience as a Data Engineer designing, building, and deploying data solutions to production.
-
Strong programming skills in Python and/or Scala for data engineering workloads.
-
Hands-on experience with core cloud data platforms such as Snowflake, AWS, Azure, and Databricks.
-
Advanced SQL skills, including the ability to write, debug, and optimize complex queries.
-
Experience developing end-to-end technical solutions with a focus on performance, security, scalability, and robust data integration.
-
Experience creating detailed technical documentation, including solution designs, POCs, roadmaps, sequence diagrams, logical system views, and related artifacts.
-
Experience delivering projects for external or internal clients in a professional services or consulting environment.
-
Ability to break down complex problems into structured, actionable steps and drive them through to completion.
-
Strong written and verbal communication skills in English.
-
Ability to create and deliver clear, client-facing presentations and solution walkthroughs for both technical and non-technical audiences.
-
Demonstrated ability to work effectively with distributed and cross-functional teams.
-
Proven track record of taking ownership, managing multiple priorities, and delivering high-quality work with minimal supervision.
Requirements
-
4-year Bachelor’s degree in Computer Science or a related field.
Preferred Qualifications
-
Production experience with core data and cloud platforms such as Snowflake, AWS, Azure, GCP, Hadoop, Databricks, and/or Informatica Intelligent Cloud Services (IICS).
-
Experience with cloud and distributed data storage technologies such as S3, ADLS, HDFS, GCS, Kudu, Elasticsearch/Solr, Cassandra, or other NoSQL storage systems.
-
Hands-on experience with data integration and streaming technologies such as Spark, Kafka, event/streaming platforms, Streamsets, Matillion, Fivetran, NiFi, AWS Data Migration Services, Azure Data Factory, Google Dataproc, or similar tools.
-
Experience integrating data from multiple sources (e.g., queues, relational databases, files, search indexes, APIs).
-
Experience across the complete software development lifecycle including design, documentation, implementation, testing, and deployment of data solutions.
-
Experience with automated data transformation and data curation using tools such as dbt, Spark, Spark Streaming, and automated data pipelines.
-
Experience with workflow management and orchestration tools such as Airflow, AWS Managed Airflow, Luigi, or NiFi.
-
Prior experience working in global or remote teams and partnering across multiple regions is a plus.
-
Contributions to open-source projects, technical communities, speaking, writing, or creation of internal accelerators are a plus.
Location & Time Zone Expectations
-
This role is based in India and operates primarily in India Standard Time (IST).
-
We are a remote-first company, and you should be comfortable working with a distributed global team.
-
Some flexibility may be required to collaborate across time zones with colleagues and clients.
-
Client needs may occasionally require flexibility in working hours to support key milestones or workshops.
Benefits
-
Remote-First Workplace
-
Medical Insurance for Self & Family
-
Medical Insurance for Parents
-
Term Life & Personal Accident
-
Wellness Allowance
-
Broadband Reimbursement
-
Continuous learning and growth opportunities to enhance your skills and expertise
-
Other benefits include paid certifications, professional development allowance, and bonuses for creating company-approved content