Capco, a Wipro company, is a management & technology consultancy dedicated to the financial services & energy industries

Data Quality & Metadata Specialist (She/ He/ They)

Data EngineerData EngineerFull Time Remote Mid LevelTeam 1,001-5,000Since 1998H1B SponsorCompany Site LinkedIn

Location

Poland

Posted

69 days ago

Salary

Seniority

Mid Level

Bachelor Degree5 yrs expEnglishAtaccama Azure Purview Business Glossary Collibra Data Quality Frameworks Informatica Metadata Structures Sap Mdg

Job Description

CAPCO POLAND *We're looking for Poland-based candidates. Capco is a global management and technology consultancy dedicated to the financial services industry. We partner with leading banks, insurers, and fintechs to deliver data-driven transformation, operational excellence, and innovative digital solutions. Role Overview: We are looking for a Data Quality & Metadata Specialist to join a strategic client project focused on strengthening enterprise Data Governance foundations. In this consulting role, you will support the design and operationalisation of the Data Quality Management model, establish business glossary and metadata governance processes, recommend appropriate tooling, and provide input into the feasibility of business KPIs from a data readiness perspective. This role requires a strong ability to translate governance principles into practical, implementable solutions while working closely with both business and IT stakeholders. What You’ll Do: Data Quality Management Model - Co-design and document the enterprise Data Quality Management framework - Define Data Quality dimensions, rules, controls, and monitoring mechanisms - Define responsibilities for Data Owners and Data Stewards - Design DQ reporting structures, issue management workflows, and escalation paths - Support definition of DQ KPIs and performance dashboards Glossary & Metadata Governance - Develop and structure the Business Glossary framework - Define governance processes for glossary term creation, approval, and maintenance - Establish metadata governance standards (business and technical metadata) - Define metadata lifecycle processes and ownership model - Align glossary and metadata governance with broader Data Governance strategy Tooling Recommendations - Assess current tooling landscape and identify capability gaps - Define functional requirements for data catalog and DQ tools - Recommend suitable tooling aligned with the target operating model - Support preparation of implementation roadmap and adoption approach KPI Feasibility & Data Readiness Input - Assess data availability and quality to support defined business KPIs - Identify data gaps and risks impacting KPI measurement - Provide recommendations to improve measurement reliability - Support alignment between business reporting ambition and data maturity What We’re Looking For: - Experience working on Data Governance or Data Management projects - Hands-on experience in Data Quality frameworks or controls definition - Experience designing or maintaining Business Glossaries and metadata structures - Understanding of data lineage, metadata concepts, and DQ monitoring approaches - Experience with data catalog or DQ tools (e.g., Collibra, Informatica, SAP MDG, Azure Purview, Ataccama, etc.) - Strong stakeholder communication skills in cross-functional environments - Structured, analytical mindset with strong documentation skills Ideal Profile: - Consulting or transformation project experience - Ability to work independently within a defined project scope - Experience facilitating workshops with business stakeholders - Knowledge of DMBOK, DCAM or similar frameworks is a plus We offer a flexible collaboration model based on a B2B contract, with the opportunity to work on diverse projects. Recruitment Process: - HR Interview - Hiring Manager Technical Interview - Client Interview - Feedback and offer We have been informed of several recruitment scams targeting the public. We strongly advise you to verify identities before engaging in recruitment related communication. All official Capco communication will be conducted via a Capco recruiter.

Benefits

401(K), 401(K) matching, Adoption Assistance, Childcare benefits, Commuter benefits, Dedicated diversity and inclusion staff, Dental insurance, Disability insurance, Volunteer in local community, Fitness stipend, Flexible Spending Account (FSA), Generous parental leave, Generous PTO, Health insurance, Job training & conferences, Life insurance, Charitable contribution matching, Mentorship program, Open office floor plan, Paid holidays, Paid sick days, Partners with nonprofits, Performance bonus, Pet insurance, Promote from within, Lunch and learns, OKR operational model, Tuition reimbursement, Vision insurance, Wellness programs, Mental health benefits, Diversity employee resource groups, Hiring practices that promote diversity, Fertility benefits, Hybrid work model, Pay transparency, Bereavement leave benefits

Related Categories

Data Engineer

Related Job Pages

Remote Full-time Jobs (US)More Remote Jobs

More Data Engineer Jobs

Data Engineer

Empassion Health

We manage unique care programs, and the people that power them, to deliver more good days.

Data Engineer69 days ago

Full Time RemoteTeam 11-50H1B No Sponsor

Company Site LinkedIn

• Partner with teams across the business and external partners to understand data needs and deliver reliable pipelines and models that solve real problems. • Build and maintain scalable ingestion and egress pipelines in Airflow and dbt Cloud, ensuring high quality, automated data flows across cloud environments. • Implement unit tests and monitoring to guarantee data integrity and reproducibility. • Model and transform and structure healthcare datasets into usable formats that power data science models, and other reporting marts. • Enhance and scale data models with SQL and dbt, ensuring precision and adaptability for new partnerships. • Write Python code for Apache Airflow DAGs, components and utilities that orchestrate and monitor data workflows. Build complex pipelines that enable flexible scheduling, conditional logic, and smooth integration across multiple data sources.

Airflow Apache AWS Azure BigQuery Cloud Google Cloud Platform Python SQL

View details: Data Engineer

United States

$120K - $160K / year

Apply

Job Closed

Senior Data Engineer

Sanctuary Computer

We’re a small team of design-oriented software developers, with a strong understanding of user experience and product. We work in design, branding and engineering roles, with clients like Nike, General Electric, The Nobel Prize, Herman Miller, Adobe, Dig Inn and many others. That same thoughtfulness is applied inward, too. We understand that technologists & engineers have a strong sense of art and integrity, and are happier being a part of a team that promotes those values. We partner with our design team (XXIX) to build compelling digital products for industry leaders, early-to-mid stage startups, hardware companies, healthcare services, restaurant groups, and lifestyle brands.

Data Engineer69 days ago

Full Time RemoteTeam 45Since 2015

Company Site

We are recruiting a Sr. Data Engineer for a client in the AI / health & wellness space Original Job Post link https://www.notion.so/garden3d/Senior-Data-Engineer-2e7131fea2c7800094e4d9340a5df499?source=copy_link About garden3d We are worker owned creative collective, innovating on everything from brands and IRL communities to IoT devices and cross platform apps. We share profit, open source everything, spin out new businesses, and invest in exciting ideas through financial and/or in-kind contributions. Our client roster includes Google, Stripe, Figma, Hinge, Black Socialists in America, ACLU, Pratt, Parsons, Mozilla, The Nobel Prize, MIT, Gnosis, Etsy & Gagosian. We’re the software team behind innovative products like The Light Phone & Mill, and a global, decentralized community space collective called Index Space. We think of our garden3d as collective for creative people, prioritizing a happy, talented, and diverse studio culture. We work on projects that bring value to our world, and we balance deep care for the work we do with a genuine curiosity about life outside of our jobs. About the client Our client is an early-stage AI startup based in NYC (but open to remote team members). The founders have experience building and scaling successful ventures including a 9-figure exit. Who we’re looking for: We’re looking for a Senior Data Engineer with deep expertise in designing and owning data pipelines, workflow orchestration, and complex data integrations. You’ll play a key role in evolving our data ingestion architecture, from an existing in-house, code-defined workflow system backed by queues, to a more scalable and observable orchestration layer using Prefect. In this role, you’ll lead the development and optimization of pipelines handling both structured and unstructured data from a wide range of sources, including web crawls and scrapers. You’ll be expected to make architectural decisions, ensure reliability and scalability, and establish best practices for workflow design, monitoring, and performance as our data platform grows. In this role, you’ll work across a variety of initiatives to find cost-effective, high-quality, pragmatic solutions to complex problems. Responsibilities will include: - Monitoring and maintaining data pipelines, troubleshooting new errors, and addressing format drift - Extracting and enriching additional data elements from diverse sources - Reprocessing and validating large datasets in batch workflows - Designing and integrating new data sources into existing pipelines - Aligning and integrating extracted data with the core application data model to ensure consistency and usability - Participating in code reviews, providing constructive feedback to teammates and ensuring adherence to best practices - Contributing to project success by keeping a close eye on team velocity, project scope, budget, and timeline - Negotiating with clients to align project scope with budget and timeline, if needed Who you are The person we’re looking for is happy, relaxed and easy to get along with. They’re flexible on anything except conceits that will lower their usually outstanding work quality. They work “smart”, by carefully managing their workflow and staggering features that have dependencies intelligently — they prefer deep work but are OK coming up to the surface now and then for top level / strategic conversations. We believe people with backgrounds or interests in design, art, music, food or fashion tend to have a well rounded sense of design & quality — so a variety of hobbies or side projects is a big nice to have! Must Have Competencies: - Senior-level Python expertise - Experience with data/workflow orchestration tools (e.g., Prefect, Airflow, Dagster) - A thorough understanding ETL & data transformation for the ingestion of industry standard LLMs (OpenAI, Claude, etc) - Familiarity with Large Language Models (LLMs) - Skilled in interfacing with APIs (OpenAI, Google Gemini/Vertex, etc.) using wrapper libraries such as Instructor, LiteLLM, etc. - Practical experience in prompt engineering - Ability to work with structured outputs and potentially tool calling - 5+ years general experience in backend (Ruby on Rails, Elixir Phoenix, Python Django, or Node Express) and/or native app development (React Native, Flutter, Android, AOSP, Kotlin/Java). Nice to Have Competencies: We’re always pitching for new and exciting technology niches. Some of the areas below are relevant to us! - Experience with Google Cloud Platform (GCP), particularly Cloud Run and Cloud Tasks - Knowledge of search technologies, including embeddings and vector databases for semantic search, as well as keyword-based search (BM25) - Familiarity with PySpark for batch data processing - Experience working with LLMs, Vector Databases, and other generalist AI-enabled application patterns - Client-facing experience: working directly with customers to gather requirements and provide technical solutions - Product management experience: defining product roadmaps and collaborating closely with stakeholders - Engineering management experience: leading teams, setting technical direction, and mentoring developers - Recent experience working in a startup environment - NYC-based preferred for collaboration, but not a strict requirement. Compensation The pay scale ranges from $125 p/hr to $175 p/hr, $150-200k/year based on experience. In addition to cash compensation, equity may be offered for candidates with the right level of experience, commitment, and long-term alignment. How we interview: Our interview process starts with a call where you get to meet a few members of our team. From there we’ll ask appropriate candidates to take part in a technical exercise which helps illustrate skill level and comfort. Direct application link here: https://garden3d.notion.site/1f1131fea2c78095922ec7e09bd96101 (Tell us a bit about your interest in the role and share your information by filling out the questions.) Quick tip! Adding a Loom recording to your profile in our form to showcase your skillset can really make your application stand out!

Airflow Android Aosp Claude Dagster Elixir Phoenix Flutter Google Cloud Platform Google Gemini Java Kotlin Node Express Openai Prefect Pyspark Python Python Django React Native Ruby On Rails

View details: Senior Data Engineer

New York

$150K - $200K / year

Apply

Azure Data Engineer

Enable Data

A leading provider of advanced data, application, and cloud engineering services.

Data Engineer69 days ago

Full Time RemoteTeam 51-200H1B Sponsor

Company Site LinkedIn

• Design, develop, and implement scalable and reliable data solutions on the Microsoft Azure platform. • Collaborate with cross-functional teams to gather and analyze data requirements. • Design and implement data ingestion pipelines to collect data from various sources, ensuring data integrity and reliability. • Perform data integration and transformation activities, ensuring data quality and consistency. • Implement data storage and retrieval mechanisms, utilizing Azure services such as Azure SQL Database, Azure Data Lake, and Azure Blob Storage. • Monitor data pipelines and troubleshoot issues to ensure smooth data flow and availability. • Implement data quality measures and data governance practices to ensure data accuracy, consistency, and privacy. • Collaborate with data scientists and analysts to support their data needs and enable data-driven insights.

Apache Azure Cloud ETL Hadoop Python Scala Spark SQL

View details: Azure Data Engineer

India

Apply

Job Closed

Data Engineer

Thaloz

We help companies achieve their goals and expand their business through technology.

Data Engineer69 days ago

Full Time RemoteTeam 51-200H1B No Sponsor

Company Site LinkedIn

• Architect, develop, and maintain scalable data pipelines and ETL workflows that support the ingestion, transformation, and storage of large datasets from diverse sources. • Implement automated data quality checks and validation processes to ensure the accuracy, consistency, and reliability of data across systems. • Work closely with product managers, data analysts, and business stakeholders to gather and understand data requirements, translating them into technical specifications and actionable engineering tasks. • Continuously monitor and optimize data systems for performance, scalability, and cost-efficiency, ensuring that data infrastructure meets evolving business needs. • Diagnose and resolve data-related issues promptly, providing root cause analysis and implementing preventive measures. • Maintain comprehensive documentation of data pipelines, ETL processes, and system architecture. Participate in design and code reviews to uphold high engineering standards. • Stay abreast of emerging data engineering technologies, tools, and best practices to drive innovation and continuous improvement within the team. • Provide guidance and mentorship to junior data engineers, fostering a culture of knowledge sharing and technical excellence.

ETL Linux MySQL Oracle Pandas PySpark Python RDBMS Shell Scripting Spark SQL Unix

View details: Data Engineer

Brazil

Apply

Job Closed

Data Quality & Metadata Specialist (She/ He/ They)

Job Description

Benefits

Related Guides

Related Categories

Related Job Pages

More Data Engineer Jobs

Data Engineer

Senior Data Engineer

Azure Data Engineer

Data Engineer