The power of one verification platform
Senior Data Engineer
Location
Worldwide
Posted
1 day ago
Salary
0
Seniority
Senior
Job Description
Senior Data Engineer
Sumsub
• Designing and developing scalable and efficient data pipelines, ETL processes, and data integration solutions. • Ensuring data quality and reliability by implementing data validation, cleansing, and quality monitoring processes. • Optimising database performance by tuning queries, implementing indexing strategies, and analyzing performance metrics. • Collaborating with cross-functional teams to gather requirements, understand data needs, and develop solutions. • Staying up-to-date with emerging technologies and industry trends in data engineering. • Establishing and enforcing best practices and standards for data engineering.
Job Requirements
- Strong technical proficiency in data engineering technologies, such as Apache Airflow, ClickHouse, ETL tools, and SQL databases.
- Deep understanding of data modeling, ETL processes, data integration, and data warehousing concepts.
- Proficiency in programming languages commonly used in data engineering, such as Python, Java, or Scala.
- Knowledge of AWS is a plus.
- Strong analytical and problem-solving skills.
- Solid project management and organizational skills, with the ability to prioritize and manage multiple data engineering projects concurrently.
Benefits
- Remote-first, trust-based culture.
- True flexibility with work hours.
- Extra time off: birthday holiday and 10 personal days.
- Fair and transparent pay.
- Opportunities for personal development and learning.
- Team offsites covered by the company.
Related Guides
Related Categories
Related Job Pages
More Data Engineer Jobs
Senior Data Engineer
eSimplicityAn engineering firm that delivers high-quality Healthcare IT, Cybersecurity, and Telecommunication solutions.
• Designing, developing, and maintaining scalable data pipelines that ingest, reconcile, and validate Medicaid program data from MESH and connected CMS systems including T-MSIS, MBES, MacFin, SPoTT, and CMCS DataConnect • Building and operating data ingestion workflows that support mixed-modality content — spreadsheets (Excel, CSV), documents (PDF, Word), API feeds, and enterprise data services — with full lineage and audit traceability • Implementing the data layer that powers the Master Detailed Budget Table (MDBT) modernization, including parsing legacy and modern MDBT formats, normalizing line items (e.g., MMIS 2A/B, 4A/B, 5A/B/C), and validating total computable, state share, and federal share calculations • Developing API services to facilitate bidirectional data integration between MESH (Salesforce) and external systems, including standardized integrations with T-MSIS for compliance and Data Quality Assessment status • Building data processing workflows on AWS (S3, Glue, EMR, Athena, RDS/Redshift) and, where applicable, Databricks, Spark, and Hive for big-data processing • Creating data products that feed Power BI dashboards and Salesforce reports, ensuring consistent definitions, refresh schedules, snapshot management, and data-quality indicators • Implementing automated data quality, validation, and reconciliation jobs that flag funding decreases, mismatched FFP rates, missing submissions, and other anomalies • Supporting AI/ML pipelines for the MESH platform — including features for AI-assisted submission analysis, predictive analytics, and anomaly detection — by providing curated, governed training and inference data • Writing unit and integration tests for all data processing code and partnering with DevOps engineers on CI, CD, and IaC (GitHub Actions, Terraform) so pipelines deploy reliably • Performing code reviews, contributing to data-engineering standards, and mentoring less experienced engineers on pipeline design, performance optimization, and observability • Maintaining data security and privacy controls for data at rest and in transit (FIPS 140 validated encryption, KMS-managed keys, least-privilege access) consistent with CMS ARS 5.1 and FedRAMP Moderate baselines • Documenting data dictionaries, lineage, calculation logic, and pipeline runbooks in CMS-approved tools (Confluence, GitHub) to support transparency, audit, and contract transition activities
• Liderar y ejecutar el desarrollo de pipelines de datos en proyectos de analytics, con una visión integral de arquitectura de datos end to end. • Colaborar estrechamente con el líder de equipo y el equipo de implementación, entregando visibilidad técnica del avance de las iniciativas. • Apoyar en la definición, estimación y planificación de tareas para productos analíticos bajo metodologías ágiles. • Liderar la gobernanza técnica de la solución junto al cliente, participando en decisiones clave. • Diseñar e implementar arquitecturas de soluciones de datos en la nube de forma colaborativa. • Participar en el modelado de datos y desarrollo de modelos analíticos corporativos. • Investigar y profundizar en tecnologías necesarias para alcanzar los objetivos del proyecto.
Data Catalog Specialist, S4
Mondelēz InternationalWe’re a house of incredible brands providing people with the right snack, for the right moment, made the right way.
Job Description Are You Ready to Make It Happen at Mondelēz International? Join our Mission to Lead the Future of Snacking. Make It Uniquely Yours. You will be a senior testing leader within the Mondelēz International Digital & Technology organization, responsible for leading end-to-end quality assurance and testing activities across large-scale transformation programs, including the o9 Enterprise Operations Planning (EOP) platform rollout. You will own the testing strategy, manage testing cycles, and drive quality outcomes across multiple workstreams and regional stakeholders in the CPG industry The Data Catalog Specialist is responsible for the design, implementation, and maintenance of the data catalog within the S/4HANA landscape. This role ensures that data assets are well-defined, easily discoverable, and properly governed, enabling data-driven decision-making across the organization. Key Responsibilities: • Data Catalog Implementation & Management: o Document data elements in scope in pre-defined format o Support design and implement the data catalog solution, integrating it with the S/4HANA environment in line with MDLZ strategy. o Support configuration and customization of the data catalog tool to meet organizational needs. • Metadata Management: o Organize and document metadata in line with MDLZ standards and policies. o Collaborate with data owners and stewards to capture and curate metadata for S/4HANA data assets. o Support implementation of metadata extraction and enrichment processes. o Ensure metadata quality and consistency across the data catalog. • Data Discovery & Search: o Train users to easily discover and understand data assets through the data catalog. o Support search functionality optimization and indexing for efficient data discovery. o Support development of a user-friendly interface for browsing and exploring the data catalog. • Data Governance & Compliance: o Support data governance initiatives by maintaining of a central repository for data definitions and lineage. o Document quality and business rules and standards for data and support implementation of data quality solution. o Ensure compliance with data privacy regulations (e.g., GDPR, CCPA) by documenting data sensitivity and access controls. • S/4HANA Integration: o Work closely with S/4HANA Finance teams to understand their data requirements and collaborate with other functional teams on requirements for data cataloguing. o Support data cataloguing tool integration with S/4HANA data sources, including tables, views, and APIs.• Collaboration & Communication: o Collaborate with data owners, data stewards, data architects, and business users to ensure the data catalog is aligned with functional requirements o Provide training and support to data users on how to use the data catalog effectively. More about this role Qualifications: - Education: Bachelor's degree in Computer Science or Information Management, or a related field. - Experience: - 5+ years of experience in data governance, metadata management, or data catalog implementation. - Strong understanding of data governance principles and practices. - Hands-on experience with data catalog tools (e.g., Ataccama, Collibra, Informatica Enterprise Data Catalog). - Experience with S/4HANA data structures and data flows. - Familiarity with data modeling and data warehousing concepts. - Experience and/or understanding of Finance function is an added advantage - Skills: - Proficiency in SQL and data manipulation languages - preferred - Excellent communication and interpersonal skills. - Strong analytical and problem-solving abilities. - Ability to work independently and as part of a team. Preferred Qualifications: - Experience with SAP Information Steward or other SAP data governance tools. - Knowledge of SAP Master Data Governance (MDG) - Experience with cloud-based data catalog solutions - Certifications in data governance or metadata management Responsibilities Specific to S/4HANA: - Understanding of S/4HANA Data: Possess a strong understanding of the S/4HANA data model, including key tables, business objects, and data relationships. - Understanding of SAP Fiori Apps: Knowledge of how data is presented and consumed through SAP Fiori apps. Key Performance Indicators (KPIs): - Data catalog adoption rate. - Number of data assets documented in the data catalog. - Data quality scores for key S/4HANA data elements. - Reduction in time spent searching for data. - Increased data literacy among business users. No Relocation support available Business Unit Summary Headquartered in Singapore, Mondelēz International's Asia, Middle East and Africa (AMEA) region is comprised of six business units, has more than 21,000 employees and operates in more than 27 countries including Australia, China, Indonesia, Ghana, India, Japan, Malaysia, New Zealand, Nigeria, Philippines, Saudi Arabia, South Africa, Thailand, United Arab Emirates and Vietnam. Seventy-six nationalities work across a network of more than 35 manufacturing plants, three global research and development technical centers and in offices stretching from Auckland, New Zealand to Casablanca, Morocco. Mondelēz International in the AMEA region is the proud maker of global and local iconic brands such as Oreo and belVita biscuits, Kinh Do mooncakes, Cadbury, Cadbury Dairy Milk and Milka chocolate, Halls candy, Stride gum, Tang powdered beverage and Philadelphia cheese. We are also proud to be named a Top Employer in many of our markets. Mondelēz International is an equal opportunity employer and all qualified applicants will receive consideration for employment without regard to race, color, religion, gender, sexual orientation or preference, gender identity, national origin, disability status, protected veteran status, or any other characteristic protected by law. Job Type Regular Project and Program Management Business Capability
Data Science Intern
RubrikRubrik is a computer software company delivering instant application availability for cloud, development, search, and recovery. Founded in 2014, Rubrik was buil
About The JobWe’re looking for a Data Scientist Intern to join our Data Research team. As a data scientist intern you will be in charge of researching and developing ML/AI models for features of our security products. Specifically, you will be involved in advanced classification algorithms, anomaly detection, NLP, and more. This is a summer internship for 3 months (full time). What you’ll be doing: - Research and develop ML/AI models in the domains of NLP, anomaly detection, advanced classification, and more - Work closely with data analysts on research, pattern analysis, etc. - Work closely with developers on implementation of models as part of features in the product - Work with global teams to continuously push our ML infrastructure forward Qualifications:Required: - Bachelor’s degree in Mathematics, Computer Science, or other related field - Currently pursuing a Master’s degree in Mathematics, Computer Science, or other related field - Proficiency in programming languages, especially Python - Experience with common data science toolkits, such as Jupyter Notebook, Pandas, NumPy, Matplotlib, etc. - Strong problem-solving skills and analytic capability to develop insights and recommendations - Excellent communication skills in Hebrew and English Preferred: - Demonstrated experience in implementing ML algorithms in production environments - Familiarity with cloud platforms such as AWS, Google Cloud, or Azure Join Us in Securing and Accelerating the World's AI TransformationRubrik (RBRK), the Security and AI Operations Company, leads at the intersection of data protection, cyber resilience, and enterprise AI acceleration. Rubrik Security Cloud delivers complete cyber resilience by securing, monitoring, and recovering data, identities, and workloads across clouds. Rubrik Agent Cloud accelerates trusted AI agent deployments at scale by monitoring and auditing agentic actions, enforcing real-time guardrails, fine-tuning for accuracy and undoing agentic mistakes. Linkedin | X (formerly Twitter) | Instagram | Rubrik.com Inclusion @ RubrikAt Rubrik, we are dedicated to fostering a culture where people from all backgrounds are valued, feel they belong, and believe they can succeed. Our commitment to inclusion is at the heart of our mission to secure the world’s data. Our goal is to hire and promote the best talent, regardless of background. We continually review our hiring practices to ensure fairness and strive to create an environment where every employee has equal access to opportunities for growth and excellence. We believe in empowering everyone to bring their authentic selves to work and achieve their fullest potential. Our inclusion strategy focuses on three core areas of our business and culture: - Our Company: We are committed to building a merit-based organization that offers equal access to growth and success for all employees globally. Your potential is limitless here. - Our Culture: We strive to create an inclusive atmosphere where individuals from all backgrounds feel a strong sense of belonging, can thrive, and do their best work. Your contributions help us innovate and break boundaries. - Our Communities: We are dedicated to expanding our engagement with the communities we operate in, creating opportunities for underrepresented talent and driving greater innovation for our clients. Your impact extends beyond Rubrik, contributing to safer and stronger communities. Equal Opportunity Employer/Veterans/DisabledRubrik is an Equal Opportunity Employer. All qualified applicants will receive consideration for employment without regard to race, color, religion, sex, sexual orientation, gender identity, national origin, or protected veteran status and will not be discriminated against on the basis of disability. Rubrik provides equal employment opportunities (EEO) to all employees and applicants for employment without regard to race, color, religion, sex, national origin, age, disability or genetics. In addition to federal law requirements, Rubrik complies with applicable state and local laws governing nondiscrimination in employment in every location in which the company has facilities. This policy applies to all terms and conditions of employment, including recruiting, hiring, placement, promotion, termination, layoff, recall, transfer, leaves of absence, compensation and training. Federal law requires employers to provide reasonable accommodation to qualified individuals with disabilities. Please contact us at hr@rubrik.com if you require a reasonable accommodation to apply for a job or to perform your job. Examples of reasonable accommodation include making a change to the application process or work procedures, providing documents in an alternate format, using a sign language interpreter, or using specialized equipment. EEO IS THE LAW NOTIFICATION OF EMPLOYEE RIGHTS UNDER FEDERAL LABOR LAWS




