Data Engineer – Mid/Senior
Location
Brazil
Posted
17 hours ago
Salary
0
Seniority
Senior
Job Description
Data Engineer – Mid/Senior
Kognit - Committed to Transforming
• Model and implement the data warehouse and a data lake on S3. • Implement an offline Feature Store (SQL Server, point-in-time correctness) and an online Feature Store (Redis). • Develop ETL/ELT pipelines with versioning, data lineage, and data quality tests. • Evolve existing pipelines and analytical dashboards as the starting point for the data lake. • Integrate event-driven architecture (Azure Service Bus / Kafka) for real-time features. • Support MLOps pipelines (Azure ML, MLflow) in collaboration with ML Engineers.
Job Requirements
- Bachelor's degree in Data Science, Computer Science, Statistics, Engineering, Mathematics, Economics, or related fields.
- English or Spanish (international client).
- Advanced SQL, preferably SQL Server.
- Python for data pipelines.
- AWS — S3, RDS; familiarity with ECS/EC2 and Application Load Balancer.
- Experience building production ETL/ELT pipelines with data quality testing and versioning.
- Good practices in data lineage, point-in-time correctness, and partitioning.
Benefits
- Contractor (PJ) — we are looking for long-term partners.
- Remote work.
- Workload: 8 hours per day (168 hours/month).
- Time bank with quarterly settlement (payment for positive balance hours).
- Variable compensation after 4 months of contract.
- Flexible card: meal allowance of BRL 700.00 per month (Flash card).
- Paid time off: 7 consecutive days after 1 year of contract; 15 days after 2 years; 21 days between years 5 and 7; and 30 days after 8 years of contract.
- Birthday day off (one day off during your birthday month).
- Partnership with Wellhub.
- Partnership with Open English.
- Partnership with FIAP.
- Partnerships with clinics and psychological services.
Related Guides
Related Categories
Related Job Pages
More Data Engineer Jobs
• Work on an SAP Commerce migration project, supporting the adaptation of the new SAP Commerce platform to the current data flows; • Perform maintenance and adjustments on existing pipelines, following directions and priorities defined by the client; • Contribute to the continuity of data migration and integration initiatives related to the project; • Support requests that fall outside the project's initial scope, ensuring the evolution and stability of data flows; • Participate in the analysis, correction, and enhancement of existing data processes; • Collaborate with the team on the maintenance and adaptation of solutions during the technological transition period.
• Diseñarás, desarrollarás y optimizarás pipelines y soluciones de datos escalables, fiables y mantenibles en entornos modernos de Data Warehouse, trabajando principalmente sobre Snowflake • Participarás en el diseño e implementación de modelos y soluciones de datos eficientes, aplicando buenas prácticas de modelado, transformación ELT, calidad, gobierno y rendimiento, con foco en la escalabilidad, mantenibilidad y evolución futura de las soluciones • Identificarás oportunidades de mejora continua en procesos, consultas, cargas, modelos, costes y flujos de integración, contribuyendo a la evolución de la plataforma de datos • Trabajarás de forma cercana con equipos de negocio y otras áreas técnicas para entender requisitos, traducirlos en soluciones robustas y asegurar la entrega de datos fiables, trazables y preparados para el consumo
• You design and own the architecture of our end-to-end data platform — from data integration and transformation to data delivery • You lead the development of robust data pipelines using Fivetran, Airbyte and Python, setting technical standards for the team • You drive outstanding data modeling in Databricks and apply advanced SQL, dbt and data-quality frameworks such as elementary • You define and establish standards for data quality and ensure reliable data marts that serve as the basis for business-critical decisions • You provide technical leadership and work closely with the Product Owner and the Business Insights team on complex data challenges • You are responsible for the ongoing development of our cloud infrastructure on AWS and advance Infrastructure as Code (IaC) using Terraform
• You design and own the architecture of our end-to-end data platform, including ingestion, transformation, and serving layers • You lead the development of robust data pipelines using Fivetran, Airbyte, and Python – setting engineering standards for the team • You drive data modeling excellence in Databricks using advanced SQL, dbt, and data quality frameworks like elementary • You define and enforce data quality standards, ensuring reliable data marts that power business-critical decisions • You act as technical lead, collaborating closely with the Product Owner and Business Insights team on complex data challenges • You own and evolve our cloud infrastructure on AWS, driving infrastructure-as-code with Terraform (IaC)




