GFT Technologies logo
GFT Technologies

As a pioneer for digital transformation GFT develops sustainable solutions across new technologies.

Engenheiro de Dados Sênior

Data EngineerData EngineerFull TimeRemoteSeniorTeam 10,001+Since 1987H1B No SponsorCompany SiteLinkedIn

Location

Brazil

Posted

5 days ago

Salary

0

Seniority

Senior

Job Description

Engenheiro de Dados Sênior

GFT Technologies

• Atuar na manutenção e evolução de pipelines ETL multiestágio em diferentes domínios de dados; • Implementar transformações de dados, como conversões, filtragem de outliers, preenchimento de lacunas, suavização e interpolação; • Diagnosticar e corrigir problemas de qualidade de dados em pipelines produtivos; • Projetar e manter configurações de mapeamento de campos baseadas em YAML para novas fontes de dados; • Consultar e carregar dados utilizando Cloud SQL e BigQuery; • Construir e manter endpoints utilizando FastAPI seguindo princípios de arquitetura limpa; • Desenvolver testes unitários e de integração utilizando pytest; • Colaborar em revisões de código e manutenção de pipelines de CI/CD no Azure DevOps; • Trabalhar diretamente com dados brutos e pipelines produtivos, garantindo eficiência e confiabilidade; • Traduzir regras de negócio em transformações eficientes utilizando pandas;

Job Requirements

  • Inglês avançado para comunicação diária com times internacionais;
  • Experiência sólida com Python (versão 3.10 ou superior);
  • Domínio de dataclasses, type hints e abstract base classes;
  • Experiência com pandas e NumPy para processamento de dados em larga escala;
  • Experiência com scipy para otimização e ajuste de curvas;
  • Experiência com SQLAlchemy (Core e ORM);
  • Vivência na construção de APIs assíncronas utilizando FastAPI e asyncio;
  • Experiência com Cloud SQL e autenticação via IAM no GCP;
  • Experiência com BigQuery para consulta e carga de dados;
  • Experiência com Cloud Storage para manipulação de arquivos;
  • Conhecimento em service accounts e controle de acesso (IAM);
  • Experiência no design e debugging de pipelines de dados multiestágio;
  • Forte entendimento de processamento de dados sequenciais ou séries temporais;
  • Experiência com validação de dados, deduplicação e imputação;
  • Experiência com pipelines orientados a configuração (YAML);
  • Experiência com Docker e Docker Compose;
  • Experiência com pipelines de CI/CD (preferencialmente Azure DevOps);
  • Experiência com gerenciamento de dependências utilizando Poetry;
  • Conhecimento básico de Redis como camada de cache;
  • Capacidade de comunicação clara de decisões técnicas e atuação colaborativa com times multidisciplinares;

Benefits

  • Cartão multi-benefícios – você escolhe como e onde utilizar.
  • Bolsas de Estudos para cursos de Graduação, Pós, MBA e Idiomas.
  • Programas de incentivo à Certificações.
  • Horário de trabalho flexível.
  • Salários competitivos.
  • Avaliação de desempenho anual com plano de carreira estruturado.
  • Possibilidade de carreira internacional.
  • Wellhub e TotalPass.
  • Previdência Privada.
  • Auxílio-Creche.
  • Assistência Médica.
  • Assistência Odontológica.
  • Seguro de Vida.

Related Categories

Related Job Pages

More Data Engineer Jobs

ContractRemoteTeam 201-500Since 2014H1B No Sponsor

• Impulsar tu desarrollo profesional trabajando en diversos proyectos dentro de distintas industrias. • Contribuir al diseño, implementación y optimización de infraestructuras en la nube. • Participar en la construcción, automatización y mejora de procesos ETL dentro del ecosistema Azure. • Resolver retos relacionados con integración de datos, automatización operativa y procesamiento eficiente de información.

Mexico
$1.8K / month
Full TimeRemoteTeam 10,001+H1B Sponsor

• Develop and maintain batch and streaming data pipelines in the Google Cloud Platform (GCP) environment; • Build ingestion and transformation workflows using Dataflow and Apache Beam; • Integrate data from multiple sources, including Oracle, SQL Server, MySQL, on-premises environments, and external buckets; • Work with BigQuery for data processing, transformation, and optimization; • Organize and structure data in Cloud Storage following a Data Lake architecture (Raw, Bronze, Silver, and Gold); • Orchestrate workflows and pipelines using Cloud Composer (Airflow); • Develop ETL/ELT processes and analytical transformations using SQL; • Build automations and data integrations using Python; • Ensure the quality and reliability of processed data; • Support initiatives to migrate reports and analytical assets to a new Data Visualization platform; • Collaborate with Engineering, Analytics, Data Science teams, and business stakeholders.

Brazil
Beyondsoft Consulting logo

Data Engineer

Beyondsoft Consulting

Beyondsoft is a leading mid-sized business IT and consulting company that combines modern technologies and proven methodologies to tailor solutions that move your business forward. Our global head office is based in Singapore, and our team is made up of a diversely talented team of experts who thrive on innovation and pushing the bounds of technology to solve our customers’ most pressing challenges. We believe that collaboration, transparency, and accountability are the values that guide our business, our delivery, and our brand. Everyone has something to bring to the table, and we believe in working together with our peers and clients to leverage the best of one another in everything we do. Our ability to achieve our mission and live out our values depends upon a diverse, equitable, and inclusive culture. So, we strive to foster a workplace where people have the respect, support, and voice they deserve, where innovative ideas flourish, and where people can unleash their brilliance.

Data Engineer5 days ago
OtherRemoteTeam 501-1,000

Role Description In this fully remote role you will support scalable data operations through the development of ETL processes, SQL-based integrations, Power Platform solutions, Azure services, and Power BI reporting capabilities. The ideal candidate combines strong technical engineering expertise with operational problem-solving skills to support complex business processes, data integrity, reporting accuracy, and automation reliability. This role requires hands-on experience with Azure Data Factory, Logic Apps, Power BI, SQL development, Power Platform administration, and Microsoft Fabric, along with the ability to troubleshoot and optimize end-to-end data workflows in a fast-paced enterprise environment. What You Will Be Doing: - Design, build, maintain, monitor, and troubleshoot data-processing automations, including data preparation, publishing, transformations, error handling, retry mechanisms, and conditional workflows. - Develop and maintain data ingestion pipelines from external sources into SQL databases. Manage datasets, linked services, copy activities, and pipeline scheduling. - Create and support automated flows to trigger Logic Apps and handle lightweight processes such as report refreshes, dataflow updates, notifications, and approval workflows. - Develop, enhance, and maintain canvas apps and standalone internal applications to support business needs. - Perform full-stack BI development, including data modeling, DAX development, Power Query (M), semantic models, row-level security (RLS), workspace management, and report publishing. Leverage Microsoft Fabric as the unified access platform for portal activities. - Manage and administer Azure resources including Logic Apps, Data Factory, storage accounts, monitoring solutions, RBAC, Application Insights, and Log Analytics. Ensure alignment with S360 security and compliance requirements. - Design and maintain standardized Excel templates, support Excel-to-database ingestion workflows, and validate raw source data. - Troubleshoot and resolve issues across complex ETL processes and data pipelines. - Monitor and manage alerting for failures in Logic Apps, ADF pipelines, Power Automate flows, and Power BI refreshes. - Govern Power Platform environments, including environment management, licensing, and compliance. - Implement and maintain security controls (Azure RBAC, Microsoft Entra ID, S360 alignment, Power BI RLS). - Manage change control, versioning, and documentation for automations, SQL artifacts, and reporting assets. - Optimize performance and scalability across data pipelines and reporting solutions as volumes grow. - Conduct root-cause analysis to reconcile discrepancies between Excel, database systems, and reporting outputs. - Collaborate with vendors and development teams to quickly identify and resolve technical issues. - Read, modify, and extend existing hard-coded queries and publishing logic as needed. - Perform basic scripting for bulk operations, diagnostics, or automation support. - Perform additional duties as assigned. Qualifications - 5+ years of experience in data engineering, BI development, automation engineering, or related technical roles. - Strong hands-on experience with Azure Data Factory (ADF), including pipeline development, datasets, linked services, scheduling, monitoring, and troubleshooting. - Experience developing and maintaining Logic Apps and Power Automate workflows for operational automation and integrations. - Advanced SQL development skills including query optimization, stored procedures, ETL development, and troubleshooting complex data transformations. - Strong experience with Power BI development including: - Data modeling - DAX - Power Query (M) - Semantic models - Row-Level Security (RLS) - Workspace administration and publishing - Experience with Microsoft Fabric and enterprise BI/reporting platforms. - Experience administering Azure resources including RBAC, Application Insights, Log Analytics, Storage Accounts, and monitoring solutions. - Experience supporting Excel-to-database ingestion processes and validating source data integrity. - Ability to troubleshoot and resolve issues across ETL pipelines, reporting systems, and automation workflows. - Experience implementing security and governance controls across Azure and Power Platform environments. - Strong analytical and problem-solving skills, particularly in reconciling data across multiple systems. - Ability to maintain thorough documentation, including runbooks and process workflows. - Attention to detail and a proactive approach to monitoring, troubleshooting, and continuous improvement. - Occasional infrequent in person activity may be required. Requirements - Beyondsoft provides equal employment opportunities to all employees and applicants for employment and prohibits discrimination and harassment of any type with regards to race, color, religion, age, sex, national origin, disability status, genetics, veteran status, sexual orientation, gender identity or expression, or any other characteristic protected by federal, state, or local laws. - This policy applies to all terms and conditions of employment, including recruiting, hiring, and the full employee lifecycle up through and including termination.

Worldwide
Stefanini LATAM logo

Ingeniero de Datos, Databricks

Stefanini LATAM

Co-creating solutions for a better future

Data Engineer5 days ago
Full TimeRemoteTeam 10,001+Since 1987H1B No Sponsor

• Diseñar, desarrollar y mantener pipelines de datos robustos y escalables. • Implementar procesos ETL/ELT para la integración y transformación de datos provenientes de múltiples fuentes. • Desarrollar soluciones utilizando Databricks y Apache Spark. • Construir y optimizar arquitecturas Lakehouse y Data Lake. • Garantizar la calidad, consistencia y disponibilidad de los datos. • Trabajar en conjunto con equipos de Data Analytics, BI, Data Science y negocio. • Participar en iniciativas de optimización de performance, monitoreo y gobernanza de datos. • Colaborar en el diseño de soluciones cloud alineadas con las mejores prácticas de ingeniería de datos. • Documentar desarrollos, procesos y soluciones implementadas.

Argentina