Derevo logo
Derevo

Release the Power of Data

Data Engineer – MS Fabric

Data EngineerData EngineerFull TimeRemoteSeniorTeam 51-200Since 2013H1B No SponsorCompany SiteLinkedIn

Location

Mexico

Posted

93 days ago

Salary

0

Seniority

Senior

Bachelor DegreeSpanishAzureETLPySparkPythonSparkSQL

Job Description

Data Engineer – MS Fabric

Derevo

• Serás clave para crear e implementar arquitecturas modernas de datos con alta calidad, impulsando soluciones analíticas basadas en tecnologías de Big Data. • Diseñarás, mantendrás y optimizarás sistemas de multiprocesamiento paralelo, aplicando las mejores prácticas de almacenamiento y gestión en data warehouses, data lakes y lakehouses. • Serás el apasionado que recolecta, procesa, limpia y orquesta grandes volúmenes de datos, entendiendo modelos estructurados y semi–estructurados, para integrar y transformar múltiples fuentes con eficacia. • Definirás la estrategia óptima según objetivos de negocio y requerimientos técnicos, convirtiendo problemas complejos en soluciones alcanzables que ayuden a nuestros clientes a tomar decisiones basadas en datos. • Te integrarás al proyecto, sus sprints y ejecutarás las actividades de desarrollo aplicando siempre las mejores prácticas de datos y las tecnologías que implementamos. • Identificarás requerimientos y definirás el alcance, participando en sprint planning y sesiones de ingeniería con una visión de consultor que aporte valor extra. • Colaborarás proactivamente en workshops y reuniones con el equipo interno y con el cliente. • Clasificarás y estimarás actividades bajo metodologías ágiles (épicas, features, historias técnicas/usuario) y darás seguimiento diario para mantener el ritmo del sprint. • Cumplirás las fechas de entrega comprometidas y gestionarás riesgos comunicando desviaciones a tiempo.

Job Requirements

  • Inglés avanzado
  • Habilidades Técnicas: Lenguajes de Consulta y Programación T-SQL / Spark SQL: DDL y DML, consultas intermedias y avanzadas (subconsultas, CTEs, joins múltiples con reglas de negocio), agrupación y agregación (GROUP BY, funciones de ventana, métricas de negocio), procedimientos almacenados para ETL/ELT, optimización de índices, estadísticas y planes de ejecución para procesos masivos.
  • Python (PySpark): Programación orientada a objetos (clases, módulos), gestión de estructuras y tipos de datos (variables, listas, tuplas, diccionarios), control de flujo mediante condicionales y bucles, ingestión de datos estructurados y semiestructurados, desarrollo de DataFrames y UDFs, ventanas temporales y particionado para optimización, buenas prácticas de código (PEP8, modularidad).
  • JSON / REST APIs: Orquestación de pipelines y despliegues CI/CD mediante llamadas a Fabric REST APIs, parametrización dinámica de ejecuciones y gestión de artefactos.
  • Microsoft Fabric Lakehouse (OneLake + Delta Lake) : modelado de datos con tablas Delta ACID, particionamiento y optimizaciones (OPTIMIZE, Z-ORDER) para mejorar rendimiento; uso de time travel para auditoría y recuperación.
  • Warehouses (Synapse Analytics) : configuración de clusters SQL provisionados y serverless; diseño de esquemas estrella/copo de nieve; ejecución de T-SQL transaccional con aislamiento y escalado automático de recursos.
  • CI/CD & Lifecycle Management : definición de pipelines en Azure DevOps o GitHub Actions con entornos dev–test–prod; pruebas unitarias de datasets, validaciones de esquema y despliegue automatizado de artefactos.
  • Monitor Hub & Activator : creación de dashboards personalizados para métricas de ingestión y transformación (latencia, throughput, errores); alertas proactivas y runbooks automáticos basados en condiciones definidas.
  • Eventstreams & Eventhouse : configuración de ingesta de eventos en tiempo real sin código; definición de ventanas de procesamiento, agregaciones incrementales y almacenamiento optimizado para análisis temporal.
  • Seguridad y Gobierno de Datos : administración granular de roles (Admin, Member, Contributor, Viewer) y permisos por workspace/item; políticas de row-level, column-level security y dynamic data masking; auditoría de accesos y cambios para cumplimiento normativo.
  • Deseable: conocimientos generales en Azure Data Factory.

Benefits

  • WELLNESS: Impulsaremos tu bienestar integral a través del equilibrio personal, profesional y económico, Nuestros beneficios de ley y adicionales te ayudarán a lograrlo.
  • LET´S RELEASE YOUR POWER: Tendrás la oportunidad de especializarte de manera integral en diferentes áreas y tecnologías, logrando así un desarrollo interdisciplinario. Te impulsaremos a plantearte nuevos retos y superarte a ti mismo.
  • WE CREATE NEW THINGS: Nos gusta pensar fuera de la caja. Tendrás el espacio, confianza y libertad para crear y la capacitación que se requiera para lograrlo.
  • WE GROW TOGETHER: Participarás en proyectos tecnológicos punteros, multinacionales y con equipos extranjeros.

Related Categories

Related Job Pages

More Data Engineer Jobs

Reply logo

Engenheiro de Dados Sênior

Reply

Reply designs and implements innovative solutions in the areas: Digital Services, Technology and Consulting.

Data Engineer93 days ago
Full TimeRemoteTeam 10,001+Since 1996H1B Sponsor

• Construir e gerenciar pipelines confiáveis de dados envolvendo ingestão/coleta, processamento, integração, armazenamento e disponibilização de dados na organização. • Atuar em uma arquitetura de sistemas distribuídos para o processamento de dados massivos em paralelo (MPP), combinando diversas fontes de dados heterogêneas e colaborando com equipes de análise e ciência de dados na construção de soluções e geração de valor baseadas em dados.

Brazil
Job Closed
OtherRemoteTeam 10,001+Since 2020H1B No Sponsor

Date Posted: 2026-03-04Country: United States of AmericaLocation: US-AZ-REMOTEPosition Role Type: RemoteU.S. Citizen, U.S. Person, or Immigration Status Requirements: The ability to obtain and maintain a U.S. government issued security clearance is required.​ U.S. citizenship is required, as only U.S. citizens are eligible for a security clearanceSecurity Clearance Type: DoD Clearance: SecretSecurity Clearance Status: Active and existing security clearance required after day 1 At Raytheon, the foundation of everything we do is rooted in our values and a higher calling – to help our nation and allies defend freedoms and deter aggression. We bring the strength of more than 100 years of experience and renowned engineering expertise to meet the needs of today’s mission and stay ahead of tomorrow’s threat. Our team solves tough, meaningful problems that create a safer, more secure world. We are seeking a highly skilled and motivated Data Engineer to join our dynamic team. As a Data Engineer, you will be responsible for designing and developing data solutions including data ingest and consumption workflows, transformation, modelling, and resulting analytics. You will work closely with data scientists, analysts, and other stakeholders to ensure the value, reliability, and trustworthiness of our data solutions. Your expertise as a data engineer will help us leverage data to drive business impact decision-making and enhance our aerospace and defense capabilities. What You Will Do - Design, develop, and maintain scalable and efficient data pipelines to process data from various sources using tools such as Matillion and Databricks. - Collaborate with data scientists, analysts, architects and other stakeholders to understand data requirements and translate them into technical solutions. - Develop and optimize ETL (Extract, Transform, Load) processes to ensure data quality and integrity. - Utilize Snowflake for data warehousing, ensuring efficient data storage and retrieval. - Perform data modeling to design and structure databases that support business requirements. - Write and optimize SQL queries for data transformation. - Ensure data security and compliance with industry standards and regulations. - Adhere to and advocate established best practices and design patterns. - Document data processes, workflows, and technical specifications. - Conduct data analysis to support business insights and decision-making. Qualifications You Must Have: - Typically requires a University Degree or equivalent experience and minimum 8 years prior relevant experience, or An Advanced Degree in a related field and minimum 5 years' experience - Experiencing developing and optimizing ETL processes - Experience with Python for data transformation - Proficiency in API integration (REST) and associated exchange security patterns - Experience with cloud platforms, services and tools - The ability to obtain and maintain a U.S. government issued security clearance is required. U.S. citizenship is required, as only U.S. citizens are eligible for a security clearance. Qualifications We Prefer - Experience in the aerospace and defense industry - Experience using data integration tools such as Matillion - Experience with Snowflake for data warehousing - Data modeling and data architecture experience - Strong proficiency in SQL and experience with relational databases - Familiarity with data governance and data security practices - Experience with data analysis and generating data requirements What We Offer - Our values drive our actions, behaviors, and performance with a vision for a safer, more connected world. At RTX we value: Safety, Trust, Respect, Accountability, Collaboration, and Innovation. Please consider the following role type definition as you apply for this role. - Remote: This position currently is designated as remote. Employees who are working in remote roles will work primarily offsite (from home) but may be expected to travel to the site location as needed. The successful candidate for this role will be required to reside and work from one of the 50 U.S. states (excluding U.S. territories). As part of our commitment to maintaining a secure hiring process, candidates may be asked to attend select steps of the interview process in-person at one of our office locations, regardless of whether the role is designated as on-site, hybrid or remote. The salary range for this role is 107,500 USD - 204,500 USD. The salary range provided is a good faith estimate representative of all experience levels. RTX considers several factors when extending an offer, including but not limited to, the role, function and associated responsibilities, a candidate’s work experience, location, education/training, and key skills. Hired applicants may be eligible for benefits, including but not limited to, medical, dental, vision, life insurance, short-term disability, long-term disability, 401(k) match, flexible spending accounts, flexible work schedules, employee assistance program, Employee Scholar Program, parental leave, paid time off, and holidays. Specific benefits are dependent upon the specific business unit as well as whether or not the position is covered by a collective-bargaining agreement. Hired applicants may be eligible for annual short-term and/or long-term incentive compensation programs depending on the level of the position and whether or not it is covered by a collective-bargaining agreement. Payments under these annual programs are not guaranteed and are dependent upon a variety of factors including, but not limited to, individual performance, business unit performance, and/or the company’s performance. This role is a U.S.-based role. If the successful candidate resides in a U.S. territory, the appropriate pay structure and benefits will apply. RTX anticipates the application window closing approximately 40 days from the date the notice was posted. However, factors such as candidate flow and business necessity may require RTX to shorten or extend the application window. RTX is an Equal Opportunity Employer. All qualified applicants will receive consideration for employment without regard to race, color, religion, sex, sexual orientation, gender identity, national origin, age, disability or veteran status, or any other applicable state or federal protected class. RTX provides affirmative action in employment for qualified Individuals with a Disability and Protected Veterans in compliance with Section 503 of the Rehabilitation Act and the Vietnam Era Veterans’ Readjustment Assistance Act. Privacy Policy and Terms: Click on this link to read the Policy and Terms

United States
$107K - $204K / year
Job Closed
OtherRemoteTeam 51-200

Sr. Data Platform Engineer Job purpose summary Real Time’s suite of Interventional Analytics solutions enables post-acute facilities to seamlessly collaborate with their partner hospitals, health systems, ACOs, and managed care plans to improve care, reduce costs, and achieve a level of interoperability. Real Time is seeking a Senior Data Platform Engineer with deep hands-on experience across AWS, Terraform, Python, PySpark, SQL Server, PostgreSQL, and C#. This role will focus on designing and optimizing cloud-native data solutions, driving performance and cost efficiency, and enabling large-scale migrations from legacy SQL Server systems to modern PostgreSQL-based architectures. The ideal candidate thrives in independent, dynamic environments, is comfortable shifting between multiple projects, and can balance long-term architecture with short-term deliverables. Duties and responsibilities - Design, build, and optimize data pipelines (batch and streaming) using PySpark, Python, and SQL - Lead and support SQL Server → PostgreSQL migration efforts, including schema translation, performance tuning, and data validation - Implement and maintain Terraform infrastructure for AWS services (S3, Glue, Lambda, EMR, Redshift, DMS, etc.) - Conduct performance and cost optimization reviews of AWS workloads, recommending and implementing improvements - Develop frameworks for data quality, monitoring, and reconciliation - Collaborate with business stakeholders and other engineers to translate requirements into scalable solutions - Apply software engineering best practices (version control, CI/CD, testing, code reviews) to data engineering workflows - Contribute to standards, best practices, and technical mentorship across the team - Participate in a rotating on-call schedule to provide production support, ensuring reliability of data platforms with minimal disruption to business operations Additional Duties and Responsibilities - Maintain regular and punctual attendance. - Any other assigned tasks - Comply with all company policies and procedures. Qualifications - 5+ years of professional experience in data engineering or software engineering roles - Advanced skills with Python and PySpark for large-scale data processing - Strong SQL development experience, including query tuning and ETL/ELT design - Hands-on experience with AWS data services (S3, Glue, EMR, Lambda, DDB, DMS, CloudWatch, IAM, VPC) - Experience with Terraform/Terragrunt (or equivalent IaC) for AWS infrastructure automation - Familiarity with SQL Server and PostgreSQL, including database optimization and migration patterns - Working knowledge of C# for application/data integrations - Strong grasp of data modeling, architecture, and governance best practices - Excellent problem-solving skills, ability to self-manage, and comfort working in environments with changing priorities - Bachelor’s degree required - Excellent verbal, written and presentation skills. - Ability to thrive in a fast paced, deadline driven environment. - AWS certification (Solutions Architect, Data Analytics, or DevOps) a plus - Data visualization experience with Tableau, PowerBI, MicroStrategy, etc. also a plus - Experience working in the healthcare industry a plus - You must love what you do! You will be working with other talented and passionate individuals who enjoy working and solving problems together! Additional Qualifications - Independent contributor who can also collaborate effectively in a remote environment - Strong communication skills, both technical and business-oriented - Comfortable balancing multiple priorities without losing quality - Reliability and responsiveness when participating in the on-call rotation, with a proactive and calm approach to production issues - Positive, solution-oriented mindset - able to tackle tasks with optimism Working conditions Position requires minimal amount of travel. Remote/Work from anywhere work environment within the United States. Physical requirements Remaining in a stationary position, often sitting for prolonged periods. Direct reports NoneWhy you'll love it here: - You’ll be part of a team that helps save lives every day, and improves the lives of 100,000s of people! - Competitive base salary - Ongoing training and coaching for career development - Comprehensive medical, dental and vision coverage with significant company contribution - Disability insurance - 401(k) retirement savings program

United States
Together For Talent logo

Data Engineer

Together For Talent

We are a faith-based nonprofit organization devoted to helping people grow through meaningful service, community, and care. Our team combines professional excellence with a shared mission to make a lasting impact.

Data Engineer93 days ago

This description is a summary of our understanding of the job description. Click on 'Apply' button to find out more. Role Description We are looking for a highly technical, detail-oriented Data Engineer to serve as the technical engine and quality-control specialist for our data operations. Reporting directly to the Director of Data (who also serves as the Lead Architect), you will execute the end-to-end development of our ETL/ELT pipelines under their strategic direction. - Execute the full data lifecycle—Extract, Transform, Load—utilizing industry-standard tools like Fivetran to automate ingestion. - Help manage and scale complex data workflows using tools like Apache Airflow, ensuring task dependencies are met and pipelines run with 100% reliability. - Perform rigorous data cleansing and transformation within the SQL layer to turn raw, messy data into "Gold-standard" KPI-ready logic. - Function as the primary monitor for the data environment. Execute automated monitoring protocols to identify schema drift or data anomalies before they reach a stakeholder. - Assist the Director of Data in the maintenance and optimization of cloud data warehouses, specifically BigQuery, ensuring performance and cost-efficiency. - Partner with Marketing and Content teams to normalize data from complex stacks, including GA4, PostHog, Facebook Ads, Reddit, and others, and custom website tracking. Qualifications - SQL Mastery: You are a "black belt" in SQL. You write complex joins, window functions, and CTEs that are both performant and easily maintainable. - Pipeline Expertise: Proven experience building and managing data pipelines at scale using Fivetran or similar automated ingestion tools. - Orchestration Power User: Experience with Apache Airflow (or similar tools like Prefect/Dagster) for managing complex pipeline dependencies and scheduling. - Warehouse & Lake Experience: Strong technical proficiency in BigQuery and Redshift, including an understanding of partitioning, clustering, and cost-optimization. - The "Data Instinct": You have a natural "whiz" for spotting when data is incorrect. You see the anomalies that indicate a tracking break, a sync error, or an API failure instantly. Requirements - Modern Marketing Stack Knowledge: A deep understanding of the nuances of GA4 and /or PostHog, web tracking schemas, and media pixel implementation. - Content Management Data: Experience sourcing and modeling data from Content Management Systems (CMS) to track content performance and attribution. - Marketing Analytics: Familiarity with how Marketing teams utilize data for campaign optimization and customer journey mapping. Company Description We are a faith-based nonprofit organization devoted to helping people grow through meaningful service, community, and care. Our team combines professional excellence with a shared mission to make a lasting impact.

United States
Job Closed