Construindo soluções com base em Inteligência Artificial para diferentes setores, formatos de negócio e cadeias de valor
Mid/Senior Data Engineer – Consulting
Location
Brazil
Posted
151 days ago
Salary
0
Seniority
Senior
Job Description
Mid/Senior Data Engineer – Consulting
Datarisk
• Act as a consultant on strategic business projects focused on data engineering using the Databricks platform; • Identify and implement best practices for high-volume data ingestion, ensuring performance and data availability; • Work on query optimization and data modeling to improve the efficiency of analytical queries; • Develop and maintain data pipelines (ETL/ELT) to integrate multiple sources, ensuring data quality and governance; • Design and implement Data Lakehouse architectures, applying the Medallion pattern (Bronze, Silver, Gold); • Support data modeling for modern Data Warehouses (Star Schema, Snowflake); • Explore best practices in Big Data environments, especially in the context of Databricks and Delta Lake; • Use open formats such as Delta Lake, Iceberg, Parquet and Avro for building and governing modern Lakehouse architectures; • Ensure data governance, including access control and versioning.
Job Requirements
- Solid experience with Databricks and Delta Lakehouse architecture;
- Strong SQL skills and experience optimizing queries for distributed processing;
- Experience with ETL/ELT and scalable data architectures;
- Knowledge of incremental data pipelines (UPSERTS, MERGE, CDC);
- Experience in analytical data modeling (Star Schema, Snowflake);
- Experience with Databricks workloads, including pipeline refactoring and performance tuning;
- Advantage: Familiarity with Unity Catalog for governance and access control in Databricks;
- Knowledge of modern Data Warehouses (Databricks, Snowflake, BigQuery, Redshift);
- Experience with AWS architectures, with strong knowledge of services such as S3, IAM, Glue and EMR;
- Experience provisioning and configuring Databricks clusters from scratch, including infrastructure, networking, permissions and integration with data catalogs.
Benefits
- Flexibility! Our team is remote — work from anywhere;
- Bradesco health and dental plan;
- Conexa Saúde & Psicologia Viva;
- Wellhub (formerly Gympass);
- Partnership with Open English;
- Caju: Home Office allowance;
- Birthday day off;
- 22 working days/year of paid leave;
- Hiring model: PJ (fixed monthly fee).
Related Guides
Related Categories
Related Job Pages
More Data Engineer Jobs
Data Engineer
LUKA GLOBALTraining, Mentorship and Recruitment for the innovation space. Europe-wide HealthTech, Life Sciences and DeepTech.
• Develop scalable data management and data processing architectures. • Manage data acquisition from API, batch, event or streaming sources. • Develop processes for data aggregation. • Design and develop data pre- and post-processing stages. • Plan and design for data governance, security, provenance and the over-all data lifecycle. • Leverage best-in-class cloud technologies to cater for OLTP and OLAP business needs. • Integrate ML models and Analytic components into the workflows (including MLOps). • Work closely with Data Science and Application Development teams in an agile development process.
• Support and maintain Census syncs between Snowflake and downstream tools. • Help design and document Census models and sync logic to ensure data reliability and consistency. • Monitor and troubleshoot data issues related to Census, Fivetran, and Segment pipelines. • Partner with Analytics to ensure upstream data supports activation and reporting needs. • Write and maintain SQL queries in Snowflake to support reporting and analysis. • Assist with ad hoc data requests and exploratory analysis for internal stakeholders. • Support the creation, maintenance, and updates of Tableau dashboards and published data sources. • Help validate data accuracy and investigate discrepancies surfaced through reporting. • Work with Analytics team members to refine data requirements and improve metric clarity. • Contribute SQL and documentation to shared repositories and knowledge bases. • Use tools like Notion, Git, and Jira to track work and communicate progress. • Support ongoing efforts to standardize definitions and improve data transparency.
Data Implementation Engineer 2
Med A/RxMeduit is one of the nation’s leading revenue cycle management solutions companies, partnering with hospitals and physician practices in 48 states to provide excellent, compassionate patient engagement. We focus our talents on addressing patient questions after their visit so our clients can focus on their treatment. Our core values that we live daily are Integrity, Teamwork, Continuous Improvement, Client-Focused, and being Results-Oriented. Meduit is an Equal Opportunity Employer and does not discriminate against any employee or applicant for employment because of race, color, religion, sex, age, national origin, disability, military status, genetic information, sexual orientation, marital status, domestic violence victim status or status as a protected veteran or any other federal, state, or local protected class.
This description is a summary of our understanding of the job description. Click on 'Apply' button to find out more. Role Description The Data Implementation Engineer is primarily responsible for the creation of processes that implement client data from hospital systems into company systems (data warehouse, data mapping tools, payment portals, collections platforms, etc.). This role also works closely with Operations, Workflow, Technical Support, Account Management, and other internal teams to ensure data integrity for clients post-implementation. Qualifications - BS in Mathematics, MIS, Computer Science, Statistics or comparable other technical background or combination of experience and/or education - 3-5 years of practical experience performing ETL between complex data sets and relational databases - Experience with MySQL, MSSQL, or other SQL and based platform - Experience with Talend Data Integration Requirements - Implement client data, implement new features on existing clients, and assist in data integrity for all clients - Implement client Revenue Cycle data into applicable applications based off project requirements including Benchmark and systems applications - Create and configure ETL processes from multiple data sources in complex formats - Log, track, and document solutions to complex data implementation tasks in Jira - Build custom data transformation processes using commercial ETL tools - Develop new logical data model schemas and tables for the creation and use of physical data model schemas and tables - Attend client project virtual meetings focused on gathering requirements and communicating project progress and timelines - Mentor other Team Members on data implementation solutions Benefits - Comprehensive paid training - Medical, dental, and vision insurance - HSA and FSA available - 401(k) with company match - Paid Wellness Time and Holidays - Employer paid life insurance and long-term disability - Internal growth opportunities
• Designs and builds scalable data extracts, integrations, transformations, and data models. • Ensures successful deployment and provisioning of data solutions across required environments. • Designs and implements data architectures and applications that enable speed, quality, and operational efficiency. • Interacts with cross-functional stakeholders to gather and define requirements and translate them into technical designs. • Develops deep familiarity with enterprise datasets, builds domain knowledge, and advances data quality. • Reviews requirements, identifies gaps, and drives resolution with stakeholders. • Identifies and recommends continuous improvement opportunities, ensuring integrations are automated, governed, and observable. • Serves as a key team member in designing and deploying a ground-up cloud data platform and pipeline. • Partners with data scientists to design, build, and maintain reproducible machine-learning pipelines. • Implements CI/CD for data and ML workflows. • Builds and maintains production-grade ML infrastructure such as feature stores, model registries, and data versioning. • Ensures ML models follow best-practice governance, including automated model performance monitoring and alerting. • Designs scalable data pipelines optimized for ML workloads. • Establishes MLOps standards, coding practices, and automation patterns that scale across teams.



