Extractta logo
Extractta

EXTRACTTA | Informações que geram Soluções

Senior Data Engineer

Data EngineerData EngineerFull TimeRemoteSeniorTeam 201-500Since 2005H1B No SponsorCompany SiteLinkedIn

Location

Brazil

Posted

25 days ago

Salary

0

Seniority

Senior

Job Description

Senior Data Engineer

Extractta

• Design, develop, and support robust data infrastructure solutions with a focus on scalability, security, and efficiency. • Lead the implementation and maintenance of ecosystems such as Trino, AWS Glue, Redshift, Airflow, Kafka, and Spark on Kubernetes (K8s). • Collaborate with engineering, data science, and product teams to ensure seamless integration between systems. • Diagnose and resolve incidents related to performance, scalability, and reliability. • Apply and promote best practices in architecture, data governance, and monitoring.

Job Requirements

  • Strong experience with data infrastructure ecosystems: Trino, AWS Glue, Terraform, CI/CD, Apache Airflow, Kafka, and Spark on Kubernetes.
  • Advanced proficiency in Python (including PySpark) and object-oriented programming.
  • Proven experience with distributed systems, data warehouses, and large-scale ETL/ELT pipelines.
  • Practical experience or knowledge of Databricks (workspaces, jobs, notebooks, and Spark integration).
  • Hands-on knowledge of data governance, security, and storage (e.g., S3, Redshift).
  • Analytical skills and problem-solving with a focus on operational efficiency.
  • Good communication skills and experience working collaboratively.
  • Certifications such as AWS Certified Solutions Architect or similar (preferred).
  • Familiarity with Trino for ad-hoc analytics (preferred).
  • Experience with monitoring tools like Datadog or New Relic (preferred).
  • Knowledge of AI applied to data, especially RAG (Retrieval-Augmented Generation), for analytics and intelligent insights projects (preferred).

Benefits

  • Competitive compensation based on experience;
  • Opportunities for growth within the company and participation in strategic projects;
  • Dynamic and challenging work environment;
  • Opportunity to work at a rapidly growing company in the market.

Related Categories

Related Job Pages

More Data Engineer Jobs

Full TimeRemoteTeam 10,001+H1B No Sponsor

• Collaborate with business users to identify, analyze, and document business requirements and key performance indicators (KPIs) • Design, develop, and maintain data integration and automation using Power BI, Azure Databricks, Power Automate, and SmartSheets • Collect and transform data from multiple sources into a structured format for analysis • Develop and maintain data models and data sets for efficient and accurate reporting • Create Power BI dataflows and shared semantic models to consolidate data model definitions • Ensure data accuracy, integrity, and consistency within the BI solutions • Provide training and support to Power BI developers to optimize data models in Power BI for their development needs • Collaborate with IT teams to ensure data security, integrity, and compliance with data governance policies • Stay up to date with the latest trends in business intelligence and analytics tools, techniques, and best practices

Arizona + 2 moreAll locations: Arizona | North Carolina | South Carolina
$95K - $105K / year
Job Closed
CVS Health logo

Data Engineer

CVS Health

Bringing our heart to every moment of your health.

Data Engineer25 days ago
Full TimeRemoteTeam 10,001+Since 1963H1B No Sponsor

Role Description We're seeking a Data Engineer to design and implement data pipelines that power analytical capabilities. This hands-on role requires an understanding of data engineering best practices and the ability to translate business requirements into technical solutions. You will be part of a dedicated team creating datasets for analytic and data science workloads. You will work with other Data Engineers and Sr. Data Engineers on designing, implementing, and testing data pipelines. - Data Pipeline Development: Design and build ETL/ELT data pipelines to ingest, process, and transform datasets from multiple sources. - Performance Optimization: Implement best practices for performance tuning, partitioning, and clustering to optimize data queries. - Data Quality & Governance: Follow data quality standards, data governance frameworks, and security policies for data storage and access. - Data Modeling & Architecture: Develop and optimize data models and schemas to support analytics, reporting, and machine learning requirements. - Data Integration & Transformation: Collaborate with data scientists and analysts to design data solutions that integrate with BI tools and machine learning models. - Documentation & Knowledge Sharing: Create comprehensive documentation for data pipelines, workflows, and processes. Qualifications - 2+ years of applicable work experience - Proficiency in Python, specifically with ETL pipelines - Strong proficiency in SQL and experience in developing complex queries - Experience deploying data pipelines in a cloud environment (any of Azure, AWS, GCP) - Excellent communication and interpersonal skills, with the ability to collaborate effectively with data scientists and analysts. Requirements - Experience working with healthcare data, especially Epic. - Experience using GCP’s BigQuery - Knowledge of data governance best practices in a cloud environment. - Experience working with Machine Learning processes. Benefits - Comprehensive benefits package designed to support the physical, emotional, and financial well-being of colleagues and their families. - Medical, dental, and vision coverage - Paid time off - Retirement savings options - Wellness programs and other resources, based on eligibility.

United States
$64.9K - $173.0K / year
Job Closed
G2i Inc. logo

Data / Analytics Engineer

G2i Inc.

G2i is a hiring platform run by engineers that match you with pre-vetted React and React Native engineers.

Data Engineer25 days ago
Full TimeRemoteTeam 11-50H1B No Sponsor

Role Description Pacific Star Excavating is looking for a hands-on Data / Analytics Engineer to support a short-term modernization initiative involving Power Apps workflows, database design, and data automation. The current workflow relies heavily on SharePoint Lists and Power Apps. The client needs an engineer who can redesign the architecture into a more scalable, database-backed solution while ensuring the existing Power App remains synchronized and operational. This is a fast-paced engagement for someone who can quickly assess the current setup, recommend improvements, and execute efficiently. Responsibilities - Replace SharePoint Lists with a scalable database-backed workflow - Design and implement ETL/data ingestion processes - Import/export and process CSV-based project estimate data - Clean, parse, and prepare operational datasets - Maintain synchronization between the database and Power Apps - Recommend technical solutions and explain tradeoffs clearly to stakeholders - Build dashboards and data visualizations when needed - Collaborate directly with the client in a consultative capacity Qualifications - Strong experience with Power Apps - Power Automate - SharePoint Lists - Database design and data modeling - ETL/data pipeline development - Experience handling CSV ingestion and transformation workflows - Strong communication and consultative/problem-solving mindset - Ability to work independently with high ownership Requirements - Experience building dashboards and data visualizations (Nice to Have) - UI/UX exposure or frontend sensibility (Nice to Have) - Previous experience modernizing legacy internal workflows (Nice to Have) Benefits - Contract duration: Approximately 2 weeks - Location: Fully remote - Client based in: Washington State (USA) - Compensation: Approximately USD 2.5K–3K for the contractor - Start: ASAP Ideal Candidate Someone proactive, technically strong, and execution-focused — capable of not only building solutions, but also helping the client understand why a specific approach is the best fit.

United States
$2.5K - $3K / year
Part TimeRemoteTeam 10,001+Since 2004H1B Sponsor

Role Description The Data Science Intern will work on specific data-driven projects aimed at improving patient outcomes, optimizing hospital operations, and enhancing healthcare delivery. During the internship, you will collaborate with a diverse group of stakeholders to analyze healthcare data, develop predictive models, and present insights to key stakeholders. - Work on targeted data science projects, such as predictive analytics, patient care optimization, and operational efficiency. - Clean, process, and analyze large healthcare datasets. - Develop and implement machine learning models to solve healthcare challenges. - Present findings and actionable insights to healthcare teams. Qualifications - Current enrollment in a relevant field (e.g., Data Science, Statistics, Computer Science). - Experience with Python, R, SQL, or related tools. - Strong analytical and problem-solving skills. - Passion for improving healthcare outcomes through data. - Junior or senior level student currently working towards a degree with a preference in business, information systems, health information management, healthcare management or other related field. Requirements - FTE: 1 - Possible Remote/Hybrid Option: Remote - Shift Rotation: Day Rotation (United States of America) - Shift Start Time: 8:00 AM - Shift End Time: 4:30 PM - Weekends: No - Holidays: No - Call Obligation: No - Union: Union Posting Deadline: - Compensation Range: $20.00 - $20.00 Benefits - Comprehensive benefits include medical, dental, vision, life, and disability insurance, along with supplemental options to fit your needs. - 401(k) plan with employer contributions to help you plan for the future. - Investment in professional development through training, tuition reimbursement, and educational programs. - Flexible scheduling and generous time off. - Wellness resources focused on physical, mental, and emotional health. - Benefit eligibility may vary.

United States
$20 / hour