IT Consultant & Managed Service Provider
Data Engineer
Location
Argentina
Posted
99 days ago
Salary
0
Seniority
Mid Level
Job Description
Data Engineer
Netrix Global
• Identifying, creating, preparing data required for modern Data solutions and Data for AI. • Designing ETLs and ELTs for data transformations. • Designing, building and management of Datalakes architectures. • Working with Apache Hadoop projects (Spark, Hive, Pig, Oozie, Airflow, etc). • Integrating and testing Data solutions. • Creating and documenting the tests to meet requirements. • Working with Bigdata Environments. • Managing Data Cloud services, data access, security and data governance. • Analyzing and prepare data for Machine Learning workloads. • Managing monitoring and logs of applications and services.
Job Requirements
- Experience in Data Engineer/Architect role and tasks as ETL, data prepare & ingest, data streaming.
- Experience working in Cloud environments (AWS, and Azure) 2 years at least.
- Knowledge of Python, R or other data process tools.
- Intermediate/ Upper English Level.
- Experience with Apache Spark.
- Experience using AWS or Microsoft Data services: S3, AWS or Microsoft Glue, Athena, Sagemaker, etc.
- Experience using Devops tools (Git, pipelines).
- Experience working in an Agile Scrum team.
- Experience working with infrastructure as a code (Terraform, Cloudformation, etc.).
Benefits
- Swiss Medical: SMG-30 (family members included).
- AWS and Azure certifications.
- Happy club: Pedidos Ya Internet and connectivity.
- Competitive salary and benefits.
- English in company.
- Ability to work remotely.
- An awesome learning environment for you to develop.
Related Guides
Related Categories
Related Job Pages
More Data Engineer Jobs
Type of Requisition: Regular Clearance Level Must Currently Possess: None Clearance Level Must Be Able to Obtain: None Public Trust/Other Required: NACI (T1) Job Family: Software Engineering Job Qualifications: Skills: Agile Methodology, Apache Airflow, Data Warehousing (DW), ETL Design, Extract Transform Load (ETL)Certifications: NoneExperience: 3 + years of related experienceUS Citizenship Required: No Job Description: Seize your opportunity to make a personal impact as a Cloud ETL Engineer supporting Drug data Processing System (DDPS) Part D Processing for CMS. GDIT is your place to make meaningful contributions to challenging projects and grow a rewarding career. At GDIT, people are our differentiator. As a Cloud ETL Engineer you will help ensure today is safe and tomorrow is smarter. Our work depends on Cloud ETL Engineer joining our team to DDPS Part D processing for CMS, to support IRA legislation Mandated ETL development and testing for CMS Part D Medicare processing ETL programming. How a Cloud ETL Engineer will Make an Impact: - Builds and codes applications and/or models using various computer programming languages. - Designs, develops, deploys, and maintains advanced operating systems and operating system software - Installs enhancements and performs updates to software of existing systems, including middleware and application programs that run on the system - Performs troubleshooting of advanced problems and provides customer support for software systems and application issues - Debugs advanced problems with system software. Provides recommendations for continuous improvement - Performs maintenance tasks to keep systems running smoothly - Writes and updates test procedures and programs - May coach and provide guidance to less-experienced professionals - May serve as a team or task lead What You’ll Need to Succeed: Education: - BA/BS in a Computer Science or related technical discipline or the equivalent combination of education, technical certifications or training, or work experience. Required Experience: - 3+ years of direct related computer programming experience. - 3+ years of IT experience with at least 4 years of SQL development experience developing on multiple relational database platforms like Snowflake. - 2+ years of Cloud ETL development experience using AWS Services / tools, Databricks, Snowflake, and/or similar technologies. - 3+ years of physical data modeling, partitioning, and developing optimization/indexing strategies on the Teradata platform or similar DBMS. - 2+ years of experience with Snowflake and Snowflake ETL for loading data from AWS S3 - 2+ years of experience with UNIX scripting and utilities. - In depth knowledge on Data Warehouse (DW) concepts for ETL Development - 2+ year of experience in Code migration and deployment using AWS resources in the cloud environment. - 3+ years of experience in working with Python and Spark programming. - Candidate must be able to obtain and maintain a Public Trust clearance and must have lived in the United States at least three (3) out of the last five (5) years. Required Technical Skills: - AWS Development using S3, EC2 Lambda functions - Extract Transform Load (ETL) - Python (Programming Language) - Apache Spark programming - Knowledge on Snowflake Data Warehouse with ETL - Knowledge on Databricks and notebook, coding and execution - GitHub Code configuration and Management Required Skills and Abilities: - Attend daily stand-up scrum calls. - Collaborate in a "war-room" setting with business analysts, developers, testers, architect, scrum master, and product owner to assist in grooming, designing, coding, unit testing user stories related to the Program Increment and current iteration. - Exercise positive interpersonal communication skills and works independently and within an agile team during all phase of the software development lifecycle. - Design, develop and implement complex ETL processes of healthcare data to meet a wide range of business and system requirements. - Support the ETL operational processes including but not limited to: automation, job scheduling, dependencies, monitoring, maintenance, patches, upgrades, security, and administration. - Investigate and corrects software defects and analyzes and maintains data quality. - Mentor and provide guidance to junior team members. - Identify process improvements and innovative ways to solve existing or new problems. Preferred Skills: - Prior experience developing healthcare IT solutions strongly preferred. - 2+ years of experience in Github, or similar version control tools. - Prior experience using the Agile development framework, and CI/CD DevOps - Prior working experience with Medicare Part D Data with ETL development Location: Remote Clearance Level: Requires the ability to pass a CMS background check and meet the residency requirement for having resided in the US at least (3) three out of the last (5) five years in order to obtain a Public Trust. Sponsorship will not be provided for this position What GDIT Can Offer You: - Full-flex work week to own your priorities at work and at home, with core work hours Monday – Friday 9:00 AM ET – 3:00 PM ET - 401K with company match - Comprehensive health and wellness packages - Internal mobility team dedicated to helping you own your career - Professional growth opportunities including paid education and certifications - Cutting-edge technology you can learn from - Rest and recharge with paid vacation and holidays - Challenging work that makes a real impact on the world around you - Remote work #GDITFedHealthJobs The likely salary range for this position is $102,000 - $138,000. This is not, however, a guarantee of compensation or salary. Rather, salary will be set based on experience, geographic location and possibly contractual requirements and could fall outside of this range. Scheduled Weekly Hours: 40 Travel Required: None Telecommuting Options: Remote Work Location: Any Location / Remote Additional Work Locations: Total Rewards at GDIT: Our benefits package for all US-based employees includes a variety of medical plan options, some with Health Savings Accounts, dental plan options, a vision plan, and a 401(k) plan offering the ability to contribute both pre and post-tax dollars up to the IRS annual limits and receive a company match. To encourage work/life balance, GDIT offers employees full flex work weeks where possible and a variety of paid time off plans, including vacation, sick and personal time, holidays, paid parental, military, bereavement and jury duty leave. GDIT typically provides new employees with 15 days of paid leave per calendar year to be used for vacations, personal business, and illness and an additional 10 paid holidays per year. Paid leave and paid holidays are prorated based on the employee’s date of hire. The GDIT Paid Family Leave program provides a total of up to 160 hours of paid leave in a rolling 12 month period for eligible employees. To ensure our employees are able to protect their income, other offerings such as short and long-term disability benefits, life, accidental death and dismemberment, personal accident, critical illness and business travel and accident insurance are provided or available. We regularly review our Total Rewards package to ensure our offerings are competitive and reflect what our employees have told us they value most. We are GDIT. A global technology and professional services company that delivers consulting, technology and mission services to every major agency across the U.S. government, defense and intelligence community. Our 30,000 experts extract the power of technology to create immediate value and deliver solutions at the edge of innovation. We operate across 50 countries worldwide, offering leading capabilities in digital modernization, AI/ML, Cloud, Cyber and application development. Together with our clients, we strive to create a safer, smarter world by harnessing the power of deep expertise and advanced technology. Join our Talent Community to stay up to date on our career opportunities and events atgdit.com/tc. Equal Opportunity Employer / Individuals with Disabilities / Protected Veterans
Role Description As a Lead Data Engineer (Consultant) at Kainos you will be responsible for designing and developing data processing and data persistence software components for solutions which handle data at scale. Working in agile teams, Lead Data Engineers provide strong development leadership for team members and take responsibility for the quality of the codebase as well as the match to user needs. - Taking responsibility for the development of whole components or subsystems within a team. Development incorporates design, code, test and defect resolution. - Focusing on hands-on design and development, using open source and commercial platforms. - Defining and enforcing development best practice and coaching team members to ensure consistency. - Working with project architects, taking responsibility for non-functional needs of ETL/ELT data processing pipelines such as robustness and performance. - Taking responsibility for standards and execution of unit and integration testing done within the team. - Taking responsibility for software product due diligence and integration. - Leading troubleshooting and tuning of activities. - Working with Operations teams to ensure the application software is operationally ready. - Working with Security Architects and accreditors to ensure compliance with relevant legal and security requirements. - Advising customers and managers and other team members of the estimated effort and technical implications of user stories and user journeys. - Contributing to technical proposals as part of the sales process. - Managing, coaching and developing a small number of staff, with a focus on managing employee performance and assisting in their career development. Qualifications - Experience of leading a team of engineers in the implementation of data-intensive system components. - Experience of applying standards for design (patterns), development (style guides) and operational readiness (automation, deployment). - Proficient software development experience in one of Java, Scala, or Python. - Software development experience with data-processing platforms from vendors including Informatica, Azure Databricks or any relevant ETL tools. - Expert in SQL or SQL extensions for analytical use case. - Expert understanding of distributed data stores and data processing frameworks. - Ability to simply and clearly communicate technical design both written and verbally. - Proficient in designing analytical and operational data models. - A keen interest in AI technologies. Requirements - Comfortable with Data Warehouse methods and techniques. - Actively shares their thoughts and views on data practices. - AWS/Azure/GCP Certified in Data Services. - Expertise in continuous improvement and sharing input on data best practice. - Participation in development and/or technology communities. - Practical experience with AI technologies, tools, processes and delivery. Who you are Our vision is to enable outstanding people to create digital solutions that have a positive impact on people’s lives. Our values aren't abstract; they are the behaviours we expect from each other every day, and underpin everything that we do. We expect everyone to display our values by being: - Determined in how obstacles are overcome; - Honest when dealing with others; - Respectful of how you treat others; - Creative to find solutions to complex problems; - Cooperative by sharing information, knowledge and experience. Company Description Kainos is a high-growth IT services company providing digital technology solutions and agile software development to enterprise customers. Across our 30-year history, we have worked on transformational projects across government, NHS and a myriad of private sector clients. At Kainos, we believe in the power of diversity, equity and inclusion. We are committed to building a team that is as diverse as the world we live in, where everyone is valued, respected, and given an equal chance to thrive.
This description is a summary of our understanding of the job description. Click on 'Apply' button to find out more. Role Description This role involves designing, building, and maintaining our data architecture using various technologies. - Design, build, and maintain data architecture using Snowflake, Azure, and Databricks - Develop and maintain ELT/ETL workflows using Python, SQL, and Spark - Collaborate with cross-functional teams to gather and analyze business requirements - Ensure compliance with data privacy regulations and best practices - Implement and manage DevOps processes to automate deployment and testing of data pipelines - Stay current with the latest data technologies, trends, and best practices - Work in an Agile environment to deliver high-quality solutions on time and within budget - May work from home Qualifications - Master’s degree (or foreign equivalent) in Computer Science, Mathematics, Statistics or related field - 2 years of experience in the offered position or closely related occupation Requirements - 2 years of cloud platform experience - 2 years of experience with Snowflake or Databricks, Azure or AWS, and visualization tools - Experience creating pipelines in Snowflake or Databricks - Knowledge of the MarTech ecosystem or related - Experience manipulating and analyzing complex, high-volume, high-dimensionality data from varying sources - Fluency with at least one scripting language: Python, R - Strong knowledge of DevOps workflow, cloud-native platforms (containers, Kubernetes, serverless), version tools (Git, bitbucket), and infrastructure as code (IaC) tools (CloudFormation or Terraform) - Experience with Data Integration tools Spark, Databricks or equivalent - Experience in Linux and Unix shell scripting Benefits - 40 hours/week - Salary range: $123,448 per year – $157,000 per year - If offered employment must have legal right to work in U.S. - EOE Contact Staffing Dept (HR) – Callaway Golf Sales Company – 2180 Rutherford Rd, Carlsbad, CA 92008 (Must reference Job Code RV0615) DE&I and EEOC As a purpose-led, performance driven company, we strive to foster a culture of belonging based on respect, connection, openness and authenticity. We are committed to building and maintaining a workplace that celebrates the diversity of our associates, supporting them to bring their authentic selves to work every day. If your experience is close to what we’re looking for, please consider applying. Experience comes in many forms, skills are transferable, and passion goes a long way. We know that diverse backgrounds and experiences make for the best problem-solving and creative thinking, which is why we’re dedicated to adding new perspectives to the team and encourage everyone to apply. We look forward to learning more about you. ARE YOU READY TO MAKE THE TURN? APPLY TODAY!
Principal Data/AI Engineer
FUJIFILMFUJIFILM is a publicly traded, multinational photography and imaging company with global headquarters in Tokyo, Japan and regional headquarters in Valhalla, New York. Established i
This description is a summary of our understanding of the job description. Click on 'Apply' button to find out more. Role Description The Principal Data/AI Engineer helps drive the technical strategy and architecture of enterprise-scale data and AI platforms that power mission-critical data products, analytics, and AI-driven solutions. In this role, you will operate as a technical expert in planning, designing, developing, and debugging new and existing data pipelines. You will advocate for data and AI engineering best practices, including: - Idempotent modular pipeline design - Version control - Automated testing - CI/CD - IaaS - Data quality checks and observability You will help mentor junior engineers through design guidance, code reviews, pairing, and enabling Agile frameworks to promote iterative delivery and continuous improvement. You will work closely with a cross-functional team of business and IT peers and are expected to lead by example, balancing delivery speed of new features with long-term platform health and technical excellence. What you'll do: - Architect, build, and maintain highly scalable batch and streaming pipelines on the Snowflake Data Platform (Snowpipe, Tasks, Streams, Dynamic Tables, Snowpark, Iceberg). - Architect and deliver ML/GenAI solutions using managed cloud services (AWS, Azure, Snowflake Cortex). - Implement modern data modeling and architecture patterns; establish and enforce standards for data quality (tests, expectations, SLAs/SLOs), observability (metrics, logs, traces), and lineage. - Ensure integration of biotech systems (MES, LIMS, SCADA, ERP, QMS) into centralized data platform. - Collaborate with product managers, product engineers, platform architects, and business stakeholders to align data and AI engineering solutions with business requirements. - Enable modern AI use cases - feature stores, vector search/RAG, model serving, safety/guardrails, and continuous monitoring for drift, bias, and performance. - Optimize storage tiers, compute clusters/warehouses, caching, and workload orchestration for latency and throughput. - Partner with cybersecurity and compliance teams to ensure adherence to GxP, FDA 21 CFR Part 11, and data privacy regulations. - Lead design reviews, incident postmortems, and cross-team architecture forums. - Stay current with emerging technologies (data mesh, real-time streaming, digital twins, generative AI platforms) and introduce relevant innovations. - And other job duties that may be assigned from time to time. Qualifications - Bachelor’s degree in Computer Science, Data Engineering, AI/ML Engineering, or related field. - 12+ years of professional experience in data/software engineering, AI/ML engineering, or cloud platform engineering. - Proven experience using Python and SQL. - Extensive experience building and maintaining data pipelines using modern frameworks (e.g. Airflow, dbt). - Proven experience with data modelling for analytics and AI use cases. - Strong experience with cloud platforms (AWS, Azure). - Proven experience delivering production-grade data solutions. - Familiarity with biotech or life sciences systems and regulatory compliance frameworks (GxP, FDA, EMA). Preferred Experience and Education - Advanced degree (MS/PhD) preferred. - Relevant industry certifications (e.g., Snowflake, AWS, Azure) preferred. Knowledge, Skills and Abilities - Design and implementation of scalable batch and streaming data pipelines. - Strong proficiency in Python and SQL/dbt for data processing, automation, and analytics. - Extensive experience in Airflow or similar orchestration tool. - Expertise in designing and developing data solutions on Snowflake, including data modelling, performance optimization, and cost-efficient usage. - Experience with modern AI technologies, including LLMs, embeddings, and vector databases. - Proven track of delivering cloud-based solutions (AWS, Azure). - Containerization and deployment of data and AI workloads using Docker. - Orchestration and operation of containerized workloads using Kubernetes. - Data quality management, observability, lineage, and governance. - Knowledge of biotech IT/OT systems (MES, LIMS, SCADA), and compliance frameworks (GxP, FDA, data privacy). - Strong problem-solving, optimization, and troubleshooting skills for large-scale data systems. - Effective communication with both technical and non-technical stakeholders, influencing at senior levels. - Passion for emerging technologies, continuous improvement, and building innovative engineering cultures. Salary and Benefits The US salary range for this position is $146,000 to $241,000. Pay within this range varies by work location and may also depend on job-related knowledge, skills, and experience. - Robust benefits package including medical, dental, vision and prescription drug coverage with the option of a Health Savings Account with company contributions. - Industry leading 401(k) savings plan. - Insurance coverage. - Employee assistance programs and various wellness incentives. - Paid vacation time, sick time, and company holidays.




