InStride Health logo
InStride Health

Evidence-based anxiety and OCD treatment for kids, teens, and young adults (ages 7-22).

Data Engineer II

Data EngineerData EngineerFull TimeRemoteSeniorTeam 51-200H1B No SponsorCompany SiteLinkedIn

Location

United States

Posted

4 days ago

Salary

$110K - $125K / year

Seniority

Senior

Job Description

Data Engineer II

InStride Health

• Design, develop, and maintain robust, scalable ETL/ELT data pipelines using Python, SQL, and data processing frameworks including dbt, Matillion, and AWS services. • Implement data quality checks, monitoring, and alerting across all data pipelines to ensure data integrity and reliability. • Optimize existing pipelines for performance, cost-efficiency, and error handling. • Contribute to the design and maintenance of InStride’s data warehouse and data lake solutions, including schema design, data modeling, and indexing strategies in Amazon Redshift. • Ensure data security, HIPAA compliance, and proper handling of protected health information (PHI) within all data infrastructure. • Work closely with data analysts, data scientists, and business intelligence engineers to understand their data requirements and deliver reliable, high-quality data access. • Troubleshoot and resolve complex production data issues with urgency and root-cause rigor. • Develop and maintain clear documentation for data models, pipelines, and data sources to enable self-service analytics. • Participate in code reviews and contribute to technical discussions, bringing a constructive and detail-oriented perspective. • Stay current on emerging data technologies and tools, bringing relevant insights to the team.

Job Requirements

  • 3+ years of experience designing, developing, and deploying data pipelines and data warehouse solutions in production environments.
  • Strong proficiency in SQL and Python for data engineering and transformation work.
  • Hands-on experience with cloud data warehouses (Amazon Redshift preferred; Snowflake or BigQuery also valued) and familiarity with ETL/ELT tools such as dbt, Matillion, or similar.
  • Working knowledge of AWS services relevant to data engineering (e.g., S3, Glue, Lambda, Redshift).
  • Demonstrated understanding of HIPAA compliance requirements and experience working with sensitive or regulated data.
  • Ability to design and build data systems that are scalable, observable, and built to handle growth in data volume and complexity.
  • Strong understanding of data integration patterns, including APIs, webhooks, and batch ingestion techniques.
  • Experience giving and receiving structured feedback through pull request reviews and technical discussions.
  • Strong communication skills, with the ability to translate complex technical concepts for both technical and non-technical stakeholders.
  • Comfortable operating in a fast-paced startup environment, balancing competing priorities and making sound tradeoffs.
  • Experience working with healthcare data is a plus.

Benefits

  • Generous benefits package (401k with match)
  • Flexible PTO
  • Paid holidays
  • Paid service days
  • 4 week paid sabbatical
  • 12 week paid parental leave
  • Health benefits starting on your first day
  • And more

Related Categories

Related Job Pages

More Data Engineer Jobs

Talent Connect logo

Data Engineer Azure Data Factory

Talent Connect

100% remoto desde cualquier país de Latam. Trabajo con equipos internacionales (principalmente USA). Horario full-time.

Data Engineer4 days ago

Role Description Integrarte a un equipo de datos para diseñar, desarrollar y operar pipelines ETL/ELT en Azure, garantizando la ingesta, transformación y disponibilidad de datos para consumidores de negocio y modelos analíticos. El rol implica responsabilidad técnica en la implementación de soluciones con Azure Data Factory, Azure Databricks y ecosistemas de almacenamiento (Blob Storage, Data Lake) aplicando buenas prácticas de calidad, rendimiento y gobernanza de datos. Responsibilities - Diseñar y desarrollar pipelines de ingestión y transformación de datos utilizando Azure Data Factory y Azure Databricks. - Implementar procesos ETL/ELT con Apache Spark y PySpark para procesamiento por lotes y en tiempo casi real. - Modelar y optimizar consultas SQL y estructuras de datos para consumo analítico y reporting. - Gestionar y organizar datos en Azure Blob Storage y Azure Data Lake (Gen2), definiendo particionamiento, formatos y políticas de retención. - Construir pipelines reproducibles, parametrizables y observables; instrumentar logging, métricas y alertas. - Colaborar con equipos de ciencia de datos, BI y producto para definir contratos de datos, SLAs y requisitos de calidad. - Automatizar despliegues y pruebas de pipelines; aplicar control de versiones y prácticas CI/CD para artefactos de datos. - Garantizar cumplimiento de políticas de seguridad y privacidad en el manejo de datos sensibles. - Documentar arquitecturas, pipelines y runbooks; transferir conocimiento al equipo y participar en on-call rotativo cuando aplique. Qualifications - Experiencia comprobable como Data Engineer (mín. 3 años) trabajando con Azure Data Factory y Azure Databricks. - Dominio de Apache Spark y PySpark para procesamiento distribuido y optimización de jobs. - Sólidos conocimientos de SQL: modelado, tuning y optimización de consultas. - Experiencia en diseño e implementación de pipelines ETL/ELT y en integración de datos. - Conocimiento práctico de Azure Blob Storage y Azure Data Lake (Gen2), incluyendo formatos como Parquet/Delta Lake. - Familiaridad con herramientas de orquestación, monitorización y CI/CD aplicadas a datos. - Inglés técnico: nivel intermedio (lectura de documentación y comunicación con equipos técnicos). - Residir en Argentina. Bonus Points - Experiencia con Delta Lake, optimizaciones de storage y compaction. - Conocimientos en seguridad y gobernanza de datos: RBAC, Data Catalog, políticas de enmascaramiento. - Familiaridad con Python avanzado y librerías de ingeniería de datos (pandas, numpy, airflow o equivalentes). - Experiencia con performance tuning de Spark y troubleshooting de jobs en Databricks. - Certificaciones Azure relacionadas con datos (DP-203, Azure Data Engineer Associate) o Databricks. - Experiencia en entornos cloud a gran escala y en trabajo ágil con equipos multidisciplinarios. Selection Process - Recruiting screen - Entrevista técnica - Entrevista con el cliente - Oferta Benefits - Envía tu solicitud y únete a nosotros para impulsar soluciones de datos robustas y escalables. - Para postular: - Envía tu CV a través de nuestro portal. - Completa tu perfil en talentconnect.ai. - Participa en una entrevista inicial utilizando inteligencia artificial. - Al completar estos pasos estarás más cerca de sumarte a nuestro equipo.

Argentina
Blend360 logo

Data Engineer Analyst

Blend360

Optimizing business performance through people, data, tech & analytics

Data Engineer4 days ago
Full TimeRemoteTeam 501-1,000H1B Sponsor

Role Description We are looking for a Data Engineer Analyst to contribute to the design and implementation of scalable, domain-oriented data solutions for a high-impact enterprise initiative. You will work within a specific data domain — such as Ingestion, Customer Data, Enrichment, or Marketing Signals — ensuring high-quality, reliable data pipelines that power analytics and business-critical workflows. You may work 100% remotely from anywhere in Mexico! - Contribute to the design and development of scalable data pipelines on AWS (Glue, S3, Athena, Step Functions). - Build and optimize batch and streaming ingestion pipelines, including CDC-based architectures. - Support the design and implementation of data models aligned with business and analytics needs. - Ensure data quality, reliability, and performance across pipelines and datasets. - Collaborate with cross-functional teams to align technical solutions with business requirements. - Apply best practices in data engineering, including modular design, reusability, and governance. - Leverage AI-assisted development tools to improve productivity and code quality. - Proactively identify risks, bottlenecks, and optimization opportunities across data workflows. Qualifications - Bachelor's degree in Computer Science, Data Engineering, Software Engineering, or equivalent field. - Hands-on experience with the AWS ecosystem, including Glue, S3, Athena, and Step Functions (required). - Solid proficiency in Python and PySpark for data pipeline development (required). - Experience with Snowflake or similar cloud data warehouse platforms (plus). - Understanding of data pipeline architecture and design patterns. - Exposure to CDC (Change Data Capture) patterns and streaming ingestion. - Experience with data modeling for analytics and reporting use cases. - Familiarity with AI-assisted development tools (plus). - Strong problem-solving skills and ability to work effectively in collaborative, agile environments. - English: Advanced (required for effective communication with global teams). Requirements - 3+ years of experience in Data Engineering or related disciplines, with hands-on work building and maintaining data pipelines on cloud platforms such as AWS. Benefits - Learning Opportunities: - Certifications in AWS (we are AWS Partners), Databricks, and Snowflake. - Access to AI learning paths to stay up to date with the latest technologies. - Study plans, courses, and additional certifications tailored to your role. - Access to Udemy Business, offering thousands of courses to boost your technical and soft skills. - English lessons to support your professional communication. - Travel opportunities to attend industry conferences and meet clients. - Mentoring and Development: - Career development plans and mentorship programs to help shape your path. - Celebrations & Support: - Special day rewards to celebrate birthdays, work anniversaries, and other personal milestones. - Company-provided equipment. - Flexible working options to help you strike the right balance. - Other benefits may vary according to your location in LATAM. For detailed information regarding the benefits applicable to your specific location, please consult with one of our recruiters.

Mexico
Nearform logo

Senior Data Engineer

Nearform

Building enduring impact

Data Engineer4 days ago
Full TimeRemoteTeam 201-500H1B No Sponsor

Role Description As a Senior Data Engineer at Nearform your main task will be designing, building, and maintaining scalable data platforms, pipelines, and warehouses using SQL, Python, Spark, and other relevant technologies. You will play a key role in building efficient data solutions, optimizing performance, and ensuring seamless data integration. However as you’ll likely work on a variety of projects your responsibilities may also include: - Designing, building, and optimizing data pipelines and ETL processes for large-scale data ingestion and transformation. - Developing and maintaining data platforms and warehousing solutions to support analytics and business intelligence needs. - Implementing scalable data storage solutions and ensuring high availability and performance. - Enhancing performance in data processing workflows for maximum speed and scalability. - Collaborating with data scientists, analysts, and stakeholders to understand business needs and translate them into data engineering solutions. - Ensuring data quality, governance, and compliance across all systems. - Supporting CI/CD processes for data pipeline deployments and implementing best practices in DevOps for data workflows. - Conducting code reviews and enforcing best practices in version control, automation, and testing. - Assisting with defining structured source code management and deployment processes. - Working with and supporting Technical Leaders in project execution and timely delivery. - Collaborating with client teams. Qualifications - Significant experience delivering at a Senior Engineer level, specialising in Data engineering. Our Seniors typically have at least 6 years’ commercial experience. - Practical experience of delivering in an agile environment. - Practical experience and knowledge of developing real-world solutions. - Extensive hands-on experience in some of the following technologies: Databricks, SQL, Python, and Spark for large-scale data processing. - Strong expertise in data warehousing technologies (e.g., Snowflake, Redshift, BigQuery, or similar). - Experience with distributed data processing frameworks (Spark, Hadoop, or similar). - Deep understanding of ETL processes, data modeling, and schema design. - Strong experience with cloud-based data solutions (AWS, Azure, GCP) and data lake architectures. - Experience with automation, CI/CD for data pipelines, and Infrastructure as Code (IaC). - Deep understanding of versioning control tools - e.g. Git. - Experience building software collaboratively using pull requests and code reviews. - Familiarity with data governance, security, and compliance best practices. - Excellent communication and collaboration skills. - Professional proficiency in English. Benefits - Annual Company Bonus - We all help Nearform to hit company goals so we all receive a share of the profits on an annual basis in line with company performance. - Work Remotely and Flexibly - We have a genuine dedication to work/life balance. Our flexible working culture allows you to work around what matters - school run, no problem! - Paid Time Off Package - We offer an annual leave of 24 days plus public holidays. We also offer sick leave, marriage leave and many more. - Remote Working Allowance - Every 2 years, you will have a budget of up to €1250 to help you set up a comfortable and productive workspace in addition to your essential equipment provided by Nearform when you join. - Training and Development Allowance - We understand the importance of continuously learning so we offer an allowance of up to €1000 you can use to upskill yourself. - Healthcare - It’s important to always take care of your health, so we offer additional private healthcare here at Nearform. If you wish to learn more about the plan offered feel free to reach out to our team. - Pension & Insurances - We offer a pension match of up to 5% and Income Protection and Death in Service for peace of mind. Company Description Nearform is an independent team of data & AI experts, engineers, and designers who build intelligent digital solutions and capability at pace. We create AI-enabled solutions that enhance digital experiences, empower developers, and deliver measurable results. In these ways, we partner with ambitious enterprises to deliver enduring impact. Our deep expertise in solving the world’s most complex digital problems, along with our collaborative, people-first approach, enables enterprises to build breakthrough products and modernise legacy systems by unleashing the power of AI. Today, our team of 500 experts in 20+ countries is trusted by leading enterprises including Lululemon, Puma, Sun Life, Starbucks, Travelex, Virgin Media 02, and Walmart.

Ireland
Helen of Troy logo

Enterprise Data Lead

Helen of Troy

Helen of Troy is a consumer goods company on a mission to elevate life through multiple well-known brands, including OXO®, Hydro Flask®, Vicks®, Braun®, Honeywell®, PUR®, and

Data Engineer4 days ago

Title: Enterprise Data Lead Location: USA - Arlington, Tennessee Job Description: Join our  Global Supply Planning  team at Helen of Troy and make an immediate impact on our trusted brands: OXO, Hydro Flask, Osprey, Honeywell, PUR, Braun, Vicks, Hot Tools, Drybar, Curlsmith, Revlon, and Olive & June. Together, we build innovative and useful products that elevate people's lives everywhere, every day.  Look around your home, and you'll find us everywhere, in your kitchen, living room, bedroom, and bathroom. We are already making your everyday lives better. We are powered by knowledgeable, enthusiastic, and forward-thinking people committed to developing a culture of inclusion. Whether you are just starting your career or in need of a challenge, we recognize, develop, and empower talent!  Position Title: Enterprise Data Lead Department: Global Supply Planning Work Location: Arlington, TN Hybrid Schedule: At Helen of Troy, we embrace a flexible hybrid work model designed to support collaboration and productivity. For roles eligible for hybrid work, our standard schedule includes in-office collaboration from Tuesday through Thursday, with the option to work remotely on Mondays and Fridays. Any updates to this model will be communicated in advance. Please note that hybrid eligibility and schedules may vary based on business needs and manager expectations. What you will be doing: The Enterprise Data Lead is a newly created, high-visibility role established to serve as the business champion for data governance, data quality, data definitions, and process standards across Helen of Troy. This senior leader will address significant legacy systems and data debt by shifting accountability from a technical problem to a process and behavioral solution. The successful candidate will have the business authority to drive behavior change across all brands, regions, and corporate functions, ensuring data accurately reflects commercial and operational realities. Data Governance & Policy Leadership - Serve as the primary Business Champion for data governance, data quality, data definitions, and data processes partnering with IT. - Partner with Domain Owners and IT to align recommendations regarding governance models, data architecture changes, and strategic policy updates; Act as a Domain Owner for Item Data across the company. - Define and enforce standard enterprise data definitions and business rules within the technical constraints of the ERP platform to maximize operational efficiency. - Define enterprise data access guardrails and change control frameworks, while partnering with IT to handle system-level access management. Process Optimization & Quality Assurance - Lead the development, optimization, and modernization of core enterprise data processes (e.g., new item and customer setup workflows) to remove operational bottlenecks. - Partner with IT to develop data quality dashboards, key performance indicators (KPIs), and automated exception reporting. - Identify behavioral root causes of ongoing data issues and establish durable, performance-linked incentive structures to sustain high data accuracy. Escalation & Decision Rights Management - Facilitate decisions crossing the boundary between technical data integrity (IT) and data accuracy (Business Domain Owners). - Own the data governance processes, standards, access guardrails, and standard hierarchy maintenance as specified in the enterprise RACI matrix. - Manage the escalation pathway for cross-domain data conflicts (e.g., item-to-customer discrepancies), routing them effectively to the respective Data Councils. Key Professional Interactions - Data Councils - This role facilitates the Data Councils (stood up at the item, customer, and vendor levels) to bring together stakeholders biweekly or monthly to review quality KPIs, align remediation priorities, and resolve enterprise blockers. - Business Domain Owners - This role partners closely with business data Domain Owners, and together with IT Lead to align on recommendations, supporting designated functional leads (Supply Chain for Item Master; Finance for Customer and Vendor Masters) to fulfill explicit accountability for data outcomes. - Information Technology (IT) Team - This role partners with Director, Supply Chain Systems who owns the system configuration, integrations, and technical control. Minimum Qualifications: - Bachelor’s degree in Data Science, Business Analytics, Supply Chain Management, Management Information Systems (MIS), Finance, or a related quantitative/business discipline; or equivalent practical experience. - 10+ years of progressive professional experience in master data governance, enterprise process design, or large-scale supply chain/finance data operations - 5+ years of operating in a dedicated director-level or senior cross-functional leadership capability within a matrixed corporate environment - Deep background in master data governance, enterprise process design, or corporate supply chain and finance data operations - Proven success aligning processes, core systems, and data governance frameworks across business units with historically disparate regional or brand practices - Working experience navigating ERP platform constraints (Oracle EBS highly preferred) and modern connected planning platforms (such as o9 Solutions) - Demonstrated ability to drive complex cross-functional behavioral modifications and establish accountability metrics without direct line authority. - Demonstrated capability in establishing data quality dashboards, automated exception reports, and system change guardrails. - Ability to work in the United States on a full-time basis. Preferred Qualification: - Master of Business Administration (MBA) or an advanced degree in a related field is an asset. Benefits: Salary + Bonus, Healthcare, Dental, Vision, Paid Holidays, Paid Parental Leave, 401(k) with company match, Basic Life Insurance, Short Term Disability (STD), Long Term Disability (LTD), Paid Time Off (PTO), Paid Charitable (volunteer) Leave, and Educational Assistance.  Wondering if you should apply? Helen of Troy welcomes people as diverse as our brands! Have the confidence to come as who you are because your point of view, skills, and experience will make us stronger. If you're eager to share new ideas and try new things, we want to hear from you.  #li-ab1 #LI-HYBRID Helen of Troy is an Equal Opportunity/Affirmative Action Employer. We are committed to developing a diverse workforce and cultivating an inclusive environment. We value diversity and believe that we are strengthened by the differences in our experiences, thoughts, cultures, and backgrounds. All qualified applicants will receive consideration for employment without regard to race, color, religion, sex, sexual orientation, gender identity, national origin, age, pregnancy, genetic information, disability, status as a protected veteran, or any other protected category under applicable federal, state, and local laws. We will provide individuals with disabilities with reasonable accommodations to participate in the job application process. If you would like to request an accommodation, please contact Human Resources at (915) 225-8000. Founded in 1968, Helen of Troy is a prominent player in the global consumer products industry, offering diverse career opportunities across North America, South America, Europe, and Asia. We boast a collection of renowned brands such as OXO, Hydro Flask, Osprey, Honeywell, PUR, Braun, Vicks, Hot Tools, Drybar, Curlsmith, Revlon, and Olive & June – many of which rank #1, #2, or #3 in their respective categories, making the Helen of Troy name synonymous with excellence and ingenuity. At Helen of Troy, our strategy involves acquiring brands that we can integrate and enhance, amplifying their unique attributes to drive growth and profitability. Embracing a culture of collaboration internally and externally, we are committed to providing innovative solutions tailored to consumers, operational excellence, global scalability, and exceptional shared services to support our brand portfolio. This dedication to fostering development and success sets Helen of Troy apart as a pioneer in the industry, propelling our brands to unparalleled heights of success and recognition worldwide The above statements are intended to describe the general nature and level of work performed by people assigned to this classification. They are not intended to be construed as an exhaustive list of all responsibilities and duties required of personnel so classified. Management retains the right to add or to change duties of the position at any time.

Texas