Job Closed

This listing is no longer active.

Blue Coding

Top notch developers, ready to deploy.

Senior Data Engineer – V

Data EngineerData EngineerContract Remote SeniorTeam 51-200Since 2014H1B No SponsorCompany Site LinkedIn

Location

Latin America

Posted

168 days ago

Salary

Seniority

Senior

5 yrs expEnglishAWS Python Scala Apache Spark SQL Tableau Terraform

Job Description

• Design, develop, and maintain data pipelines and datastores that support enterprise analytics, data science, and operational workloads. • Lead and support large-scale database migration initiatives, including on-premises to cloud migrations. • Monitor, analyze, and optimize the performance and stability of data layer services and platforms. • Ensure data integrity, quality, and compliance across pipelines and datasets. • Collaborate closely with peers across engineering, analytics, and technology teams. • Guide, coach, and mentor data engineers, BI developers, and analysts. • Design and implement enterprise-scale data solutions with long-term business impact. • Build and maintain data processing solutions using Python and/or Scala. • Work with a variety of data ingestion patterns, including SFTP, APIs, streaming, and batch processing. • Design and support database models optimized for analytical and reporting use cases. • Implement monitoring, alerting, and observability for data pipelines and infrastructure. • Maintain clear and comprehensive documentation of data architectures, pipelines, and processes. • Work within an Agile environment, collaborating through tools such as Jira and Git.

Job Requirements

5+ years of experience working in data engineering or data-centric technical roles.
Strong experience designing and maintaining enterprise-scale data pipelines and platforms.
Solid understanding of data warehousing concepts, modeling strategies, and analytical datasets.
Proven experience with SQL-based data environments and database performance tuning.
Hands-on experience with cloud-based data platforms, particularly on AWS.
Experience working with cloud-native data technologies such as S3, EMR/Spark, Lambda, Kinesis, Firehose, and Glue.
Strong programming skills in Python and/or Scala.
Experience with Infrastructure as Code concepts; Terraform experience is a strong plus.
Experience supporting analytics and data science consumers, including BI tools such as Tableau or PowerBI.
Strong communication skills and the ability to collaborate effectively across teams.
Ability to work autonomously with minimal direction while owning complex data initiatives end-to-end.

Benefits

Salary in USD
Flexible schedule (within US Time zones)
100% Remote
High-impact role in a large-scale data migration and modernization initiative
Collaboration with senior data engineering, analytics, and technology teams
Opportunities for technical growth, leadership, and increased ownership as the data platform evolves

Related Categories

Data Engineer

Related Job Pages

Remote Python Jobs (US)More Remote Jobs

More Data Engineer Jobs

Support Data Engineer

KIS Solutions

Keep it Simple

Data Engineer168 days ago

Other RemoteTeam 51-200Since 2012H1B No Sponsor

Company Site LinkedIn

• Maintain and optimize cloud infrastructure to ensure reliability and scalability for data systems. • Extract, transform, and load (ETL) data across various systems to support business operations and decision-making. • Identify and resolve issues within existing systems to enhance performance, stability, and scalability. • Troubleshoot and fix bugs in deployed systems, ensuring minimal downtime and disruption. • Optimize systems and streamline existing processes to improve efficiency and reduce latency. • Build and optimize system deployment structures to ensure smooth, consistent deployments and system performance.

ETL Python SQL

View details: Support Data Engineer

United States

Apply

Job Closed

ETL Pipeline Developer, Data Engineer

eSimplicity

An engineering firm that delivers high-quality Healthcare IT, Cybersecurity, and Telecommunication solutions.

Data Engineer168 days ago

Other RemoteTeam 51-200Since 2016H1B No Sponsor

Company Site LinkedIn

• Work collaboratively with Product Managers, Designers, and Engineers to set up, develop, and maintain critical back-end integrations for data and analytics platform. • Create and maintain new and existing data pipelines, Extract, Transform, and Load (ETL) processes, and ETL features using Azure cloud services. • Build, expand, and optimize data and data pipeline architectures. • Optimize data flow and collection for cross functional teams of database architects, data analysts, and data scientists. • Operate large-scale data processing pipelines and resolve business and technical issues pertaining to the processing and data quality. • Assemble large, complex sets of data that meet non-functional and functional business requirements. • Identify, design, and implement internal process improvements including re-designing data infrastructure for greater scalability, optimizing data delivery, and automating manual processes. • Develop and document standard operating procedures (SOPs) for new and existing data pipelines. • Build analytical tools to utilize the data pipeline, providing actionable insight into key business performance metrics including operational efficiency and customer acquisition. • Write unit and integration tests for all data processing code. • Read data specifications and translate them into code and design documents.

Apache HTTP Server Azure ETL PySpark Python Apache Spark SQL

View details: ETL Pipeline Developer, Data Engineer

United States

$110 / hour

Apply

Job Closed

Data/Infrastructure Advocate Engineer - US Remote

Stride, Inc.

Stride, Inc., formerly known as K12 Inc., is a leading provider of personalized online education programs and services, including customized tutoring, online ed

Data Engineer168 days ago

Other Remote

Company Site

At Hugging Face, we’re on a journey to democratize good AI. We are building the fastest growing platform for AI builders with over 5 million users & 100k organizations who collectively shared over 1M models, 300k datasets & 300k apps. Our open-source libraries have more than 400k+ stars on Github. About the Role As our first Data/Infrastructure Advocate Engineer, you’ll bridge the gap between cutting-edge data infrastructure and the global community of data engineers, researchers, and developers. You’ll champion Xet storage on the Hugging Face Hub, empowering users to efficiently store, version, and collaborate on large-scale datasets. This role is for someone who thrives at the intersection of technical depth (storage, Parquet, deduplication) and community advocacy—helping define the future of open data workflows. You’ll collaborate with teams like Datasets, Hub, and Infrastructure to shape how developers interact with data on our platform, and inspire a community to build better, faster, and more scalable data pipelines. Your Main Missions: - Grow and nurture the open-source data/infra community—launch initiatives, collaborate with data-focused groups, and organize events or challenges. Engage with communities like Apache Parquet, Open Tables Formats, and data engineering forums to promote best practices and Hugging Face tools. - Promote the Hugging Face Hub as the go-to platform for data storage, versioning, and collaboration—curate and showcase datasets, benchmarks, and tools like Xet. - Highlight use cases like efficient large dataset updates, Parquet editing, and deduplication to demonstrate the Hub’s value for data workflows. - Create demos, benchmarks, and tools (e.g., Colab notebooks) to illustrate best practices for data storage and versioning.bExperiment with Xet, Parquet, and other data formats to showcase their potential for ML and data engineering. - Produce high-quality tutorials, blog posts, and videos that make complex topics accessible. - Share insights on storage optimization, dataset versioning, and deduplication to empower developers. - Actively participate in online communities (Discord, GitHub, forums) to highlight contributions, answer questions, and foster collaboration. - Ensure datasets and tools released on the Hub are well-documented, with clear examples, benchmarks, and use cases. About you You’re a great fit if you: - Have strong technical skills in Python, data libraries (e.g., pandas, pyarrow, huggingface/datasets), and storage systems (Parquet, Open Table Formats, S3). - Are a hands-on builder who loves experimenting with data tools, storage optimization, and dataset versioning. - Can clearly explain complex topics (e.g., deduplication, compression, Parquet editing) through writing, demos, or talks. - Are active in developer communities (GitHub, Discord, forums) and passionate about open source and knowledge sharing. - Thrive in fast-moving environments and enjoy building in public to inspire others. If you're interested in joining us but don't tick every box above, we still encourage you to apply! We're building a diverse team whose skills, experiences, and backgrounds complement one another. We're happy to consider where you might be able to make the biggest impact. More about Hugging Face We are actively working to build a culture that values diversity, equity, and inclusivity. We are intentionally building a workplace where you feel respected and supported—regardless of who you are or where you come from. We believe this is foundational to building a great company and community, as well as the future of machine learning more broadly. Hugging Face is an equal opportunity employer, and we do not discriminate based on race, ethnicity, religion, color, national origin, gender, sexual orientation, age, marital status, veteran status, or ability status. We value development. You will work with some of the smartest people in our industry. We are an organization that has a bias for impact and is always challenging ourselves to grow continuously. We provide all employees with reimbursement for relevant conferences, training, and education. We care about your well-being. We offer flexible working hours and remote options. We offer health, dental, and vision benefits for employees and their dependents. We also offer parental leave and flexible paid time off. We support our employees wherever they are. While we have office spaces in NYC and Paris, we're very distributed, and all remote employees have the opportunity to visit our offices. If needed, we'll also outfit your workstation to ensure you succeed. We want our teammates to be shareholders. All employees have company equity as part of their compensation package. If we succeed in becoming a category-defining platform in machine learning and artificial intelligence, everyone enjoys the upside.

View details: Data/Infrastructure Advocate Engineer - US Remote

New York

Apply

Job Closed

Data Engineer, SQL, ADF

Upstart 13

Bringing down borders in technology.

Data Engineer168 days ago

Full Time RemoteTeam 51-200H1B No Sponsor

Company Site LinkedIn

• Code SQL for both on-premises and Azure SQL environments, including optimizing queries, data manipulation, and ensuring high performance. • Design and build data pipelines from various data sources (mainly databases) to a target Data Warehouse / Lakehouse, using batch data load strategies, and leveraging Microsoft Azure technologies (Data Factory and Databricks). • Implement data validation and quality assurance checks to maintain high data integrity standards, ensuring accurate and reliable data for analytical purposes. • Collaborate with a cross-functional team of business analysts, architects, engineers, data analysts, and key stakeholders to formulate and implement data pipelines. • Participate in architecture discussions, take ownership and responsibility over new projects or requirements, and perform audits to ensure data quality and integrity. • Utilize programming languages, databases & cloud Data Warehouses / Lakehouses. • Document database designs that include data models, metadata, ETL specifications, and process flows for business data project integrations. • Monitor, maintain, and optimize production systems. • Investigate and resolve incidents reported by users. • Identify opportunities to automate, consolidate, and simplify data solutions at scale. • Define best practices and create coding standards and patterns that can be reused by the rest of the data engineers.

Azure ETL NoSQL Python SDLC Apache Spark SQL

View details: Data Engineer, SQL, ADF

Brazil

Apply

Job Closed

Senior Data Engineer – V

Job Description

Job Requirements

Benefits

Related Guides

Related Categories

Related Job Pages

More Data Engineer Jobs

Support Data Engineer

ETL Pipeline Developer, Data Engineer

Data/Infrastructure Advocate Engineer - US Remote

Data Engineer, SQL, ADF