Job Closed

This listing is no longer active.

Apply → Pass qualification(s) → Join a project → Complete tasks → Get paid. Project time expectations: Tasks are estimated to require around 10–20 hours per week during active phases, based on project requirements; This is an estimate, not a guaranteed workload, and applies only while the project is active. Note: Rates vary based on expertise, skills assessment, location, project needs, and other factors. Higher rates may be offered to highly specialized experts. Lower rates may apply during onboarding or non-core project phases. Payment details are shared per project.

Freelance Web Scraping Specialist (Vibe Coding)

Data EngineerData EngineerPart Time Remote Mid Level Company Site

Location

New York

Posted

118 days ago

Salary

Seniority

Mid Level

EnglishJavaScript

Job Description

This opportunity is only for candidates currently residing in the specified country. Your location may affect eligibility and rates. Please submit your resume in English and indicate your level of English. Mindrift is looking for highly skilled Vibecode specialists to join the Tendem project (https://tendem.ai/) and drive specialized data scraping workflows within our hybrid AI + human system. In this role, as an AI Pilot – that’s how we refer to this role at Mindrift – you’ll collaborate with Tendem Agents that handle repetitive tasks, while you provide critical thinking, domain expertise, and quality control to deliver accurate and actionable results. This part-time remote opportunity is ideal for technical professionals with hands-on experience in web scraping, data extraction and processing. What we do The Mindrift platform connects specialists with AI projects from major tech innovators. Our mission is to unlock the potential of Generative AI by tapping into real-world expertise from across the globe. About the Role This is a freelance role for a Tendem project. As a Vibe Code specialist, you'll handle data scraping tasks requiring technical precision for web extraction and processing, utilizing various tools such as our provided Apify and OpenRouter alongside your own resourceful approaches. Key Responsibilities - Own end-to-end data extraction workflows across complex websites, ensuring complete coverage, accuracy, and reliable delivery of structured datasets. - Leverage internal tools (Apify, OpenRouter) alongside custom workflows to accelerate data collection, validation, and task execution while meeting defined requirements. - Ensure reliable extraction from dynamic and interactive web sources, adapting approaches as needed to handle JavaScript-rendered content and changing site behavior. - Enforce data quality standards through validation checks, cross-source consistency controls, adherence to formatting specifications, and systematic verification prior to delivery. - Scale scraping operations for large datasets using efficient batching or parallelization, monitor failures, and maintain stability against minor site structure changes. Compensation On this project, contributors can earn up to $32 per hour equivalent, depending on their level and pace of contribution. Compensation varies across projects depending on scope, complexity, and required expertise. Please note that other projects on the platform may offer different earning levels based on their requirements. How to get started Simply apply to this post, qualify, and get the chance to contribute to projects that match your technical skills, on your own schedule. From coding and automation to fine-tuning AI outputs, you’ll play a key role in advancing AI capabilities and real-world applications.

Job Requirements

At least 1 year of relevant experience in data analysis, AI automation, data engineering, or software development is desirable.
Bachelor's or Master’s Degree in Engineering, Applied Mathematics, Computer Science, or related technical fields is a plus.
Python web scraping: Build reliable scraping scripts using BeautifulSoup, Selenium (or equivalents) for multi-level sites, dynamic JS content (infinite scroll, AJAX), and API endpoints via provided proxy.
Data extraction expertise: Navigate complex hierarchical structures (regions → companies → details), handling archived pages and varied HTML formats.
Data processing: Clean, normalize, and validate scraped data; deliver high-quality datasets in well-structured formats (CSV, JSON, Google Sheets) with clear, consistent presentation.
Hands-on experience with LLMs and AI frameworks to enhance automation and problem-solving.
Strong attention to detail and commitment to data accuracy.
Self-directed work ethic with ability to troubleshoot independently.
English proficiency: Upper-intermediate (B2) or above (required).

Benefits

Why this freelance opportunity might be a great fit for you?
Work fully remote on your own schedule with just a laptop and stable internet connection.
Gain hands-on experience in a unique hybrid environment where human expertise and AI agents collaborate seamlessly — a distinctive skill set in a rapidly growing field.
Participate in performance-based bonus programs that reward high-quality work and consistent delivery.

Related Categories

Data Engineer

Related Job Pages

Data Engineer Jobs in New York More Remote Jobs

More Data Engineer Jobs

Senior Data Engineer

Stefanini Brasil

Co-creating Solutions for a Better Future

Data Engineer118 days ago

Full Time RemoteTeam 10,001+Since 1987H1B No Sponsor

Company Site LinkedIn

• Define and structure the organization's data architecture; • Develop and guide the construction of data pipelines (batch and streaming); • Ensure data quality, integrity and governance; • Support the implementation of solutions focused on artificial intelligence; • Collaborate with both technical and business teams; • Optimize resource usage in cloud environments.

Airflow Apache HTTP Server BigQuery Python Redis Apache Spark SQL

View details: Senior Data Engineer

Brazil

Apply

Job Closed

Senior Data Engineer

Sinch

Powering trusted communications at scale

Data Engineer118 days ago

Full Time RemoteTeam 1,001-5,000H1B Sponsor

Company Site LinkedIn

• Join the Orion Platform team, the engineering group responsible for building and scaling Sinch's new Tier-1, business-critical data platform. • Design and implement the mediation system Orion, using Go (Golang) to process billions of raw usage data events from Kafka into standardized formats for billing and analytics. • Build and maintain our high-performance, near real-time data pipelines, ensuring minimal latency between data ingestion and its availability in our ClickHouse database. • Implement complex data transformation rules to normalize, enrich, and aggregate data from multiple sources, preparing it for accurate billing and reporting. • Automate mediation processes and workflows within our Kubernetes environment, minimizing manual intervention and improving the efficiency of data flows. • Ensure data quality by implementing checks, building robust monitoring and alerting systems with Prometheus and Grafana, and creating error-handling mechanisms to prevent inconsistencies, duplicates, or data loss within our streaming architecture. • Work closely with billing, product, and engineering teams to understand data requirements and ensure that our data pipelines align perfectly with business needs.

Apache HTTP Server AWS Docker GCP Grafana Java Apache Kafka Kubernetes Prometheus Scala SQL

View details: Senior Data Engineer

Sweden

Apply

Job Closed

Lead Data Engineer (Architecture & Data Governance)

ARETUM

Data Engineer118 days ago

Full Time RemoteTeam 501-1,000H1B No Sponsor

Company Site LinkedIn

Secret Clearance Required About Aretum Aretum is a mission-driven organization committed to delivering innovative, technology-enabled solutions to our customers across defense, civilian, and homeland security sectors. Our teams work at the intersection of strategy, technology, and transformation, helping agencies solve their most critical challenges. We believe in investing in our people and creating a culture where collaboration, inclusion, and professional growth are at the forefront. Job Summary Due to the nature of our work as a federal consulting organization, employees may be expected to handle Controlled Unclassified Information (CUI) and must adhere to applicable safeguarding and compliance requirements. Additionally, all team members may be called upon to support proposal efforts as needed. This could include resume formatting, providing skills alignment summaries, participating in meetings, or contributing to solutioning activities based on subject matter expertise or functional experience. Responsibilities - Lead hands-on development of data pipelines and data platform components, including writing and reviewing production-quality code - Own end-to-end delivery for assigned pipelines and services - Build, improve, and maintain ETL/ELT pipelines for large-scale data ingestion and transformation - Implement CI/CD and automated testing for data pipelines (integration tests, data validation, and release workflows) - Establish operational readiness for production data workflows - Optimize performance and cost of pipelines and storage through tuning, partitioning, and workload right-sizing in AWS GovCloud - Contribute to and implement modern data architecture patterns, including lakehouse approaches - Ensure architecture and implement support DoD strategic initiatives, including enterprise data modernization efforts - Drive creation and upkeep of data transformation roadmaps and technical documentation; support data maturity assessments and current-state analysis across agency-wide programs - Translate technical approaches into clear, business-understandable frameworks and recommendations. - Partner with cross-functional teams and program offices to understand data requirements, enable data-driven decision-making, and translate technical approaches into business-understandable frameworks and recommendations. - Support delivery of leadership priority initiatives (e.g., childcare forecasting, Military OneSource analytics, readiness dashboards) and design/implement data solutions that connect outreach efforts to program uptake and readiness outcomes, for users with varying technical proficiency levels

View details: Lead Data Engineer (Architecture & Data Governance)

Virginia

Apply

Job Closed

Data Engineer II - SRC - Music

Spotify

Spotify is a Swedish company founded in 2008 that provides music, comedy, podcast, and streaming services. As an employer, Spotify is interested in candidates w

Data Engineer118 days ago

Full Time Remote

Company Site

The Rights Systems team builds and operates Spotify Rights Center (SRC) — Spotify’s rights management platform that enables Spotify and its partners to identify, manage, and enforce music licensing rights across all the content available on the platform. Comparable in scope to YouTube’s Content ID, SRC is purpose-built for Spotify’s ecosystem and underpins the company’s strategy for video and emerging content types. Over the past year, the team has taken SRC from concept to a fully operational production platform, delivering automated content scanning, policy management, enforcement pipelines, appeals workflows, manual claims, and analytics capabilities. The team has grown from a single squad to three squads (~30 bandmates) and continues to scale as a strategic investment area for Spotify over the next several years. What You'll Do - Build and maintain the data pipelines and analytics infrastructure that power Spotify’s rights management platform - Own batch and streaming pipelines that generate core datasets used by Spotify Rights Center, including processing content segments, joining rights metadata, and producing match data that drives the product - Develop and evolve analytics models that transform pipeline and service data into reporting for system reliability, rightsholder adoption, and business ROI - Partner closely with product managers, backend engineers, and insights teams to define metrics, build dashboards, and generate data that informs strategic decisions around platform expansion - Maintain data export pipelines connecting backend services and the data warehouse to ensure downstream consumers receive timely and accurate data - Implement strong data quality practices, including validation tests, alerts, and monitoring to ensure reliable pipeline outputs - Contribute to technical solutions that support licensing, financial engineering, and content platform stakeholders - Participate in product ideation with engineers, researchers, product managers, and domain experts across the team - Contribute to a collaborative engineering culture and support continuous learning through hack days, reading groups, and internal training Who You Are - You have strong SQL skills and deep experience with data modeling and warehouse design - You have experience building and operating batch data pipelines at scale using tools such as Scio, Apache Beam, Spark, or similar frameworks - You are comfortable working with modern cloud data warehouses such as BigQuery, Snowflake, or similar technologies - You have experience with analytics engineering tools like dbt and understand layered data modeling approaches (staging, transformation, reporting) - You have worked with workflow orchestration platforms such as Flyte, Airflow, or similar systems - You can write production-quality code in languages such as Scala, Python, or Java - You understand data quality practices and have built monitoring and alerting systems for pipeline reliability - You take ownership of solutions end-to-end, from understanding business questions to deploying and validating data pipelines - You communicate clearly with both technical and non-technical stakeholders - You enjoy learning new domains and tackling complex data challenges such as rights management, content matching, and multi-territory licensing Where You'll Be - We offer you the flexibility to work where you work best! For this role, you can be within the North Americas region as long as we have . - This team operates within the Eastern Standard time zone for collaboration. The United States base range for this position is $133,000 – $190,000 USD, plus equity. The benefits available for this position include health insurance, six-month paid parental leave, 401(k) retirement plan, monthly meal allowance, 23 paid days off, paid flexible holidays, and paid sick leave. These ranges may be modified in the future. Spotify is an equal opportunity employer. You are welcome at Spotify for who you are, no matter where you come from, what you look like, or what’s playing in your headphones. Our platform is for everyone, and so is our workplace. The more voices we have represented and amplified in our business, the more we will all thrive, contribute, and be forward-thinking! So bring us your personal experience, your perspectives, and your background. It’s in our differences that we will find the power to keep revolutionizing the way the world listens. At Spotify, we are passionate about inclusivity and making sure our entire recruitment process is accessible to everyone. We have ways to request reasonable accommodations during the interview process and help assist in what you need. If you need accommodations at any stage of the application or interview process, please let us know - we’re here to support you in any way we can. Spotify transformed music listening forever when we launched in 2008. Our mission is to unlock the potential of human creativity by giving a million creative artists the opportunity to live off their art and billions of fans the chance to enjoy and be passionate about these creators. Everything we do is driven by our love for music and podcasting. Today, we are the world’s most popular audio streaming subscription service.

SQL Apache Beam Apache Spark BigQuery Snowflake dbt Airflow Scala Python Java

View details: Data Engineer II - SRC - Music

United States

$133K - $190K / year

Apply

Freelance Web Scraping Specialist (Vibe Coding)

Job Description

Job Requirements

Benefits

Related Guides

Related Categories

Related Job Pages

More Data Engineer Jobs

Senior Data Engineer

Senior Data Engineer

Lead Data Engineer (Architecture & Data Governance)

Data Engineer II - SRC - Music