Spotify logo
Spotify

Passionate music fans. Innovative tech pros. Perfect harmony. Join our band.

Data Engineer II – Gen AI, Music

Data EngineerData EngineerFull TimeRemoteSeniorTeam 5,001-10,000Since 2008H1B SponsorCompany SiteLinkedIn

Location

New York

Posted

71 days ago

Salary

$125.3K - $179K / year

Seniority

Senior

Job Description

Data Engineer II – Gen AI, Music

Spotify

• Build and maintain large-scale data pipelines, including ML pipelines, with data processing frameworks like Scio and Python-based tools on Google Cloud Platform. • Leverage data engineering best practices in continuous integration and delivery. • Help drive optimization, testing and tooling to improve data quality and reliability. • Collaborate with engineers, product managers, subject matter experts, and stakeholders while taking on learning and leadership opportunities that arise every day. • Work in cross-functional, agile teams to continuously experiment, iterate, and deliver on new product objectives.

Job Requirements

  • You have at least 3+ years of professional experience working in a product-driven environment.
  • You have experience working with high-volume, heterogeneous data using distributed systems and big data technologies such as Python, Scala (e.g., Scio), Ray, Apache Spark, or similar frameworks used for distributed data processing.
  • You are proficient in designing and building distributed data pipelines in Python, Scala, or Java, with experience in frameworks like Scio on platforms such as Dataflow.
  • You understand data modeling, data access, and data storage techniques, and can apply them to both batch and analytical processing (e.g., using BigQuery for analysis).
  • You value iterative software processes, data-driven development, reliability, and responsible experimentation, with attention to cost efficiency and best practices in data engineering.
  • You thrive in collaborative environments and enjoy working with cross-functional teams.
  • You are a creative problem solver who is passionate about building outstanding products that add real value to millions of people.
  • You are enthusiastic about learning more about turning research ideas into products operating at scale

Benefits

  • health insurance
  • six month paid parental leave
  • 401(k) retirement plan
  • monthly meal allowance
  • 23 paid days off
  • 13 paid flexible holidays

Related Categories

Related Job Pages

More Data Engineer Jobs

Foley logo

Staff Data Engineer

Foley

Driving Your Business Forward

Data Engineer71 days ago
Full TimeRemoteTeam 201-500Since 1992H1B Sponsor

• Lead the design and evolution of our enterprise data architecture • Define standards and best practices for data modeling, storage, and access • Drive architectural decisions that balance scalability, performance, and maintainability • Design, build, and maintain scalable data pipelines • Ensure high data quality, reliability, and observability • Own data warehouse schema design and implement data governance best practices • Act as a technical expert to Engineering, Product, Analytics, and Business teams • Identify opportunities to improve work through automation, tooling, and AI

Arizona + 20 moreAll locations: Arizona | California | Colorado | Connecticut | Florida | Illinois | Kansas | Nebraska | New Hampshire | New Jersey | New York | North Carolina | Maryland | Massachusetts | Pennsylvania | Rhode Island | South Carolina | Tennessee | Texas | Virginia | Wisconsin
$160K - $210K / year
Guidehouse logo

Oncology Data Specialist - Certified (Part-Time) (remote)

Guidehouse

Solving big problems, building trust in society, and empowering our clients to shape the future.

Data Engineer71 days ago
Full TimeRemoteTeam 10,001+Since 2018H1B Sponsor

Job Family: Cancer Tumor Registrar Travel Required: None Clearance Required: None This position is available as a part-time opportunity and is fully remote. What You Will Do: The remote Oncology Data Specialist will review clinical documentation as appropriate to extract data and apply ICDO-O codes. To code, stage, and abstract cases of cancer and reportable benign tumors diagnosed and/ or treated. To participate in research, education, and monitoring for quality improvement activities to ensure data integrity and compliance with the American College of Surgeons Committee on Cancer guidelines. - Translate medical terminology into standardized codes to capture patient diagnosis and treatment information. - Code, stage, and enter data into the registry database utilizing: ICD-O, ICD-10-CM, AJCC TNM (Tumor, Nodes, Metastasis), Site-Specific Data Items (SSDI), STORE (Standards for Oncology Registry Entry), and SEER (Surveillance of Epidemiology and End Results) guidelines. - Complies with state and federal mandates that require reporting all diagnosed and /or treated malignancies and reportable benign tumors. - Perform data quality control activities on registry data. - Retrieve and comply with data for preparation of annual reporting and may collaborate in generating reports for special studies. - Prepare minutes for multi-disciplinary Tumor Conference every week. - Contact patient providers and state registries per the database to gather follow-up information and verify or correct patient information. - Maintains the highest degree of confidentiality of all information encountered including verbal, written, and computerized. Reports to the manager any failure by anyone to protect confidential information. - Performs other duties as assigned. What You Will Need: - Minimum degree required is Associates. - Currently certified as an Oncology Data Specialist (ODS) formerly a Certified Tumor Registrar (CTR) by the National Cancer Registrars Association (NCRA). - 5 years experience working in tumor registry as an ODS (CTR). - Experience working for a COC accredited hospital. - Ability to work a minimum of 20 hours per week, flexible hours available. What Would Be Nice to Have: - NCI experience. - Strong conceptual, as well as quantitative and qualitative analytical skills - Basic knowledge of Microsoft applications. - Excellent written and verbal communication skills. #LI-DNI The annual salary range for this position is $68,000.00-$113,000.00. Compensation decisions depend on a wide range of factors, including but not limited to skill sets, experience and training, security clearances, licensure and certifications, and other business and organizational needs. What We Offer: Guidehouse offers a comprehensive, total rewards package that includes competitive compensation and a flexible benefits package that reflects our commitment to creating a diverse and supportive workplace. Benefits include: - Medical, Rx, Dental & Vision Insurance - Personal and Family Sick Time & Company Paid Holidays - Position may be eligible for a discretionary variable incentive bonus - Parental Leave - 401(k) Retirement Plan - Basic Life & Supplemental Life - Health Savings Account, Dental/Vision & Dependent Care Flexible Spending Accounts - Short-Term & Long-Term Disability - Tuition Reimbursement, Personal Development & Learning Opportunities - Skills Development & Certifications - Employee Referral Program - Corporate Sponsored Events & Community Outreach - Emergency Back-Up Childcare Program About Guidehouse Guidehouse is an Equal Opportunity Employer–Protected Veterans, Individuals with Disabilities or any other basis protected by law, ordinance, or regulation. Guidehouse will consider for employment qualified applicants with criminal histories in a manner consistent with the requirements of applicable law or ordinance including the Fair Chance Ordinance of Los Angeles and San Francisco. If you have visited our website for information about employment opportunities, or to apply for a position, and you require an accommodation, please contact Guidehouse Recruiting at 1-571-633-1711 or via email at RecruitingAccommodation@guidehouse.com. All information you provide will be kept confidential and will be used only to the extent required to provide needed reasonable accommodation. All communication regarding recruitment for a Guidehouse position will be sent from Guidehouse email domains including @guidehouse.com or guidehouse@myworkday.com. Correspondence received by an applicant from any other domain should be considered unauthorized and will not be honored by Guidehouse. Note that Guidehouse will never charge a fee or require a money transfer at any stage of the recruitment process and does not collect fees from educational institutions for participation in a recruitment event. Never provide your banking information to a third party purporting to need that information to proceed in the hiring process. If any person or organization demands money related to a job opportunity with Guidehouse, please report the matter to Guidehouse’s Ethics Hotline. If you want to check the validity of correspondence you have received, please contact recruiting@guidehouse.com. Guidehouse is not responsible for losses incurred (monetary or otherwise) from an applicant’s dealings with unauthorized third parties. Guidehouse does not accept unsolicited resumes through or from search firms or staffing agencies. All unsolicited resumes will be considered the property of Guidehouse and Guidehouse will not be obligated to pay a placement fee.

United States
$68K - $113K / year
Job Closed
Sequoia Connect logo

Azure Data Engineer

Sequoia Connect

Our core expertise lies in connecting Top Technologists with Top Companies through unparalleled IT headhunting solutions

Data Engineer71 days ago
Full TimeRemoteTeam 11-50H1B No Sponsor

Our client, founded in Poland in 2005, is a leading consulting and technology company specializing in data analysis, business intelligence (BI), and Big Data solutions. The company is dedicated to transforming data into valuable information that drives strategic decision-making, enabling organizations to optimize their operations and gain a competitive edge in the market. With a solid reputation built on an innovative approach and the high quality of its services, our client has collaborated with some of the most recognized brands worldwide. Their team of experts employs cutting-edge technologies, including artificial intelligence and machine learning, to solve complex problems and improve operational efficiency for their clients. This ability to handle large volumes of data and extract actionable insights positions our client as a strategic and valuable partner across various sectors. In addition to their technical expertise, our client is distinguished by a corporate culture focused on innovation and continuous development. The company invests significantly in staff training and the research and development of new technologies, ensuring they are always at the forefront of emerging technological trends. This proactive approach allows our client to anticipate market needs and offer solutions that address current challenges and prepare their clients for the future. We are currently searching for a Azure Data Engineer: Responsibilities - Design, build, and maintain highly scalable data pipelines utilizing Object-Oriented (OO) Python, shifting away from traditional notebook-centric development. - Optimize Apache Spark data processing by deeply analyzing execution plans, identifying bottlenecks, and optimizing code (partitions, shuffle, caching) prior to scaling infrastructure. - Design and optimize distributed data storage solutions, including data lakes, data warehouses, and distributed file systems within the Azure ecosystem. - Enforce strong software engineering practices within data operations, including the creation of unit tests, CI/CD pipelines, deployment automation, and writing clean, maintainable code. - Operate with a high degree of autonomy and ownership, working directly with client stakeholders to drive projects forward with minimal supervision. - Act as a technical mentor to peers and communicate complex data architecture concepts clearly to both technical and business stakeholders. Requirements - 4 to 6 years of professional experience in a Data Engineering or similar role. - Strong proficiency (Level 4/5) in Databricks and Apache Spark, with a deep understanding of distributed processing mechanics. - Advanced programming skills in Python (Level 4/5), specifically applied to Object-Oriented software development for Data Engineering. - Solid hands-on experience (Level 3/5) with Azure Services (e.g., Data Lake, Data Factory). - Proven track record implementing CI/CD automation and Agile methodologies utilizing tools like JIRA and Azure DevOps. - High-Performance Mindset: Resilience, emotional intelligence, and a focus on agile delivery. - Technologist DNA: A deep understanding of the difference between "coding" and "engineering." Desired - Proven background in Data Processing Architecture. - Familiarity with Generative AI applied to Data Engineering tasks. - Experience with other cloud technologies, data governance, and business analysis. - Familiarity with cloud-native foundations or AI coding assistants. Languages - Advanced Oral English. - Native Spanish. Note: - Fully remote. If you meet these qualifications and are pursuing new challenges, Start your application to join an award-winning employer. Explore all our job openings | Sequoia Career’s Page: https://www.sequoia-connect.com/careers/. Requirements Requirements - 4 to 6 years of professional experience in a Data Engineering or similar role. - Strong proficiency (Level 4/5) in Databricks and Apache Spark, with a deep understanding of distributed processing mechanics. - Advanced programming skills in Python (Level 4/5), specifically applied to Object-Oriented software development for Data Engineering. - Solid hands-on experience (Level 3/5) with Azure Services (e.g., Data Lake, Data Factory). - Proven track record implementing CI/CD automation and Agile methodologies utilizing tools like JIRA and Azure DevOps. - High-Performance Mindset: Resilience, emotional intelligence, and a focus on agile delivery. - Technologist DNA: A deep understanding of the difference between "coding" and "engineering."

Mexico
Gugu Robotics logo

Data Architect

Gugu Robotics

The Future is Now; Beyond Boundaries, Beyond Imagination

Data Engineer71 days ago
Full TimeRemoteTeam 51-200Since 2016H1B No Sponsor

• Design, develop, and deploy predictive and prescriptive models across a variety of domains (e.g., customer behavior, operational efficiency, personalization). • Apply machine learning, deep learning, and statistical techniques to solve real-world business challenges. • Drive experimentation (A/B testing, multi-variate testing) and causal inference to validate hypotheses and measure impact. • Analyze large, complex datasets to extract key insights and translate them into strategic recommendations. • Communicate findings clearly and effectively to both technical and non-technical audiences, using compelling data knowledge and visualization. • Collaborate with product managers and business stakeholders to identify opportunities and frame data science solutions. • Work closely with data engineers, analysts, and software developers to build scalable, data-powered applications. • Mentor junior data scientists, supporting technical development and scientific rigor. • Contribute to the development of reusable assets, tools, and processes to increase team velocity and impact.

Canada