AWS Data Science – Specialist

Data ScientistData ScientistFull TimeRemoteSeniorTeam 10,001+H1B SponsorCompany SiteLinkedIn

Location

Brazil

Posted

6 days ago

Salary

0

Seniority

Senior

Job Description

AWS Data Science – Specialist

Compass

• Lead the development and evolution of the Feature Store capabilities: data lineage, feature views, feature recommendation, and new query engines; • Design and implement Apache Iceberg tables with a focus on read performance, versioning, and schema evolution; • Architect and optimize the serving layer with Redis for real-time features meeting strict latency SLOs; • Integrate and optimize Amazon EMR as a query and large-scale processing engine; • Define and implement feature selection and transformation pipelines with end-to-end traceability; • Establish quality, versioning, and governance standards for features across the platform; • Serve as the technical reference for data and data science teams that consume the Feature Store.

Job Requirements

  • Expertise in feature engineering on enterprise ML platforms (Feast, Tecton, Hopsworks, or equivalents);
  • Advanced proficiency in Apache Spark/PySpark for large-scale distributed processing;
  • Deep knowledge of Apache Iceberg and lakehouse architectures (including comparisons with Delta Lake and Hudi);
  • Expertise in Redis for low-latency feature serving, including cache invalidation strategies and efficient serialization;
  • Strong production experience with AWS data services (S3, Glue, EMR, Redshift, Athena);
  • Preferred: experience with data lineage and metadata catalogs (DataHub, OpenMetadata, Marquez) in production; experience with Amazon EMR including cluster configuration, cluster optimization, and Spark job tuning; expertise in MLOps practices focused on versioning and traceability of data artifacts; prior work in financial contexts with high-cardinality, high-frequency data and regulatory requirements; familiarity with scalable data quality tools (Great Expectations, Soda, dbt tests).

Related Categories

Related Job Pages

More Data Scientist Jobs

Environment Agency Jobs logo

Senior Evidence Advisor

Environment Agency Jobs

We are fully committed to being an inclusive employer and ensuring equal opportunities for everyone. We welcome flexible working patterns for all our vacancies, including job share.

Data Scientist6 days ago

Role Description This is an exciting opportunity to join the Environment Agency’s Natural Capital Ecosystem Assessment (NCEA) Programme as the lead for Surveillance Networks Workstream. The Defra-led NCEA programme is gathering broad scale and comprehensive data on our ecosystems including freshwater, estuarine and coastal water environments to reveal their benefits for people and the economy - their natural capital. This evidence and insight will ensure that policy and decision makers consider the true value of nature. You will provide leadership and assurance of these NCEA monitoring networks ensuring high-quality technical outputs are delivered. You will lead a team of water monitoring environmental specialists and subject matter experts. The team designs and delivers a suite of surveillance networks covering freshwater surface and ground waters and estuaries and coasts. The data from these networks is used to develop statistically robust and nationally representative evidence on ecosystem condition. You will coordinate and lead the work of the team and support delivery to ensure the workstream meets its objectives and the overall aims of the programme. Your responsibilities will include: - Providing technical leadership and setting the overall strategy and plan for the workstream and the work packages within it; - Co-ordinating delivery of the workstream plan and supporting the work package leads and workstream project manager to resolve technical issues, manage risks/issues and ensure work is delivered to the required technical/scientific assurance standards; - Working with the other workstream leads to address cross-cutting technical issues; - Engaging with stakeholders within and beyond the programme on working groups and thematic boards, in the context of developing and delivering data management, analysis and digital solutions; - Line management duties for some members of the workstream. Company Description We are fully committed to being an inclusive employer and ensuring equal opportunities for everyone. We welcome flexible working patterns for all our vacancies, including job share.

United Kingdom
Nextdoor logo

Data Scientist – Search

Nextdoor

Connecting neighbors to each other - and to everything nearby.

Data Scientist6 days ago
Full TimeRemoteTeam 501-1,000Since 2011H1B Sponsor

• Design and develop algorithms and models to optimize Search relevance, retrieval, and ranking • Experiment design: ensure we run experiments efficiently and accurately • Develop and share key strategic insights through data analysis • Build scalable metrics and dashboards to track Search performance • Partner with cross-functional teams to support product development efforts • Care deeply about data quality

California + 2 moreAll locations: California | New York | Texas
$175K - $234K / year
Fireworks AI logo

Revenue Analytics Lead

Fireworks AI

Run AI faster, more efficiently, and on your own terms

Data Scientist6 days ago
Full TimeRemoteTeam 51-200Since 2022H1B No Sponsor

• Own the source of truth for GTM health: pipeline, bookings, win rates, sales velocity, rep productivity, GPU consumption trends, and segment performance. • Build and maintain the analytical framework across the full funnel — outbound through AE close and expansion. • Build executive dashboards that give leadership real-time visibility into performance, risk, and opportunity. • Execute annual and quarterly planning cycles: territory design, quota-setting, capacity modeling, and headcount planning in partnership with Finance and the Director. • Build and maintain predictive models for revenue pacing and sales attainment. • Own the quantitative inputs to the forecast: conversion rates, coverage ratios, pipeline aging, and ramp curves. • Build scalable, governed GTM data models in Snowflake and dbt. Own definitional standards: pipeline, logo, ARR, GPU consumption. • Maintain data integrity across Salesforce and GTM systems. One source of truth, enforced. • Deploy AI and automation workflows across GTM analytics: reporting automation, pipeline intelligence, and LLM-assisted analysis. • Automate recurring analytical work so capacity goes to interpretation, not production.

California
$182K - $243K / year
Full TimeRemoteTeam 11-50H1B No Sponsor

• Data literacy made easy through practical, hands-on training • Flexible, modern online courses in Data Science, Business Analytics and Artificial Intelligence • Discussion of business processes and optimization of decision-making • Building AI models and automating processes

Germany
€46K - €62K / year