Moonvalley logo
Moonvalley

Building the next generation creative studio, powered by the most capable video and image foundational models.

Member of Technical Staff (Data): World Models

Data EngineerData EngineerFull TimeRemoteLeadTeam 1-10H1B No SponsorCompany SiteLinkedIn

Location

United States + 2 moreAll locations: United States | Canada | United Kingdom

Posted

69 days ago

Salary

0

Seniority

Lead

Job Description

Member of Technical Staff (Data): World Models

Moonvalley

Your Charter - Data at Scale: Own the pipelines and storage systems that feed petabyte-scale multimodal datasets into model training. - Sustainable Platforms: Build tooling and systems that are automated and efficient, enabling processing at scale and handling many small heterogeneous datasets. Required Skillsets - Data Engineering: Knowledge of Python ETL pipelines and supporting infrastructure, data formats, and storage systems at scale. - ML Data Ops: Experience managing datasets, annotations, and data versioning for model training. - Basic ML Knowledge: Solid grasp of ML fundamentals is essential to collaborate effectively with researchers and make sound data platform decisions. - Agentic Engineering: Skilled at writing high-quality specifications for AI agents, while maintaining effective human review of AI-generated work. Responsibilities - Design, automate, maintain, and optimize Python ETL pipelines (Spark/Ray) for large-scale multimodal data. - Build and maintain data cataloging, lineage, quality tooling, integrity verification, access controls, and lifecycle management systems. - Provide guidance, internal tools, and documentation to colleagues on data best practices. - Serve as a custodian of the company’s datasets, ensuring overall data health, quality, and discoverability. Challenges You'll Tackle - Implement high-performance, multimodal data pipelines capable of processing petabyte-scale datasets on 10,000s of CPUs and 100s of GPUs. - Evolve data formats, storage, and processing to keep pace with cutting-edge AI advancements, while maintaining backward compatibility. - Scale data infrastructure to handle the next order of magnitude in growth. - At the same time, ensure the data platform flexible to rapidly handle many small heterogeneous datasets and ad hoc analytics queries. Traits of the Ideal Candidate - High agency and ownership: proactively picks up new work according to priority, manages their own backlog, and escalates early when priorities are unclear or deadlines are at risk. - Takes responsibility for validating inputs end-to-end: spot-checks data, understands upstream preprocessing, and speaks up when something doesn't add up. - Takes responsibility for ensuring outputs are correct and handed over: actively seeks sign-off from downstream consumers, communicates caveats, and ensures relevant stakeholders are aware of changes and breaking impacts. - Cares about continuously improving pipelines, tooling, and processes so that each iteration makes the next one faster, more reliable, and easier for the team. - Comfortable with rapid, pragmatic solutions when needed, but committed to high-quality, long-term solutions. What we offer (compensation & benefits) - Competitive salary and equity - Private health coverage - Pension contribution (UK, Canada, US) - Unlimited paid vacation - Fully-distributed, async-first culture - Hardware setup of your choice - Stipends for phone, internet, and meals In our team, we approach our work with the dedication similar to Olympic athletes. Anticipate occasional late nights and weekends dedicated to our mission. We understand this level of commitment may not suit everyone, and we openly communicate this expectation. If you're motivated by deeply technical problems, a seemingly never-ending uphill battle and the opportunity to build (and own) a generational technology company, we can give you what you're looking for. All business roles at Moonvalley are hybrid positions by default, with some fully remote depending on the job scope. We meet a few times every year, usually in London, UK or North America (LA, Toronto) as a company. If you're excited about the opportunity to work on cutting-edge AI technology and help shape the future of media and entertainment, we encourage you to apply. We look forward to hearing from you! The statements contained in this job description reflect general details as necessary to describe the principal functions of this job, the level of knowledge and skill typically required and the scope of responsibility. It should not be considered an all-inclusive listing of work requirements. Individuals may perform other duties as assigned, including work in other functional areas to cover absences, to equalize peak work periods, or to otherwise balance organizational work Moonvalley AI is proud to be an equal opportunity employer. We are committed to providing accommodations. If you require accommodation, we will work with you to meet your needs. Please be assured we'll treat any information you share with us with the utmost care, only use your information for recruitment purposes and will never sell it to other companies for marketing purposes. Please review our privacy policy and job applicant privacy policy located here for further information.

Related Categories

Related Job Pages

More Data Engineer Jobs

IT - Data Engineer

ArchWell Health

ArchWell Health aims to transform healthcare for seniors and focuses on quality care, empathy, and community outreach. The company fosters a collaborative cultu

Data Engineer69 days ago

ArchWell Health is a new, innovative healthcare provider devoted to improving the lives of our senior members. We deliver best-in-class care at comfortable, accessible neighborhood clinics where seniors can feel at home and become part of a vibrant, wellness-focused community. Our members experience greater continuity of care, as well as the comfort of knowing they will be treated with respect by people who genuinely care about them, their families, and their communities.  Duties/Responsibilities: - Build data integrations from internal and external sources to centralize data into a Data Warehouse environment. - Monitor data integration operations, data quality, troubleshoot, and resolve problems. - Profile data sources and map to target table formats. - Develop and monitor data quality processes and address problems. - Develop, unit test and system test integration components. - Create support documentation describing the functionality of the integrations. - Participating in technical design & requirements gathering meetings. - Participate in planning and implementing data integration and data migration activities. - Perform QA tests to ensure data integrity and quality. - Research data issues between source systems and the data warehouse. Required Skills/Experience: - Bachelor’s degree required; Master's degree (in data science, computer science or MIS, mathematics, engineering, or related field) preferred. - 5+ years of prior experience in Data Management / ETL / ELT / Data Warehousing - Experience in writing Data Quality routines for cleansing of data and capturing confidence score - Experience with master data management - Strong knowledge of Structured Query Language (SQL) and Transact-SQL (T-SQL) - Experience using scripting languages such as JavaScript or Python - Experience Healthcare data models, datasets, and source systems (e.g. EHR, claims, labs, etc.) - Experience with healthcare reference data (ICD, CPT etc.) - Experience with agile delivery methodologies - Data Modeling experience preferred. - Strong organizational, administrative, and analytical skills required. - Experience managing and working in cloud environments such as Amazon Web Services or Azure - Knowledge of HIPAA; ability to implement systems and processes in accordance with regulations - Excellent interpersonal communication skills, both written and verbal ArchWell Health is an equal opportunity employer. Qualified applicants will receive consideration for employment without regard to their race, color, religion, age, sex, sexual orientation, gender identity, national origin, disability, veteran status, or any other protected classification.

United States
Full TimeRemoteTeam 1,001-5,000H1B No Sponsor

Overview We are seeking to fill the role of MERS and Data Integrity Support Specialist. The ideal candidate enjoys collaborating with clients, industry partners and internal teams to maximize outcomes for homeowners. Responsibilities • Oversee and assist with the daily processes and operations of all MERS-related functions and responsibilities for organization’s current and future loan portfolio, to include the full reconciliation of the organization’s loan portfolio as it compares to all registered and deactivated Mortgage Identification Numbers (“MIN”) in the MERS OnLine database.• Responsible for leading, monitoring and reviewing daily processes and operations of MERS and Data Integrity areas, to include the full reconciliation of all data elements within the organization’s servicing platform to ensure data quality and correctness.• Assists supervisor with managing department staff to ensure instructions are being followed, standards are being met and operational issues are being addressed in a timely fashion.• Report to senior management regularly using various data models and statistical analysis methodologies, to include the creation of reports and metrics with an emphasis on quality control and root cause reconciliation solutions.• Ensure that the teams meet/exceed established metrics/SLA standards provided to clients.• Assist when necessary with all client interaction to ensure proper notification, tracking and resolution of MERS and data integrity exceptions are reported timely, and action plans are developed to correct any ongoing trending that may impact the quality of MERS or servicing platform data elements.• Assist in the creation of or updating of all departmental policies and procedures and ensure revisions are made timely and in accordance with MERS standards and processes.• Communicate clearly and timely to all applicable departments on matters related to MERS data reconciliations, exceptions and errors and ensure all items are resolved in a timely fashion.• Research process gaps (intra/inter departmental) and close gaps timely, ensuring that proper methods for tracking gaps or trending can be presented to senior management for review and approval.• Train, coach and develop staff to ensure department efficiencies are maintained and consistently improved upon• All other duties as assigned. Qualifications Required Skills and Qualifications• High School Diploma or equivalent required.• Excellent analytical, organization and communication (verbal/written) skills required. • Ability to work collaboratively with peers, departments, and clients in a fast-paced, team-centric environment to attain common goals. Desired Skills and Qualifications• 2 to 3 years of experience in mortgage servicing, financial or banking field preferred.• Proficient use of Microsoft Excel, Access, and Word is preferred. Total Rewards LoanCare’s Total Rewards Package offers a comprehensive blend of health and welfare, financial, lifestyle and learning benefits to support employee well-being and engagement. Highlights include: - Health & Welfare Coverage: Optional medical, dental, vision, life, and disability insurance - Time Off: Paid holidays, vacation, and sick leave - Retirement & Investment: Fidelity National Financial matching 401(k) and employee stock purchase plans - Wellness Programs: Access to mental health resources, including free Calm memberships, and initiatives that promote physical and emotional well-being - Employee Recognition: Programs that celebrate achievements and milestones - Lifestyle & Learning Perks: Enjoy discounts on gym memberships, pet insurance, and employee purchasing programs, plus access to a tuition reimbursement program that supports your continued education and professional growth. Compensation Range: $19.33-$28.89 hourly. Actual compensation may vary within the range provided, depending on a number of factors, including qualifications, skills and experience. Build Your Future with LoanCare® At LoanCare, we don’t just service mortgage loans—we serve people. As a leading full-service mortgage loan subservicer, we deliver excellence to banks, credit unions, independent mortgage companies, investors, and the homeowners they support. Backed by the strength and stability of Fidelity National Financial (NYSE: FNF), a Fortune 500 company, we offer a career foundation built on integrity, innovation, and collaboration. Here, you’ll find: - A culture that helps you thrive, with resources and support to fuel your growth - Flexibility to work remotely, while staying connected through virtual engagement - Opportunities to make a real impact in an industry that touches millions of lives - If you're ready to grow your career in a place that values your contributions and empowers your success, we invite you to join our team. About Remote Employment We provide the necessary equipment; all you need is a quiet, private place in your home and a high-speed internet connection with a minimum network download speed of 25 megabits per second (MBPS) and a minimum network upload speed of 10 MBPS WHO WE AREAbout us …LoanCare is a leading national provider of full service subservicing and interim subservicing to the mortgage industry and has offered its expertise and best practices in providing servicing solutions for others since 1991. At the present time, LoanCare subservices over 1.8 million loans in 50 states. LoanCare has a seasoned loan servicing team with senior managers averaging nearly 30 years of experience in the mortgage and financial services industry.LoanCare, its affiliates and subsidiaries, is an Equal Opportunity employer. All qualified applicants will receive consideration for employment without regard to race, color, religion, sex, age, disability, protected veteran status, national origin, sexual orientation, gender identity or expression (including transgender status), genetic information or any other characteristic protected by applicable law. WORK CONDITIONSWorking conditions are normal for an office environment. Ability to attend work and be productive during normal business hours and to work early, late or weekend hours as needed for successful job performance. Over time required as necessary. Essential functions are the basic job duties that an employee must be able to perform, with or without reasonable accommodation. EQUAL EMPLOYMENT OPPORTUNITY LoanCare, its affiliates and subsidiaries, is an Equal Opportunity employer. All qualified applicants will receive consideration for employment without regard to race, color, religion, sex, age, disability, protected veteran status, national origin, sexual orientation, gender identity or expression (including transgender status), genetic information or any other characteristic protected by applicable law.

United States
$19 - $29 / hour
Full TimeRemoteTeam 51-200Since 2018H1B No Sponsor

• Gather and refine data requirements with business stakeholders and technical teams • Design end-to-end data architectures (ingestion, processing, storage and consumption) • Evaluate technical alternatives considering cost, risk, complexity and time-to-market • Build and evolve the Data Map (domains, sources, pipelines and consumers) • Define data modeling standards and taxonomy • Participate in data classification and governance (including LGPD — Brazilian Data Protection Law) • Produce architecture artifacts (HLD, LLD, ADRs and diagrams) • Lead Architecture Review sessions • Work closely and collaboratively with Engineering, Platform, Security, Analytics and Product teams • Support the evolution of data architecture standards and governance

Brazil
Job Closed
Full TimeRemoteTeam 10,001+H1B Sponsor

• Design, build, and maintain robust data pipelines to ensure reliable data flow across the enterprise. • Maintain data pipeline schedules, orchestrate workflows, and monitor the overall health of data pipelines to ensure continuous data availability. • Create, update, and optimize data connections, datasets, and transformations to align with business needs. • Troubleshoot and resolve data sync issues, ensuring consistent and correct data flow from source systems. • Collaborate with cross-functional teams to uphold data quality standards and ensure accurate data is available for use. • Utilize Palantir Foundry to establish data connections to source applications, extract and load data, and design complex logical data models that meet functional and technical specifications. • Develop and manage data cleansing, consolidation, and integration mechanisms to support big data analytics at scale. • Build visualizations using Palantir Foundry tools and assist business users with testing, troubleshooting, and documentation creation, including data maintenance guides.

Florida
$125K - $232K / year
Job Closed