Here you can create the extraordinary. Join us.
Staff Deep Learning Engineer
Location
New York
Posted
1 day ago
Salary
$220K - $260K / year
Seniority
Lead
Job Description
Staff Deep Learning Engineer
NBCUniversal
• Implement core deep-learning, computer vision, and (inverse-)procedural modeling algorithms in Python • Apply cutting-edge research in machine learning and computer graphics to solve real-world problems • Work closely with our cofounders to understand high-level product vision • Interact with remote machines via a Unix shell to deploy and test code on large-scale geospatial datasets • Use Git to manage source code and modularize complex implementation tasks
Job Requirements
- Master's degree in Computer Science, Engineering, Mathematics, or a related field
- Minimum of 5+ years of relevant industry experience
- Proven experience as a DL Engineer or Applied Research Engineer
- Prior experience in industries with complex multi-disciplinary teams
- Fluency with Python, Git, and the Unix shell
- Proven experience training and debugging artificial neural networks or adjacent experience
- A strong mathematical background covering linear algebra, statistics, probability, and numerical methods
- Preferred prior experience with modern C++
Benefits
- medical, dental and vision insurance
- 401(k)
- paid leave
- tuition reimbursement
- a variety of other discounts and perks
Related Guides
Related Job Pages
More Machine Learning Engineer Jobs
AI / Machine Learning Consultant
Vertical RelevanceYour trusted Financial Services industry partner in Business Transformation, Customer Experience, & AWS Cloud Services.
• Deliver end-to-end ML/AI solutions from problem framing through deployment and monitoring • Design scalable AI/ML architectures using AWS and GCP services • Optimize RAG pipelines including chunking, embeddings, hybrid search, and re-ranking • Develop retrieval evaluation frameworks (Recall@K, Precision@K, MRR, nDCG) • Design and implement knowledge graphs and ontologies • Build data ingestion pipelines for structured and unstructured data • Deploy solutions using AWS tools such as SageMaker, Bedrock, Lambda, and OpenSearch • Implement self-hosted embedding models in secure environments • Ensure compliance with PHI/PII data security requirements • Collaborate with cross-functional teams and act as a technical advisor • Mentor team members and contribute to technical thought leadership
Machine Learning Research Benchmark Consultant
24-MAGThis opportunity is available through a leading AI-driven work platform.
Role Description We are sharing a specialised part-time consulting opportunity for experienced machine learning engineers and researchers with strong practical expertise in ML research workflows, model training, post-training, dataset curation, reinforcement learning, architecture design, evaluation tasks, and sandboxed technical environments. This role supports current and upcoming remote consulting opportunities focused on benchmarking AI agent performance on realistic machine learning research tasks. Selected professionals will complete self-contained ML research tasks under defined time and compute constraints, provide human reference performance for evaluation workflows, and submit structured work outputs that support technical assessment and research-quality analysis. Key Responsibilities - Machine Learning Research Task Execution - Attempt open-ended machine learning research tasks under fixed time and compute constraints. - Work independently in a sandboxed Linux environment using provided compute resources. - Apply practical ML engineering and research judgment to self-contained AI R&D tasks. - Use preferred development workflows and tools, including IDEs, coding assistants, notebooks, or command-line workflows where permitted. - Submit final work products that reflect clear reasoning, technical execution, and reproducible effort. - Benchmarking & Human Reference Evaluation - Serve as a skilled human reference point for evaluating AI agent performance on realistic ML research tasks. - Complete tasks using the same constraints and environment conditions defined for evaluation workflows. - Support benchmark quality by producing reliable, interpretable, and technically meaningful task attempts. - Document decisions, assumptions, implementation choices, and constraints where relevant. - Complete short pre-task and post-task questionnaires as part of the evaluation process. - Technical Workflow, Recording & Quality Control - Work in sandboxed technical environments with Linux, internet access, and provided compute resources. - Record full working sessions when required for evaluation and review purposes. - Follow task-specific confidentiality, NDA, environment, and submission requirements. - Debug issues involving code, packages, model training workflows, data processing, runtime behavior, or environment setup. - Submit required materials, including final outputs, recordings, questionnaires, and supporting notes where applicable. Qualifications - 3+ years of practical machine learning experience, with time spent in a PhD program counting toward this requirement where relevant. - Hands-on experience with at least one major ML framework such as PyTorch, JAX, or TensorFlow. - Strong practical ability to complete open-ended ML research or engineering tasks independently. - Experience working in Linux environments, debugging technical workflows, and managing research-oriented development tasks. - Ability to reason under time and compute constraints while producing clear, high-quality technical work. - Strong written communication skills for explaining methods, decisions, and results. - High attention to detail and comfort following structured evaluation, recording, and submission requirements. - Availability for at least 20 hours per week if selected, with additional availability considered helpful depending on project needs. Educational Background - Strong academic or professional background in machine learning, artificial intelligence, computer science, data science, statistics, engineering, or a related technical field is highly relevant. - PhD experience, advanced research experience, or comparable industry experience in machine learning may be especially valuable. - Candidates with experience from highly selective academic programs, major technology companies, AI research teams, or comparable technical environments may be a strong fit. - Practical research and engineering experience may be considered alongside formal education depending on project requirements. Nice to Have - Deep hands-on expertise in one or more of the following areas: - Pretraining transformer language models from scratch. - Reinforcement learning, PPO, reward shaping, custom gym or gymnasium environments, and throughput tuning. - Full fine-tuning, LoRA, QLoRA, DPO, RLHF, RLAIF, distillation, or post-training workflows. - Large-scale corpus filtering, deduplication, subsampling, and benchmark contamination avoidance. - Architecture design under strict parameter-count or size constraints. - Modifying pretrained architectures, including attention patterns, pooling heads, or training objectives. - Contrastive training for embedding or retrieval models. - Generative vision or video modeling. - Multilingual or low-resource language work. - Image or video data pipelines at scale. - Balancing competing model objectives such as safety and capability. - Prior experience as an ML evaluator, red-teamer, benchmark contributor, research engineer, or technical baseliner. - Experience using AI coding assistants in technical workflows while maintaining strong independent judgment. - Comfort working with confidential project materials and structured technical review processes. Why This Opportunity - Apply advanced ML research and engineering expertise to realistic AI R&D benchmarking tasks. - Serve as a human reference point for evaluating AI agent performance on open-ended technical challenges. - Work with provided compute and sandboxed environments without requiring personal GPU resources. - Use practical ML judgment across training, post-training, dataset curation, architecture, reinforcement learning, or evaluation workflows. - Remote structure with competitive hourly compensation. Contract Details - Independent contractor role. - Fully remote with project-based technical work. - Minimum expected availability of approximately 20 hours per week if selected, with greater availability preferred depending on project needs. - Competitive rates of $60–$75 per hour depending on expertise, ML research depth, task performance, and project scope. - A work-trial-style baseline task may be required before longer-term selection. - Each assigned baseline task may be attempted only once per contractor. - Project work may require screen recording, questionnaires, final work product submission, and adherence to confidentiality or NDA requirements. - Compute and sandboxed technical environments may be provided depending on task scope. - Weekly payments via Stripe or Wise. - Projects may be extended, shortened, or adjusted depending on scope and performance. About the Platform This opportunity is available through 24-MAG LLC. We connect experienced professionals with remote consulting opportunities across technical, evaluation, and project-based workstreams.
• Designing, building, deploying, and evolving the end-to-end AI & ML systems • Initial focus on agentic capabilities, agent-to-agent (A2A) protocol, demand modeling and forecasting, optimization, product relationships • Building out new features on the Science roadmap • Working on full stack technology, from prototyping to production-ready software • Collaborating with product, engineers, and data scientists to translate ideas into new products, services and features • Exposing and evangelizing the Science team capabilities
Role Description The Sr Machine Learning Engineer is responsible for designing, building, integrating, optimizing, and maintaining AI-powered applications that leverage generative models and overall product lifecycle for a product that our users love. Generative AI Engineers are expected to collaborate closely with teammates as they develop and deliver user stories while supporting AI-powered products as they evolve. Activities may include: - Designing and implementing applications using large language models (LLMs) and other generative models. - Embedding intelligent capabilities directly into software products. - Prompt engineering, model integration, building Retrieval-Augmented Generation (RAG) pipelines, and developing scalable AI services. - Interacting with business stakeholders, infrastructure teams, and development teams to ensure business requirements are effectively addressed. - Supporting evaluation, performance optimization, testing, and monitoring of AI systems in production. - Working with domain data, improving prompts and AI workflows, and creating documentation or enablement materials for generative AI solutions. Sr Generative AI Engineers should be able to work independently with minimal guidance, while collaborating with cross-functional teams of varying skill levels to design, deploy, and maintain production AI applications. This role may review submitted code and prompt implementations, providing feedback and improvements based on engineering and responsible AI best practices. Qualifications - 3 - 5 years of relevant work experience. - Experience in Python and modern AI development frameworks. - Experience building Generative AI applications using large language models (LLMs). - Experience with prompt engineering, prompt optimization, and prompt evaluation techniques. - Experience integrating AI models through APIs from platforms such as Google, OpenAI, or Anthropic. - Experience with GenAI frameworks such as Google Agent Development Kit (ADK). - Experience implementing Retrieval-Augmented Generation (RAG) pipelines using vector databases. - Experience working with vector databases such as Google Vertex AI Search. - Experience with building conversational AI systems, or AI assistants. - Experience with responsible AI practices including bias mitigation and safety guardrails. - Experience working with graph databases, knowledge ingestion pipelines, and data mesh architectures. - Experience implementing CI/CD pipelines, monitoring, and automated workflows for reliable AI model deployment and lifecycle management. - Experience with monitoring, evaluation, and optimization of production AI systems. - Experience in Google Cloud Platform and AI/ML related components such as Vertex AI, BigQueryML. - Experience in effective data engineering practices and big data platforms such as BigQuery, Data Store, etc. - Experience in a modern scripting language (preferably Python). - Experience with GPU acceleration (i.e. CUDA and cuDNN). - Experience in a front-end technology and framework such as Node.js, HTML, CSS, JavaScript, ReactJS, D3. - Experience in writing SQL queries against a relational database. - Familiarity with production systems design including High Availability, Disaster Recovery, Performance, Efficiency, and Security. - Familiarity with cloud computing platform and associated automation patterns and machine learning services. - Familiarity with defensive coding practices and patterns for high Availability. - Familiarity with A/B testing and effective REST design for scalable web services architecture. - Familiarity with advanced machine learning techniques such as NLP, convolutional neural networks, autoencoders, and embeddings generation and utilization. Requirements - Must be eighteen years of age or older. - Must be legally permitted to work in the United States. Benefits - Health care benefits. - 401K. - ESPP. - Paid time off. - Success sharing bonus. Company Description For a full list of the various benefits The Home Depot offers, visit The Home Depot Benefits . For California, Colorado, Connecticut, Rhode Island, Nevada, New York City, Ithaca (NY), Westchester County (NY), and Washington residents: The pay range for this position is between $100,000.00 - $180,000.00.



