Headquartered in Philadelphia, Pennsylvania, Comcast was established in 1963 as a single-system cable company. Over the years, Comcast experienced tremendous gr
Machine Learning Engineer
Location
United States
Posted
4 days ago
Salary
$142.7K - $214.0K / year
Seniority
Mid Level
No structured requirement data.
Job Description
Machine Learning Engineer
Comcast
Role Description Multimodal Analysis Framework (MAF) is an end‑to‑end platform designed to process diverse content sources—including video, images, audio, and documents—to generate rich, structured metadata. The platform unifies multiple ML/AI models to extract curated insights at scale, tailored to specific business needs. MAF supports both on‑demand workloads (batch uploads, ad‑hoc analysis) and real‑time streaming workflows, enabling continuous metadata generation for live content streams. Customers can define their metadata requirements—such as entity extraction, scene segmentation, object detection, transcription, summarization, or multimodal correlation—and the framework orchestrates the appropriate models and toolchains to deliver high‑quality outputs. Through flexible APIs and UI‑based workflows, customers and internal teams can visualize metadata, trigger enrichment, monitor processing, and integrate results into downstream applications. The platform emphasizes modularity, scalability, and extensibility to support new ML models, LLM‑based agents, and cross‑modal inference as use cases evolve. We are looking for a mid-level Backend Engineer to join our Machine Learning Platform team. This role focuses on building scalable backend systems that power ML workloads, including video, image, and document processing, and enable LLM-driven applications through agents and MCP servers. You will work primarily in Golang, deploy and operate services on Kubernetes, manage infrastructure with Terraform, and build on AWS. A core part of the role is designing platform capabilities that allow LLMs to safely and reliably interact with tools, data, and services via agent frameworks and MCP servers. Qualifications - 3–6 years of professional software engineering experience. - Strong backend engineering experience with Golang. - Experience building and operating APIs (REST and/or gRPC) in production. - Hands-on experience with Kubernetes in production environments. - Experience using Terraform for infrastructure provisioning and deployment. - Solid working knowledge of AWS cloud services and core architectural concepts. - Experience building or supporting ML processing pipelines (video, image, or document). - Practical experience using LLMs in production systems. - Experience developing agents and/or MCP servers, or equivalent tool-integration platforms. Requirements - Design, build, and maintain high-performance backend services in Golang for ML and AI platform use cases. - Develop REST and gRPC APIs for inference, processing pipelines, orchestration, and platform services. - Implement asynchronous and distributed processing patterns (workers, queues, event-driven systems). - Ensure backend services meet production standards for scalability, reliability, and security. - Build and operate backend systems supporting video processing (frame extraction, metadata generation, embeddings, indexing). - Build and operate backend systems supporting image processing (OCR, classification, detection, embedding generation). - Build and operate backend systems supporting document processing (parsing, layout analysis, chunking, OCR, retrieval pipelines). - Integrate ML inference services into backend workflows with attention to latency, throughput, and cost. - Work closely with ML engineers and data scientists to productionize models and pipelines. - Build LLM-enabled backend services using structured prompting, tool/function calling, and retrieval-augmented generation (RAG). - Design and implement agentic workflows (multi-step reasoning, tool orchestration, retries, guardrails). - Develop and operate MCP servers that expose internal platform capabilities (search, retrieval, processing, data access) to LLM-based applications. - Enforce security, access control, and observability for agent and MCP interactions. - Design and maintain vector-based retrieval systems using Milvus. - Implement embedding ingestion, indexing, and query pipelines at scale. - Optimize retrieval quality, latency, and relevance for downstream LLM applications. - Deploy and operate backend and ML services on Kubernetes (scaling, rollouts, resource management). - Use Terraform for infrastructure provisioning and continuous delivery of cloud resources. - Build and operate primarily on AWS, leveraging services such as compute, networking, and IAM; object storage; managed Kubernetes; logging and monitoring services. - Implement observability using logs, metrics, and traces; define SLOs and alerts. - Write automated tests (unit, integration) and contribute to CI/CD pipelines. - Participate in on-call rotations and incident response; drive post-incident improvements. Benefits - Base pay range: $142,651.46 - $213,977.19, dependent on job-related, non-discriminatory factors such as experience. - Most sales positions are eligible for a Commission under the terms of an applicable plan. - Most non-sales positions are eligible for a Bonus. - Best-in-class benefits to eligible employees, personalized to meet the needs of employees.
Related Guides
Related Job Pages
More Machine Learning Engineer Jobs
Machine Learning Engineer, Artist-First AI Music Lab
SpotifyPassionate music fans. Innovative tech pros. Perfect harmony. Join our band.
• Design, build, evaluate, and improve machine learning training and inference pipelines that power new AI-driven music experiences and help take them to fully scaled production-ready features. • Apply machine learning and prompt engineering knowledge across complex ML pipelines to support rich user experiences involving large language models. • Create evaluation frameworks, including LLM-as-judge pipelines, to measure quality and build fast feedback loops that enable rapid and confident iteration. • Partner with music subject-matter experts to bootstrap training and reference data, including synthetic generation, expert curation, and taxonomy design. • Build scalable systems that balance experimentation velocity with production rigor, ensuring strong performance, reliability, and latency at Spotify scale. • Collaborate closely with Data Science teams to connect evaluation frameworks with real-world usage signals and continuously improve model quality. • Contribute to technical direction and engineering best practices across model deployment, observability, experimentation, and production infrastructure. • Work cross-functionally with engineering, product, design, and music industry partners to shape entirely new listening experiences for artists and fans.
Senior ML Engineer, ML/AI
Lyra HealthTransforming behavioral health through technology with a human touch
• Design, train, fine-tune, and evaluate deep learning, classical ML, and foundational GenAI models to drive core product features. • Build, scale, and maintain robust ML pipelines for continuous training, evaluation, and real-time/batch inference using modern MLOps frameworks. • Collaborate with backend and frontend teams to integrate AI models into microservices, ensuring low latency, high availability, and optimal resource utilization. • Architect scalable data pipelines for preprocessing, vectorizing, and ingestion of massive structured and unstructured datasets. • Implement rigorous evaluation frameworks for model alignment, bias mitigation, guardrailing, and cost/latency optimization (e.g., quantization, distillation). • Provide technical leadership, conduct thorough code reviews, and mentor junior/mid-level engineers on best practices in software craftsmanship and ML engineering.
Senior Machine Learning Engineer
Mitek SystemsThe global leader in mobile capture and digital identity verification.
• Build, train, and optimize computer vision models for image classification, face liveness detection, and presentation attack detection (PAD) / anti-spoofing • Work on real-world identity verification and biometric authentication problems, improving model performance on noisy, adversarial inputs such as spoofed images, replay attacks, deepfakes, and synthetic media • Design and run experiments to improve model accuracy, recall, robustness, and fraud detection performance using techniques such as augmentation, class balancing, architecture tuning, and hard-negative mining • Design, train, and improve deep learning models (e.g., CNNs, Vision Transformers, and foundation models), including loss function design, hyperparameter optimization, and performance tuning on large-scale image datasets • Prepare and curate large, noisy datasets, including data ingestion, validation, cleaning, deduplication, labeling strategies, and dataset QA to improve model reliability and generalization • Develop evaluation protocols and success metrics that balance fraud detection effectiveness, false acceptance rates, false rejection rates, and overall business impact • Develop production-grade training and inference pipelines on AWS with strong reproducibility, monitoring, observability, and cost controls • Productionize models as resilient Python services and libraries; collaborate with platform teams to optimize APIs, latency, scalability, and operational reliability • Contribute to the evolution of our Identity Verification (IDV) platform by modernizing legacy components and improving model performance, maintainability, and modularity • Partner closely with Product, Customer Success, Fraud, and Platform Engineering teams to ensure ML solutions meet privacy, compliance, security, and reliability requirements • Support and mentor other engineers through design reviews, code reviews, experimentation best practices, and knowledge sharing • Research and evaluate emerging techniques in face liveness detection, presentation attack detection (PAD), deepfake detection, biometric authentication, and adversarial machine learning to strengthen our fraud prevention capabilities
• Tune and optimize ML/DL models for production-scale audio deepfake detection. • Investigate failure cases in the client environment, build custom evaluation frameworks, and implement mitigation strategies for model robustness. • Drive model iteration for performance under a variety of real-world environments, e.g. compression artifacts, noise, telephony, and streaming pipelines. • Present technical findings and model performance insights to internal stakeholders. • Interface with Product and Engineering teams to build a deep understanding of the production environment and incorporate relevant evaluations for performance assessment.




