Reddit, Inc. logo
Reddit, Inc.

Dive into anything

Senior Machine Learning Systems Engineer

Systems EngineerSystems EngineerOtherRemoteSeniorTeam 501-1,000Since 2005H1B No SponsorCompany SiteLinkedIn

Location

United States

Posted

89 days ago

Salary

$216.7K - $303.4K / year

Seniority

Senior

Job Description

Senior Machine Learning Systems Engineer

Reddit, Inc.

• Design end-to-end model lifecycle patterns (MLOps) to boost velocity of development for ML engineers, including data preparation, model management, experiment tracking, and more • Zero-to-one development and support of a graph ML codebase and platform that abstracts away common patterns and enables greater model scalability and iteration • Collaborate with ML engineers on performance tuning, including improving model training time, efficiency, and GPU training costs in a large, distributed ML training environment • Optimize batch data processing within a data warehouse and with tools such as Apache Beam, Apache Spark, Ray Data, and more • Architect pipelines to build and maintain massive graph data structures on the order of billions of nodes and tens of billions of edges

Job Requirements

  • 5+ years of experience in ML infrastructure, including model training and model deployments
  • Hands-on experience with ML optimization, including memory and GPU profiling
  • Deep experience with cloud-based technologies for supporting an ML platform, including tools like GCP BigQuery, Google Cloud Storage, infrastructure-as-code (Terraform), and more
  • Hands-on experience administering and integrating MLOps tools for experiment tracking, model serving, and model registries (e.g. MLflow or Wandb)
  • Proficiency with the common programming languages and frameworks of ML, such as Python, PyTorch, Tensorflow, etc.
  • Deep experience working with distributed training frameworks, including Ray and Kubernetes
  • Strong focus on scalability, reliability, performance, and ease of use. You are an undying advocate for platform users and have a deep intuition for the machine learning development lifecycle.
  • Strong organizational & communication skills
  • Experience working with graph databases (Neo4j, JanusGraph, TigerGraph) is a big plus
  • Experience working with graph neural networks (GNNs) and associated graph ML frameworks (PyTorch Geometric, Deep Graph Library) is a big plus

Benefits

  • medical, dental, and vision insurance
  • 401(k) program with employer match
  • generous time off for vacation
  • parental leave

Related Categories

Related Job Pages

More Systems Engineer Jobs

Republic Services logo

Senior Principal Control Systems Engineer

Republic Services

As a leader in environmental solutions, recycling & waste, we partner with customers to create a more sustainable world.

Systems Engineer89 days ago
OtherRemoteTeam 10,001+Since 1998H1B No Sponsor

• Develop and implement maintenance and improvement measures for recycling equipment, including conveyors, shredders, optical sorters and material handling systems. • Collaborates closely with cross-functional teams to ensure seamless integration of mechanical and electrical infrastructure. • Troubleshoot and resolve issues with existing control systems and optimize the performance of control systems, ensuring efficient and reliable operations and improving operational outcomes. • Lead cross-functional projects from concept through commissioning, including budgeting, scheduling, and vendor coordination. • Analyzes system performance and implements improvements to maximize uptime, throughput, and efficiency. • Maintain detailed documentation for control systems, and ensure compliance with safety standards, environmental regulation, and industry standards in all control systems designs. • Support commissioning new equipment, retrofits of existing systems, and analysis of acquired equipment. • Champion a strong safety culture by incorporating industrial safety standards, including LO/TO, machine safeguarding, and electrical safety in all control and infrastructure projects. • Good communicator with the ability to translate findings in a strategic and tactical manner through investment opportunities.

United States
Job Closed
OtherRemoteTeam 51-200H1B No Sponsor

• Design and implement SDN control and data-plane software • Develop high-performance networking services in C and Rust • Implement and extend OpenFlow-based control systems • Build software that interacts with switch ASICs, flow tables, and packet pipelines • Develop packet inspection and protocol analysis capabilities • Optimize network performance, latency, and deterministic behavior • Work closely with cybersecurity and OT networking teams to implement deterministic network policy enforcement

Michigan
Job Closed
QinetiQ US logo

Analysis Lead Engineer

QinetiQ US

We are a world-class team of professionals who deliver next generation technology and products in robotic and autonomous platforms, ground, soldier, and maritime systems in 50+ locations world-wide. Much of our work contributes to innovative research in the fields of sensor science, signal processing, data fusion, artificial intelligence (AI), machine learning (ML), and augmented reality (AR). QinetiQ US’s dedicated experts in defense, aerospace, security, and related fields all work together to explore new ways of protecting the American Warfighter, Security Forces, and Allies. Being a part of QinetiQ US means being central to the safety and security of the world around us. Partnering with our customers, we help save lives; reduce risks to society; and maintain the global infrastructure on which we all depend.

Systems Engineer89 days ago
OtherRemoteTeam 1,001-5,000

Role Description Join us in our fast-paced support to the Space Development Agency (SDA). Recognized as the Department of Defense's constructive disruptor for space acquisition, SDA delivers space-based capabilities to the joint warfighter to support terrestrial missions through development, fielding, and operation of the Proliferated Warfighter Space Architecture. SDA capitalizes on a unique business model that values speed and lowers costs by harnessing commercial development to achieve a proliferated architecture and enhance resilience. We are seeking a Analysis Lead Engineer to support SDA’s mission. Responsibilities - Lead performance analysis efforts for the Enterprise using the tool-sets developed by the M&S Lead Engineer and those available through the SDA Cells and relationships with other organizations. - Establish requirements based on warfighter inputs. - Verify Enterprise-level requirements through analysis. - Validate system performance based on the results of Developmental Testing and Operational Testing. - Perform quick-turn analyses in support of external engagements and the Front Office. - Provide inputs and prioritization on needed updates to the M&S tool-set. - Manage a distributed team across multiple organizations. - Translate M&S results to actionable decision briefings to senior management. Qualifications - Experience in modeling one or more of the critical mission areas: Missile Warning/Missile Tracking/Missile Defense, end-to-end kill chains, and/or communications systems and networks. - Experience with AFSIM, Python, and/or STK. - Bachelor's degree in science, technology, engineering or mathematics. - Minimum of 10+ years experience with satellite and ground systems. - Ability to work in fast paced environment with excellent oral and written communication skills. - TS Clearance with SCI Eligibility. Company Description QinetiQ US is a world-class team of professionals who deliver next generation technology and products in robotic and autonomous platforms, ground, soldier, and maritime systems in 50+ locations world-wide. Much of our work contributes to innovative research in the fields of sensor science, signal processing, data fusion, artificial intelligence (AI), machine learning (ML), and augmented reality (AR). - Dedicated experts in defense, aerospace, security, and related fields. - Explore new ways of protecting the American Warfighter, Security Forces, and Allies. - Help save lives; reduce risks to society; and maintain the global infrastructure.

United States
ESA - Electronic Security Association logo

Lead Hyperion EPM Systems Architect, Administrator

ESA - Electronic Security Association

THE voice of the electronic security and life safety industry.

Systems Engineer89 days ago
OtherRemoteTeam 11-50Since 1948H1B No Sponsor

• Shape and lead the strategic direction for Oracle Hyperion EPM solutions, including HFM, FDMEE, Essbase, and Planning, to align with evolving financial and business requirements. • Drive the architectural design, implementation, and optimization of the Hyperion EPM AWS landscape, ensuring scalability, security, and peak performance for all critical financial processes. • Serve as the primary technical authority for complex Hyperion application functionality, guiding the development, modification, and testing of advanced metadata hierarchies, business rules, and scripting (VB, Jython, MaxL, Perl). • Oversee and enhance the global month-end closing and planning/budgeting processes within HFM and FDMEE, providing expert guidance to Controllers and resolving complex data validation issues across diverse financial structures. • Formulate and execute comprehensive compliance strategies for SOX financial system requirements, collaborating directly with internal and external audit teams to ensure adherence and mitigate risks. • Develop and deploy advanced testing methodologies, including test data generation and comprehensive test script execution, to ensure the integrity and reliability of integrated EPM systems. • Cultivate and disseminate expert-level knowledge across the organization through global user training programs, publishing authoritative reference materials, and establishing best practices for Hyperion EPM utilization. • Initiate and lead continuous improvement initiatives for the Hyperion EPM environment, leveraging deep technical expertise and industry trends to enhance system capabilities and user experience.

United States
$132.2K - $197.2K / year
Job Closed