Nebius

Nebius is a European AI infrastructure company based in Amsterdam, North Holland, the Netherlands, specializing in full-stack AI solutions. The company offers l

ML Solutions Architect

Location

United States

Posted

5 days ago

Salary

$102 - $126 / year

Seniority

Mid Level

Job Description

ML Solutions Architect

Nebius

Role Description We're looking for an ML Solutions Architect (Early Career) to join the team behind Nebius Token Factory's serverless inference and fine-tuning platform for open-source LLMs. Working alongside senior Solutions Architects, you'll take on real technical work – building and testing LLM-based solutions, benchmarking, and inference optimization – and learn how scalable AI applications are built and tuned on our platform, in close collaboration with our backend team. This is a hands-on learning role with close mentorship from senior SAs. Strong performers will be considered for a full-time Solutions Architect position at the end of the program. This is a paid temporary contract, open to students and recent graduates. You're welcome to work remotely from any timezone. Your responsibilities: - Help build and test LLM-based solutions and applications using Token Factory's inference services, including multimodal models (text, vision, audio). - Assist senior SAs with prompt engineering, model selection, benchmarking, and inference optimization. - Run performance and quality experiments to support proof-of-concept work. - Contribute to internal tooling and automation that improves how the SA team delivers. Qualifications - Currently pursuing or recently completed a BSc/MSc/PhD in Computer Science, Machine Learning, or a related field. - Strong Python programming skills. - Hands-on generative AI experience, including with common ML frameworks (e.g., PyTorch, Transformers). - Strong communication skills, with a willingness to explain technical concepts to diverse audiences. Requirements - Experience deploying/serving LLMs with vLLM, SGLang, or TensorRT-LLM. - Familiarity with inference optimization techniques such as quantization, batching, caching, and routing. - Knowledge of model architectures and fine-tuning approaches. - Contributions to open-source ML/AI projects. Benefits - Health insurance: 100% company-paid medical, dental, and vision coverage for employees and families. - 401(k) plan: Up to 4% company match with immediate vesting. - Parental leave: 20 weeks paid for primary caregivers, 12 weeks for secondary caregivers. - Remote work reimbursement: Up to $85/month for mobile and internet. - Disability & life insurance: Company-paid short-term, long-term and life insurance coverage. - Competitive compensation and benefits packages. - Career growth and learning opportunities. - Flexibility and ownership. - Collaborative and innovative culture. - Opportunity to work on impactful AI projects. - International environment and talented teams.

Related Categories

Related Job Pages

More Solutions Engineer Jobs

Nscale logo

Principal Network Architect- AI Infrastructure

Nscale

Nscale is the Hyperscaler engineered for AI.

Full TimeRemoteTeam 201-500Since 2024H1B No Sponsor

Role Description Nscale is seeking a Network Architect Engineer to lead the evolution, reliability, and operational excellence of our global AI networking infrastructure. This role sits at the core of Nscale’s platform, where network performance directly impacts AI training outcomes. You will act as a technical authority across large-scale RDMA / Infiniband / RoCE fabrics, driving automation, availability improvements, and system-level design across a globally distributed GPU cloud. You will combine deep network protocol-level networking expertise with strong software and automation skills to operate and scale one of the most demanding AI networking environments in the industry. What You’ll Do - Technical Leadership & Strategy - Own the technical direction and operational lifecycle management of Nscale’s high-performance RDMA network fabrics. - Define long-term architecture, reliability strategy, and operational standards for AI interconnect networks. - Lead availability and performance improvement initiatives across globally distributed GPU clusters. - Act as a technical authority (SME) across networking, influencing platform-wide decisions. - Network Engineering & Operations - Support design, build, and evolve large-scale Infiniband and RoCE fabrics. - Drive deep debugging and resolution of complex cross-layer issues (hardware, firmware, kernel, distributed workloads). - Lead incident response and postmortems, ensuring systemic fixes and long-term improvements. - Define and enforce standards across: - Congestion control and traffic engineering. - Routing (BGP, ECMP, fabric-level routing strategies). - Firmware lifecycle and change management. - Network observability and telemetry. - Automation & Systems Development - Develop and scale automation frameworks for network provisioning, validation, and operations. - Build tooling to support high-reliability, low-touch network operations at scale. - Improve operational efficiency across hundreds of thousands of endpoints and high-throughput links. - Cross-Functional Leadership - Lead complex technical initiatives across Network, SRE, Compute, and Platform teams. - Serve as technical lead on critical programs, coordinating engineers and stakeholders. - Influence product and infrastructure roadmaps based on operational insights and customer needs. - Mentor senior engineers and raise the bar for technical rigor and execution. Qualifications - 10+ years of experience in network engineering in hyperscale, AI, or HPC environments. - Deep expertise in RDMA, Infiniband, and/or large-scale RoCE fabrics. - Strong understanding of: - RDMA internals and performance tuning. - Congestion control and fabric failure modes. - Distributed system communication patterns. Requirements - Expert-level knowledge of data center networking protocols (BGP, OSPF, ECMP). - Proven ability to debug multi-layer issues across network, system, and application layers. - Strong programming/scripting skills for automation (Python, Go, etc.). - Experience designing high-scale, highly available network systems. Leadership & Impact - Demonstrated ability to lead complex technical programs without direct authority. - Experience acting as a senior escalation point for critical production issues. - Strong ability to drive cross-team alignment and execution. - Systems-level thinking balancing performance, reliability, scalability, and cost. Nice to Have - Experience with NVIDIA / Mellanox networking platforms. - Familiarity with distributed AI training frameworks and GPU communication patterns. - Experience building network observability systems at scale. - Background influencing infrastructure strategy in high-growth environments. Equal Opportunities Statement We strongly encourage applications from people of colour, the LGBTQ+ community, people with disabilities, neurodivergent people, parents, carers, and people from lower socio-economic backgrounds. If there’s anything we can do to accommodate your specific situation, please let us know. The responsibilities outlined in this job description are not exhaustive and are intended to provide a general overview of the position. The employee may be required to perform additional duties, tasks, and responsibilities as assigned by management, consistent with the skills and qualifications required for the role.

United States
Astrolab logo

Senior Mission Integration Engineer

Astrolab

We build rovers for the Moon & Mars.

Full TimeRemoteTeam 11-50H1B No Sponsor

• The discovery, development, maturation and eventual integration of rover applicable technologies with government and commercial partners. • Cross discipline coordination and integration of design efforts. • Supporting business development in proposal responses, customer development and market exploration, leveraging a comprehensive overview of technical capabilities of the rover. • Supporting the rover payloads customer team from a technical integration standpoint.

California
GitLab logo

Senior Solutions Architect

GitLab

Build software faster. The One DevOps Platform enables your entire org to collaborate around your code. We're hiring.

Full TimeRemoteTeam 1,001-5,000Since 2014H1B No Sponsor

• Guide technical discovery, product demonstrations, and validation activities, including proofs of value, to confirm technical fit, accelerate evaluation milestones, and improve technical win rates for GitLab’s AI-powered DevSecOps platform. • Own the technical evaluation process for complex opportunities, including solution design, workshop facilitation, proof of concept or proof of value execution, and technical materials for tenders, audits, and assessments, with accountability for clear success criteria and documented evaluation outcomes. • Develop end-to-end technical strategies for assigned accounts that expand platform adoption, reduce delivery risk, and enable multi-team and multi-year transformation milestones. • Collaborate with Account Executives and regional sales teams in the East territory to shape account and territory plans, support qualification, and align technical strategy to customer priorities, opportunity progression, and business outcomes. • Advise technical practitioners and business leaders on modern software development, continuous integration, continuous deployment, security, cloud, and platform adoption practices to improve delivery efficiency, strengthen security outcomes, and increase adoption of GitLab workflows. • Drive competitive analysis and positioning for complex opportunities by using market, industry, and customer context to clarify GitLab’s differentiated approach and improve technical win readiness. • Represent the voice of the customer by sharing product feedback, use cases, integration needs, and field insights with Product Management, Engineering, Sales, and Marketing to improve roadmap decisions, integration readiness, and field effectiveness. • Mentor other Solutions Architects, contribute to team learning initiatives, improve technical collateral and documentation, and share subject matter expertise through GitLab’s common collaboration channels to increase team readiness, reuse of technical assets, and consistency across engagements.

New York
$137.4K - $231.2K / year
EDB logo

Partner Solution Architect

EDB

The leading Postgres data and AI company

Full TimeRemoteTeam 501-1,000Since 2004H1B No Sponsor

• Act as the technical voice of the company for our partner network • Design and deliver high-impact technical training programs • Provide architectural guidance and hands-on support during complex POCs • Serve as the trusted liaison between channel partners and internal Product Management

United States