Job Closed

This listing is no longer active.

AI Performance Optimization Engineer

Location

United States

Posted

9 days ago

Salary

$100K - $150K / year

Seniority

Mid Level

No structured requirement data.

Job Description

AI Performance Optimization Engineer

Bright Vision Technologies

Role Description We are seeking an AI Performance Optimization Engineer to focus on extracting maximum throughput, minimizing latency, and reducing cost across training and inference workloads for large neural network systems. The role spans the full stack from low-level kernel optimization to distributed system tuning, requiring deep understanding of GPU architecture, model parallelism, memory management, and compiler-level optimization. The ideal candidate has demonstrated impact on production AI workloads, with strong instrumentation and measurement discipline that enables rigorous, data-driven optimization decisions. In this role you will work closely with cross-functional partners — product, design, engineering, operations, and business stakeholders — to translate ambiguous requirements into well-engineered solutions, and will be expected to raise the bar through code review, design review, and mentorship of more junior engineers. The successful candidate brings strong engineering discipline, a clear communication style, and a track record of shipping meaningful work that holds up well in production. Key Responsibilities - Profile and optimize end-to-end AI training and inference pipelines for throughput, latency, and cost. - Identify and eliminate bottlenecks across data loading, model compute, communication, and memory. - Implement and tune quantization, sparsity, and pruning strategies to reduce model footprint and accelerate inference. - Optimize distributed training using tensor parallelism, pipeline parallelism, FSDP, and ZeRO-style sharding. - Tune attention implementations using FlashAttention, paged attention, and related techniques. - Implement KV cache optimization, continuous batching, and speculative decoding for LLM serving. - Drive compiler-level optimizations using Triton, XLA, TorchInductor, or TVM, working with the broader ML framework community to land improvements that translate into measurable end-to-end performance gains. - Optimize data pipelines, sharding strategies, and storage access patterns for high-throughput training. - Build and maintain rigorous benchmark suites and regression frameworks across workloads. - Collaborate with ML and platform engineering teams to embed best practices in standard pipelines. - Drive cost-efficiency improvements through model architecture, hardware selection, and scheduling strategies. - Evaluate new hardware and software offerings, and advise on adoption. - Document performance tuning playbooks and share findings broadly across engineering teams. - Stay current with AI systems research and translate advances into production improvements. Qualifications - Bachelor’s or Master’s degree in Computer Science, Computer Engineering, or a related field. - Six or more years of experience in performance engineering, ML systems, or HPC. - Strong proficiency in Python and C++. - Hands-on experience optimizing deep learning workloads on modern GPUs. - Deep understanding of distributed training and inference techniques. - Experience with profiling tools across CPU, GPU, and distributed systems. - Familiarity with model compression techniques and their accuracy implications. - Strong grasp of memory hierarchies, communication primitives, and parallelism strategies. - Excellent measurement, debugging, and analytical reasoning skills. - Strong communication and collaboration skills. Preferred Qualifications - Experience optimizing LLM inference at production scale. - Contributions to vLLM, TensorRT-LLM, DeepSpeed, or similar projects. - Familiarity with custom kernel authoring in Triton or CUTLASS. - Experience with FinOps for AI workloads. - Publications or talks on AI systems performance. How to Apply Would you like to know more about this opportunity? For immediate consideration, please send your resume to [email protected] . Learn more about Bright Vision Technologies at www.bvteck.com .

Related Categories

Related Job Pages

More Performance Marketing Jobs

Beilmann Marketing logo

Media Buyer / Performance Manager – Google Ads

Beilmann Marketing

Mehr Online Shop Umsatz mit Google Ads | Youtube Ads | Shopping Ads | Search Ads | Content Produktion

Full TimeRemoteTeam 11-50Since 2019H1B No Sponsor

• You manage the budgets for your clients' Google Ads campaigns • Operational campaign work: You implement performance marketing campaigns in the SEA domain • Collaborate with Account Strategists • Google Ads management • Keyword and ad creation • Creative development

Germany
€37K - €58K / year
Full TimeRemoteTeam 51-200Since 2020H1B No Sponsor

• Eigenständige Skalierung von Meta-Kampagnen mit Fokus auf profitabler Neukundenakquise. • Analyse des gesamten Funnels vom ersten Ad-Klick bis zum Checkout. • Entwicklung und Optimierung skalierbarer Direct-Response-Funnels. • Zusammenarbeit mit Direct Response Copy, Creative Strategy und CRO. • Aufbau eines strukturierten Creative-Testing-Frameworks. • Datengetriebene Steuerung anhand von CAC, LTV, und Funnel-Metriken. • Kontinuierliche Markt- und Wettbewerbsforschung.

Germany
Paddle logo

Senior Paid Search Manager

Paddle

We’re the only complete payments infrastructure provider for SaaS companies.

Full TimeRemoteTeam 201-500H1B Sponsor

• Lead paid search strategy and execution: own end-to-end planning, campaign build, and ongoing optimisation across Google Ads and Microsoft Advertising • Scale performance through structured optimisation: manage bid strategies, keyword targeting and audience lists with precision • Run a rigorous experimentation programme: design and execute ongoing A/B tests across ad copy, audience targeting, bidding approaches, and landing pages • Own landing page strategy and testing: partner closely with the Web team to ensure post-click experiences align with ad intent • Work closely with Sales: regularly assess lead quality in partnership with the Sales team • Align with Organic Search: work closely with the Organic team to share keyword intent signals • Own attribution and reporting: measure and communicate the impact of paid search on leads, MQLs, SQLs, pipeline, and revenue • Manage budget with rigour: track and forecast spend, pacing, and efficiency metrics • Stay ahead of the curve: monitor platform developments, algorithm changes, and industry shifts

United Kingdom
Full TimeRemoteTeam 1-10H1B No Sponsor

• Lead Paid Media strategy and execution across channels including Google Ads, Meta (Facebook/Instagram), LinkedIn Ads, Display, YouTube, and Streaming Media • Own campaign performance for assigned accounts, ensuring campaigns align with partner goals such as inquiries, applications, enrollments, and lead generation • Monitor campaign performance and provide clear reporting, insights, and optimization recommendations to partners and internal teams • Identify and execute ongoing testing opportunities across channels, audiences, and creative approaches to improve results • Collaborate with creative, analytics, and account teams to ensure smooth execution and strong campaign performance • Support and mentor junior Paid Media team members while helping maintain high execution standards • Stay informed on Paid Media trends, platform updates, and innovations within higher education marketing

Canada