BJAK logo
BJAK

Bjak is a technology company focused on making financial services easy, fun and more rewarding for everyone

Principal Machine Learning Engineer

Machine Learning EngineerMachine Learning EngineerFull TimeRemoteLeadTeam 51-200H1B No SponsorCompany SiteLinkedIn

Location

China

Posted

120 days ago

Salary

0

Seniority

Lead

EnglishPyTorchRay

Job Description

Principal Machine Learning Engineer

BJAK

• Build and own end-to-end ML pipelines spanning data, training, evaluation, inference, and deployment. • Fine-tune and adapt models using state-of-the-art methods such as LoRA, QLoRA, SFT, DPO, and distillation. • Architect and operate scalable inference systems, balancing latency, cost, and reliability. • Design and maintain data systems for high-quality synthetic and real-world training data. • Implement evaluation pipelines covering performance, robustness, safety, and bias, in partnership with research leadership. • Own production deployment, including GPU optimization, memory efficiency, latency reduction, and scaling policies. • Collaborate closely with application engineering to integrate ML systems cleanly into backend, mobile, and desktop products. • Make pragmatic trade-offs and ship improvements quickly, learning from real usage. • Work under real production constraints: latency, cost, reliability, and safety

Job Requirements

  • Strong background in deep learning and transformer-based architectures.
  • Hands-on experience training, fine-tuning, or deploying large-scale ML models in production.
  • Proficiency with at least one modern ML framework (e.g. PyTorch, JAX), and ability to learn others quickly.
  • Experience with distributed training and inference frameworks (e.g. DeepSpeed, FSDP, Megatron, ZeRO, Ray).
  • Strong software engineering fundamentals – you write robust, maintainable, production-grade systems.
  • Experience with GPU optimization, including memory efficiency, quantization, and mixed precision.
  • Comfort owning ambiguous, zero-to-one ML systems end-to-end.
  • A bias toward shipping, learning fast, and improving systems through iteration.

Benefits

  • Health insurance
  • Retirement plans
  • Paid time off
  • Flexible work arrangements
  • Professional development

Related Job Pages

More Machine Learning Engineer Jobs

Correlation One logo

Lead Instructor – Machine Learning Data Associate

Correlation One

Correlation One is a technology company that is on a mission “to create equal access to data-driven jobs of tomorrow.” As an employer, the company is known for its empowering,

• Deliver live, virtual instruction to large groups of learners (100 to 8,000+) • Conduct large synchronous online lectures on technical content • Prepare and lead virtual classroom sessions • Collaborate with operations personnel to ensure smooth program delivery • Assist in lesson design, development, and improvement • Interact professionally with learners and staff • Maintain a high level of courtesy and respect • Prepare diligently for lectures to ensure high-quality content • Adjust lesson pace and presentation to meet diverse learner needs • Provide thoughtful answers and assistance to learners throughout the lesson

Europe
OtherRemoteTeam 1-10Since 2022H1B No Sponsor

• Build platforms for driving intelligent decisions, interacting with machine learning models, and powering interactions such as conversational interfaces • Collaborate with data scientists, data engineers, data analysts, software engineers, IT specialists, and stakeholders to expand effective use of AI applications • Research and prove out new approaches and algorithms through prototypes • Ensure high-quality system delivery and continually improve workflows and methods

Utah
Job Closed
Full TimeRemoteTeam 201-500Since 1997H1B No Sponsor

• Develop, deploy, and maintain robust AI and machine learning pipelines for internal and client-driven projects. • Deploy, manage, and scale AI models, including pre-trained models (e.g., LLMs) and custom ML models, into production environments. • Work with data scientists to implement model prototypes into scalable, production-ready AI systems. • Optimize and tune model performance, latency, and cost-efficiency on cloud platforms. • Integrate AI/ML solutions with major cloud platforms (AWS, GCP, Azure) and utilize containerization technologies (Docker, Kubernetes) for consistent deployment. • Apply standard MLOps practices, including continuous integration/continuous delivery (CI/CD), model versioning, monitoring, and maintenance systems. • Collaborate with software engineering teams to ensure seamless integration of AI capabilities into applications and user workflows. • Stay up to date with advancements in AI/ML technologies, Generative AI, and MLOps deployment strategies.

United Kingdom
Job Closed
Full TimeRemoteTeam 201-500Since 1997H1B No Sponsor

• Design, develop, and maintain high-performance AI and machine learning pipelines for complex internal and client-driven projects. • Deploy, manage, and scale diverse AI models, including pre-trained models (e.g., LLMs, vision models) and custom ML models, into production environments. • Collaborate with data scientists and researchers to operationalize models, transforming prototypes into resilient, production-grade AI systems that meet stringent business requirements. • Optimize model performance, latency, and cost-efficiency across various cloud and edge deployment targets. • Integrate AI/ML solutions with major cloud platforms (AWS, GCP, Azure) and leverage containerization technologies (Docker, Kubernetes) for consistent deployment. • Implement comprehensive MLOps practices, including automated testing, continuous integration/continuous delivery (CI/CD) for models, model versioning, monitoring, and proactive maintenance systems. • Work closely with software engineering teams to ensure seamless and ethical integration of AI capabilities into larger applications and user workflows. • Stay up to date with the latest advancements in AI/ML research, Generative AI, data engineering, and MLOps deployment strategies.

India
Job Closed