Bjak is a technology company focused on making financial services easy, fun and more rewarding for everyone
Principal Machine Learning Engineer
Location
China
Posted
120 days ago
Salary
0
Seniority
Lead
Job Description
Principal Machine Learning Engineer
BJAK
• Build and own end-to-end ML pipelines spanning data, training, evaluation, inference, and deployment. • Fine-tune and adapt models using state-of-the-art methods such as LoRA, QLoRA, SFT, DPO, and distillation. • Architect and operate scalable inference systems, balancing latency, cost, and reliability. • Design and maintain data systems for high-quality synthetic and real-world training data. • Implement evaluation pipelines covering performance, robustness, safety, and bias, in partnership with research leadership. • Own production deployment, including GPU optimization, memory efficiency, latency reduction, and scaling policies. • Collaborate closely with application engineering to integrate ML systems cleanly into backend, mobile, and desktop products. • Make pragmatic trade-offs and ship improvements quickly, learning from real usage. • Work under real production constraints: latency, cost, reliability, and safety
Job Requirements
- Strong background in deep learning and transformer-based architectures.
- Hands-on experience training, fine-tuning, or deploying large-scale ML models in production.
- Proficiency with at least one modern ML framework (e.g. PyTorch, JAX), and ability to learn others quickly.
- Experience with distributed training and inference frameworks (e.g. DeepSpeed, FSDP, Megatron, ZeRO, Ray).
- Strong software engineering fundamentals – you write robust, maintainable, production-grade systems.
- Experience with GPU optimization, including memory efficiency, quantization, and mixed precision.
- Comfort owning ambiguous, zero-to-one ML systems end-to-end.
- A bias toward shipping, learning fast, and improving systems through iteration.
Benefits
- Health insurance
- Retirement plans
- Paid time off
- Flexible work arrangements
- Professional development
Related Guides
Related Job Pages
More Machine Learning Engineer Jobs
Lead Instructor – Machine Learning Data Associate
Correlation OneCorrelation One is a technology company that is on a mission “to create equal access to data-driven jobs of tomorrow.” As an employer, the company is known for its empowering,
• Deliver live, virtual instruction to large groups of learners (100 to 8,000+) • Conduct large synchronous online lectures on technical content • Prepare and lead virtual classroom sessions • Collaborate with operations personnel to ensure smooth program delivery • Assist in lesson design, development, and improvement • Interact professionally with learners and staff • Maintain a high level of courtesy and respect • Prepare diligently for lectures to ensure high-quality content • Adjust lesson pace and presentation to meet diverse learner needs • Provide thoughtful answers and assistance to learners throughout the lesson
• Build platforms for driving intelligent decisions, interacting with machine learning models, and powering interactions such as conversational interfaces • Collaborate with data scientists, data engineers, data analysts, software engineers, IT specialists, and stakeholders to expand effective use of AI applications • Research and prove out new approaches and algorithms through prototypes • Ensure high-quality system delivery and continually improve workflows and methods
• Develop, deploy, and maintain robust AI and machine learning pipelines for internal and client-driven projects. • Deploy, manage, and scale AI models, including pre-trained models (e.g., LLMs) and custom ML models, into production environments. • Work with data scientists to implement model prototypes into scalable, production-ready AI systems. • Optimize and tune model performance, latency, and cost-efficiency on cloud platforms. • Integrate AI/ML solutions with major cloud platforms (AWS, GCP, Azure) and utilize containerization technologies (Docker, Kubernetes) for consistent deployment. • Apply standard MLOps practices, including continuous integration/continuous delivery (CI/CD), model versioning, monitoring, and maintenance systems. • Collaborate with software engineering teams to ensure seamless integration of AI capabilities into applications and user workflows. • Stay up to date with advancements in AI/ML technologies, Generative AI, and MLOps deployment strategies.
• Design, develop, and maintain high-performance AI and machine learning pipelines for complex internal and client-driven projects. • Deploy, manage, and scale diverse AI models, including pre-trained models (e.g., LLMs, vision models) and custom ML models, into production environments. • Collaborate with data scientists and researchers to operationalize models, transforming prototypes into resilient, production-grade AI systems that meet stringent business requirements. • Optimize model performance, latency, and cost-efficiency across various cloud and edge deployment targets. • Integrate AI/ML solutions with major cloud platforms (AWS, GCP, Azure) and leverage containerization technologies (Docker, Kubernetes) for consistent deployment. • Implement comprehensive MLOps practices, including automated testing, continuous integration/continuous delivery (CI/CD) for models, model versioning, monitoring, and proactive maintenance systems. • Work closely with software engineering teams to ensure seamless and ethical integration of AI capabilities into larger applications and user workflows. • Stay up to date with the latest advancements in AI/ML research, Generative AI, data engineering, and MLOps deployment strategies.



