Job Closed
This listing is no longer active.
Cybersecurity & Managed Services firm providing Technical Advisory support to Federal and Commercial customers.
AI/ML Engineer I
Location
United States
Posted
100 days ago
Salary
0
Seniority
Mid Level
No structured requirement data.
Job Description
AI/ML Engineer I
Quzara LLC
Role Description Quzara is seeking an AI/ML Engineer I to support the development, deployment, and operational support of AI-enabled applications serving federal and defense-adjacent customers. This role combines applied AI engineering with practical data science responsibilities to ensure reliable, secure, and high-performing AI systems in production environments. AI/ML Engineer will collaborate with internal development teams and government customers to troubleshoot issues, resolve defects, optimize model behavior, and improve data quality across AI-driven workflows. This is a hands-on engineering role focused on implementation, support, and continuous improvement not research or supervisory leadership. - Develop, integrate, and maintain AI/ML components within secure, production-grade applications - Work directly with customers and internal engineering teams to troubleshoot system behavior and resolve defects - Perform bug fixes, root cause analysis, and performance optimization across AI-enabled services - Diagnosing and remediate data quality issues affecting model performance and system outputs - Support data preprocessing, transformation, validation, and feature engineering pipelines - Implement and optimize Large Language Model (LLM) integrations in secure cloud environments - Assist in prompt engineering, response evaluation, and model tuning - Support retrieval-augmented generation (RAG) pipelines and embedding workflows - Participate in LLMOps practices including monitoring, logging, versioning, and performance tracking of deployed models - Evaluate model outputs for consistency, hallucination risk, bias, and reliability - Integrate AI services via REST APIs within backend systems - Contribute to observability, telemetry, and audit logging aligned with compliance requirements - Document technical implementations in accordance with secure development standards Qualifications - Bachelor’s degree in Computer Science, Engineering, Data Science, or related technical field - 1–3 years of professional experience in AI/ML engineering, data science, or software development - Strong proficiency in Python - Experience troubleshooting structured and unstructured data issues - Familiarity with data preprocessing, feature engineering, and model validation techniques - Hands-on experience working with OpenAI or similar LLM APIs - Familiarity with foundational models and smaller language models (SLMs) - Experience with transformer-based architecture (e.g., Hugging Face Transformers) - Understanding of embeddings, tokenization, fine-tuning concepts, and inference workflows - Experience in integrating AI services into cloud environments (Azure preferred) - Understanding of LLMOps concepts including model monitoring, logging, prompt iteration, and deployment workflows - Ability to debug model outputs and trace issues across data, prompts, and system integrations - Eligible for U.S. Public Trust clearance. Requirements - Experience implementing RAG architecture and vector-based search systems - Familiarity with vector databases (e.g., FAISS, Pinecone, Azure AI Search) - Exposure to containerization (Docker) and CI/CD practices - Experience supporting AI systems in regulated or government environments - Understanding of model evaluation metrics and performance benchmarking Benefits - This full-time role runs Monday to Friday, 8:30 AM–5:30 PM and requires flexibility to work remotely or on-site (if applicable per client RTO policies). - On occasion additional hours may be necessary. EEO Statement The Company is an Equal Employment Opportunity (EEO) employer and does not discriminate based on race, color, religion, sex, sexual orientation, national origin, age, marital status, disability, veteran's status, or any other basis protected by applicable discrimination laws.
Related Guides
Related Job Pages
More Machine Learning Engineer Jobs
• Develop and refine features for deep learning models, working with large-scale customer and behavioral datasets • Implement model architecture changes informed by recent academic research (e.g., papers from NeurIPS and similar venues) • Collaborate with client teams to understand business context and translate requirements into technical solutions • Optimize model training pipelines for efficiency and scalability • Document approaches, findings, and technical decisions for knowledge sharing across teams • Participate in code reviews and contribute to engineering best practices
ML Engineer – Small Language Models
Fastino LabsBuilding the first foundational model for agent personalization.
• Design, build, and deploy the critical small language models that are foundational to Fastino’s product • Own the full lifecycle of state of the art models, from prototyping and data analysis to deployment, monitoring, and the continuous improvement of models in production • Drive the data strategy to continuously improve model performance by analyzing distribution gaps, contributing to synthetic data pipelines, and creating automated annotation systems • Experiment with novel language model architectures, helping drive and execute Fastino's research roadmap • Implement reinforcement learning techniques including Direct Preference Optimization and Generalized Reward Preference Optimization to align model outputs with human preferences and quality standards • Build robust and real-world motivated evaluations • Partner with Fastino engineering team to ship model updates directly to customers • Establish best practices for code health and documentation on the team, to facilitate collaboration and reliable development
• Design, develop, and optimize computer vision models for real-time applications on embedded systems • Implement model compression techniques such as quantization, pruning, and knowledge distillation to improve performance on low-power hardware • Deploy machine learning models on embedded platforms, including ARM, NVIDIA Jetson, Qualcomm, or custom ASICs • Write clean, efficient, and well-documented code in Python and C++, leveraging ML frameworks like TensorFlow, PyTorch, and ONNX • Develop and fine-tune SLAM, object detection, tracking, and feature extraction models for high efficiency • Collaborate with cross-functional teams to integrate ML models into production systems, optimizing for latency, accuracy, and power consumption • Benchmark and profile ML models to identify and implement optimizations for inference on embedded hardware • Research and apply cutting-edge ML techniques to improve real-time performance in resource-constrained environments
ML Engineer
Orcrist Technologies GmbHPioneering Future Technologies with Advanced AI and Data Analytics
• Package and deploy models (ASR, translation, OCR, NER, summarization) using Triton/KServe on Kubernetes. • Build evaluation pipelines (WER, BLEU, F1, latency, cost) and automate release gating. • Operate streaming + batch inference via Kafka, Temporal, and backfill tooling. • Monitor drift/quality with Prometheus, Grafana, Evidently; optimize inference cost and performance. • Collaborate with TypeScript teams on payload schemas, contracts, and human-in-the-loop feedback loops.




