Job Closed

This listing is no longer active.

LEO Technologies, LLC

We help you hear the voices that matter.

AI/LLM Evaluation & Alignment Software Engineer

Full-stack EngineerSoftware EngineerFull Time Remote SeniorTeam 11-50H1B No SponsorCompany Site LinkedIn

Location

Texas

Posted

64 days ago

Salary

$135K - $160K / year

Seniority

Senior

Bachelor Degree3 yrs expEnglishAWS Azure Cloud Kubernetes Python PyTorch Terraform

Job Description

• Build and maintain evaluation frameworks for LLMs and generative AI systems tailored to public safety and intelligence use cases. • Design guardrails and alignment strategies to minimize bias, toxicity, hallucinations, and other ethical risks in production workflows. • Partner with AI engineers and data scientists to define online and offline evaluation metrics (e.g., model drifts, data drifts, factual accuracy, consistency, safety, interpretability). • Implement continuous evaluation pipelines for AI models, integrated into CI/CD and production monitoring systems. • Collaborate with stakeholders to stress test models against edge cases, adversarial prompts, and sensitive data scenarios. • Research and integrate third-party evaluation frameworks and solutions; adapt them to our regulated, high-stakes environment. • Work with product and customer-facing teams to ensure explainability, transparency, and auditability of AI outputs. • Provide technical leadership in responsible AI practices, influencing standards across the organization. • Contribute to DevOps/MLOps workflows for deployment, monitoring, and scaling of AI evaluation and guardrail systems (experience with Kubernetes is a plus). • Document best practices and findings, and share knowledge across teams to foster a culture of responsible AI innovation.

Job Requirements

Bachelor's or Master's in Computer Science, Artificial Intelligence, Data Science, or related field.
3–5+ years of hands-on experience in ML/AI engineering, with at least 2 years working directly on LLM evaluation, QA, or safety.
Strong familiarity with evaluation techniques for generative AI: human-in-the-loop evaluation, automated metrics, adversarial testing, red-teaming.
Experience with bias detection, fairness approaches, and responsible AI design.
Knowledge of LLM observability, monitoring, and guardrail frameworks e.g Langfuse, Langsmith
Proficiency with Python and modern AI/ML/LLM/Agentic AI libraries (LangGraph, Strands Agents, Pydantic AI, LangChain, HuggingFace, PyTorch, LlamaIndex).
Experience integrating evaluations into DevOps/MLOps pipelines, preferably with Kubernetes, Terraform, ArgoCD, or GitHub Actions.
Understanding of cloud AI platforms (AWS, Azure) and deployment best practices.
Strong problem-solving skills, with the ability to design practical evaluation systems for real-world, high-stakes scenarios.
Excellent communication skills to translate technical risks and evaluation results into insights for both technical and non-technical stakeholders.

Benefits

3 weeks of paid vacation – out the gate!!
Competitive Salary.
Generous medical, dental, and vision plans.
Sick, and paid holidays are offered.

Related Categories

Remote Full-stack Engineer Jobs Remote Software Engineer Jobs Remote Backend Engineer Jobs Frontend Engineer Android Engineer iOS Engineer Game Engineer

Related Job Pages

Remote Full-stack Engineer Jobs Full-stack Engineer Jobs in Texas Remote Full-time Jobs (US)Remote Python Jobs (US)More Remote Jobs

More Full-stack Engineer Jobs

Full Stack Developer

Vianai Systems, Inc.

Building a World Full of Life & Intelligence

Full-stack Engineer64 days ago

Full Time RemoteTeam 51-200H1B No Sponsor

Company Site LinkedIn

• Design, plan, and build all aspects of our products frontend, UI, data pipelines, and backend components. • Take leadership and pursue the best, state-of-the-art solutions, within the dynamic requirements and timelines. • Take E2E ownership of all aspects of the development cycle. • Focus on continuous growth and improvement, in every aspect (personal, products, processes, tools, skills, etc.). • Mentor engineers to develop new skills and grow professionally.

JavaScript MySQL Python React Ruby

View details: Full Stack Developer

California

$110K - $160K / year

Apply

Staff AI Product Engineer

Modern Health

Offering global, personalized mental health care designed to help you feel more resilient, productive, and empowered.

Full-stack Engineer64 days ago

Full Time RemoteTeam 201-500Since 2017H1B No Sponsor

Company Site LinkedIn

• Provide technical leadership on a new team prototyping and experimenting with new AI features • Productionize and ship AI integrations into Modern Health’s core product • Collaborate with cross-functional teams to deliver product features on time • Stay up-to-date with the latest AI technologies and trends

PostgreSQL React React Native

View details: Staff AI Product Engineer

Hawaii

$175.1K - $206K / year

Apply

Job Closed

Senior Software Engineer

Maker&Son

Home of the most comfortable furniture in the world. Handmade with love in the UK, US, AUS, NZ from natural materials.

Full-stack Engineer64 days ago

Full Time RemoteTeam 51-200H1B No Sponsor

Company Site LinkedIn

• Execute full software development life cycle, with a DevOps practice adopting CI/CD • Write well-designed, testable code. We currently use NodeJs, TypeScript, Python, and looking to use Golang for new backend services • Integrate software components into a fully functional software system • Troubleshoot, debug and upgrade existing systems • Deploy programs and evaluate user feedback • Comply with project plans and industry standards • Ensure software is updated with latest features

Drupal Java Node.js Python SDLC Spring SQL TypeScript Go

View details: Senior Software Engineer

United Kingdom

Apply

Senior Fullstack Developer

Clearview

Meet Clearview, a remote-first, distributed software company with team members spread across the globe.

Full-stack Engineer64 days ago

Full Time RemoteTeam 11-50Since 2008H1B Sponsor

Company Site LinkedIn

• Actively drive code review and architecture choices • Mentor other engineers • Work closely with management and clients • Make independent decisions and create initiatives

Flutter JavaScript PHP React React Native WordPress

View details: Senior Fullstack Developer

Bosnia And Herzegovina

Apply

AI/LLM Evaluation & Alignment Software Engineer

Job Description

Job Requirements

Benefits

Related Guides

Related Categories

Related Job Pages

More Full-stack Engineer Jobs

Full Stack Developer

Staff AI Product Engineer

Senior Software Engineer

Senior Fullstack Developer