Staff AI Engineer
Location
United States
Posted
4 days ago
Salary
$150K - $165K / year
Seniority
Lead
No structured requirement data.
Job Description
Staff AI Engineer
Onpoint Healthcare Partners Inc
Role Description Onpoint Healthcare Partners builds Iris, a Medical Agent AI platform that takes administrative work off the plates of providers and care teams. Our agents handle charting, coding, care gap closure, and care coordination across the full patient journey, combining AI automation with expert clinical oversight. More than 2,000 providers across 35+ specialties rely on the platform every day, which means the systems behind it have to be accurate, reliable, and auditable without exception. We are seeking a Staff AI Engineer to design, build, and scale the agent systems behind Iris. This role combines strong software engineering fundamentals with deep experience building production AI systems. The work you ship directly determines how much time providers get back for patient care. The ideal candidate has shipped AI solutions to production that deliver measurable business value, and will evolve and be a good steward of the agent platform architecture. Key Responsibilities - Design and develop AI applications and agent workflows. - Build scalable backend services, APIs, and integrations supporting AI solutions. - Evolve and steward the agent platform architecture, including orchestration, runtime safety, and prompt governance. - Treat prompts and tool schemas as versioned code with staged rollouts and rollback. - Develop and maintain evaluation frameworks for AI agents and models, with CI gates that block bad changes before release. - Design retrieval, memory, and context management strategies. - Build observability, monitoring, and debugging capabilities, including full trace and replay for incident resolution. - Build the MLOps foundation the team is currently missing: training and retraining pipelines, model versioning, model registry, and feature stores, so model work stops being reactive. - Stand up A/B testing infrastructure and automated drift detection that triggers retraining, so model quality is monitored and maintained without manual firefighting. - Track token and tool costs per run, workflow, and tenant, and keep costs predictable as usage scales. - Improve reliability, performance, safety, and cost efficiency of production AI systems. - Partner with Product, Clinical, Data Science, and Engineering teams to deliver AI capabilities. - Own AI deployment, monitoring, and operational excellence. Qualifications - 8+ years of software engineering experience. - 3+ years building AI and ML applications. - Strong Python and/or C# development experience. - Experience deploying AI systems into production environments. - Experience with LLMs, RAG architectures, agent frameworks, and AI evaluation. - Experience with AWS and distributed systems. - Hands-on experience building MLOps infrastructure such as training pipelines, model registries, feature stores, A/B testing, and drift detection. - Strong debugging, observability, and operational skills. Preferred Qualifications - Healthcare industry experience. - Experience with HIPAA, PHI, and regulated environments. - Experience with vector databases, embeddings, and knowledge graphs. - Experience building AI systems at large scale. Success Metrics - Reliable, scalable, and cost efficient AI systems running in production. - Incidents get resolved fast because tracing and replay make root cause obvious. - Cost per workflow stays predictable as volume grows. - Model training, versioning, and retraining run as a managed pipeline rather than a reactive, manual effort. - Strong adoption and measurable business impact. - High trust in AI quality, safety, and operational performance. Physical Demands The physical demands described here are representative of those that must be met by an employee to successfully perform the essential functions of this job. While performing the duties of this job, the employee is regularly required to speak, hear, read, and type. This is largely a sedentary role. This position requires the ability to occasionally lift office products and supplies up to 40 pounds. Work Environment To perform this job successfully, an individual must be able to perform each essential duty satisfactorily. The requirements listed above are representative of the knowledge, skill and/or ability required. Please note this job description is not designed to cover or contain a comprehensive listing of activities, duties, or responsibilities that are required of the employee for this job. Duties, responsibilities, and activities may change at any time, with or without notice.
Related Guides
Related Job Pages
More AI Engineer Jobs
Analista de Engenharia de Software – Plataforma de IA
Vivo (Telefônica Brasil)Com a conexão, queremos que você descubra novos pontos de vista e aproveite tudo o que realmente importa.
• Projetar, desenvolver e manter **APIs, SDKs e serviços de plataforma**. • Implementar integrações e camadas de orquestração para agentes e serviços de IA. • Garantir **segurança, performance e observabilidade** em sistemas distribuídos em cloud. • Colaborar com squads multidisciplinares, apoiando decisões técnicas e evolução da arquitetura. • Contribuir para a melhoria contínua de pipelines CI/CD e práticas de **DevOps/Infra as Code**. • Participar de discussões técnicas, propondo soluções alinhadas a boas práticas de engenharia. • Apoiar e orientar desenvolvedores menos experientes, fortalecendo a cultura de qualidade.
Role Description Serious Development is a boutique healthcare strategy, product, and software engineering firm. We partner with healthcare organizations to solve complex operational challenges through thoughtfully designed custom software. We are actively seeking a full-stack AI/ML Software Engineer to help us build HealthTech apps and systems for our clients by contributing as a member of one of our agile engineering teams. Responsibilities include: - Implement scalable microservices that will work in concert together as applications. - Shape pieces of technical architecture and integrate our applications with ML & third-party technologies. - Enhance existing features and participate in code reviews. - Help ensure that the team is delivering high quality code and optimize existing systems. - Become an SME and owner of some of the services and systems that you helped implement. - Contribute your experience to make your team and the department stronger. Qualifications - Bachelor of Science degree in Computer Science, Computer Engineering, Software Engineering, or similar engineering major. - 4+ years of experience as an AI/ML Software Engineer with same/similar responsibilities (excludes any internship experience). - Experience engineering with Python. - Experience engineering in Linux and/or MacOS environments. - Experience engineering a SaaS product/platform. - Experience accurately logging all design, development, and consulting hours daily to prevent revenue leakage and ensure precise client invoicing. - Excellent written and verbal communication skills (English Level at least C1- Advanced). - Role based in Europe. - Must have valid work authorization and residency in Europe. Requirements - Operate as a self-sufficient, T-shaped, full-stack AI/ML engineer. - Recommend AI/ML-driven solutions and implement them. - Review technical architecture documentation for the project and ensure that the engineering team's deliverables are implemented accordingly. - Develop code that fulfills requirements specified by the business, technical architects, and clients. - Ensure that the code you deliver has an extremely high level of quality and extremely low potential for defects that will surface in the production environments. - Author documentation of your code that is useful for your team members and other members of Engineering. - Participate as a team-player that works together with your fellow team members to deliver commitments on time. - Dive into code as technical challenges arise to perform root-cause analyses and implement resolutions. - Develop proofs of concepts as needed. - Review code written by your team members to help catch issues before they surface after deployment. - Keep current with engineering best practices, design principles, technology, security, and compliance to apply that knowledge to all the responsibilities above. Benefits - Working proficiency of Portuguese, Spanish, or Ukrainian language skills could be helpful.
• Lead a team of Product Managers across AI Platform, Analytics, Portals, and Platform/Integrations • Set strategy and ladder to the 2026 GA calendar and 2028 vision • Own the AI Platform and agent strategy • Partner with engineering on AI architecture decisions • Define outcomes and articulate measurable customer and business outcomes
Role Description We are looking for an Edge AI Engineer to design, optimize, and deploy machine learning models that run efficiently on resource-constrained edge devices, including mobile platforms, embedded systems, and specialized accelerators. The role requires deep expertise in model compression, quantization, and hardware-aware optimization, along with strong systems engineering skills to ship reliable AI capabilities outside the data center. The ideal candidate has shipped edge AI in production environments where compute, memory, energy, and connectivity constraints fundamentally shape the engineering trade-offs. Key Responsibilities - Design and implement edge AI solutions optimized for diverse hardware including mobile SoCs, NPUs, and embedded accelerators. - Apply quantization, pruning, distillation, and architectural optimization to fit models within edge constraints. - Tune model performance for latency, energy efficiency, and memory footprint on target hardware. - Build cross-platform inference runtimes leveraging frameworks such as TensorFlow Lite, ONNX Runtime, and Core ML. - Optimize models for specific accelerator backends including DSPs, NPUs, and mobile GPUs. - Implement on-device model update, versioning, and rollback workflows that allow safe staged rollouts to large device populations and rapid recovery if a model release behaves unexpectedly in the field. - Design hybrid edge-cloud architectures that gracefully degrade based on connectivity and device capability. - Build telemetry pipelines that respect privacy while enabling continuous improvement. - Collaborate with hardware, firmware, and product teams to align AI capabilities with device constraints. - Implement secure execution paths, model protection, and integrity verification on edge devices. - Develop benchmarking suites that characterize accuracy, latency, and energy trade-offs across devices. - Drive responsible AI considerations including on-device privacy and bias evaluation. - Maintain comprehensive, current technical documentation — including architecture diagrams, design decisions, configuration references, runbooks, and operational procedures — so that the system remains supportable, auditable, and easy to onboard new engineers onto over time. - Stay current with edge AI hardware and software developments, regularly review release notes and community discussions, and translate noteworthy advances into concrete recommendations and adoption proposals for the team. Qualifications - Bachelor’s or Master’s degree in Computer Science, Computer Engineering, or a related field. - Six or more years of experience in ML engineering, with significant work on edge or mobile AI. - Strong proficiency in Python and C++. - Hands-on experience with model compression, quantization, and pruning techniques. - Experience with at least one major edge inference framework. - Solid understanding of mobile and embedded hardware architectures. - Experience deploying ML models to production on mobile or embedded platforms. - Strong performance engineering and profiling skills. - Familiarity with on-device privacy and security considerations. - Strong communication and cross-functional collaboration skills. Preferred Qualifications - Experience with custom NPU or DSP toolchains. - Familiarity with federated learning or on-device personalization. - Exposure to safety-critical or industrial edge deployments. - Open-source contributions to edge AI frameworks. - Experience optimizing LLMs for on-device inference. How to Apply Would you like to know more about this opportunity? For immediate consideration, please send your resume to [email protected] or contact us at (908) 505-3899. Learn more about Bright Vision Technologies at www.bvteck.com .

