Job Closed
This listing is no longer active.
AI-Native Physical Security
AI Research Engineer (Agents)
Location
United States
Posted
87 days ago
Salary
$165K - $210K / year
Seniority
Mid Level
Job Description
AI Research Engineer (Agents)
Coram AI
At Coram AI, we’re reimagining video security for the modern world. Our cloud-native platform uses computer vision and AI to help businesses stay safe, make smarter decisions, and move faster; from real-time alerts to seamless clip sharing and multi-site visibility. You’ll be joining a small, fast-moving team that values clarity, craftsmanship, and impact. Every person here has a voice, ships meaningful work, and helps shape how AI can make the world safer and more connected. We are looking for engineers who want to build production-grade AI agents powered by the latest LLMs and Claude Code. This role is focused on turning foundation models into reliable, high-performance systems that operate in real environments. What You’ll Do - Design and build autonomous agents using state-of-the-art LLMs - Implement tool use, retrieval pipelines, memory systems, and multi-step reasoning flows - Engineer prompts and system instructions for robustness, reliability, and speed - Optimize latency, cost, and throughput in production - Build evaluation frameworks to measure agent accuracy, tool correctness, and failure modes - Create high-quality datasets for training, fine-tuning, and benchmarking - Develop introspection tooling to debug reasoning chains, hallucinations, and tool misuse - Run structured experiments to improve agent performance through iterative testing What We’re Looking For - Strong experimental mindset with a scientific approach to evaluation and iteration - Experience working with modern LLMs, RAG pipelines, tool calling, and agent frameworks - Deep understanding of failure modes in LLM systems and how to mitigate them - Experience building production systems in Python, Go, or TypeScript - Familiarity with distributed systems, APIs, and real-time infrastructure - Comfort shipping systems that must be reliable, observable, and measurable Bonus Points - Experience building evaluation harnesses or LLM benchmarking systems - Background in machine learning, applied research, or systems performance optimization - Experience optimizing inference latency and cost at scale - Experience debugging complex agent behaviors in real-world environments Skills and qualifications: - BS, MS, or PhD in Computer Science, Engineering, Machine Learning, or a related technical field from top University - 2+ years of experience building software systems (experience working with LLMs, AI agents, or ML systems highly preferred) - Strong programming ability in Python, with experience in Go or TypeScript a plus - Experience working with modern LLM APIs (OpenAI, Anthropic, etc.) and building applications powered by foundation models - Experience building or contributing to production systems that must be reliable, observable, and scalable - Ability to diagnose and mitigate LLM failure modes such as hallucinations, tool misuse, and reasoning errors - Strong experimental mindset with a data-driven approach to improving system performance - Excellent communication skills (written and verbal) in English - Passion for building cutting-edge AI systems at the speed of a fast-growing startup - Resilient and adaptable in challenging, fast-paced environments - Ability to work in an onsite environment, we move faster when we're in the same room What we offer: - Competitive compensation package - 100% Employer-paid medical, dental, vision, and base life insurance - Flexible paid time off and 9 paid holidays - 401(k) with both Traditional and Roth options - Equity in a rapidly growing company - Referral bonuses - Daily team dinners and regular team off-sites to build connection and momentum - The latest Apple tech and unlimited tools so you can win - Unlimited Cursor and Claude Code credits - Direct exposure to our AI-native GTM machinery We're on a mission to transform a $50B+ legacy industry by bringing the power of cutting-edge multimodal LLMs and computer vision to real-world security and operations. From firearm detection to intelligent access control, our AI-native platform turns every camera and sensor into a smart system that enhances safety, efficiency, and awareness. Founded by Ashesh Jain (ex-Lyft Level 5, PhD Cornell) and Peter Ondruska (ex-Lyft, PhD Oxford), Coram AI is backed by Battery Ventures, Mosaic, and 8VC, have raised over $30M, and were named to the CB Insights AI 100 as one of the most promising AI companies in the world. If you're excited to work on mission-critical AI that makes an impact in the real world, we’d love to meet you.
Related Guides
Related Job Pages
More AI Research Scientist Jobs
Generative AI Specialist - Humanities (English and Dutch)
Innodata IncInnodata (NASDAQ: INOD) is a leading data engineering company. With more than 2,000 customers and operations in 13 cities around the world, we are an AI technology solutions provider-of-choice for 4 out of 5 of the world’s biggest technology companies, as well as leading companies across financial services, insurance, technology, law, and medicine. By combining advanced machine learning and artificial intelligence (ML/AI) technologies, a global workforce of subject matter experts, and a high-security infrastructure, we’re helping usher in the promise of AI. Our global workforce includes over 7,000 employees in the United States, Canada, United Kingdom, the Philippines, India, Sri Lanka, Israel and Germany. We’re poised for a period of explosive growth over the next few years.
Job Title: Generative AI Specialist - Humanities (English and Dutch) Location: Fully Remote within the U.S. (excluding California, Washington, Alaska, Colorado, Montana, New York, Puerto Rico, Nevada, Nebraska) Employment Type: Part-Time; up to 28 hours per week Who we are: Innodata (NASDAQ: INOD) is a leading data engineering company. With more than 2,000 customers and operations in 13 cities around the world, we are an AI technology solutions provider-of-choice for 4 out of 5 of the world’s biggest technology companies, as well as leading companies across financial services, insurance, technology, law, and medicine. By combining advanced machine learning and artificial intelligence (ML/AI) technologies, a global workforce of subject matter experts, and a high-security infrastructure, we’re helping usher in the promise of AI. Innodata offers a powerful combination of both digital data solutions and easy-to-use, high-quality platforms. Our global workforce includes over 7,000 employees in the United States, Canada, United Kingdom, the Philippines, India, Sri Lanka, Israel and Germany. We’re poised for a period of explosive growth over the next few years. About the Role: At Innodata, we’re partnering with the world’s leading technology companies to build the future of generative AI and large language models (LLMs). We’re on the lookout for smart, savvy, and curious Generative AI Specialist to join our global contributor community as part of our Subject Matter Expert (SME) on Demand program. This is not a traditional full-time role. It’s a part-time, remote, flexible, project-specific opportunity designed for those who want to make a real impact—on their schedule. Whether you're a writer, linguist, educator, researcher, or just deeply passionate about language and logic, this role lets you contribute to cutting-edge AI development while maintaining control over your time. You’ll be helping LLMs learn the intricacies of language and reasoning—not just how to write, but how to think. If you’ve ever dreamed of shaping the intelligence behind tomorrow’s technology, this is your chance. This is more than just a gig—it’s a rare chance to help shape the future of AI from anywhere in the world, on your own terms. What You’ll Be Doing: Core tasks would include (any/multiple of) but not limited to the following: - Evaluation: Rating/assessing the performance of AI models or algorithms based on their output or behavior through a set of evaluative questions. - Annotation Labeling: Labeling elements of a piece of content rather than the content as a whole. - Classification: Assigning predefined categories or labels to items. - Content Quality: Evaluating the perceived quality and/or appropriateness of content - Content Understanding: Generating labels to advance understanding of a concept, trend etc. - Data Augmentation: Creation of additional training data for machine learning models by applying transformations to the original data, such as modifying images (rotation, flipping, cropping), generating new text (paraphrasing, summarization), or altering audio/video signals (speed modification, pitch shifting) to reduce overfitting and increase dataset diversity. - Grading: Reviewing data and identifying whether or not a product feature works as intended based on the project's guidelines. - Identification Labeling: Labeling model outputs to identify if a piece of content is or isn't something. Examples: identify clickbait; identifying gaming videos; identifying branded content. - Preference Ranking: Ordering or ranking items based on a set of preferences or criteria. - Prompt Generation: Creating prompts or questions that will be used to generate responses from a language model or other AI system. - Relevance Evaluation: Projects that evaluate the relevance of content based on a relevancy scale (1-3, 1-5, etc.). - Response Generation: Generating responses to prompts or questions using a language model or other AI system. - Response Rewrite: Rewriting existing text while preserving the original meaning, often to improve clarity or style and adherence to guidelines. - Response Summarization: Producing concise summaries of longer pieces of text or data. - Similarity Evaluation: Projects where content is compared in order to drive a determination. - Transcription: Converting spoken language or audio content into written text. - Translation: Converting text or spoken language from one language to another. - Data Collection: Gathering and compiling various forms of data to be used for training, evaluating, or fine-tuning the AI models. This may include text, images, videos, audio files, or other types of digital content. Minimum Qualifications: - A Bachelor’s degree or higher in a humanities specialization is required. Advanced degrees are strongly preferred (Master’s or PhD) - Professional or Expert level proficiency (C1/C2) in English and Dutch Salary rates at Innodata vary depending on a wide array of factors, which may include but are not limited to the role, skill set, educational background and geographic location. Innodata is an equal opportunity employer and values diversity. We do not discriminate on the basis of race, religion, color, national origin, gender, gender identity or expression, sexual orientation, age, marital status, veteran status, disability status, or any other legally protected status. Innodata is committed to creating an inclusive environment for all employees and applicants. If you need assistance or accommodation during the application or recruitment process due to a disability, please contact us and we will be happy to assist. Applicants must be legally authorized to work in the United States at the time of hire. Innodata is unable to provide visa sponsorship now or in the future for this position. Please be aware of recruitment scams involving individuals or organizations falsely claiming to represent employers. Innodata will never ask for payment, banking details, or sensitive personal information during the application process. To learn more on how to recognize job scams, please visit the Federal Trade Commission’s guide at https://consumer.ftc.gov/articles/job-scams. If you believe you’ve been targeted by a recruitment scam, please report it to Innodata at verifyjoboffer@innodata.com and consider reporting it to the FTC at ReportFraud.ftc.gov.
Subject Matter Expert - Prompt Creator
DataForce by TransPerfectDataForce by TransPerfect is part of the TransPerfect family of companies, the world’s largest provider of language and technology solutions for global business, with offices in more than 100 cities worldwide. We offer high-quality data for Human-Machine Interaction to some of the most prestigious technology companies in the world. Our department focuses on gathering, enriching, and processing data for Machine Learning in different AI domains. To learn more about DataForce please visit us at https://www.transperfect.com/dataforce . For more information on the TransPerfect Family of Companies, please visit our website at www.transperfect.com .
This description is a summary of our understanding of the job description. Click on 'Apply' button to find out more. Role Description DataForce by TransPerfect is seeking doctoral-level experts in STEM disciplines (Physics, Chemistry, and Biology) to contribute to a highly specialized, high-impact AI training initiative. In this role, you will leverage your own doctoral research to craft "stumper" prompts—complex, PhD-level scientific problems specifically designed to be unsolvable by current Large Language Models (LLMs). You will be responsible for: - Designing original, highly complex, self-contained STEM problems based on your doctoral research or directly related subfields. - Inputting your prompts into multiple designated LLMs to verify that they consistently fail to produce the correct answer ("stumping" the models). - Writing a flawless derivation or calculation leading to the correct answer, including all relevant steps, explanations, equations, and visual diagrams. - Utilizing our proprietary DataForce Platform (DFP) to draft, test, and format your submissions (e.g., drawing chemical structures, creating physics diagrams, utilizing equation editors). - For Senior/QA roles: Reviewing peer submissions, ensuring scientific accuracy, clarity, and adherence to project guidelines. Qualifications - A Ph.D. (completed or in the final stages of defense) in one of the following subdomains: - Biology: Molecular Biology, Biochemistry, Genetics, Physiology, Ecology, Cell Biology - Chemistry: Organic Chemistry, Inorganic Chemistry, Physical Chemistry, Radiation Chemistry, Polymer Chemistry - Physics: Polymer Physics, Basic Physics, Condensed Matter Physics, Quantum Physics, Computational Physics, Applied Physics, High Energy & Particle Physics, Astrophysics - Prior experience with peer review, academic grading, or scientific publishing. - Deep expertise in a specific doctoral research niche. - Familiarity with interacting with GenAI/LLMs and a critical eye for identifying AI hallucinations or logical leaps in scientific reasoning. Requirements - Exceptional analytical and critical thinking skills. - Proficiency in standardized scientific documentation (e.g., SMILES, InChIKey for Chemistry; standard notation and equations for Physics). - Comfort working in an agile data collection/annotation platform environment. - Meticulous attention to detail and a commitment to strictly original work (AI assistance in generating your own solutions is strictly prohibited). Benefits - Remote work location within the US. - Independent Contractor/Freelance engagement model. - Estimated project duration of approximately 6 weeks. - Flexible commitment of 15 hours per week.
Staff Machine Learning Engineer, AI Researcher
JobgetherWe use an AI-powered matching process to ensure your application is reviewed quickly, objectively, and fairly against the role's core requirements. Our system identifies the top-fitting candidates, and this shortlist is then shared directly with the hiring company. The final decision and next steps (interviews, assessments) are managed by their internal team. We appreciate your interest and wish you the best! Data Privacy Notice: By submitting your application, you acknowledge that Jobgether will process your personal data to evaluate your candidacy and share relevant information with the hiring employer. This processing is based on legitimate interest and pre-contractual measures under applicable data protection laws (including GDPR). You may exercise your rights (access, rectification, erasure, objection) at any time. #LI-CL1 We may use artificial intelligence (AI) tools to support parts of the hiring process, such as reviewing applications, analyzing resumes, or assessing responses. These tools assist our recruitment team but do not replace human judgment. Final hiring decisions are ultimately made by humans. If you would like more information about how your data is processed, please contact us.
This description is a summary of our understanding of the job description. Click on 'Apply' button to find out more. Role Description This role offers the opportunity to lead cutting-edge AI and machine learning initiatives, working closely with a highly-skilled team of engineers and researchers. You will design, train, and optimize ML models, translating advanced research into scalable, production-ready systems that solve real-world problems. The position involves building robust ML pipelines, running iterative experiments, and collaborating cross-functionally to integrate AI technologies into key products. This is a remote-friendly, high-impact role where innovation, experimentation, and technical rigor are highly valued. You will have the chance to influence AI strategy, contribute to state-of-the-art research, and deliver solutions that directly benefit customers. - Design, develop, train, and evaluate machine learning models across research and applied AI initiatives. - Build and maintain robust ML pipelines for data ingestion, feature engineering, model training, and evaluation. - Run iterative experiments, test hypotheses, and refine models to improve performance and accuracy. - Collaborate with researchers, engineers, and stakeholders to translate academic advances into production-ready solutions. - Optimize model performance through fine-tuning, hyperparameter search, and architecture experimentation. - Document experimental results, track findings, and share learnings with the broader team. - Stay current with AI/ML research trends and identify opportunities to apply new techniques to product solutions. - Participate in stand-by, on-call, or off-hours duties during critical research or deployment milestones as needed. Qualifications - Bachelor’s degree in Computer Science, Mathematics, Statistics, or a related field (Master’s or PhD preferred). - 5+ years of experience in industry or research, with deep hands-on expertise training and evaluating ML models, including language models. - Proficiency in Python and ML frameworks such as PyTorch or TensorFlow. - Experience with MLOps tooling and infrastructure (e.g., MLflow, Weights & Biases, Kubeflow, or similar). - Solid understanding of modern NLP, computer vision, and/or reinforcement learning techniques. - Strong analytical and problem-solving skills with the ability to iterate quickly without sacrificing rigor. - Excellent communication skills, capable of presenting experimental results to technical and non-technical stakeholders. - Ability to work independently and collaboratively in a fast-paced, remote-first environment. Benefits - Competitive salary range: $230,000–$275,000, dependent on experience and location. - Comprehensive health, dental, and vision insurance. - Short-term disability and life insurance coverage. - Paid holidays and generous paid time off. - 401(k) plan and company equity. - Fertility treatment benefits. - Eligibility for a discretionary company-wide bonus. - Remote-first work environment with flexible work arrangements. Company Description
Temporary Research Scientist
Worcester Polytechnic InstituteWPI is a vibrant, active, and diverse community of extraordinary students, world-renowned faculty, and state of the art research facilities. WPI is an Equal Opportunity Employer. All qualified candidates will receive consideration for employment without regard to race, color, age, religion, sex, sexual orientation, gender identity, national origin, veteran status, or disability. It seeks individuals from all backgrounds and experiences who will contribute to a culture of creativity, collaboration, inclusion, problem solving, innovation, high performance, and change making. It is committed to maintaining a campus environment free of harassment and discrimination.
This description is a summary of our understanding of the job description. Click on 'Apply' button to find out more. Role Description We are seeking a Temporary Research Scientist to support the healthcare AI project in developing a trustworthy healthcare large language model. - The role involves hands-on LLM fine-tuning (including instruction tuning, PEFT/LoRA, domain adaptation, and alignment) on clinical datasets. - Designing rigorous evaluation protocols. - Contributing substantially to publications by drafting key manuscript sections and preparing results for top-tier venues. Qualifications - Candidates must hold a PhD in a relevant field. - Demonstrate strong expertise in LLM fine-tuning with frameworks like Hugging Face Transformers and PyTorch. - Have a proven publication record with significant contributions in healthcare AI, NLP, or trustworthy LLM research in premier conferences/journals. Requirements - Pay rate: $25 per hour. - This is a fully remote position, working 10 hours per week. Benefits - WPI is an Equal Opportunity Employer. - All qualified candidates will receive consideration for employment without regard to race, color, age, religion, sex, sexual orientation, gender identity, national origin, veteran status, or disability. - It seeks individuals from all backgrounds and experiences who will contribute to a culture of creativity, collaboration, inclusion, problem solving, innovation, high performance, and change making. - It is committed to maintaining a campus environment free of harassment and discrimination.
