AI Engineer Remote Jobs in Rhode Island (US)
This page tracks remote ai engineer openings that are location-eligible for Rhode Island.
This page tracks remote ai engineer openings that are location-eligible for Rhode Island.
Open jobs
1,429
Hiring companies this week
10
Salary sample
$20 - $169,000
Jobs added last hour
0
1429 Jobs
906 Companies
Hi there! We’re Razorfish. We’ve been leading the marketing industry with our digital expertise since the start of the internet. But in 2020, we did a full reboot. What’s different? It all starts with people. Weird, wonderful, complex people - with diverse backgrounds in strategy, creative and technology. But no matter how different we are, we all have one thing in common. We believe our differences are our strength. So we push for inclusion, challenge convention and bring in new perspectives, to inspire new ideas. Because when we connect by understanding what makes people different, we can create unforgettable experiences that enrich lives. Join us at razorfish.com.
Role Description We are looking for a AI Developer Senior to play a key role in designing, shaping, and evolving the next generation of agent-based architectures. You will work closely with cross-functional teams to build innovative solutions that transform how brands operate, make decisions, and create value. - Lead the design and development of the backend architecture powering conversational agents and LLM-based systems, ensuring scalability, robustness, and long-term evolution. - Work on: - Orchestrating intelligent agents and multi-agent systems - Integrating foundation models (e.g. Google Gemini, Azure OpenAI) - Designing scalable APIs, including real-time streaming capabilities - Building cloud-ready infrastructure for production environments - Your work will directly impact tools used by strategy, media, creative, and data teams. - AI Agent Development: - Design and build AI agents using frameworks such as Google ADK, LangChain, or similar - Implement complex workflows with tool calling (search, retrieval, APIs, databases) - Optimize prompts and evaluate response quality - Backend & APIs: - Develop asynchronous APIs using FastAPI - Design modular and scalable architectures - Implement real-time streaming endpoints (SSE, WebSockets) - LLM Integration: - Integrate with APIs like Google Gemini or Azure OpenAI - Manage context, grounding, citations, and metadata - Optimize token usage and control costs - Infrastructure & Cloud: - Containerize services using Docker - Deploy and manage applications in cloud environments (GCP, Azure, etc.) - Handle secrets securely (Key Vault, Secret Manager, etc.) - Implement advanced observability (logging, metrics, alerting, tracing) - Data & Persistence: - Work with PostgreSQL and SQL-based systems - Use ORMs such as SQLAlchemy - Manage session history, traceability, and data flows - Engineering Quality: - Write clean, maintainable, and well-documented code - Apply testing practices (unit, integration, E2E when needed) - Follow SOLID principles, Clean Architecture, and DDD when relevant - Ensure proper versioning (Git), branching strategies, and code reviews - Build resilient, fault-tolerant systems for production Qualifications - Strong product mindset, focused on business impact and end-user value - Experience working in Agile environments with cross-functional teams - Solid backend experience in Python, building scalable services - Strong experience with asynchronous development (FastAPI or similar) - Hands-on experience integrating LLMs into production systems - Deep understanding of clean code, SOLID principles, and software design - Experience with modern architectures (hexagonal, clean architecture, etc.) - Strong knowledge of SQL databases and data modeling - Ability to make technical decisions with a long-term architectural vision Requirements - Experience with agent frameworks (Google ADK, LangChain, LlamaIndex) - Experience working with Google Cloud - CI/CD experience (GitHub Actions, Azure DevOps) - Real-time streaming implementations - Knowledge of RAG (Retrieval-Augmented Generation) - Experience in marketing, media, or data environments Benefits - Flexible Benefits (Coverflex): Enjoy more than just work with flexible compensation including meal vouchers, health insurance, transportation, and more. - Growth Opportunities: You can advance in your career not only through the experience of working with major clients but also by accessing local and global training programs specialized according to your role, covering both technical and soft skills. - Free Online Training: You can access unlimited courses from LinkedIn Learning and Udemy Catalogs through our artificial intelligence platform "Marcel". - Partner Certifications: You'll have the opportunity to obtain certifications from industry giants such as Meta, Google, or Amazon. - Work from anywhere: Telecommute up to 6 weeks from over 100 countries with our #WorkYourWorld program. - Attractive holidays package including your birthday & Advertising Day off plus some additional days off. Rest is also important! - Well-being: We prioritize the well-being of our staff and organize various health initiatives such as daily meditation or yoga among others.
Role Description This role is responsible for designing, developing, implementing, troubleshooting, and optimizing scalable, high-performance software and product applications. Leveraging industry best practices, the Sr. Software & Product Engineer delivers robust, customer-focused solutions that accelerate product innovation. - Assesses and defines software and product requirements, establishing the specifications and standards that guide scalable, high-quality development. - Executes coding, debugging, testing, and troubleshooting across the full development lifecycle, incorporating AI-assisted and agentic development approaches to maintain quality and delivery efficiency. - Develops and advances software and product capabilities that integrate with design systems, infrastructure, databases, and cloud-based platforms, all with the goal of maximizing operational efficiency. - Evaluates application requirements and architects database solutions that ensure scalability, performance, and data integrity. - Serves as a subject matter expert in AI-assisted development practices; acts as a resource for the engineering team on the effective and disciplined application of AI coding tools, informs development standards around AI use, and reinforces quality expectations through code reviews. Qualifications - Bachelor's degree (or international equivalent) or equivalent experience, required. - 5+ years of related experience, required. - 2+ years of Agentic Engineering required. - Experience with Claude Code, Curser or comparable LLM. - 5+ Python required. - Experience with Git/GitHub, JIRA, Confluence, CircleCI, required. - Experience developing in Agile, SCRUM, or similar iterative methodologies, required. - Experience in fast-growing companies or entrepreneurial environments, required. - 9+ years of related experience, preferred. - Demonstrated knowledge of component-based frontend architecture and modern frontend development principles, enabling scalable, modular front-end development. - Skilled in frontend build tools and development pipeline practices. - Advanced knowledge of software testing methodologies enabling robust, reliable test coverage. - Expert-level knowledge of AI-assisted development tools and agentic coding workflows, with the ability to apply engineering judgment to evaluate, refine, and integrate AI-generated code into production software delivery. - Technical proficiency to translate detailed business requirements into actionable technical specifications and determine the most effective implementation approach using a wide range of tools and technologies. - Industry knowledge of current software engineering practices and emerging development methodologies. - Ability to serve as a technical resource for the engineering team on AI-assisted development practices, guide peer adoption of effective AI tool use, and reinforce quality standards for AI-generated code. - Ability to influence and contribute to architectural design. - Ability to travel less than 5% of the time. - Must be 18 years of age or older. - Must successfully complete pre-employment screening process, as required. - Must successfully complete any required training or orientation courses, as needed. Benefits - Work from anywhere – Thryv is a Remote First company! - Competitive medical, dental, and vision plans, plus a wellness program with added incentives. - 401(k) savings plan with company match and employee stock purchase plan. - Continuing education benefits with tuition assistance programs. - One week of paid time off at the end of the year, in addition to our standard paid time off policy.
The CES Family of Companies is a collection of strong brands and businesses providing food equipment, supplies, service.
• Design and implement advanced AI/GenAI features across applications and SDLC workflows • Build and deploy AI solutions using Azure AI Foundry and LLM platforms • Develop multi-agent workflows using frameworks like LangChain, LangGraph, Semantic Kernel, or LlamaIndex • Implement RAG architectures, prompt engineering, and vector-based retrieval systems • Drive AI-assisted development using GitHub Copilot and establish best practices • Build scalable full-stack applications using Python, TypeScript/JavaScript, and/or C# • Develop APIs, backend services, and work with relational and NoSQL databases • Implement observability, monitoring, and performance optimization for AI systems • Lead troubleshooting, design decisions, and system integrations • Mentor junior engineers and contribute to AI governance and best practices
CES has 26+ years of experience in delivering Software Product Development, Quality Engineering, and Digital Transformation Consulting Services to Global SMEs & Large Enterprises. CES has been delivering services to some of the leading Fortune 500 Companies including Automotive, AgTech, Bio Science, EdTech, FinTech, Manufacturing, Online Retailers, and Investment Banks. These are long-term relationships of more than 10 years and are nurtured by not only our commitment to timely delivery of quality services but also due to our investments and innovations in their technology roadmap. As an organization, we are in an exponential growth phase with a consistent focus on continuous improvement, process-oriented culture, and a true partnership mindset with our customers.
Role Description As an organization, we are in an exponential growth phase with a consistent focus on continuous improvement, process-oriented culture, and a true partnership mindset with our customers. We are looking for the right qualified and committed individuals to play an exceptional role as well as to support our accelerated growth. - Design and implement AI/GenAI features across applications and SDLC workflows - Build AI solutions using platforms like Azure AI Foundry and LLM APIs - Develop agent-based workflows using frameworks such as LangChain, LangGraph, or Semantic Kernel - Implement RAG-based solutions and prompt engineering strategies - Leverage GitHub Copilot for AI-assisted development and productivity - Build full-stack applications using Python, JavaScript/TypeScript, and/or C# (.NET) - Develop APIs, backend services, and integrate with databases (SQL/NoSQL) - Ensure application quality through testing, monitoring, and observability - Collaborate with cross-functional teams (product, UX, data science) - Troubleshoot and optimize AI models, pipelines, and integrations Qualifications - 3+ years of full-stack software development experience - 1–2 years of experience in AI/LLM-based application development - Hands-on experience with AI platforms (Azure OpenAI, OpenAI, or similar) - Knowledge of RAG, embeddings, and vector databases - Experience with AI orchestration frameworks (LangChain, Semantic Kernel, etc.) - Strong programming skills in Python (preferred) and/or JavaScript/C# - Experience with REST APIs and database systems (SQL & NoSQL) - Familiarity with GitHub Copilot or AI-assisted development tools - Strong problem-solving and communication skills - Experience working in Agile environments Benefits - Flexible working hours to create a work-life balance - Opportunity to work on advanced tools and technologies - Global exposure to not only collaborate with the team, but also to connect with the client portfolio and build professional relationships - Highly encouraged for any innovative ideas & thoughts and we support in executing the same - Periodical and on-spot rewards and recognitions on your performance - Provides a better platform for enhancing skills via many different L&D programs - Enabling and empowering atmosphere to work along
From soup to snacks, we've connected people through food they love since 1869.
Role Description An Agentic AI Engineer designs, builds, and deploys autonomous AI systems that can reason, plan, use tools, and execute multi-step workflows with minimal human intervention. Unlike traditional AI/ML engineers who focus on model training and prediction, agentic AI engineers orchestrate goal-driven workflows that integrate models, tools, memory, and business logic to achieve objectives dynamically. Core Responsibilities - Design & Develop Agentic Systems: Build intelligent agents capable of autonomous planning, reasoning, and task execution, often using LLMs (e.g., GPT-class, LLaMA), multi-modal models, and autonomous workflows. - Orchestration & Frameworks: Implement agent orchestration using frameworks like LangChain, AutoGen, CrewAI, Semantic Kernel, or custom solutions. - Retrieval-Augmented Generation (RAG): Design and optimize RAG pipelines for enhanced reasoning with external knowledge, including document ingestion, chunking, embeddings, vector stores, and retrieval ranking. - Tool & Memory Integration: Develop agents that call APIs, databases, and other tools, maintain memory, and adapt based on outcomes. - Evaluation & Monitoring: Create evaluation frameworks for accuracy, grounding, latency, and cost; build observability for agent behavior and failure modes. - Model Adaptation: Fine-tune or adapt foundation models (e.g., via LoRA, adapters) for domain-specific use cases. - Production Deployment: Deploy GenAI/agentic systems in cloud-native environments with CI/CD, versioning, and runtime safeguards. - Cross-Functional Collaboration: Work with data scientists, ML engineers, product teams, and governance/compliance stakeholders. Qualifications - 2+ years in AI/ML system design, deployment, or autonomous agent development. - Programming: Proficiency in Python (and sometimes Java, C#) for AI/ML solution development. - Agent & Workflow Expertise: Experience with agent orchestration frameworks and multi-agent communication protocols. - RAG & LLM Integration: Hands-on with RAG architectures, evaluation methodologies, and LLM integration. - Cloud & DevOps: Experience with cloud platforms (e.g., Azure, AWS) and CI/CD pipelines. - Governance & Compliance: Understanding of responsible AI, security, and compliance in regulated domains (e.g., retail). Company Description The Company is committed to providing equal opportunity for employees and qualified applicants in all aspects of the employment relationship, including consideration for employment, without regard to race, color, sex, sexual orientation, gender identity, national origin, citizenship, marital status, protected veteran status, disability, age, religion, or any other classification protected by law.
Solventum is dedicated to improving healthcare options and health outcomes through cutting-edge solutions in health, materials, and data science. The company ai
Role Description As a Senior AI Engineer, Enterprise Agentic Solution, you will serve as the premier technical authority driving the enterprise-wide architecture, engineering, and deployment of Agentic AI and Generative AI platforms. Operating at a highly senior level, your focus extends beyond data science and model training into the rigorous engineering of scalable, high-performance AI systems. You will architect robust, multi-agent frameworks that integrate seamlessly into mission-critical healthcare operations. Furthermore, you will act as a primary technical liaison, partnering directly with executive stakeholders and healthcare customers to translate complex business challenges into highly reliable, autonomous AI solutions. Key Responsibilities - Agent Development & Engineering: - Build, test, and deploy autonomous, multi-agent systems using frameworks such as AutoGen and LangGraph. - Implement the core logic for agent orchestration, tool utilization, and state management. - Advanced RAG & Data Integration: - Engineer robust data ingestion pipelines capable of processing complex, multi-modal healthcare data. - Implement advanced retrieval techniques, including Graph RAG, and develop solutions for high-accuracy document intelligence (e.g., page-by-page parsing of complex PDFs). - Performance Optimization & Evaluation: - Design and execute prompt engineering strategies. - Establish and monitor rigorous evaluation of metrics for LLM performance to ensure clinical safety, minimize hallucinations, and optimize inference latency in production environments. - Technical Execution & Collaboration: - Partner with data scientists, product managers, and cloud engineers to transition AI models into high-concurrency production environments. - Establish code quality standards, write comprehensive technical documentation, and mentor junior developers. - Operational Rigor: - Build and maintain MLOps pipelines, ensuring secure containerization, CI/CD integration, and comprehensive system telemetry in adherence to healthcare privacy regulations (HIPAA, HITRUST). Qualifications - Bachelor's degree in Computer Science, Software Engineering, AI, or related field AND 8+ years of professional experience in software engineering and ML/AI development OR a Master's degree AND 6+ years of experience. - Deep, hands-on programming expertise in Python and extensive experience building backend systems and APIs. - Direct experience developing and deploying Agentic workflows and orchestration frameworks (e.g., AutoGen, LangChain) in production. - Strong practical understanding of Generative AI paradigms, prompt engineering, and LLM evaluation metrics. - Experience building ML infrastructure and deploying applications using cloud-native technologies (AWS, Azure, or GCP). Additional Qualifications - Hands-on experience implementing Graph RAG and complex document parsing workflows. - Experience with healthcare data standards (e.g., FHIR) and secure data handling practices. - Strong background in system design patterns, microservices, and infrastructure as code. - Demonstrated ability to lead development pods and translate complex business requirements into technical deliverables. Work Location Remote Travel May include up to 10% domestic. Requirements - Must be legally authorized to work in a country of employment without sponsorship for employment visa status (e.g., H1B status). Benefits - Competitive pay and benefits. - Medical, Dental & Vision. - Health Savings Accounts. - Health Care & Dependent Care Flexible Spending Accounts. - Disability Benefits. - Life Insurance. - Voluntary Benefits. - Paid Absences and Retirement Benefits.
To enable broadband service providers of all sizes to simplify, innovate and grow.
Role Description The Calix platform enables Communication Service Providers (CSPs) of all sizes to transform and future-proof their businesses. We are standing up an enterprise AI capability across our product organization, and we are looking for a seasoned technical lead to drive it end to end. This is a hands-on role: you will set the technical direction, architecture, and standards, mentor engineers, and steer external delivery partners while still writing code, building reference implementations, and personally unblocking the hardest problems. This is a remote-based position located in the United States or Canada. What You'll Do: - Enterprise platform rollout & enablement - Own the architecture and rollout of the enterprise AI platform across the product organization. - Build and operate cost-effective infrastructure: model garden setup, budget controls and cost monitoring, model tiering, and fallback open-source models. - Stand up and harden platform foundations: identity federation/SSO, IAM, tenant isolation, security and compliance guardrails, and observability. - Enable users to adopt off-the-shelf capabilities (e.g., NotebookLM, enterprise search, code assistants) and provide the training and ongoing support that drives real adoption. - Design and deliver a self-service “agent factory”: patterns, templates, and in-IDE guardrails so teams can build and maintain their own agents safely. - Product development lifecycle agents - Lead the design and build of agent workflows across the PDLC: - Ideation: requirement ideation and generation, design specification generation. - Development: coding assistants, source-code management. - Deployment: CI/CD, deployment, and monitoring. - Partner directly with product, engineering, and other business teams to understand their requirements and help build out their use-cases. - Establish evaluation, quality, and monitoring practices for agent workflows from sandbox to production. - Cross-platform interoperability - Build agents that interoperate with agents on other enterprise AI platforms, including cross-platform contracts, schema/intent mapping, and cross-perimeter authentication. - Represent the product organization technically when working with other business teams on cross-platform agent designs. - Across all tracks - leadership & hands-on - Own the technical strategy, standards, patterns, and guardrails for agentic workflow development, RAG, security, and governance. - Lead and mentor a team across cloud and enterprise AI tracks; raise the bar through code review, pairing, and design reviews. - Steer external delivery partners: scope work, review deliverables, and hold them to quality and timeline. - Stay hands-on: build reference agents, RAG pipelines, integrations, and MCP connectors, and debug the hardest pieces (including non-deterministic retrieval/generation behavior). Qualifications - 10+ years of software engineering experience, with 4+ years in a technical lead or staff-level role. - Strong, current hands-on coding ability - you still build and ship. Proficiency in Python (and comfort across at least one other modern language). - Experience designing and operating production AI/ML or LLM-based systems: agents, RAG, prompt/eval pipelines, or similar. - Deep familiarity with Google Vertex AI / Gemini and the surrounding services (IAM, networking, observability, cost management). - Experience building developer-facing platforms or internal tooling, and driving adoption with training and support. - Experience integrating systems via APIs and connectors; comfort with authentication and identity federation (OAuth, SSO, workload identity). - A track record of leading technical initiatives across teams and influencing without authority. - Strong communication skills - able to work directly with both engineers and non-technical business stakeholders. Preferred Qualifications - Hands-on with Gemini Enterprise, Vertex AI, NotebookLM, or comparable enterprise AI tooling. - Experience building agentic systems and orchestration (agent development kits, A2A protocols, MCP, tool/function calling). - Experience with LLM gateways and routing (e.g., LiteLLM), model cost optimization, and multi-model fallback strategies. - Experience managing or working alongside systems integrators / delivery partners. - Familiarity with the modern PDLC toolchain (Jira/Confluence, Figma, GitHub, CI/CD) and AI coding assistants. - Experience with AI governance, guardrails, data privacy, and compliance in an enterprise setting. Compensation The base pay range for this position varies based on the geographic location. More information about the pay range specific to candidate location and other factors will be shared during the recruitment process. Individual pay is determined based on location of residence and multiple factors, including job-related knowledge, skills and experience. - San Francisco Bay Area: 156,400 - 265,700 USD Annual - All Other US Locations: 136,000 - 231,000 USD Annual As a part of the total compensation package, this role may be eligible for a bonus. For information on our benefits click here.
Role Description We're hiring a Junior Youth Mentor to help students build the confidence, focus, and resilience they need to perform at a high level in school and in sport. This is a non-clinical coaching and mentorship role focused on mindset, motivation, and everyday skills, not therapy, diagnosis, or treatment. This is an entry-level role. If you're fresh out of school or have a year or two of experience, we want you to apply. What You'll Do - Confidence & Resilience Coaching: Help students build practical skills for handling performance pressure, peer dynamics, and the normal ups and downs of being a young athlete in a high-standards environment. - Partner With Coaches and Guides: Share relevant, day-to-day observations with the adults around each student to support them well while respecting each student's privacy. - Parent Communication: Keep parents informed about their student's engagement, progress, and goals, and discuss how the family can reinforce what's happening at school. - Tech Platform Support: Help students navigate the TSA learning platform to stay focused on academics and athletics without getting stuck on the tools. - Small-Group Workshops: Lead group sessions on practical topics like bouncing back from a loss, managing pre-competition nerves, time management, and healthy competition. Qualifications - Bachelor's Degree: A bachelor's degree is required to apply; psychology, education, kinesiology, social work, sports science, or a related field is a plus. - 0 to 2 years of experience is exactly what we're looking for. Recent graduates are encouraged to apply. - You're Good With Kids: You can engage with children and young adults effectively. - Strong Listening Skills: You allow students to express themselves fully before offering advice. - Comfortable in an Athletic Environment: You understand the dynamics of competition and training. - Clear Written and Verbal Communication: You can write professional updates for parents or coaches. Requirements - Coursework or Experience With K-12 Students: Practicums, school program internships, youth program volunteering, or camp counselor work all count. - Background in Sports or Athletics: Experience in competitive sports is beneficial. - Bilingual (English/Spanish): Helpful for connecting with the full range of families on our campus. Benefits - You'll be part of a small, mission-driven team building something different in K-12 education. - Real mentorship and responsibility. - Opportunity to shape how mental and emotional development is integrated into the daily life of a school. - Great opportunity for someone early in their career who wants to do meaningful work with young people. Compensation $20/hour
Role Description We're looking for a Senior AI Developer Experience Engineer to accelerate the AI tooling transformation that enables our teams to build, test, and ship with speed and confidence. You'll treat developer experience as a product - defining the strategy, driving platform adoption, and delivering measurable improvements to productivity, quality, and reliability across the software development lifecycle. This is a high-impact individual contributor role. You'll partner with senior engineering leaders across the organisation, influence standards and tooling decisions, and champion platform-first thinking - turning developer pain points into durable, scalable solutions that benefit hundreds of engineers. - Define and drive DevEx strategy across engineering organizations, partnering with senior leaders to align investments with business priorities - Design, implement, and optimise CI/CD pipelines and lifecycle automation platforms - Build developer tools and automation to simplify and expedite the software development lifecycle from requirements through release - Define and promote golden paths for building, deploying, and operating services - Reduce tooling fragmentation through consolidation, strong defaults, and principled standardisation - Identify and eliminate friction across build, test, deployment, and operations workflows - Improve and maintain core infrastructure across multi-cloud and hybrid environments - Collaborate with cross-functional teams to drive best practices in reliability, testability, and performance - Represent DevEx strategy in senior technical forums and drive engineering culture improvements - Iterate on evaluation frameworks that keep automation, AI-assisted tooling, and coding agents reliable and aligned with sound software engineering practices Qualifications - 8+ years of professional software engineering experience, with at least 3 years focused on developer productivity, platform engineering, or internal tooling - Strong hands-on coding skills in one or more modern languages such as Python, Go, Java, TypeScript, Kotlin, or Swift - Deep experience with CI/CD systems, build infrastructure, testing frameworks, and deployment automation - Demonstrated ability to influence engineering practices across multiple teams or organisations - Exposure to AI-assisted engineering workflows, coding agents, or LLM evaluation pipelines - Experience improving developer productivity at a company with hundreds or thousands of engineers - Experience identifying, measuring, and systematically eliminating friction in the software development lifecycle - Ability to balance standardisation with developer autonomy - knowing when to enforce defaults and when to empower teams - Proven track record of translating technical strategy into measurable outcomes - Experience operating critical systems with high reliability, observability, and availability requirements - Clear and effective communication skills with the ability to engage both technical and leadership audiences Requirements - Nice to have: - Experience building and scaling internal developer platforms as products (e.g., Backstage or similar portals) - Experience with application lifecycle management or automation at enterprise scale - Passion for engineering excellence and a track record of raising the technical bar across teams Compensation The annual base salary range for this position is as follows, plus equity and benefits: - SF Bay Area, Los Angeles, Seattle, Portland, Boston, New York, and Washington, DC Metro: $187,000 - $220,000 USD - All other US Locations: $169,000 - $198,000 USD - Canada: $172,000 - $202,000 CAD The ranges displayed reflect the target for new hire salaries, and within each range, individual pay is determined by your skills and experience, as well as relevant education. Your recruiter can share more and answer questions about the specific salary range during the hiring process. Salary is just one component of Quo’s total compensation package. Your total rewards package will include equity, extensive medical coverage, a monthly lifestyle stipend, and a flexible PTO policy.
Apply → Pass qualification(s) → Join a project → Complete tasks → Get paid. Project time expectations: Tasks are estimated to require around 10–20 hours per week during active phases, based on project requirements; This is an estimate, not a guaranteed workload, and applies only while the project is active. Note: Rates vary based on expertise, skills assessment, location, project needs, and other factors. Higher rates may be offered to highly specialized experts. Lower rates may apply during onboarding or non-core project phases. Payment details are shared per project.
Role Description Mindrift connects specialists with project-based AI opportunities for leading tech companies, focused on testing, evaluating, and improving AI systems. Participation is project-based, not permanent employment. What this opportunity involves: - Building a dataset to evaluate AI coding agents - how well a model handles real-world developer tasks. - Creating challenging tasks and evaluation criteria within realistic simulated environments: - Build realistic developer environments - a virtual company with codebase, infrastructure, and context (tickets, docs, conversations) that forms a believable development history. - Design tasks from intermediate states of these environments - craft the prompt, define what "solved" means, and ensure the task is solvable by an AI agent. - Write tests that verify agent solutions - accept all valid approaches and reject incorrect ones, neither too strict nor too lenient. - Iterate on tasks and tests based on QA feedback - review agent solutions, analyze failures, and refine until the evaluation is fair and robust. What this is NOT: - Not data labeling. - Not prompt engineering. - Not writing code from scratch - the agent writes most of the code; you guide and evaluate. Qualifications - 5+ years in software development. - Core stack: Python (FastAPI), JavaScript/TypeScript (React), Docker, Postgres, Kafka, Redis. - Experience writing tests (functional, integration). - English proficiency - B2+. Requirements - Deep understanding of where models fail and what scenarios reveal the difference between a good and a bad solution. - Ability to create tasks that genuinely challenge the best models. - Writing tests that accept all correct solutions and reject incorrect ones. Benefits - Compensation up to $50/hr equivalent, depending on level and pace. - Tasks are estimated at ~20 hours each; you set your own schedule. Effort Estimate Tasks for this project are estimated to take 20 hours to complete, depending on complexity. This is an estimate and not a schedule requirement; you choose when and how to work. Tasks must be submitted by the deadline and meet the listed acceptance criteria to be accepted.
1,419more opportunities are still waiting for you.Log in now and take your next shot before someone else does.
AWS, Azure, JavaScript, Microservices, NoSQL, Python