CI&T logo
CI&T

Navigate Change

Master AI / Machine Learning Specialist, AWS

Machine Learning EngineerMachine Learning EngineerFull TimeRemoteSeniorTeam 5,001-10,000Since 1995H1B No SponsorCompany SiteLinkedIn

Location

Brazil

Posted

11 days ago

Salary

0

Seniority

Senior

Postgraduate DegreeEnglishAWSDynamoDBRayTerraform

Job Description

Master AI / Machine Learning Specialist, AWS

CI&T

• Design and implement multi-agent AI architectures using AWS Bedrock • Develop agent orchestration logic and collaborative agent workflows • Configure and manage AWS Bedrock Agents, Knowledge Bases, and Guardrails • Build Retrieval-Augmented Generation (RAG) pipelines using vector databases and embeddings • Implement tool integrations using Model Context Protocol (MCP) and API-based services • Optimize LLM behavior through prompt engineering, tuning, and context management • Develop observability and monitoring strategies for AI workflows using CloudWatch and X-Ray • Build scalable event-driven architectures and resilient integration patterns • Design error handling, retry strategies, and graceful degradation mechanisms • Collaborate with engineering, product, and architecture teams to deliver production-grade AI solutions • Support infrastructure automation and deployment pipelines using IaC and CI/CD practices • Ensure governance, security, auditability, and compliance standards across AI systems

Job Requirements

  • Proven experience building production AI/ML systems on AWS
  • Strong hands-on expertise with AWS Bedrock Agents (AgentCore)
  • Experience designing multi-agent systems and agent orchestration workflows
  • Experience with AWS Bedrock Knowledge Bases (RAG), vector embeddings, and OpenSearch Serverless
  • Expertise with AWS Bedrock Guardrails, including PII protection and content governance
  • Experience implementing tool calling, function invocation, and state management
  • Strong prompt engineering and LLM optimization experience
  • Deep understanding of AWS observability tools: CloudWatch, X-Ray, Distributed tracing
  • Experience with: API Gateway, DynamoDB, Event-driven architectures
  • Familiarity with Infrastructure as Code: Terraform, AWS CDK
  • Strong knowledge of RESTful APIs and integration patterns
  • Experience with CI/CD pipelines for ML and AI systems
  • Ability to design resilient and fault-tolerant AI applications
  • Strong communication, collaboration, and technical documentation skills
  • Nice to Have: Experience with Model Context Protocol (MCP)
  • Experience with AWS Step Functions for workflow orchestration
  • Familiarity with: CloudFront, S3, AWS WAF
  • Knowledge of conversational AI UX patterns and hybrid interaction models
  • Experience with session persistence and conversation state management
  • Understanding of compliance and governance requirements: PII handling, Audit trails, Data retention
  • Experience optimizing AWS Bedrock and OpenSearch operational costs
  • Familiarity with LLM evaluation frameworks and AI quality metrics
  • Experience with multi-turn dialogue management and context preservation
  • Knowledge of explainability and AI reasoning visualization techniques

Benefits

  • Health and dental insurance
  • Meal and food allowance
  • Childcare assistance
  • Extended paternity leave
  • Partnership with gyms and health and wellness professionals via Wellhub (Gympass) TotalPass;
  • Profit Sharing and Results Participation (PLR);
  • Life insurance
  • Continuous learning platform (CI&T University);
  • Discount club
  • Free online platform dedicated to physical, mental, and overall well-being
  • Pregnancy and responsible parenting course
  • Partnerships with online learning platforms
  • Language learning platform
  • And many more!

Related Job Pages

More Machine Learning Engineer Jobs

Gametime logo

Senior AI & ML Engineer

Gametime

Gametime is an IT startup whose mobile app allows users to purchase last-minute tickets to sports, music, and theater events quickly and easily without the need

• Design, build, and ship LLM-powered features and agentic workflows that serve real Gametime users in production. • Build and maintain evaluation frameworks, prompt testing pipelines, and regression suites that ensure quality and reliability of AI-powered experiences. • Contribute to the orchestration layer, including agent routing, tool use, state management, and multi-step workflow coordination. • Develop and optimize prompt optimization strategies, structured outputs, and LLM integration patterns across the platform. • Propose architecture decisions and technical designs for review by the team's tech lead, balancing speed with long-term maintainability. • Collaborate cross-functionally with product, engineering, and data teams to translate customer needs into AI system design. • Stay current with the rapidly evolving LLM and agentic AI landscape, bringing practical new techniques into the team's toolkit.

United States
$194K - $228K / year
Job Closed
Dailymotion logo

Senior MLOps Engineer

Dailymotion

The home for videos that matter

Full TimeRemoteTeam 201-500Since 2005H1B Sponsor

Role Description We're hiring our first dedicated MLOps Engineer. Our ML team runs +100 active AI projects on GCP and needs the industrialized infrastructure to match. ML and DevExp teams are moving fast on their respective fronts and they need a dedicated expert to connect the dots, align on the right practices, and help both sides deliver at their best. You'll be that person. You'll work at the heart of an AI-first environment, where shipping models fast and reliably is the core business, not a side project. Your Mission - Empower ML Engineers with the tools, infrastructure, and frameworks they need to iterate fast autonomously. - Accelerate time-to-market for production-ready ML products: seamless integration, proper service connections, access to data and resources. - Own ML CI/CD in close collaboration with the DevExp team, adapting existing frameworks to ML-specific needs, not just consuming them. - Keep ML Engineers in control of their models in production: monitor, troubleshoot, iterate, refine directly in prod, no staging/prod mirror. - Enable large-scale ML experimentation: robust, reproducible, scalable environments for both internal tests and A/B testing in production. - Deliver concrete MLOps building blocks (MLflow, Kubeflow, KubeRay...) and manage GPU infrastructure dynamically, you've seen L40 shortages mid-training, you know how to handle them. - Tackle technical debt on existing projects while laying the right foundations for what's next. - Be the technical mediator between ML and DevExp: understand both sides, propose solutions that stick. - Handle run responsibilities: on-call, post-mortems, level-1 failure analysis. Qualifications - Solid MLOps or DevOps background; projects shipped in prod matter more than years on a resume. - GCP expert: Vertex AI, GKE, GCS, BigQuery. - Full GitOps: FluxCD first, ArgoCD accepted. - Kubernetes in prod, not just in a lab. - Hands-on with real MLOps tools: MLflow, Kubeflow, KubeRay. - GPU-aware: you've managed GPU scarcity at scale during mass training runs. - Python is a must, Bash expected, Go or Rust a plus. - IaC (Terraform), containerization (Docker, Helm), observability (Prometheus, Datadog, Looker). - Fluent in French and English. Requirements - Jupyter Notebooks, broader ML/AI ecosystem. - AI Coding Assistants (Claude, Cursor, Dust). - Data pipelines (Airflow, Dataflow, Kestra). - Data lifecycle management (cost, security, encryption). - Redis clusters and infrastructure performance optimization. Benefits - Don't check every box? Apply anyway. - We're looking for the right person, not the perfect resume. If this role excites you, let's talk. - Dailymotion is an equal opportunity employer. All positions are open to people with disabilities. Need accommodations? Just let us know. Interview Process - HR Interview with Marvin (30mn) - Manager interview with Cédric (1h) - Technical Interview with 2 Architects (1h): no coding test, just a real technical conversation. - ML Interview with Brice & Samuel (1h) - Final Interview with Alan, VP Platform (1h) Additional Information - Type of Contract: Permanent - Department: Operations - Compensation: EUR 75000 - EUR 85000 yearly

France
€75K - €85K / year
Instacart logo

Senior Machine Learning Engineer II, Growth Modeling

Instacart

Instacart invites the world to share love through food. This is how homemade is made.

Full TimeRemoteTeam 1,001-5,000Since 2012H1B Sponsor

• Drive major initiatives and define roadmaps for ML/AI solutions • Identify high-leverage opportunities in incentive optimization • Partner with cross-functional teams to build impactful ML applications • Lead and mentor team members to improve execution efficiency

Canada
$206K - $217.5K / year
SweatPals logo

Senior Machine Learning Scientist

SweatPals

We make the world healthier, happier, and more connected by bringing people together around fitness and health.

Full TimeRemoteTeam 11-50Since 2022H1B No Sponsor

• Frame fuzzy product problems as ML problems and pick the right approach: ranking, retrieval, classification, sequence models, LLM agents, or classic stats • Run end-to-end: data exploration, offline evaluation, prototype, online experiment, iteration • Push to the cutting edge when it matters, stay pragmatic when it doesn't • Own offline metrics (NDCG, recall@k, AUC, calibration) and tie them to online metrics (booking lift, retention, GMV) • Ship models to production with our engineering team. Our ML stack is FastAPI, PostgreSQL, BigQuery, AWS App Runner, with retrieval via FAISS and sentence-transformers, and managed LLM APIs (Claude, Gemini) • Build evaluation harnesses and monitoring so we know when models drift • Develop LLM-powered features across HostCopilot (drip campaigns, retention nudges, pricing and content suggestions) and Pal-facing surfaces (AI Concierge, semantic search, recommendations) • Partner with product to size opportunities and translate findings into roadmap decisions • Set the bar for the squad on ML rigor: offline evaluation, experiment design, and writeups

Europe