Lead Software Engineer, AI Platform
Location
Texas
Posted
22 days ago
Salary
0
Seniority
Senior
Job Description
Lead Software Engineer, AI Platform
Lennar
• Lead the adoption of ServiceNow and AWS-native AI capabilities • Identify opportunities where AI reduces manual effort and improves service outcomes • Design a repeatable, domain-aligned pattern as a Spotify Backstage Software Template • Architect production-ready MCP servers with integrated authentication and RBAC • Develop reference Python MCP servers for enterprise domains like Workday • Implement shared authentication libraries for Entra ID and OAuth flows • Build a tool-accuracy eval harness to validate LLM tool behavior in CI • Automate registration with AWS Bedrock AgentCore Gateway • Manage container deployment on AWS ECS Fargate using read-only filesystems • Deploy infrastructure-as-code using Terraform or CloudFormation • Instrument observability using OpenTelemetry tracing and structured JSON logging • Conduct technical design reviews and architecture governance • Transfer knowledge to the platform team through live sessions and documentation
Job Requirements
- Python Mastery: 7+ years of production Python using async frameworks like FastAPI or Starlette
- AWS Infrastructure: Deep knowledge of IAM role chaining, Secrets Manager, ECR, and VPC networking
- AI & Agentic Systems: Hands-on experience with Model Context Protocol (MCP) and AWS Bedrock AgentCore
- Identity & Security: Advanced OAuth 2.0/OIDC execution, including On-Behalf-Of flows and JWT validation
- Platform Engineering: Experience authoring Spotify Backstage templates and GitHub Actions CI/CD pipelines
- Enterprise Integration: Proven history of integrating complex REST APIs such as Workday, Salesforce, or ServiceNow
- 8+ years of overall IT experience
- 6+ years of hands-on architecture and development experience
- Demonstrated ability to lead complex, enterprise-scale implementations
- Excellent technical writing skills for runbooks and architecture docs
Benefits
- Robust health insurance plans, including Medical, Dental, and Vision coverage
- 401(k) Retirement Plan with a $1 for $1 Company Match up to 5%
- Paid Parental Leave
- Associate Assistance Plan
- Education Assistance Program
- Up to $30,000 in Adoption Assistance
- Up to three weeks of vacation annually
- Generous Holiday, Sick Leave, and Personal Day policies
- New Hire Referral Bonus Program
- Significant Home Purchase Discounts
- Everyone’s Included Day
Related Guides
Related Job Pages
More AI Engineer Jobs
• Collaborate directly with senior leadership to develop and deliver AI solutions. • Amplify our development capacity. • Own meaningful pieces of development work and solution delivery from end-to-end, UX to deployment.
Applied AI Engineer
Material BankSearch and sample materials from hundreds of leading brands. Order by midnight, receive by 10:30am.
• Design, build, and deploy end-to-end AI-powered product experiences from concept through production. • Architect and implement scalable AI systems leveraging LLMs, embeddings, multimodal models, retrieval systems, agent frameworks, and modern data infrastructure. • Build production-grade multi-agent workflows and orchestration systems using frameworks such as LangGraph, LangChain, Mastra, and custom tooling. • Develop and optimize Retrieval-Augmented Generation (RAG) systems, including embeddings, vector search, retrieval pipelines, chunking strategies, and relevance tuning. • Build multimodal AI workflows that analyze and reason over images, creative assets, and visual datasets using modern multimodal LLMs, embedding models, and specialized tooling such as SAM2/SAM3. • Create AI-assisted experiences for search, discovery, content generation, personalization, and creative workflows across Material Bank’s platform. • Evaluate, refine, and improve AI-generated outputs for quality, tone, accuracy, and creative alignment through testing, iteration, and human-in-the-loop evaluation strategies. • Partner closely with Product, Design, Engineering, Data, and Executive Leadership to identify high-impact opportunities and translate ambiguous ideas into production-ready AI capabilities. • Make architectural decisions that balance speed, scalability, latency, cost, accuracy, and long-term maintainability. • Continuously evaluate emerging AI technologies, models, frameworks, and workflows to identify opportunities that create meaningful business and user value.
Role Description We are seeking a skilled AI/MLOps Engineer to join the innovative team at 99x Brazil. In this role, you will be responsible for designing, deploying, and maintaining scalable machine learning infrastructure and pipelines that enable rapid development and reliable deployment of AI models. You will work closely with data scientists, engineers, and product managers to ensure seamless integration of AI capabilities into production systems. You will play a crucial part in automating ML workflows, monitoring model performance, and optimizing resource utilization in cloud environments. Join us to help drive the future of AI-powered solutions in a fast-paced, collaborative environment. Responsibilities - Design and maintain monitoring and observability solutions for AI applications and ML pipelines - Track logs, metrics, and traces using tools such as CloudWatch, Datadog, or similar platforms - Develop evaluation and testing frameworks for prompts, models, and AI workflows - Perform regression testing and quality validation for LLM-based systems - Manage prompt experimentation, versioning, and A/B testing processes - Debug AI workflows, including model outputs, orchestration pipelines, and infrastructure failures - Support deployment, scaling, and maintenance of AI/ML infrastructure in production environments - Collaborate with engineering and product teams to improve system reliability and performance - Analyze production data and user feedback to drive continuous improvement of AI systems - Contribute to operational best practices, documentation, and incident response processes Qualifications - Experience with DevOps, SRE, MLOps, or AI infrastructure engineering - Strong understanding of monitoring and observability concepts - Hands-on experience with tools such as Datadog, CloudWatch, Grafana, Prometheus, or similar - Experience supporting AI/ML or LLM-based applications in production - Familiarity with prompt engineering, model evaluation, and experimentation workflows - Knowledge of cloud platforms such as AWS, Azure, or Google Cloud - Experience troubleshooting distributed systems and production pipelines - Proficiency in Python, scripting, or automation tooling - Strong analytical and problem-solving skills - Excellent communication and collaboration abilities Nice to Have - Experience with LLM orchestration frameworks - Familiarity with vector databases and RAG architectures - Experience with CI/CD pipelines for ML systems - Knowledge of Kubernetes, Docker, and infrastructure-as-code tools - Experience with AI governance, security, or compliance practices Benefits - Your pick when it comes to employment models: CLT/PJ/Cooperativa - We provide resources for you to grow and learn on the job, including online courses, mentoring, and the latest-gen laptops - A fully remote work environment with flexible working hours - Bonus for any referrals that we end up hiring
• Design, implement, and deploy ML/AI models end-to-end • Maintain and evolve AI systems in production • Bring an AI-forward coding mindset • Partner closely with product, engineering, and data teams • Translate technical tradeoffs into terms non-specialists can act on • Participate actively in code reviews and design discussions • Contribute to AI architecture decisions • Take ownership of meaningful work end-to-end • Raise the bar on engineering practices by supporting junior engineers



