Job Closed
This listing is no longer active.
Serverless AI Inference - run any model, at any scale, without managing GPUs
Software Engineer – API Gateway
Location
Canada
Posted
145 days ago
Salary
0
Seniority
Senior
Job Description
Software Engineer – API Gateway
Featherless AI
• support and evolve the API gateway to our inference cloud, which is responsible for • authentication and inference to all models • subscription management and subscription entitlement (e.g. context-length, concurrency limits) • providing the necessary API surface for applications and builders • undertake feature development and bug fixes to keep up with clients, resolve user issues, and onboard new models • improve the reliability of the existing API (increasing instrumentation and monitoring, right-sizing infrastructure) • respond to availability incidents • triage and resolve issues of inference quality and reliability • manage the infrastructure on which our gateway runs
Job Requirements
- first-hand experience of the user’s we’re building for (familiarity with popular open LLMs, common clients, and experience building with LLM)
- experience with the technologies and paradigms of the web (REST, websockets, DNS, networking, opentelemetry)
- experience with significant components of our stack (k8s, node, mikro-orm, fastify, redis, mongodb, python, elastic cloud, cloudflare, sentry, otel)
- ability to debug complex issues across a wide stack and build instrumentation as necessary
- desire to work collaboratively as part of a skilled team
- Alignment with team and company values, including
- bias to action
- responsiveness to users (bug-fixes over features)
- instinct to iterate
- subscribing to that done means proven by usage data
Benefits
- This team operates on Eastern Time
- Remote preference to hire in Toronto, Canada
Related Guides
Related Job Pages
More Full-stack Engineer Jobs
• Design, develop, and implement end-to-end SharePoint-based and web solutions, including front-end components, backend services, APIs, and workflows tailored to client needs. • Build modern, responsive user interfaces using SPFx, React, TypeScript, and other modern web technologies to enhance user experience across SharePoint and Microsoft 365 platforms. • Develop and maintain backend services, APIs, and data access layers using Azure services and RESTful architectures to support scalable and secure applications. • Build and manage seamless integrations between Infodash, SharePoint, Microsoft Teams, Microsoft Graph, and external platforms such as financial systems, directories, legal systems, and other third-party data sources. • Design, build, and integrate AI-powered features into Infodash products, including conversational assistants, intelligent search, summarization, and data-driven insights. • Develop solutions leveraging large language models (LLMs) and AI services (e.g., Azure OpenAI, embeddings, vector search) to enhance user workflows and decision-making. • Implement Retrieval-Augmented Generation (RAG) patterns by combining structured data, documents, and search indexes with LLMs. • Collaborate with architects and product leaders to translate AI use cases into secure, scalable production solutions. • Ensure responsible AI practices, including security, privacy, data access controls, and performance monitoring. • Design and implement solutions to aggregate, transform, and present data from multiple sources (e.g., financial, directory, and client/matter systems) using Azure SQL, Azure Data Factory, and other data services. • Optimize performance across the full stack, troubleshoot complex technical issues, and ensure reliable, secure operation of all solutions and integrations. • Support application deployment, configuration, and monitoring using Azure-based tools and services, with an emphasis on scalability, security, and maintainability. • Work closely with project teams, architects, and client stakeholders to translate business requirements into technical solutions aligned with organizational goals. • Maintain high standards of code quality, documentation, testing, and version control throughout the development lifecycle.
• Lead design and implementation of control-plane services (APIs, schedulers, workflow/rules engines, config & policy management). • Build features around reporting/analytics, global alert filters/policies, re-investigation flows, and customer-visible auditability. • Evolve multi-tenant, role-aware “organization & permissions” models; ensure safe, idempotent, and rate-limited operations. • Define and version public/internal APIs. • Partner with AI Core, Security Integrations, and GTM to prioritize roadmap and translate ambiguity into shipped outcomes. • Provide mentorship and raise the bar on code quality, design reviews, and operational excellence. • Drive quality through automated testing, performance monitoring, and secure coding practices. • Periodically participate in a 24x7 on-call rotation.
• Work with the team to help solve business problems. • Write high quality distributed system software. • Work closely with architects and technical product managers to translate overall system architecture and product requirements into well-designed and implemented software components. • Take ownership of the implementation of individual software components, with high emphasis on quality, test-driven development, and sound software engineering practices. • Participate in software design reviews, you conduct peer code reviews, and provide input and feedback to other members of the development team. • Design, implement, test, deploy and maintain innovative software solutions to transform service performance, durability, cost, and security. • Use software engineering best practices to ensure a high standard of quality for all the team deliverables.
Senior Software Engineer – Authorization AI
Climb Channel Solutions NAA different breed of specialty technology distributor. #ClimbWithUs
• Design, develop, and deploy scalable and highly available cloud-based applications using C#/.NET Core and Azure App Services in performant Kubernetes clusters • Innovate on the Authorization AI team, bringing AI-enabled features and security to our platform • Using Scrum processes, collaborate with cross-functional teams (PM, UX, Dev) to understand requirements, propose technical solutions, and contribute to architectural decisions • Participate in collaborative engineering environment. Mentor and provide technical guidance to other engineers, fostering a culture of continuous learning and growth • Consistently look for ways to improve the development and automation processes (code quality, E2E testing, security scans, etc.), monitoring, and tiered incidence response




