Founded in 2000 and based in Mississauga, Ontario, Canada, PointClickCare offers comprehensive services to assist long-term healthcare providers. One of the fir
Principal AI Platform Engineer
Location
United States
Posted
143 days ago
Salary
$179K - $199K / year
Seniority
Lead
Job Description
Principal AI Platform Engineer
PointClickCare
• Design, build, and maintain the core infrastructure layer supporting GenAI products, including model gateways, prompt/versioning stores, vector databases, and LLM evaluation tools. • Implement secure access controls and authentication mechanisms integrated by default into the AI platform components. • Develop and manage observability, monitoring, and logging solutions for GenAI workloads and infrastructure. • Collaborate closely with product and engineering teams to integrate GenAI infrastructure with agent frameworks, and downstream applications. • Optimize infrastructure for scalability, high availability, cost efficiency for production workloads.
Job Requirements
- Extensive experience building and maintain AI platform infrastructure, Kubernetes, and container security.
- Demonstrated expertise in observability, and monitoring frameworks, with a focus on real-time performance (i.e: experience with OpenTelemetry, MLFlow).
- Experience with AI infrastructure components such as vector databases, prompt/versioning stores, and AI IDEs.
- Familiarity with vLLM, SGLang or similar framework to host LLM inference workloads.
- Experience with CI/CD pipelines and automation for AI model deployment and platform operations
- Strong knowledge of authentication and authorization frameworks integrated into AI platforms.
Related Guides
Related Categories
Related Job Pages
More Platform Engineer Jobs
Staff Platform Engineer
KentikKentik is an information technology company specializing in network intelligence. Seeking curious, driven professionals who share its passion for "unlocking the
• Build self-service, declarative and API-driven infrastructure components in go, nodejs • Contribute to our internal deployment tooling (mostly python CLI tools) and service orchestration platform based on Envoy, Nomad and other Hashicorp components • Help formulate and execute our strategy for datastores such as postgres, kafka, redis (reliability, performance, overhead, capacity planning, …) • Improve the reliability of our services, with code and testing improvements as well as internal advocacy and education • Mentoring of junior team members • Create and update technical documentation for infrastructure • Be on the on-call escalation path for services owned by the team
Principal Salesforce Platform Engineer
SuperlanetAdvisory, Staffing, and Multi-State Employer of Record Solutions for Clinicians, by Clinicians.
• Serve as the primary technical owner of the Salesforce platform, including data model, security architecture, integrations, and overall system design • Design and maintain scalable, secure Salesforce solutions aligned with institutional priorities and long-term growth • Own platform governance, technical standards, and best practices • Lead hands-on configuration and customization, including: Custom objects, fields, page layouts, flows, validation rules, and permissions • Implement enhancements that improve operational efficiency, data integrity, and user experience • Partner with IT, data, and analytics teams to support integrations with enterprise systems (e.g., ERP, data platforms, marketing tools) • Ensure data quality, reporting accuracy, and appropriate data access across teams • Work closely with functional leaders to gather requirements and translate them into technical solutions • Ensure Salesforce configuration aligns with healthcare, nonprofit, and university compliance requirements (HIPAA, data privacy, security standards)
• Own and evolve our AWS infrastructure across compute, networking, storage, and managed services • Design and maintain infrastructure that supports high availability, predictable performance, and financial correctness • Lead platform-level architectural decisions, including service migrations and runtime changes (e.g., Redis → Valkey, EKS → ECS/Fargate) • Ensure infrastructure choices align with reliability, cost, and operational simplicity—not just trend adoption • Design and maintain deployment pipelines that are safe, repeatable, and observable • Own system reliability through capacity planning, failure modeling, and controlled change management • Lead incident response and root-cause analysis for infrastructure-level failures • Participate in on-call rotations and continuously improve operational ergonomics • Build and maintain strong observability across infrastructure and services (metrics, logs, tracing, alerting) • Ensure secure configuration of AWS resources, IAM policies, secrets management, and network boundaries • Proactively identify infrastructure risks related to scale, cost, or security and address them before they become incidents • Partner closely with application engineers to ensure platform constraints and capabilities are well understood • Drive infrastructure changes through hands-on implementation • Establish standards and best practices for infrastructure, deployment, and operations as the team grows • Mentor other platform engineers and help raise the overall operational maturity of the organization
Senior Staff Platform Engineer
Veeam SoftwareYour Single Backup and Data Management Platform for Cloud, Virtual and Physical
• Design, build, and evolve cloud infrastructure using modern Infrastructure-as-Code tools (Terraform, Pulumi) • Architect and manage core cloud services, including networking, identity/access control, account/subscription hierarchy, and resource governance • Define and implement cloud-based disaster recovery strategies that meet enterprise RTO/RPO requirements • Drive standards for high availability, security, scalability, and cost-efficiency across our cloud environments (Azure and/or AWS) • Lead initiatives that improve developer experience, platform reliability, and operational excellence • Build internal tooling and automation to simplify cloud adoption and resource provisioning for engineering teams • Serve as a technical leader and mentor, guiding engineers across platform, SRE, and product teams • Partner with security, compliance, and operations teams to enforce cloud architecture policies and guardrails • Represent the platform team in architecture reviews, planning discussions, and cross-functional programs




