Kipu Systems logo
Kipu Systems

In an environment of rapid change, millions are struggling to cope. Kipu is here to help. Having shaped the industry for 10 years, today we focus on advancing our New Vision for the behavioral health ecosystem, evolving how it operates, interacts, communicates, and heals. We are an equal opportunity employer and highly value diversity at our company. We do not discriminate based on race, religion, color, national origin, gender, sexual orientation, age, marital status, veteran status, or disability status.

Director of DevOps

Location

United States

Posted

53 days ago

Salary

0

Seniority

Lead

No structured requirement data.

Job Description

Director of DevOps

Kipu Systems

Role Description We’re looking for a Director of DevOps and CloudOps to own the reliability, security, and scalability of Kipu’s platform infrastructure across AWS and Azure. You’ll lead the team responsible for keeping our systems running, our deployments fast and safe, and our infrastructure ready to support the next phase of Kipu’s growth. This is a hands-on leadership role—you write scripts, build automation, maintain and upgrade internal tools, and architect solutions alongside your team every day. Kipu is the leading technology platform for behavioral health, operating in a HIPAA-regulated environment where uptime and data security are non-negotiable. You’ll manage two established teams and individual contributors—approximately 13 engineers across DevOps Engineering, DevOps Production Support, and specialized infrastructure roles. A critical part of this role is enabling the broader engineering organization: multiple product teams are spinning up new services at a fast pace, and your team is responsible for standing up all CI/CD pipelines, container orchestration (Kubernetes, AWS ECS), cloud infrastructure, and observability for every new service that ships. You’ll partner closely with engineering, product, and security to ensure our cloud infrastructure is a competitive advantage, not a constraint. What you’ll do - Infrastructure strategy and operations: - Own Kipu’s cloud infrastructure strategy across AWS and Azure, including architecture decisions, cost optimization, and capacity planning. - Drive reliability and availability targets, establishing and maintaining SLAs/SLOs that align with customer and business expectations. - Lead incident response, root cause analysis, and post-incident review processes to continuously improve system resilience. - Manage infrastructure budgets and optimize cloud spend without sacrificing performance or security. - Write Python, Bash, and other scripts daily to automate operations, solve problems, and improve workflows. Own and evolve infrastructure-as-code (Terraform, CDK, Ansible). - Maintain, upgrade, and develop internal DevOps applications and automation tools used across the organization. - CI/CD and release engineering: - Design and maintain CI/CD pipelines (Jenkins, GitHub Actions) that enable engineering teams to ship with speed and confidence. - Establish release engineering standards, including deployment strategies (blue-green, canary, feature flags) and rollback procedures. - Reduce build times, flaky tests, and deployment friction across the engineering organization. - Serve as the infrastructure partner for product engineering teams spinning up new services—own the process of onboarding each service into CI/CD, container platforms (Kubernetes, ECS), and cloud infrastructure. - Drive standardization of service deployment patterns, infrastructure templates, and operational runbooks across all teams. - Security, compliance, and governance: - Ensure infrastructure meets HIPAA, SOC 2, and other regulatory requirements, partnering with security and compliance teams on audits and remediation. - Implement and enforce infrastructure security best practices, including network segmentation, IAM policies, secrets management, and encryption at rest and in transit. - Maintain disaster recovery and business continuity plans, including regular testing and validation. - Own security risk identification, assessment, and remediation across the infrastructure—proactively identify vulnerabilities and drive fixes across cloud resources. - Manage security patching, hardening, and compliance remediation at scale across AWS and Azure environments. - Observability and platform reliability: - Build and evolve Kipu’s observability stack: monitoring, alerting, dashboards, logging, and distributed tracing (Datadog, CloudWatch, Azure Monitor). - Establish a data-driven approach to reliability, using SLIs and error budgets to balance velocity with stability. - Proactively identify and address infrastructure risks before they become customer-facing incidents. - Design and enforce observability standards for every new service—ensure teams ship with proper metrics, logging, and alerting from day one. - Provide production support and operational guidance to other engineering teams across the organization. - Team leadership: - Lead and mentor two managers and their teams, plus direct IC reports (~13 total headcount), fostering a culture of ownership, accountability, and continuous improvement. - Define team structure, hiring plans, and career development paths as the organization scales. - Collaborate cross-functionally with engineering, product, and security leadership to align infrastructure priorities with business goals. Qualifications - 8+ years of experience in DevOps, CloudOps, SRE, or infrastructure engineering, with at least 3 years leading teams. - Deep expertise in AWS (EC2, ECS/EKS, RDS, S3, Lambda, VPC, IAM, CloudWatch, Secrets Manager), including networking, compute, storage, and cost optimization. - Strong background in CI/CD pipeline design, release engineering, and deployment automation. - Experience operating infrastructure in a HIPAA-compliant or similarly regulated environment. - Proven track record building and maintaining observability stacks (monitoring, alerting, logging, tracing). - Infrastructure-as-code fluency: Terraform (required), AWS CDK, with familiarity in CloudFormation or Pulumi. Experience with configuration management tools (Ansible preferred). - Experience managing containerized workloads at scale (Kubernetes, ECS, or similar). - Demonstrated ability to recruit, develop, and retain strong infrastructure engineering talent. - Working experience with Azure cloud services (Azure DevOps, AKS, Azure Monitor, or equivalent). - Strong scripting, coding, and automation skills in Python and Bash—you write code daily, not occasionally. - Experience building, maintaining, and upgrading internal tools and applications. - Experience with security risk management, vulnerability remediation, and compliance-driven patching across cloud infrastructure at scale. - Demonstrated ability to manage managers and lead through others while remaining technically engaged. - High personal integrity, strong work ethic, and a commitment to doing the right thing under pressure. Nice to have - Experience with Azure, particularly in a multi-cloud or hybrid environment. - Healthcare SaaS or multi-tenant platform experience. - SOC 2 or HITRUST audit experience, including evidence collection and control implementation. - Background supporting data-intensive or AI/ML infrastructure workloads. - Experience leading platform migrations or major infrastructure modernization efforts. - Familiarity with FinOps practices and cloud cost governance at scale. - Experience with PostgreSQL administration and performance tuning. - Familiarity with Datadog, Grafana, PagerDuty, and building observability-as-code. - AWS certifications (Solutions Architect Professional, DevOps Engineer Professional) or Azure equivalents. - Experience with service mesh, API gateways, or zero-trust networking models. - Familiarity with Ruby on Rails, Node.js, or Spring Boot application ecosystems (the services your team will support). Leadership qualities and culture fit - Leads by example—rolls up their sleeves and works alongside the team, not from a distance. - Takes ownership and accountability for outcomes, not just tasks. - Operates with high ethical standards and transparency in all decisions. - Demonstrates commitment and reliability—follows through on promises and is present when it matters. - Builds trust through technical credibility and genuine care for team growth. - Communicates directly and honestly, escalating risks and issues proactively.

Related Categories

Related Job Pages

More DevOps Engineer Jobs

Full Scale logo

DevOps, Infrastructure Engineer

Full Scale

Build software development teams quickly and affordably.

DevOps Engineer53 days ago
Full TimeRemoteTeam 201-500Since 2018H1B No Sponsor

• Design, build, and maintain cloud infrastructure on AWS • Manage and optimize services such as ECS/Fargate, RDS, ElastiCache, S3, and CloudFront • Build and maintain CI/CD pipelines using GitHub Actions • Implement infrastructure as code using Terraform or AWS CDK • Manage SSL/TLS certificates and ensure secure communication across services • Set up and improve monitoring and alerting using CloudWatch, Datadog, or similar tools • Support system reliability, scaling, and performance optimization • Collaborate with developers to improve deployment workflows and production stability

Philippines
Job Closed
Full TimeRemoteTeam 201-500Since 2018H1B No Sponsor

• Manage and improve cloud infrastructure with a strong focus on reliability and scalability • Implement and maintain bot protection and anti-scraping systems • Work with and optimize solutions such as Cloudflare, DataDome, or Akamai • Support CI/CD pipelines and deployment automation • Monitor system performance, incidents, and security threats • Collaborate with engineering teams to strengthen application security and uptime • Troubleshoot production issues and implement long-term fixes

Philippines
Full TimeRemoteTeam 10,001

IQVIA is seeking a Sr. Site Contracts Associate with experience working on Clinical Trial Agreements and negotiating the language and budgets with sites in Spain. Please note this role is fully remote so you can be based anywhere but must be fluent in both Spanish and English. This is a full time, temporary position. KNOWLEDGE, SKILLS, AND ABILITIES: - Has advanced knowledge of site contracts within the clinical research area. - Has Site Contracting experience, especially Spain. - Excellent communications skills (verbal and written) in both Spanish and English. - Take part in client meetings. - Manage CTA and budget negotiations with sites. - Ability to exercise discretion and judgment while negotiating with clients and sites. - Must be willing to work in a fast-paced environment with time sensitive material. - Demonstrated ability to work effectively at all levels of an organisation. - Ability to work independently, prioritise and work in a team environment is essential - Strong organisational and interpersonal skills. - Ability to work flexible hours to ensure timelines are met. - Demonstrated ability to perform multiple tasks effectively. - Proficient in Excel, Word and Windows If this role sounds of interest, please apply today! #LI-HCPN #LI-CES #LI-DNP #LI-CT1 IQVIA is a leading global provider of clinical research services, commercial insights, and healthcare intelligence to the life sciences and healthcare industries. We create intelligent connections to accelerate the development and commercialization of innovative medical treatments to help improve patient outcomes and population health worldwide. Learn more at https://jobs.iqvia.com. IQVIA is committed to integrity in our hiring process and maintains a zero tolerance policy for candidate fraud. All information and credentials submitted in your application must be truthful and complete. Any false statements, misrepresentations, or material omissions during the recruitment process will result in immediate disqualification of your application, or termination of employment if discovered later, in accordance with applicable law. We appreciate your honesty and professionalism. At IQVIA, we believe that diversity, inclusion, and belonging empower our mission to accelerate innovation for a healthier world. We create a culture of belonging by valuing the perspectives of all talented employees worldwide and providing them with the opportunity to power smarter healthcare for everyone, everywhere. When our talented employees bring their authentic selves and their diverse experiences to work, they enable us to accomplish extraordinary things. Multifaceted thought processes spark innovation. Multi-talented collaboration harnesses innovation to deliver superior outcomes. Likewise, as part of this culture, IQVIA is committed to ensuring effective equality between women and men, integrating it as a strategic principle in its corporate and human resources policies.

Spain
Public Partnerships | PPL logo

AppSec, DevSecOps Engineer

Public Partnerships | PPL

PPL is an Equal Opportunity Employer dedicated to celebrating diversity and intentionally creating a culture of inclusion. We believe that we work best when our employees feel empowered and accepted, and that starts by honoring each of our unique life experiences. At PPL, all aspects of employment regarding recruitment, hiring, training, promotion, compensation, benefits, transfers, layoffs, return from layoff, company-sponsored training, education, and social and recreational programs are based on merit, business needs, job requirements, and individual qualifications. We do not discriminate on the basis of race, color, religion or belief, national, social, or ethnic origin, sex, gender identity and/or expression, age, physical, mental, or sensory disability, sexual orientation, marital, civil union, or domestic partnership status, past or present military service, citizenship status, family medical history or genetic information, family or parental status, or any other status protected under federal, state, or local law. PPL will not tolerate discrimination or harassment based on any of these characteristics.

DevOps Engineer53 days ago
Full TimeRemoteTeam 1,001-5,000H1B Sponsor

• Integrate security at every phase of the software development lifecycle. • Collaborate with engineering and product teams in Agile/Scrum environments to prioritize, track, and remediate security issues during sprint cycles. • Develop and maintain threat models and perform design reviews. • Lead threat modeling sessions and conduct in-depth security architecture reviews. • Educate development teams on secure coding practices. • Actively support the organization’s secure software development lifecycle (SDLC) initiatives by integrating security controls, processes, and testing into development workflows and CI/CD pipelines. • Integrate security testing tools (SAST, DAST, SCA, IaC scanning) into CI/CD pipelines. • Perform and manage vulnerability assessments, code reviews, and penetration testing. • Secure containerized environments (Docker, Kubernetes). • Ensure cloud infrastructure security (AWS/GCP/Azure) using infrastructure-as-code (IaC) tools like Terraform or CloudFormation. • Support documentation and evidence collection for SOC 2 Type II audits and HIPAA security risk assessments.

United States
$120K - $135K / year