Movable Ink logo
Movable Ink

Movable Ink personalizes every customer engagement through automation and artificial intelligence. The world’s most innovative brands rely on Movable Ink to maximize revenue, simplify workflow and achieve the optimal customer experience. Headquartered in New York City with 600 employees, Movable Ink serves its global client base with operations throughout North America, Central America, Europe, and Australia.

Lead Site Reliability Engineer

DevOps EngineerDevOps EngineerFull TimeRemoteSeniorTeam 501-1,000Since 2010H1B No SponsorCompany SiteLinkedIn

Location

California + 1 moreAll locations: California | Canada

Posted

27 days ago

Salary

$154K - $200K / year

Seniority

Senior

Job Description

Lead Site Reliability Engineer

Movable Ink

• Define and drive the automation strategy for infrastructure tooling, establishing standards that minimize manual work, increase performance and reduce incident frequency and severity of incidents • Own the design, reliability and evolution of core platform applications, mentoring team members on best practices and ensuring systems meet long-term business objectives • Architect and lead the logging platform strategy, driving its design and balancing availability, retention and cost optimization • Establish capacity planning and performance management frameworks, proactively identifying scaling opportunities and guiding teams through complex troubleshooting scenarios • Lead cross-functional reliability initiatives with SRE and service engineering teams, influencing architectural decisions and championing practices that ensure resilient service delivery • Demonstrate a high level of autonomy in anticipating, identifying, and addressing systemic weaknesses and opportunities for platform improvement without direct supervision.

Job Requirements

  • Proven track record in Site Reliability or Software Engineering, designing, building, and owning scalable, resilient services with a focus on long-term reliability strategy
  • Deep expertise in architecting and operating complex distributed systems such as Apache Pulsar, Apache Kafka, Grafana Loki, ScyllaDB/Cassandra, with the ability to guide teams through distributed system challenges
  • Designing and owning automation strategies to manage services at scale, with expertise in establishing performance analysis frameworks and mentoring others on diagnostics and resolution
  • Deep, hands-on experience (6+ years) in Site Reliability or Software Engineering, specifically leading and shaping multi-cloud architecture and strategy (AWS and GCP).
  • Experience architecting and leading large-scale observability platforms, including defining observability standards and SLO frameworks. We use Prometheus and Thanos with Grafana Alloy, Loki and Tempo
  • Experience leading on-call excellence, including driving improvements to monitoring and alerting strategies, automating runbooks and mentoring team members on incident response best practices. Every member of the SRE team does a week long on-call rotation
  • Expert-level proficiency with infrastructure as code, including defining IaC standards and patterns across teams. We use Terraform and Chef
  • Advanced Kubernetes expertise, including cluster architecture design, multi-tenancy strategies, and guiding teams on container orchestration best practices. We use EKS and GKE
  • Proficiency in multiple programming languages with the ability to design and review code that meets reliability standards. We use NodeJS, Golang, Ruby, Python and shell scripting
  • Advanced Linux systems expertise, with the ability to diagnose complex system-level issues and mentor others on performance tuning and troubleshooting.

Benefits

  • full range of medical
  • financial
  • other benefits

Related Categories

Related Job Pages

More DevOps Engineer Jobs

Full TimeRemoteTeam 1,001-5,000Since 2007H1B No Sponsor

Role Description Triumph is looking for a hands-on, forward-thinking Lead DevOps Engineer to help scale and strengthen our cloud infrastructure. In this role, you’ll lead the design and evolution of secure, reliable, and high-performing systems while partnering closely with engineering, security, and operations teams. If you enjoy solving complex problems, improving systems at scale, and driving meaningful technical change, this is a great opportunity to make an impact. You’ll spend your day ensuring our cloud platforms run smoothly and efficiently optimizing Kubernetes clusters, improving CI/CD pipelines, and partnering with teams to deliver scalable, secure applications. From troubleshooting complex issues to rolling out automation and guiding infrastructure strategy, your work will directly impact performance and developer productivity across the organization. What You’ll Be Doing - Design, build, and maintain AWS-based cloud infrastructure, CI/CD pipelines, and automation tools - Partner with development teams to ensure applications are scalable, reliable, and secure - Optimize and manage Kubernetes clusters for performance, scalability, and consistency - Develop and maintain Helm charts for containerized applications - Manage and support Kafka clusters and streaming infrastructure - Collaborate with security teams to meet compliance standards (SOC2, SOX, FFIEC) - Monitor system performance using tools like Grafana, Loki, and other observability platforms - Automate deployments, configurations, and operational processes - Recommend and implement improvements to infrastructure design and DevOps practices - Define and implement SLOs and SLIs to enhance system reliability - Continuously improve DevOps workflows and platform efficiency Qualifications - 5+ years of experience in DevOps, cloud engineering, or infrastructure roles - Strong hands-on experience with AWS services (EC2, S3, EKS, RDS, IAM, VPC, and more) - Deep expertise in Kubernetes and Helm - Experience with Terraform (Terragrunt is a plus) and CI/CD tools like Argo CD or GitHub Actions - Familiarity with Kafka, Redis, and Postgres in production environments - Experience managing MSSQL Server and performing database migrations - Solid understanding of networking in cloud and hybrid environments - Comfortable working in Linux environments - Experience supporting large-scale, multi-account AWS environments - Familiarity with Snowflake or Looker is a plus - Strong troubleshooting and problem-solving skills - Ability to communicate technical concepts clearly to different audiences - Experience working in Agile teams - Motivated to learn, grow, and pursue technical certifications Benefits - Medical, Dental, Vision - Paid Time Off - 401k - And much more Compensation Range Annual Salary: $173,464.00 - $281,010.00 Location: Dallas, TX or Remote U.S. excluding the following states: AK, DE, ID, ND, RI, VT, WY Apply now and take the next step in your career. We’re excited to meet you!

United States
$173.5K - $281.0K / year
General Dynamics logo

DevSecOps Engineer

General Dynamics

General Dynamics is a global aerospace and defense company offering products designed to provide safety and security to people around the world. In the past, General Dynamics has p

DevOps Engineer27 days ago

• Responsible for the set-up, maintenance and ongoing development of continuous build/integration infrastructure • Creating and maintaining fully automated CI build processes for multiple environments • Developing build and deployment scripts • Supporting CI/CD tools integration/operations/change management, and maintenance • Support full automation of CI/CD Development and Testing • Supporting policies, standards, guidelines, governance and related guidance for both CI/CD operations and for work of developers • Enable successful release management by moving code from Development and Testing environments to Staging and Production

United States
$129.8K - $161K / year
Job Closed
Full TimeRemoteTeam 10,001+H1B Sponsor

Role Description The Site Visit Coordinator will support Medicaid operations in the Austin, TX region. This role involves conducting in-person site visits to provider offices to ensure adherence to Medicaid policies as part of the enrollment process. The ideal candidate will independently manage their territory, stay current on policy updates, utilize internal tools and reports, and actively participate in team meetings and training while working remotely. Must live within 45 minutes of Austin, TX with ability to travel to/from regional locations daily. - Manage and serve a designated territory in Austin, TX. - Conduct scheduled and unscheduled face-to-face visits with Medicaid providers to verify compliance with program requirements. - Travel up to 3 hours each way using a personal, reliable vehicle. - Stay informed on Medicaid policies, procedures, and available provider support resources. - Independently manage workflow using internal tracking and reporting systems. - Build and maintain positive relationships with providers, internal teams, and peers. - Participate in remote team meetings, training sessions, and collaborative workgroups. - Represent the organization professionally within the provider community. Qualifications - Proficiency in Microsoft Office Suite (Word, Excel, Outlook). - Familiarity with Adobe Acrobat. - Strong interpersonal and communication skills. - Self-motivated with excellent time management and the ability to manage a travel-based schedule. - Attention to detail and the ability to deliver high quality of work. - Organizational and problem-solving skills. - Ability to work independently and collaboratively in a remote environment. - Valid driver’s license and access to a dependable vehicle. Requirements - Minimum of 6 months’ experience in provider-facing or healthcare roles. Preferred Qualifications - Working knowledge of Texas Medicaid programs and policies. Benefits - Market competitive suite of benefits including medical, dental, vision, life, and long-term disability coverage. - 401(k) plan. - Bonus opportunities. - Paid holidays and paid time off.

United States
Amentum logo

Senior Secure DevOps Engineer

Amentum

A Premier Leader in Global Engineering, Project Management, and Solutions Integration.

DevOps Engineer27 days ago
Full TimeRemoteTeam 10,001+H1B No Sponsor

• Design, implement, and maintain secure CI/CD pipelines to support DevOps workflows • Work closely with development, operations, and security teams to integrate security tools and best practices into the software development lifecycle • Automate infrastructure deployment using Infrastructure as Code (IaC) while maintaining security and scalability • Develop and enforce security policies and ensure continuous monitoring of vulnerabilities and risks in the systems • Manage and secure cloud infrastructure (Azure, or GCCH) to optimize performance and compliance • Collaborate with the security team to perform threat modeling and risk assessments, and address identified vulnerabilities • Monitor systems, logs, and events to detect security threats, misconfigurations, and other operational or security issues • Stay current with industry trends in DevSecOps tools, cloud security, and cybersecurity practices • Create technical documentation and workflows for DevOps processes and security implementations • Provide mentorship and promote secure DevOps best practices across development and operations teams.

Virginia
$116K - $144K / year