DevOps Team Lead
Location
United States
Posted
58 days ago
Salary
0
Seniority
Senior
Job Description
DevOps Team Lead
ida (Innovations- und Digitalagentur GmbH)
• Responsible for the organizational and operational management as well as the strategic development of the DevOps team • Consideration of infrastructure projects, structured planning and sensible allocation of capacity • Reliable operation of technical platforms • Continuous development of internal platforms and infrastructure processes • Supporting the team in technical decisions
Job Requirements
- Experience operating modern cloud and infrastructure platforms
- Experience taking responsibility for technical teams, platforms or infrastructure projects
- Team management, capacity planning and prioritization in technical projects
- Operation of cloud and container platforms (e.g., Kubernetes, GCP or comparable environments)
- Strong Linux skills
- Solid understanding of networking (DNS, routing, firewall, ingress/egress)
- Experience with CI/CD pipelines (GitLab)
- Automation (Ansible)
- Infrastructure-as-Code (Terraform/OpenTofu)
- Agile working methods (Scrum, Kanban)
- Clear and structured communication
- Ability to explain technical concepts in an understandable way
- Solution-oriented working style
Benefits
- We strive for diversity.
- We explicitly encourage FLINTA*, BIPoC, LGBTQIA+ people and people with disabilities to apply to ida.
- We have listed some requirements, but if you don’t meet all of them, that’s fine — we look forward to receiving your application.
Related Guides
Related Categories
Related Job Pages
More DevOps Engineer Jobs
Senior SRE
CisionCision is the global leader in consumer and media intelligence, engagement, and communication solutions. We equip PR and corporate communications, marketing, and social media professionals with the tools they need to excel in today's data-driven world. Our deep expertise, exclusive data partnerships, and award-winning products, including CisionOne, Brandwatch, and PR Newswire, enable over 75,000 companies and organizations, including 84% of the Fortune 500, to see and be seen, understand and be understood by the audiences that matter most to them. Cision is committed to fostering an inclusive environment where all employees can be their authentic selves and perform at their best. We believe diversity, equity, and inclusion is vital to driving our culture, sparking innovation and achieving long-term success.
• Design and implement reliability solutions for data ingestion, processing, and delivery pipelines • Define and maintain SLIs/SLOs for data licensing services and manage error budgets • Build automation for deployment, monitoring, and incident response • Enhance system observability through metrics, logging, and tracing • Develop and maintain dashboards and alerts to proactively detect and resolve issues • Participate in on-call rotations and lead incident response efforts • Conduct root cause analysis and drive post-incident improvements • Maintain runbooks and operational documentation • Partner with software and data engineers to embed reliability into system design • Contribute to blameless postmortems and reliability reviews • Share knowledge and mentor junior team members
Join Triumph! At Triumph, our vision is a world where freight transactions are accurate and seamless on the most modern and secure freight transaction network. That’s why we’re looking for passionate, innovative, solutions-oriented people to join our team. We thrive on providing exceptional customer service and we look for team members with an entrepreneurial spirit and a passion to build successful partnerships with our clients. Because at the end of the day our goal is to help our partners businesses run better. We are a fast-growing FinTech company that is looking for a highly skilled Senior DevOps Engineer to join our team. In this role you will be responsible for designing, implementing, and maintaining our organization's DevOps infrastructure. You will work closely with development, security, and operations teams to ensure that our systems are secure, reliable, and scalable. We expect you to be quick to grasp new concepts, thoroughly explore the depths of an issue, and be persistent in understanding the root cause of issues. We hop you have strong interpersonal skills due to continual interaction with managers and users with varying technical backgrounds in a fast-paced work environment. What You’ll Be Doing: - Design, implement, and maintain our organization's cloud infrastructure, including CI/CD pipelines, automation tools, and monitoring systems in AWS. - Work closely with development teams to ensure that our applications are reliable and scalable in a secure and compliant manner. - Own the deployment and maintenance of Kubernetes clusters, ensuring efficient resource utilization, scalability, and high availability. - Develop, maintain, and optimize Helm charts for deploying and managing containerized applications. - Collaborate with security teams to ensure that our systems meet security standards and regulations such as SOX, SOC2, and FFIEC. - Monitor and optimize systems and applications performance. - Automate the deployment and configuration of systems and applications. - Recommend impactful design and process changes to ensure the success of platforms and infrastructure services. - Design and deploy technical implementations to achieve SLOs and SLIs to ensure we are meeting our customer’s expectations. - Continuously evaluate and implement new technologies and tools to improve our DevOps processes. What Makes You a Great Fit: We hope you’ll possess business operations experience and skills, leadership expertise, analytical and critical thinking skills, and attention to detail. Additionally, you should possess the following: - Technical certifications in AWS or related technologies highly desirable - Minimum of 5 years of experience in an IT support role with strong troubleshooting skills. - Comprehensive experience with AWS services including a solid understanding of S3, EBS, EC2, ECS, EKS, ELB/ALB, IAM, VPC, RDS, CloudTrail, and CloudWatch, KMS, Secrets Manager, Route53, SSM, Redshift, Cloudfront, Elastic Beanstalk. - Must have Experience with Kubernetes, Helm Charts. - Experience working with Redis and Postgres. - Experience with CICD platforms like Argo CD, Github Actions. - Experience with Machine Learning, Snowflake, and Looker are a big plus. - Experience running with migration projects and technology consolidation efforts. - Recent experience with Terraform and CI/CD processes in a production environment. - Knowledge of networking fundamentals and network requirements for a hybrid cloud environment. - Experience working within Agile software development methodology. - Experience with Linux. - Ability to write technical documentation and present materials to mixed audiences is required. Skills and Competencies We Value: - Knowledge and understanding of IT concepts, best practices and procedures. - Strong troubleshooting experience. - Self motivated with the ability to work individually or in a team. - Ability to leverage tools to perform day-to-day administration tasks, root-cause analysis and service restoration (such as backup, restore, failover, log interpretation, and performance monitoring). - Ability to multitask and manages work effectively by prioritizing own assignments, schedules, and meetings resulting in timely completion of work. - High degree of personal integrity. - The applicant should eager to learn and obtain technical certification. - Must be able to receive and follow instructions given by management. - Must have the ability develop solutions to unique problems. Work Environment The work environment characteristics described here are representative of those an employee may encounter while performing the essential functions of this job. Reasonable accommodations may be made to enable individuals with disabilities to perform the essential functions. - Moderate noise level typical of a business office environment. - Ability to work in a confined office or workstation area. - Ability to sit at a computer terminal for extended periods of time. - Occasional stooping, kneeling, or light physical activity may be required. - Regular use of hands, fingers, and vision for computer and phone work. - Light to moderate lifting may be required. - Regular, predictable attendance is required. - Travel or additional physical requirements may be added as needed. #LI-JC1 Compensation Range Annual Salary: $151,038.00 - $234,109.00 ***Location: Dallas, TX or Remote U.S. excluding the following states: AK, DE, ID, ND, RI, VT, WY *** We offer Medical, Dental, Vision, Paid Time Off, 401k and much more. Go on. Do it. Apply Today!
• Lead current-state assessments of cloud infrastructure practices, team capabilities, delivery bottlenecks, and governance gaps across a globally distributed organization • Design a Cloud Center of Excellence (CoE) operating model that defines ownership boundaries, responsibilities, and escalation paths between the central platform team and product engineering teams • Define a cross-cutting standards framework for AWS account structure, multi-region strategy, security baselines, resilience, and scalability • Design exception-handling and design-checkpoint processes that enable innovation beyond standard patterns without creating bottlenecks • Create a transition roadmap and enablement strategies to reduce product team dependency on the central Cloud team • Partner with security architects to embed security guardrails and IAM governance into the CoE framework • Provide strategic guidance and manage stakeholders across technical and executive audiences
Manager II, DevOps
InComm PaymentsQuando você pensar na InComm Payments, pense em tecnologia inovadora de pagamentos. Fomos fundados há mais de 30 anos e continuamos a ser pioneiros na indústria de pagamentos (FinTech). Desde a nossa criação estamos em continuo crescimento e somos uma equipe de mais de 3.000 funcionários em mais de 34 países ao redor do mundo. Possuímos mais de 400 patentes técnicas globais e uma rede que inclui mais de 525.000 pontos de distribuição no varejo que apontam para nossa experiência no setor. A InComm Payments está altamente focada em nosso pessoal e em seu crescimento, e trabalhamos duro para tornar a sua carreira significativa e gratificante. Valorizamos a inovação, a qualidade, a paixão, a integridade e a responsabilidade em tudo o que fazemos e procuramos pessoas excelentes para se juntarem à nossa equipa à medida que avançamos em direção a um futuro muito brilhante. Antecipamos o desenvolvimento de futuros líderes para nossas equipes no Brasil!
Role Description As a Senior Manager of Dev Ops, you will lead the design, governance, and evolution of an enterprise Kubernetes platform supporting mission‑critical workloads, while shaping the company’s containerization strategy end‑to‑end. This role blends deep technical leadership with team mentorship, cross‑functional influence, and hands‑on platform ownership. - Own the design, build, and operation of enterprise Kubernetes platforms supporting mission-critical workload. - Lead architecture decisions for cluster design, networking, security, scalability, and resiliency. - Oversee daily operations of the Enterprise Containerization Team, ensuring timely, accurate, and customer-focused support for Application Development and Infrastructure teams. - Manage and mentor a group of engineers to meet day-to-day needs and project requirements. - Maintain strong relationships with AppDev Teams to assist and support them in their containerization efforts. - Collaborate closely with Infrastructure teams to drive Kubernetes cluster requirements and resource planning. - Work with other Enterprise IT teams to drive Kubernetes enterprise requirements across enterprise domains (monitoring, support, patching, etc). - Work with EA and Sr Leaders to provide technical guidance on overall containerization direction and roadmaps, driving the enterprise container strategy. - Act as a technical representative in architecture reviews, governance forums, and leadership briefings. - Assist in troubleshooting and diagnosing issues and impacts on Kubernetes cluster and container environments. - Provide training to internal teams on k8s and container coding practices. - Ensure documentation around containerization tooling and processes are up to date and available to internal team members. - Drive automation using Infrastructure as Code and Configuration Management Tools. - Standardize CI/CD integration and Git based workflows for platform and application teams. - Partner with Security and Compliance teams to ensure platforms meet enterprise and regulatory requirements (e.g., PCI, SOC, internal controls). Qualifications - Experience with Kubernetes and Kubernetes management platforms (Rancher, OpenShift) is a requirement. - Experience with Ansible or other automated configuration tools. - Working knowledge of git source control tools such as GitLab, GitHub or Bitbucket is required. - Ability to learn new technologies and translate into working processes. - Demonstrated ability to advance project timelines and effectively communicate status updates to stakeholders. - Communicate effectively with Internal Teams and represent the team effectively and independently including leading in the presentation of technical documentation to various audiences such as architectural standards, blueprints and new capability debriefs. - Ability to communicate with technical and non-technical team members. - Ability to excel within an agile environment (iterative development, continuous integration, shared ownership, etc.) as well as waterfall environments. - Customer service-oriented communication skills including the ability to communicate directly or indirectly with customers, partners, stakeholders and leadership. - Attention to detail, flexible, open to suggestions and possess good communicative interpersonal skills. - Ability to work within a multi-cultural global team environment or independently as needed with little guidance. Requirements - Experience with monitoring and APM tools in a k8s context. - Technical implementation knowledge and experience with compliance-driven environments is a plus (PCI, SOX, HIPAA, etc.). - Experience with infrastructure as code tools (Terraform) is a plus. - Background in cloud platforms (Azure, AWS, or GCP). - Kubernetes or cloud-related certifications (CKA, CKAD, CKS, etc.). Benefits - This position is eligible for the Employee Referral Bonus Program - Tier IV.



