Microsoft, SAP, and general IT recruiters | Talent Acquisition Worldwide | Recruitment Outsourcing
Senior DevOps Engineer
Location
United States
Posted
2 days ago
Salary
0
Seniority
Senior
Job Description
Senior DevOps Engineer
Talentuch
• Design, build, and maintain scalable cloud infrastructure. • Manage and optimize AWS environments. • Develop and maintain Infrastructure as Code using Terraform. • Build and improve CI/CD pipelines for multiple engineering teams. • Automate deployment, provisioning, and operational processes. • Support containerized workloads and cloud-native applications. • Monitor system health, performance, and reliability. • Improve observability through monitoring, logging, and alerting solutions. • Implement security best practices across infrastructure and deployment pipelines. • Troubleshoot production issues and participate in root-cause analysis. • Collaborate with engineering and data teams to support AI and analytics workloads. • Contribute to infrastructure architecture and platform evolution.
Job Requirements
- 5+ years of experience in DevOps, Cloud Engineering, or Infrastructure Engineering.
- Strong hands-on AWS experience.
- Solid experience with Terraform and Infrastructure as Code.
- Strong Linux administration skills.
- Experience with Docker and containerized environments.
- Experience building and maintaining CI/CD pipelines.
- Experience with GitHub Actions, GitLab CI, Jenkins, CircleCI, or similar tools.
- Strong scripting skills using Python, Bash, or similar languages.
- Experience with monitoring and observability tools such as Prometheus, Grafana, ELK, CloudWatch, or similar.
- Good understanding of networking, security, VPNs, firewalls, and load balancing.
- Experience supporting production systems in cloud environments.
- Strong troubleshooting and problem-solving skills.
- Fluent English communication skills.
- Nice to Have Kubernetes experience.
- Experience supporting AI, machine learning, or data analytics platforms.
- Experience with Azure or GCP.
- Experience with security and compliance tooling.
- Experience in SaaS or high-scale data platforms.
- Previous exposure to SRE practices and reliability engineering.
Related Guides
Related Categories
Related Job Pages
More DevOps Engineer Jobs
Senior DevSecOps Engineer
SAICSAIC® is a premier mission integrator focused on advancing the power of technology and innovation to serve and protect our world. Our robust portfolio of offerings across the defense, space, intelligence, and civilian markets includes secure high-end solutions in mission IT, enterprise IT, engineering services, and professional services. We integrate emerging technology, rapidly and securely, into mission critical operations that modernize and enable critical national imperatives. We are approximately 23,000 strong; driven by mission, united by purpose, and inspired by opportunities. SAIC is an Equal Opportunity Employer. Headquartered in Reston, Virginia, SAIC has annual revenues of approximately $7.3 billion. For more information, visit saic.com . For ongoing news, please visit our newsroom .
Role Description We are seeking a highly skilled and motivated Senior DevSecOps Engineer with proven experience leading Agile teams. The selected candidate will play a critical role in designing, developing, and implementing DevSecOps practices and tools to enable efficient and secure software delivery pipelines. This position requires strong leadership skills, an in-depth understanding of cloud technologies, security, and Agile frameworks, as well as hands-on expertise in automated software delivery and deployment. The ideal candidate will be passionate about fostering collaboration, driving innovation through automation, ensuring security is embedded throughout the SDLC (Secure Software Development Lifecycle), and mentoring cross-functional teams in an Agile environment. - Establish, maintain, and enforce DevSecOps best practices throughout the software development lifecycle (SDLC). - Evaluate, integrate, and maintain DevSecOps tools and technologies to build and improve automated CI/CD pipelines. - Lead Agile teams by facilitating daily stand-ups, sprint planning, reviews, retrospectives, and other Agile ceremonies. - Ensure that security is a top priority in the design, implementation, and delivery of all development projects. - Collaborate across teams, including software developers, security engineers, and IT operations, to ensure seamless integration of security practices within workflows. - Drive team collaboration and communication, fostering a culture of innovation, accountability, and continuous improvement. - Provide mentorship to development team members on DevSecOps practices, tooling, and Agile methodologies. - Generate reports and provide updates to leadership on project progress, risks, and implemented security measures. Company Description SAIC® is a premier mission integrator focused on advancing the power of technology and innovation to serve and protect our world. Our robust portfolio of offerings across the defense, space, intelligence, and civilian markets includes secure high-end solutions in mission IT, enterprise IT, engineering services, and professional services. We integrate emerging technology, rapidly and securely, into mission critical operations that modernize and enable critical national imperatives. - We are approximately 23,000 strong; driven by mission, united by purpose, and inspired by opportunities. - SAIC is an Equal Opportunity Employer. - Headquartered in Reston, Virginia, SAIC has annual revenues of approximately $7.3 billion. - For more information, visit saic.com . - For ongoing news, please visit our newsroom .
• Own and drive end-to-end operational delivery for cloud-native data platform environments (AWS, Azure) across multiple managed services clients — including environment configuration, infrastructure automation, and platform reliability. • Translate business and technical requirements into resilient, cost-effective platform solutions aligned with phData methodologies, architecture standards, and best practices. • Build, deploy, and maintain infrastructure-as-code configurations and CI/CD pipelines that support repeatable, governed platform delivery. • Monitor and support production data jobs and pipelines (ETL/ELT), ensuring timely resolution of failures and minimizing business impact as you develop depth in data platform patterns. • Ensure engagements are delivered on time, within scope, and with measurable business value for clients. • Collaborate with Solutions Architects, data engineering teams, analytics teams, and client stakeholders to deliver successful, well-integrated client engagements. • Provide technical leadership during troubleshooting sessions, infrastructure reviews, and platform deployments, particularly across cloud services on AWS and Azure. • Ensure high quality in deliverables through clear documentation, deployment guides, runbooks, and adherence to governance and change management processes. • Partner with practice and account leaders to improve operational maturity, standardize delivery patterns, and enhance client satisfaction across a large user base. • Contribute to internal initiatives such as building and enhancing IaC templates, automation scripts, CI/CD frameworks, and operational playbooks for Elastic Platform Operations. • Mentor peers by sharing best practices in cloud engineering, leading knowledge-sharing sessions, and helping up-skill team members on new tools and technologies. • Represent phData with professionalism in all interactions, communicating clearly with both technical and non-technical stakeholders. • Act as a trusted advisor to senior client stakeholders on cloud platform reliability, infrastructure strategy, and performance optimization. • Lead complex infrastructure and platform delivery efforts, coordinating across multiple teams and driving long-term improvements. • Mentor and coach junior engineers, fostering a culture of learning, feedback, and continuous improvement. • Help define and refine Elastic Operations standards, reusable IaC assets, and delivery frameworks for managed services.
Senior Network Deployment Engineer – EU Hours
AstreyaIT services that put people at the center of your business
• Design, plan, and coordinate the implementation of network technologies in support of business and growth requirements. • Validate project requirements, define project scope, develop project schedules, and produce detailed network designs for assigned projects. • Produce work breakdown structures (WBS) that demonstrate understanding of proposed changes and how they will be implemented with minimal service impact. • Perform analysis and diagnosis of highly complex networking problems; build simulated networks in test labs to resolve significant issues and compatibility challenges. • Plan and drive complex network upgrade and migration activity, including highly automated environments and quarterly maintenance events. • Prepare and maintain up-to-date documentation detailing the configuration of deployed solutions; generate network configurations and run books. • Provide mentorship and technical leadership to existing network team members and partner teams during outages and downtimes. • Collaborate with vendors to manage circuit delivery, problem resolution, and network migrations.
Role Description This role exists to ensure the Hyperstack platform — including Hyperstack GPU Cloud, AI Studio and the Investor Portal — is kept running, automated and observable as it scales. As the DevOps team acts as a bridge across every function in the business, we need a capable engineer who can own automation, incident response, observability and internal tooling without waiting to be directed. This is a role for someone who builds first and documents second — someone who finds a manual process and replaces it, who picks up a production incident and drives it to resolution, and who enjoys the visibility that comes from working across an entire business. What You'll Be Doing - Own core DevOps engineering tasks across the Hyperstack platform: automation, incident response, release pipeline support and internal tooling. - Maintain and improve observability tooling (Prometheus, Grafana and the broader monitoring stack) to ensure platform health and early incident detection. - Support Kubernetes operations across two contexts: as a managed product sold to customers, and as the underlying infrastructure powering NexGen Cloud’s own platform. - Act as a first responder for platform incidents alongside the CX team — triaging issues, reviewing code, and confirming whether problems are bugs or expected behaviour. - Build and improve internal tools consumed by other teams including Revenue Ops, Finance, Engineering and CX. - Identify and eliminate manual workload through automation and self-service tooling as the business continues to scale. - Collaborate across a globally distributed, remote-first team and communicate clearly with non-technical stakeholders. Qualifications - Hands-on Kubernetes experience in production — both managed/hosted K8s as a product and self-managed clusters. - Active experience with Prometheus, Grafana and related observability tooling; able to maintain and improve monitoring of a live platform. - Strong automation and scripting skills — able to build or improve tooling that reduces manual workload across multiple teams. - Proven incident response experience in live environments; comfortable being a first responder alongside non-technical colleagues. - Cross-functional mindset — comfortable building tools and processes that serve Engineering, CX, Revenue Ops and Finance without being siloed. Nice to Have - Experience in a SaaS, cloud infrastructure or GPU/AI compute environment. - Familiarity with GitOps workflows and release pipeline tooling. - Exposure to OpenStack-based infrastructure or GPU cloud environments. - Experience working in a distributed, remote-first team across multiple time zones. Benefits - Competitive salary and annual discretionary bonus scheme. - Employee wellbeing benefits. - 25 days of holiday, plus public holidays. - Fully remote working — no office requirement, no geographic constraint. - Real ownership and autonomy, with the trust to take initiative and experiment. - Broad scope — this role touches every team in the business, giving you exposure well beyond a typical DevOps position. - Greenfield opportunities to improve tooling, automation and observability — not just maintenance. - Clear career progression and growth opportunities in a fast-growing company. - A collaborative, international culture built on trust, transparency and ownership. - The chance to work on a cutting-edge GPU cloud platform used for real AI, ML and HPC workloads — where Kubernetes is central to how the product is built and sold.



