NetApp / ONTAP Storage Engineering — FSx for ONTAP provisioning, volume and SVM management, snapshot policies, tiering policies, ONTAP CLI/REST API operations, and performance tuning AWS Storage Architecture — FSx for ONTAP sizing and deployment, throughput capacity planning, integration with VPCs, and cost optimization (capacity pool vs. SSD tier) Data Migration & Replication — SnapMirror configuration for cross-region replication, NetApp XCP or robocopy for bulk data migration, cutover planning, and data validation Cloud Network Architecture — VPC subnet design, security groups for NFS/SMB/iSCSI protocols, cross-region VPC peering for replication traffic, and DNS configuration for file system endpoints Linux / Windows Systems Engineering — NFS mount configuration on Linux, SMB share mapping on Windows, multi-protocol access testing, and client-side performance tuning Backup, DR & Data Protection — AWS Backup integration with FSx for ONTAP, snapshot scheduling, cross-region DR strategy, and RTO/RPO validation Security & Compliance — Encryption at rest (KMS), encryption in transit, IAM policies for FSx access, ONTAP export policies, and data governance controls
Sr TechOps Lead Engineer (AWS Cloud)- REMOTE
Location
United States
Posted
81 days ago
Salary
0
Seniority
Lead
No structured requirement data.
Job Description
Sr TechOps Lead Engineer (AWS Cloud)- REMOTE
Simple Solutions
Sr TechOps Lead Engineer (AWS Cloud) Department: Technology / Engineering Role Overview We are seeking a highly experienced TechOps SME/Lead Engineer with deep expertise in Cloud to lead our cloud infrastructure, DevOps practices, reliability engineering, and operational excellence initiatives. This role is both strategic and hands-on — responsible for designing scalable architectures, improving automation, ensuring system reliability, and leading the TechOps team. Key Responsibilities - Architect and manage secure, scalable, and highly available infrastructure on AWS. - Design multi-account AWS environments using AWS Organizations. - Implement VPC architecture, IAM policies, networking, and security best practices. - Oversee EC2, ECS/EKS, Lambda, RDS, S3, CloudFront, and related AWS services. - Optimize AWS cost management and resource utilization. Reliability & Production Operations - Implement Site Reliability Engineering (SRE) best practices. - Define SLIs, SLOs, and error budgets. - Manage monitoring and alerting (CloudWatch, Datadog, Prometheus, Grafana). - Lead incident response, root cause analysis (RCA), and postmortems. - Ensure 24/7 uptime and operational resilience. Security & Compliance - Implement IAM best practices and least-privilege access controls. - Manage secrets and key management (AWS KMS, Secrets Manager). - Conduct vulnerability management and patching. - Support compliance initiatives (SOC 2, ISO 27001, GDPR as applicable). - Lead disaster recovery planning and backup strategies. Leadership & Strategy - Lead and mentor a team of DevOps/TechOps engineers. - Establish operational KPIs and performance benchmarks. - Manage on-call rotations and escalation processes. - Collaborate with Engineering, Product, Security, and Data teams. - Contribute to long-term infrastructure strategy and cloud roadmap. <>Required Qualifications - Bachelor’s degree in Computer Science, Engineering, or equivalent experience. - 10+ years in DevOps, Cloud Engineering, or Infrastructure roles. - 5+ years leading technical teams. - Strong hands-on experience with AWS services (EC2, EKS, RDS, S3, IAM, VPC, Lambda). - Deep knowledge of networking, Linux systems, and distributed systems. - Experience with Infrastructure-as-Code (Terraform or CloudFormation). - Strong scripting skills (Python, Bash, or similar). - Experience with containerization (Docker) and Kubernetes (EKS preferred). Key Competencies - Strong architectural thinking - Hands-on technical leadership - Crisis and incident management - Strategic planning and execution - Excellent cross-functional communication Success Metrics - 99.9%+ production uptime - Reduced deployment lead time - Reduced incident frequency and MTTR - Improved cost efficiency - High-performing and scalable TechOps function
Related Guides
Related Categories
Related Job Pages
More DevOps Engineer Jobs
DevOps Engineer
PanoptoStop Typing. Start Recording. Panopto is trusted by millions as the easiest way to record and share videos.
• Modernize Pipelines: Evaluate existing legacy build and deployment workflows to identify inefficiencies and replace manual gates with high-speed automation. • Engineer Automation: Design and implement end-to-end CI/CD pipelines that incorporate automated testing and security scanning. • Implement Infrastructure as Code (IaC): Transition manual setups into reproducible code using tools like Terraform or CloudFormation. • Standardize "Golden Paths": Establish consistent build artifacts and deployment patterns across environments to help developers move faster. • Drive Observability: Implement robust logging and alerting to ensure developers get immediate, actionable feedback on their builds. • Collaborate with Purpose: Partner with engineers to solve problems as they arise, balancing technical rigor with the agility needed in a high-growth environment.
Build Engineer
LeidosLeidos is an innovation company rapidly addressing the world’s most vexing challenges in national security and health.
Leidos has an opening for an experienced Build Engineer for Decision Advantage Division. This is an exciting opportunity to bring your experience to support across all-domain large-scale weapon systems, Information Technology Systems, and Command and Control Systems to realize the Department of Defense Joint All-Domain Command and Control (JADC2). In this role you will support the Advanced Battle Management System (ABMS) Enterprise Systems Engineering Team (ESET) and Deployable Digital Infrastructure (DDI) Team to design solutions that can be delivered at speed, scale, and with the necessary security to deliver operational advantages to the joint warfighter. ABMS is a top modernization priority for the Department of the Air Force and will be the backbone of a network-centric approach to battle management in partnership with all the services across JADC2. This position will work closely with Program Managers, other domain engineers, and Government counterparts across Government and Industry partners. Primary Responsibilities - Design and establish configuration management documentation. - Identify and establish baselines and configuration items. - Perform software builds as well as authorize the release of software builds and changes specified by Program or Engineering leadership. - Create and maintain build environments and develop and execute build scripts and test automation (DevOps). - Develop scripts and automate processes to improve efficiencies and accuracy of the software build and release processes. - Provide software configuration management support to software teams. - Perform administration and user support, development, and maintenance for the configuration management toolset. - Perform as configuration management support on various program assignments to include Software CM and/or Hardware CM and Data Management responsibilities. - Research, design and implement new software configuration management tools from the ground-up to automate and release software application. - Participate in Working Groups and IPTs, informal and formal technical interchanges, and formal reviews. - Additional responsibilities as needed by the program. Basic Qualifications - Bachelor’s degree in a related field and12-15 years of related experience or a Master’s degree in a related field with 10 years of related experience. - US citizenship and an active Secret security clearance, with ability to obtain a Top Secret clearance. - Experience managing Git-based software repositories (versioning, branching, merging) and binary artifact repositories. - Development experience with scripting languages (e.g., Bash, Python, PowerShell, Perl, Groovy, Make). - Experience in fast-paced continuous change integration environments. - Continuous integration/continuous deployment (CI/CD) experience in software release automation in Agile development process. - Proven ability to define and implement Software Configuration Management processes in an enterprise environment. - Ability to develop effective cross-functional working relationships. Preferred Qualifications - Experience with software CM in a Product Line Engineering (PLE) process If you're looking for comfort, keep scrolling. At Leidos, we outthink, outbuild, and outpace the status quo — because the mission demands it. We're not hiring followers. We're recruiting the ones who disrupt, provoke, and refuse to fail. Step 10 is ancient history. We're already at step 30 — and moving faster than anyone else dares. Original Posting: March 13, 2026 For U.S. Positions: While subject to change based on business needs, Leidos reasonably anticipates that this job requisition will remain open for at least 3 days with an anticipated close date of no earlier than 3 days after the original posting date as listed above. Pay Range: Pay Range $116,350.00 - $210,325.00 The Leidos pay range for this job level is a general guideline only and not a guarantee of compensation or salary. Additional factors considered in extending an offer include (but are not limited to) responsibilities of the job, education, experience, knowledge, skills, and abilities, as well as internal equity, alignment with market data, applicable bargaining agreement (if any), or other law.
Director of Engineering – Product Infrastructure, Release Engineering
Mechanical OrchardMechanical Orchard combines software development and managed cloud operations in one offering.
• Report to the VP of Engineering, providing strategic direction, product thinking, and cross-functional alignment across Platform, Forward-deployed Engineering, Sales, Security, and Product Management • Manage the Product Infrastructure and Release Engineering teams • Ensure that the Imogen Platform is designed as a scalable, repeatable, and evolving product surface that accelerates customer adoption and long-term platform success • Ensure all of Imogen's components are released and deployable into a diverse set of customer environments, meeting the needs of enterprise customers • Turn infrastructure into a first-class product capability, opinionated where possible, flexible where necessary, and aligned with Mechanical Orchard’s modernization strategy and GTM motion
Senior DevOps Engineer
Sword HealthSword Health is the world’s fastest growing virtual MSK care provider, on a mission to free two billion people from pain
• Design, implement, and maintain scalable, resilient infrastructure to support Sword Health’s high-demand applications and services. • Automate and streamline deployment processes, CI/CD pipelines, and routine maintenance tasks to enhance efficiency and reduce downtime. • Monitor and optimize system performance, proactively identifying and resolving issues to ensure high availability and reliability. • Collaborate closely with development, data, and security teams to ensure seamless integration of infrastructure and code changes. • Drive security best practices by implementing and managing access control, network security, and compliance-related policies across the infrastructure. • Lead incident response and troubleshooting for infrastructure-related issues, ensuring rapid and effective resolution to maintain service continuity. • Mentor and guide junior team members, sharing DevOps best practices and fostering a culture of continuous learning and improvement within the team. • Stay up-to-date with industry trends and emerging technologies, bringing innovative solutions to Sword Health’s DevOps processes and toolchains.



