Job Closed

This listing is no longer active.

Leidos logo
Leidos

Leidos is an innovation company rapidly addressing the world’s most vexing challenges in national security and health.

AWS Cloud Infrastructure Engineer

Infrastructure EngineerInfrastructure EngineerFull TimeRemoteMid LevelTeam 10,001+Since 1969H1B SponsorCompany SiteLinkedIn

Location

United States

Posted

44 days ago

Salary

0

Seniority

Mid Level

Job Description

AWS Cloud Infrastructure Engineer

Leidos

Leidos was awarded the U.S. Air Force Cloud One Architecture and Common Shared Services contract, and currently has an opening for Cloud Engineers across AWS, Azure, Google, and Oracle clouds. This is an exciting opportunity to use your experience to modernize a leading, global-scale multi-cloud environment in support of a critical mission, supporting USAF system resiliency, security, and cost effectiveness. Location: This position will be remote. Preferred candidates will be located near Hanscom AFB (Boston, MA) or work in Huntsville, AL. Primary Responsibilities: We are seeking an AWS Cloud Operation and support Engineer with expertise in multiple cloud platforms. A successful individual will be responsible for developing in a scalable cloud-native solutions, and ensuring best practices across architecture, development, deployment, and security from design, test, integration, production, sustainment and maintenance. This is a hands-on technical role that requires rolling up your sleeves to architect, code, debug, and mentor.  - Perform cloud operations and engineering tasks to enhance, sustain, and maintain scalable, resilient, and secure cloud solutions for AWS cloud environment - Perform AWS cloud operations, sustainment, and maintenance activities to maintain optimum cloud - Adopt and utilize DevSecOps practices, infrastructure as code, and automation frameworks  - Through development and sustainment activities, optimize application performance and reliability in cloud environments  - Design, implement and sustain secure cloud architectures and networks implementing zero-trust principles and defense-in-depth strategies  - Maintain compliance with industry standards (SOC 2, HIPAA, PCI-DSS, etc.) and regulatory requirements  - Architect, implement and maintain cloud networking security controls including STIG requirements - Implement identity and access management solutions and security monitoring frameworks  - Support development of migration methodologies and ensure minimal organizational disruption during transitions  - Utilize CI/CD workflows and infrastructure-as-code development using Jenkins, Terraform, Ansible, Kubernetes, Jira, Confluence, Artifactory, and Guacamole to support DevSecOps practices. - Containerize applications to enhance scalability and deployment efficiency. - Support the design and development of Shared Services. - Configure and troubleshoot cloud, virtual, and physical hardware and software systems. - Establish and maintain SQL and NoSQL databases, ensuring their performance and reliability. - Support preparation of detailed technical documentation of development and operational processes. - Work in cross-functional teams including development, operations, security, and product management  Minimum Qualifications - Bachelors and 4+ years or more of experience; Masters and 2+ years or more of experience. Additional experience may be accepted in lieu of degree. - Secret clearance required - US citizenship required - Certifications: CompTIA Security+ or equivalent (IAT-2) - Practiced verbal and written communications skills - Ability to participate in team efforts to accomplish assigned tasks  - Demonstrated experience in cloud operations and sustainment and performing tasks and actions described in the primary responsibilities section Preferred Qualifications - Experience with USAF Cloud One or Platform 1 - Knowledge of Zero Trust Architecture. Experience a plus. - Capable of working in high powered teams and maintaining positive interpersonal relationships while delivering products and services to the customer - Understanding Active Directory, AWS AD, SAML and the standards, procedures, and processes  - Experience with Ansible, AWS console, Elastic, AWS, Jira, Confluence, Git, Bitbucket and various cloud Software as a Service (SaaS) offerings to conduct DEV/SEC/OPS pipeline development activities - Administration experience with cloud-based applications (MS O365, SharePoint, AWS AD, AWS)  - ​Experience administering Windows Server, and related services  - Cloud certifications in AWS, Azure, Google, or Oracle clouds - Certification Examples - ​AWS Certified Solutions Architect (Professional), Azure Solutions Architect (Expert), ​MCSE (Server), Certified AWS SysAdmin, AWS Certified Cloud Practitioner, AWS Certified Developer, AWS Certified Solutions Architect (Dev/Associate), AWS Certified DevOps Engineer, AWS Certified Advanced Networking, AWS Certified Security, Azure Developer Associate, Azure Solutions Architec If you're looking for comfort, keep scrolling. At Leidos, we outthink, outbuild, and outpace the status quo — because the mission demands it. We're not hiring followers. We're recruiting the ones who disrupt, provoke, and refuse to fail. Step 10 is ancient history. We're already at step 30 — and moving faster than anyone else dares. Original Posting: April 13, 2026 For U.S. Positions: While subject to change based on business needs, Leidos reasonably anticipates that this job requisition will remain open for at least 3 days with an anticipated close date of no earlier than 3 days after the original posting date as listed above. Pay Range: Pay Range - The Leidos pay range for this job level is a general guideline only and not a guarantee of compensation or salary. Additional factors considered in extending an offer include (but are not limited to) responsibilities of the job, education, experience, knowledge, skills, and abilities, as well as internal equity, alignment with market data, applicable bargaining agreement (if any), or other law.

Related Categories

Related Job Pages

More Infrastructure Engineer Jobs

Senior or Staff AI Infrastructure Engineer

TRM Labs

TRM Labs specializes in blockchain investigations and risk management, empowering organizations to detect, investigate, and prevent crypto-related fraud and financial crime. Founde

Build a Safer World. TRM Labs provides blockchain analytics and AI solutions to help law enforcement and national security agencies, financial institutions, and cryptocurrency businesses detect, investigate, and disrupt crypto-related fraud and financial crime. TRM’s blockchain intelligence and AI platforms include solutions to trace the source and destination of funds, identify illicit activity, build cases, and construct an operating picture of threats. TRM is trusted by leading agencies and businesses worldwide who rely on TRM to enable a safer, more secure world for all. The AI Engineering Team is chartered with enabling next-generation AI applications, with a special focus on Large Language Models (LLMs) and agentic systems. Our mission is to build robust pipelines, high-performance infrastructure, and operational tooling that allow AI systems to be deployed with speed, safety, and scale. We manage petabyte-scale pipelines, serve models with millisecond-level latency, and provide the observability and governance needed to make AI production-ready. We’re also deeply involved in evaluating and integrating cutting-edge tools in the LLM and agent space — including open-source stacks, vector databases, evaluation frameworks, and orchestration tools that unlock TRM’s ability to innovate faster than the market. As a Senior or Staff AI Infrastructure Engineer, you’ll be at the core of building and scaling the technical infrastructure for AI/ML systems. You will: - Build reusable CI/CD workflows for model training, evaluation, and deployment — integrating Langfuse, GitHub Actions, and experiment tracking, etc. - Automate model versioning, approval workflows, and compliance checks across environments. - Build out a modular and scalable AI infrastructure stack — including vector databases, feature stores, model registries, and observability tooling. - Partner with engineering and data science to embed AI models and agents into real-time applications and workflows. - Continuously evaluate and integrate state-of-the-art AI tools (e.g. LangChain, LlamaIndex, vLLM, MLflow, BentoML, etc.). - Drive AI reliability and governance, enabling experimentation while ensuring compliance, security, and uptime. - Build and enhance AI/ML Model Performance - Ensure data accuracy, consistency and reliability, leading to better model training and inferencing - Deploy infrastructure to support offline and online evaluation of LLMs and agents — including regression testing, cost monitoring, and human-in-the-loop workflows. - Enable researchers to iterate quickly by providing sandboxes, dashboards, and reproducible environments. What We’re Looking For - Write high-quality, maintainable software — primarily in Python, but we value engineering ability over language familiarity. - Have a strong background in scalable infrastructure, including: - Containerization and orchestration (e.g. Docker, Kubernetes) - Infrastructure-as-code and deployment (e.g. Terraform, CI/CD pipelines) - Monitoring and logging frameworks (e.g. Datadog, Prometheus, OpenTelemetry) - Understand and implement ML Ops best practices, including: - Model versioning and rollback strategies - Automated evaluation and drift detection - Scalable model and agent serving infrastructure (e.g. vLLM, Triton, BentoML) - Deploy and maintain LLM and agentic workflows in production, including: - Monitoring cost, latency, and performance - Capturing traces for analysis and debugging - Optimizing prompt/response flows with real-time data access - Demonstrate strong ownership and pragmatism, balancing infrastructure elegance with iterative delivery and measurable impact. Learn about TRM Speed in this position: - Rapid Issue Resolution. TRM Engineers identify and resolve critical onsite issues in minutes to hours, not weeks. We create virtual war rooms, implement fixes, and share lessons with both customer stakeholders and internal teams within 48 hours. - Navigating Bureaucracy. We anticipate and address procedural hurdles, build trust with key stakeholders, and find alternative pathways to approvals. This keeps projects moving even in complex environments. - Efficient Knowledge Transfer. Engineers document and share updates in real time, ensuring the entire team—onsite and remote—has full visibility into plans, blockers, and resolutions. Knowledge sharing sessions and clear documentation reduce friction and accelerate delivery. About TRM's Engineering Levels: Engineer: Responsible for helping to define project milestones and executing small decision decisions independently with the appropriate tradeoffs between simplicity, readability, and performance. Provides mentorship to junior engineers, and enhances operational excellence through tech debt reduction and knowledge sharing. Senior Engineer: Successfully designs and documents system improvements and features for an OKR/project from the ground up. Consistently delivers efficient and reusable systems, optimizes team throughput with appropriate tradeoffs, mentors team members, and enhances cross-team collaboration through documentation and knowledge sharing. Staff Engineer: Drives scoping and execution of one or more OKRs/projects that impact multiple teams. Partners with stakeholders to set the team vision and technical roadmaps for one or more products. Is a role model and mentor to the entire engineering organization. Ensures system health and quality with operational reviews, testing strategies, and monitoring rigor. The following represents the expected range of compensation for this role: - Individual pay is determined by skills, qualifications, experience, and location. The compensation details listed in this posting reflect the US base salary only. - The estimated base salary range for this role is $200,000 - $275,000. - Additionally, this role may be eligible to participate in TRM’s equity plan. - Please note – we factor in the different costs for geographies outside the United States. Life at TRM We are building a safer world. That promise shows up in how we work every day. TRM moves quickly. We are a high velocity, high ownership team that expects clarity, follow-through, and impact. People who thrive here are energized by hard problems, experimentation, and continuous feedback. If something takes months elsewhere, it will ship here in days. Our work sits at the intersection of AI, national security, and fighting financial crime. The problems are complex, the stakes are real, and the environment evolves quickly. The pace and intensity of the work reflect the importance of the mission. As a result, the way we operate requires a high level of ownership, adaptability, collaboration, and creative problem-solving. At TRM, you should expect: - Priorities and targets to change quickly as we experiment and iterate - Work that often requires operating with a high degree of ambiguity - A high level of personal ownership and accountability - Close collaboration across teams and functions - Frequent, high-touch communication - Creative problem solving and out-of-the-box thinking - A pace that rewards urgency, adaptability, and outcomes This environment is energizing for people who enjoy building, solving hard problems, and making progress in situations that are not always fully defined. It also requires comfort navigating ambiguity, adjusting course as new information emerges, and maintaining focus and positivity in a fast-moving and intense environment. We also recognize that this style of operating is not for everyone. If you are primarily optimizing for predictability or a consistently balanced workload, we encourage you to use the interview process to pressure test whether this environment is truly the right fit. We want teammates who thrive here, not just survive here. At the same time, many people find this work deeply rewarding. If you are excited by meaningful problems, motivated by ambitious goals, and energized by working alongside mission-driven colleagues, there is a good chance you will find TRM to be an exceptional place to grow and contribute. Learn more: Interviewing at TRM: How We Hire and What Success Looks Like AI Fluency at TRM AI fluency is a baseline expectation at TRM. We believe AI meaningfully changes how top performers operate. We expect every team member to use AI to accelerate and reimagine their craft, not just automate surface tasks. At TRM, AI fluency means you are among the top 10 percent of operators in your function in how you apply AI to: - Accelerate repeatable workflows - Structure and solve problems - Improve output quality - Increase speed and leverage You will be evaluated on applied AI fluency during the interview process. Leadership Principles We hire and grow against three leadership principles. They’re the standards for how we operate, treat each other, and make decisions. - Impact-Oriented Trailblazer: We put customers first and move with speed, focus, and adaptability. We treat every plan like an experiment – test, ship, measure, and iterate quickly. - Master Craftsperson: We care deeply about our craft. We balance speed with high standards, own outcomes end‑to‑end, and invest in getting better everyday. - Inspiring Colleague: We add clarity and energy, not noise. We bring humility, candor, and a one‑team mindset — giving and receiving feedback to make the team stronger. Join our Mission At TRM we care deeply about our craft. We are looking for individuals who want their work to matter, who experiment with speed and rigor, and who take pride in building a safer world for billions of people. If you’re excited by TRM’s mission but don’t check every box, we encourage you to apply — we hire for slope, judgment, and the will to learn fast. TRM is a Series C company with $220M in total funding, backed by Blockchain Capital, Goldman Sachs, Bessemer, Y Combinator, Thoma Bravo, and others. Headquartered in San Francisco, TRM operates as a distributed-first company with hubs in Los Angeles, San Francisco, New York, Washington D.C., London, and Singapore. Privacy Policy and Additional Information By submitting your application, you are agreeing to allow TRM to process your personal information in accordance with the TRM Privacy Policy. Our typical hiring cycles for specialized roles span 24 to 36 months. Accordingly, we retain your personal information for up to 36 months to evaluate your application and to consider you for current and future employment opportunities, unless you request earlier deletion or a different retention period is required or permitted by law. To notify TRM Labs that you believe this job posting is non-compliant, please submit a report through this form. No response will be provided to inquiries unrelated to job posting compliance. The use of AI tools of any kind (including but not limited to notetakers, interview assistants, and real-time coaching tools such as Otter.ai, Fireflies, Fathom, Cluey, or similar) during TRM interviews is not permitted without prior approval from TRM. TRM uses its own internal tools for note-taking to ensure a consistent and confidential experience for all candidates. We are committed to providing reasonable accommodations to applicants with disabilities, and requests can be made via this form. Recruitment agencies TRM Labs does not accept unsolicited agency resumes. Please do not forward resumes to TRM employees. TRM Labs is not responsible for any fees related to unsolicited resumes and will not pay fees to any third-party agency or company without a signed agreement. Learn More: Company Values | Interviewing | FAQs

United States
$200K - $275K / year
Full TimeRemoteTeam 11-50Since 2025

• Lead, design, build, and operate large-scale compute clusters to power AI scientific research. • Write software that orchestrates large GPU and CPU clusters, manages resource allocation and automates cluster lifecycle operations. • Work on bringup, operations and maintenance of all aspects of these clusters. • Build tools and get directly involved in large scale frontier research experiments.

California
Job Closed
NTT DATA logo

Desktop Infrastructure Engineer

NTT DATA

NTT DATA is a $30 billion business and technology services leader, serving 75% of the Fortune Global 100. We are committed to accelerating client success and positively impacting society through responsible innovation. We are one of the world's leading AI and digital infrastructure providers, with unmatched capabilities in enterprise-scale AI, cloud, security, connectivity, data centers and application services. Our consulting and industry solutions help organizations and society move confidently and sustainably into the digital future. As a Global Top Employer, we have experts in more than 50 countries. We also offer clients access to a robust ecosystem of innovation centers as well as established and start-up partners. NTT DATA is a part of NTT Group, which invests over $3 billion each year in R&D.

Full TimeRemoteTeam 10,001+H1B Sponsor

Role Description We are seeking a highly skilled Remote Desktop Infrastructure Engineer to design, optimize, and support a next-generation remote trading environment. The ideal candidate will be responsible for upgrading a Citrix-based infrastructure, ensuring low-latency, high-performance remote access for traders working with multi-4K display setups. This role requires expertise in Citrix HDX, Mechdyne TGX, HP Anyware (Teradici PCoIP), RDP, and VDI solutions, with a focus on real-time financial market applications and secure remote desktop performance. What you'll be doing: - Design & Implementation - Architect and deploy high-performance remote desktop solutions for a multi-4K trading environment. - Evaluate Citrix, RDP, Mechdyne TGX, HP Anyware (Teradici PCoIP), or alternative solutions. - Optimize GPU acceleration (NVIDIA GRID, AMD vGPUs) to enhance remote rendering speeds. - Implement network optimizations for low-latency, high-bandwidth trading applications. - Develop and maintain a scalable and secure remote access infrastructure. - Performance & Optimization - Ensure real-time market data visualization with minimal compression loss and ultra-low latency. - Configure multi-4K monitor support across remote desktops with optimal frame rates. - Perform load testing, failover planning, and redundancy configuration. - Troubleshoot and enhance WAN performance, bandwidth efficiency, and display streaming quality. - Security & Compliance - Implement secure authentication (MFA, Active Directory, Zero Trust) for trader remote access. - Work with financial industry security standards to ensure compliance with data privacy and regulatory requirements. - Manage out-of-band remote control solutions for troubleshooting and recovery. - Support & Maintenance - Provide technical support to traders and resolve issues related to remote desktop performance. - Proactively monitor and address latency, connectivity, and rendering issues. - Develop documentation and best practices for IT teams managing the infrastructure. Qualifications - 5+ years of experience in remote desktop infrastructure for financial services or trading environments. - Expertise in Citrix HDX, Mechdyne TGX, HP Anyware (Teradici PCoIP), VMware Horizon, RDP, VDI solutions. - Strong understanding of GPU acceleration technologies (NVIDIA GRID, AMD vGPUs) for remote workstations. - Hands-on experience with multi-4K monitor setups in a remote trading environment. - Advanced networking knowledge – TCP/IP, VPN, WAN optimization, bandwidth management. - Experience with Windows Server, Active Directory, MFA security protocols, PowerShell automation. - Ability to design and support Blade Server or Virtualized Workstation (On-Prem or Cloud) solutions. Benefits - We offer a range of tailored benefits that support your physical, emotional, and financial wellbeing. - Continuous growth and development opportunities through our Learning and Development team. - Flexible work options. Company Description At NTT DATA, you have endless opportunities to think big, act bold and take ownership. As a $30+ billion business and technology services, AI and digital infrastructure leader, we co-innovate solutions with clients and partners globally for business and societal impact. Serving 75% of the Fortune Global 100, with experts in over 70 countries, we encourage experimentation and recognize great work. Proudly a Global Top Employer, NTT DATA is part of NTT Group, which invests over $3 billion annually in R&D.

United Kingdom
Job Closed
Baker Hughes logo

Senior Specialist - Infrastructure Architecture

Baker Hughes

We take energy forward – making it safer, cleaner, and more efficient for people and the planet.

Full TimeRemoteTeam 10,001+Since 1907H1B Sponsor

Senior Specialist Infrastructure Architect Would being part of a digital transformation excite you? Are you passionate about infrastructure security? Join our digital transformation team We operate at the heart of the digital transformation of our business. Our team is responsible for the cybersecurity, architecture and data protection for our global organization. We advise on the design and validation of all systems, infrastructure, technologies and data protection. Partner the best As a Senior Specialist Infrastructure Architect, you will be responsible for: - Participate in the domain technical and business discussions relative to future architect direction. - Assist in the analysis, design and development of a roadmap and implementation based upon a current vs. future state in a cohesive architecture viewpoint. - Gather and analyze data and develop architectural requirements at project level. - Participate in the infrastructure architecture governance model. - Support design and deployment of infrastructure solutions meeting standardization, consolidation, TCO, security, regulatory compliance and application system qualities, for different businesses. - Research and evaluate emerging technology, industry and market trends to assist in project development and/or operational support activities. - Coach and mentor team members Fuel your passion To be successful in this role you will: - Bachelor's Degree. A minimum 8 years of professional experience. - Have an experience in Azure infra services and automating deployments. - Have an experience working in DevOps and Data bricks. - Have some exposure to MLOPS. - Have hands on experience working with database technologies, including ETL tools including Databricks Workflows using Pyspark / Python, and an ability to learn new technologies. - Have strong proficiency in writing and optimizing SQL queries and working with databases. - Skilled level expertise in design of computing or network or storage to meet business application system qualities. - Understands technical and business discussions relative to future architecture direction aligning with business goals. - Understands concepts of setting and driving architecture direction. - Familiar with elements of gathering architecture requirements. - Understands architecture standards concepts to apply to project work. Work in a way that works for you We recognize that everyone is different and that the way in which people want to work and deliver at their best is different for everyone too. In this role, we can offer the following flexible working patterns: · Working remotely from home or any other work location · Flexibility in your work schedule to help fit in around life! · Talk to us about your desired flexible working options when you apply Working with us Our people are at the heart of what we do at Baker Hughes. We know we are better when all of our people are developed, engaged and able to bring their whole authentic selves to work. We invest in the health and well-being of our workforce, train and reward talent and develop leaders at all levels to bring out the best in each other. The Baker Hughes internal title for this role is: Digital Technology Senior Specialist - Infrastructure Architecture

India