Job Closed

This listing is no longer active.

Systems Engineer – Container Platform

Systems EngineerSystems EngineerOtherRemoteSeniorTeam 501-1,000H1B No SponsorCompany SiteLinkedIn

Location

Virginia

Posted

116 days ago

Salary

0

Seniority

Senior

Job Description

Systems Engineer – Container Platform

ARETUM

• Lead the deployment, hardening, and operational management of containerized applications on AWS ECS or OpenShift platform for a Federal cloud environment • Responsible for evaluating and hardening vendor-supplied containers, implementing container orchestration infrastructure-as-code, and establishing secure, compliant container operations • Deploy, configure, and manage AWS ECS or OpenShift container orchestration platform in production Federal environment • Evaluate vendor-supplied container images for security vulnerabilities, compliance gaps, and operational requirements • Implement container hardening strategies applying CIS benchmarks, DSTI STIGs, and federal security baselines • Configure container orchestration including task/service definitions (ECS) or deployments/operators (OpenShift) • Manage container lifecycle including image versioning, updates, patching, and rollback procedures • Implement horizontal auto-scaling policies based on CPU, memory, custom metrics, and workload patterns • Establish container networking including service discovery, ingress/egress controls, and inter-container communication • Perform container image scanning using tools such as Prisma Cloud, Aqua Security, Twistlock, or AWS ECR scanning • Remediate container vulnerabilities identified through scanning and security assessments • Implement runtime security controls including container isolation, resource limits, and security contexts • Configure secrets management for containerized applications using AWS Secrets Manager or HashiCorp Vault • Apply least-privilege principles to container IAM roles and service accounts • Implement container image signing and verification workflows • Document container security controls and provide evidence for RMF/ATO security assessment • Develop and maintain infrastructure-as-code using Terraform or AWS CloudFormation for container platform • Build automated deployment pipelines for container infrastructure and application updates • Create repeatable, version-controlled infrastructure patterns for scaling to 130+ system instances • Implement GitOps workflows for infrastructure change management and audit trails • Develop automation scripts for container platform management and troubleshooting • Establish configuration baselines and drift detection mechanisms • Design and implement multi-AZ container deployments ensuring high availability during infrastructure failures • Configure health checks, readiness probes, and liveness probes for container self-healing • Implement disaster recovery procedures including backup strategies for persistent container data • Establish resource reservation and quality-of-service policies to prevent resource contention • Design capacity planning and scaling strategies to handle variable workloads serving millions of clients • Implement zero-downtime deployment strategies including blue-green and rolling updates • Create comprehensive operational runbooks for container platform management, troubleshooting, and incident response • Document deployment procedures, configuration baselines, and security hardening steps • Develop standard operating procedures (SOPs) for routine maintenance and emergency procedures • Maintain container platform architecture diagrams and configuration documentation for RMF compliance • Create knowledge transfer materials for scaling operations team

Job Requirements

  • Bachelor's degree in Computer Science, Information Systems, Information Technology, or related technical field
  • Relevant professional certifications and demonstrated experience may supplement education
  • 5-7 years in systems engineering, DevOps, or infrastructure roles
  • 3+ years hands-on experience with container platforms (ECS, OpenShift, or Kubernetes) in production environments
  • 2+ years working with AWS infrastructure and services
  • Experience with container hardening, security scanning, and vulnerability remediation
  • Strong analytical and troubleshooting skills with systematic problem-solving approach
  • Attention to detail and commitment to security-first operations
  • Ability to work independently and manage multiple concurrent infrastructure workstreams
  • Effective written and verbal communication for documentation and cross-team collaboration
  • Adaptable to fast-paced, deadline-driven environment with changing requirements
  • Proactive mindset for identifying and resolving potential issues before they impact operations
  • AWS Certified Solutions Architect - Associate or Professional (Preferred)
  • Certified Kubernetes Administrator (CKA) or Red Hat Certified Specialist in OpenShift (Preferred)
  • Docker Certified Associate (Preferred)
  • Experience with service mesh technologies (Istio, AWS App Mesh) (Preferred)
  • Knowledge of container vulnerability management platforms (Prisma, Aqua, Twistlock) (Preferred)
  • Federal government contracting or DoD infrastructure experience (Preferred)
  • Experience with immutable infrastructure and GitOps methodologies (Preferred)

Benefits

  • Health Care Plan (Medical, Dental & Vision)
  • Retirement Plan (401k)
  • Life Insurance (Basic, Voluntary & AD&D)
  • Paid Time Off
  • Family Leave (Maternity, Paternity)
  • Short Term & Long-Term Disability
  • Training & Development

Related Categories

Related Job Pages

More Systems Engineer Jobs

OtherRemoteTeam 501-1,000H1B No Sponsor

About Aretum Aretum is a mission-driven organization committed to delivering innovative, technology-enabled solutions to our customers across defense, civilian, and homeland security sectors. Our teams work at the intersection of strategy, technology, and transformation, helping agencies solve their most critical challenges. We believe in investing in our people and creating a culture where collaboration, inclusion, and professional growth are at the forefront.  Job Summary Lead the deployment, hardening, and operational management of containerized applications on AWS ECS or OpenShift platform for a Federal cloud environment. Responsible for evaluating and hardening vendor-supplied containers, implementing container orchestration infrastructure-as-code, and establishing secure, compliant container operations that support millions of client transactions while meeting RMF/ATO requirements. Due to the nature of our work as a federal consulting organization, employees may be expected to handle Controlled Unclassified Information (CUI) and must adhere to applicable safeguarding and compliance requirements.  Responsibilities - Deploy, configure, and manage AWS ECS or OpenShift container orchestration platform in production Federal environment - Evaluate vendor-supplied container images for security vulnerabilities, compliance gaps, and operational requirements - Implement container hardening strategies applying CIS benchmarks, DSTI STIGs, and federal security baselines - Configure container orchestration including task/service definitions (ECS) or deployments/operators (OpenShift) - Manage container lifecycle including image versioning, updates, patching, and rollback procedures - Implement horizontal auto-scaling policies based on CPU, memory, custom metrics, and workload patterns - Establish container networking including service discovery, ingress/egress controls, and inter-container communication - Perform container image scanning using tools such as Prisma Cloud, Aqua Security, Twistlock, or AWS ECR scanning - Remediate container vulnerabilities identified through scanning and security assessments - Implement runtime security controls including container isolation, resource limits, and security contexts - Configure secrets management for containerized applications using AWS Secrets Manager or HashiCorp Vault - Apply least-privilege principles to container IAM roles and service accounts - Implement container image signing and verification workflows - Document container security controls and provide evidence for RMF/ATO security assessment - Develop and maintain infrastructure-as-code using Terraform or AWS CloudFormation for container platform - Build automated deployment pipelines for container infrastructure and application updates - Create repeatable, version-controlled infrastructure patterns for scaling to 130+ system instances - Implement GitOps workflows for infrastructure change management and audit trails - Develop automation scripts for container platform management and troubleshooting - Establish configuration baselines and drift detection mechanisms - Design and implement multi-AZ container deployments ensuring high availability during infrastructure failures - Configure health checks, readiness probes, and liveness probes for container self-healing - Implement disaster recovery procedures including backup strategies for persistent container data - Establish resource reservation and quality-of-service policies to prevent resource contention - Design capacity planning and scaling strategies to handle variable workloads serving millions of clients - Implement zero-downtime deployment strategies including blue-green and rolling updates - Create comprehensive operational runbooks for container platform management, troubleshooting, and incident response - Document deployment procedures, configuration baselines, and security hardening steps - Develop standard operating procedures (SOPs) for routine maintenance and emergency procedures - Maintain container platform architecture diagrams and configuration documentation for RMF compliance - Create knowledge transfer materials for scaling operations team

Virginia
Job Closed
OtherRemoteTeam 501-1,000H1B No Sponsor

About Aretum Aretum is a mission-driven organization committed to delivering innovative, technology-enabled solutions to our customers across defense, civilian, and homeland security sectors. Our teams work at the intersection of strategy, technology, and transformation, helping agencies solve their most critical challenges. We believe in investing in our people and creating a culture where collaboration, inclusion, and professional growth are at the forefront.  Job Summary Lead comprehensive testing, validation, and quality assurance activities for a complex Federal cloud integration solution serving millions of clients. Our delivery is for a polypharmacy solution in a complex, multi-system cloud integration solution for Department of Veterans Affairs healthcare system that services millions of veterans. You are responsible for developing and executing test strategies across infrastructure, integration, security, and performance domains to ensure the solution meets functional requirements, security controls, and compliance standards necessary for RMF/ATO approval within a 6-month timeline. Due to the nature of our work as a federal consulting organization, employees may be expected to handle Controlled Unclassified Information (CUI) and must adhere to applicable safeguarding and compliance requirements.  Responsibilities - Develop comprehensive test strategy covering functional, integration, performance, security, and compliance testing - Create detailed test plans aligned with RMF/ATO requirements and federal security control validation - Define test environments, test data requirements, and testing infrastructure needs - Establish test schedules, milestones, and success criteria for pilot deployment across 10 systems - Identify testing tools, frameworks, and automation opportunities to meet aggressive timeline - Coordinate testing dependencies across development, infrastructure, and security teams - Test container platform deployments (ECS/OpenShift) including orchestration, scaling, and failover capabilities - Validate infrastructure-as-code deployments for consistency, repeatability, and compliance with baselines - Test application load balancer configurations including health checks, routing rules, and SSL/TLS termination - Verify network segmentation, security group rules, and multi-AZ high availability configurations - Validate backup and disaster recovery procedures including RTO/RPO compliance - Test auto-scaling behaviors under various load conditions and failure scenarios - Design and execute integration test cases for data flows across 10+ disparate source systems - Test API interfaces, data orchestration workflows, and error handling/retry logic - Validate data transformation accuracy, completeness, and consistency across integration points - Test vendor container integration with custom infrastructure and security controls - Verify Databricks pipeline processing and PowerBI report accuracy against expected outcomes - Execute end-to-end business process testing for risk analysis and service coordination workflows - Validate implementation of NIST 800-53 security controls required for ATO - Coordinate and support security assessment testing activities with ISSO/ISSM - Test authentication, authorization, and access control mechanisms across all system components - Validate encryption implementation (at-rest and in-transit) for sensitive client data and PII - Execute vulnerability scanning and verify remediation of identified security findings - Test audit logging completeness, accuracy, and compliance with federal requirements - Validate container hardening against CIS benchmarks and DSTI STIGs - Support penetration testing activities and coordinate remediation verification - Design and execute performance tests simulating millions of client records and concurrent users - Test system performance under peak load, sustained load, and stress conditions - Validate horizontal scaling effectiveness and resource utilization efficiency - Identify performance bottlenecks in integration workflows, database queries, and API endpoints - Test monitoring and alerting thresholds under various load scenarios - Measure and validate response times, throughput, and latency against defined SLAs - Develop automated test suites for regression testing of infrastructure and application components - Integrate automated testing into CI/CD pipelines for continuous validation - Create reusable test scripts and frameworks for scaling to 130+ system instances - Automate security compliance checks and configuration validation - Build synthetic monitoring and health check automation for production readiness - Document test cases, test results, and defect findings with clear reproduction steps - Track defects through resolution and verify fixes through regression testing - Maintain traceability matrix linking requirements to test cases and security controls - Create test summary reports for stakeholder reviews and compliance evidence - Document test environments, configurations, and procedures for audit and knowledge transfer - Provide testing evidence and artifacts for RMF/ATO security assessment package

Virginia
Job Closed
OtherRemoteTeam 501-1,000H1B No Sponsor

About Aretum Aretum is a mission-driven organization committed to delivering innovative, technology-enabled solutions to our customers across defense, civilian, and homeland security sectors. Our teams work at the intersection of strategy, technology, and transformation, helping agencies solve their most critical challenges. We believe in investing in our people and creating a culture where collaboration, inclusion, and professional growth are at the forefront. Job Summary Lead AWS networking architecture, application load balancing, and enterprise monitoring/observability implementation for a Federal cloud integration solution. This is for a polypharmacy solution in a complex, multi-system cloud integration solution for Department of Veterans Affairs healthcare system that services millions of veterans. Responsible for designing secure network segmentation, configuring high-availability load balancing, and establishing comprehensive monitoring across Splunk, Dynatrace, and DataDog platforms to ensure visibility, compliance, and operational excellence for RMF/ATO approval. Due to the nature of our work as a federal consulting organization, employees may be expected to handle Controlled Unclassified Information (CUI) and must adhere to applicable safeguarding and compliance requirements.  Responsibilities - Design and implement AWS networking architecture including VPC design, subnets, route tables, security groups, and NACLs - Configure Application Load Balancer (ALB) with target groups, health checks, SSL/TLS termination, path-based routing, and WAF integration - Implement network security controls for federal compliance including network segmentation, encryption in transit, and zero-trust principles - Design multi-AZ high availability architecture ensuring resilience during infrastructure failures - Understand and coordinate Transit Gateway, PrivateLink, and VPC peering for secure multi-system connectivity - Implement container networking including service discovery, ingress controllers, and network policies - Manage VPC Flow Logs and network traffic analysis for security monitoring and troubleshooting - Create network diagrams, boundary protection documentation, and data flow diagrams for RMF compliance - Implement and configure enterprise monitoring platforms (Splunk, Dynatrace, and/or DataDog) for comprehensive system visibility - Design monitoring architecture covering containers, load balancers, APIs, databases, and data pipelines - Configure audit logging and SIEM integration for federal compliance requirements including who-did-what-when traceability - Establish alert design, escalation policies, and incident response integration for operational excellence - Create dashboards for technical teams, operations, and compliance stakeholders - Integrate AWS CloudWatch, CloudTrail, and VPC Flow Logs with enterprise monitoring platforms - Implement performance monitoring, capacity planning, and baseline establishment for anomaly detection - Configure distributed tracing and application performance monitoring (APM) for multi-tier applications - Design network architecture supporting zero-downtime deployments and automatic failover - Configure load balancer health checks, connection draining, and traffic distribution algorithms - Implement DNS failover strategies and multi-region considerations for disaster recovery - Test and validate network failover scenarios and recovery procedures - Monitor network performance metrics and optimize for latency, throughput, and reliability - Implement network security controls aligned with NIST 800-53 requirements - Configure encryption in transit (TLS 1.2+) across all network communication paths - Apply least-privilege network access policies using security groups and NACLs - Implement network intrusion detection and prevention monitoring - Document network security controls and monitoring capabilities for RMF/ATO security assessment - Configure compliance logging with appropriate retention policies for audit requirements - Monitor and alert on security events, anomalous network traffic, and compliance violations - Create comprehensive network architecture diagrams, IP addressing schemes, and routing documentation - Develop operational runbooks for network troubleshooting, load balancer management, and monitoring response procedures - Document monitoring alert thresholds, escalation procedures, and incident response playbooks - Maintain network and monitoring configuration baselines for compliance and change management - Collaborate with container platform team on networking requirements and service mesh integration - Work with developers on application health check design and monitoring instrumentation - Partner with testing team on performance monitoring and load testing metric collection - Support security teams with network traffic analysis and security event investigation

Virginia
Job Closed
OtherRemoteTeam 10,001+Since 2015H1B Sponsor

• Develop technical proficiency across the combined HPE Networking portfolio • Partner with sales teams to understand customer needs and design solutions • Create architectures and proposals aligned with customer objectives and lead proof-of-concept activities • Participate in advanced training programs and certifications to deepen expertise in HPE and Juniper technologies • Collaborate with internal teams and partners to ensure successful solution positioning and adoption

California
$166K - $343K / year
Job Closed