Mission focused engineering for space and national security.
DevOps Engineer
Location
United States
Posted
3 days ago
Salary
$140K - $160K / year
Seniority
Senior
Job Description
DevOps Engineer
Sphinx Defense
• Maintain and support existing AWS-based Kubernetes infrastructure and services. • Ensure platform reliability, availability, and security across cloud environments. • Develop and maintain GitLab CI/CD pipelines to support software delivery. • Build and maintain Terraform modules and infrastructure-as-code solutions. • Create and manage Helm charts for Kubernetes application deployments. • Troubleshoot infrastructure, platform, and application issues across AWS and Kubernetes environments. • Manage AWS IAM permissions and cross-account access configurations. • Partner with software and platform engineering teams to support operational needs and platform improvements. • Continuously improve system performance, automation, observability, and operational efficiency.
Job Requirements
- Experience maintaining and supporting production AWS-based Kubernetes (EKS) environments.
- Experience writing, maintaining, and optimizing GitLab CI/CD pipelines.
- Experience developing and maintaining Infrastructure as Code using Terraform.
- Experience creating, deploying, and maintaining Helm charts for Kubernetes applications.
- Strong understanding of AWS IAM, role-based access controls, and cross-account access management.
- Experience troubleshooting applications and services running in AWS and Kubernetes environments.
- Proficiency with Python or Go for automation, tooling, and operational support.
- Experience maintaining highly available cloud infrastructure while meeting performance, reliability, and security requirements.
- Ability to support cross-functional engineering teams and respond to operational issues as needed.
- Strong communication skills and a collaborative approach to problem solving.
Benefits
- Quarterly bonus structure
- Equity ownership in the company
- 100% Premium coverage for medical, dental, and vision for you and dependants
- Health Savings Accounts
- Life Insurance
- 401k employer contribution plan
- Flexible paid time off policy
- Flexible work/remote work policy
Related Guides
Related Categories
Related Job Pages
More DevOps Engineer Jobs
• Design, implement, and support CI/CD pipelines, build automation, and deployment workflows across development, test, and production environments. • Engineer secure and scalable cloud platform capabilities using infrastructure-as-code, container platforms, and automation tooling. • Support platform reliability, observability, access controls, secrets management, and operational readiness. • Partner with software engineers to improve developer experience, deployment efficiency, and release repeatability. • Implement and maintain DevSecOps controls across build, deploy, scanning, monitoring, and incident response workflows. • Manage platform components such as containers, orchestration platforms, artifact repositories, and shared runtime services. • Troubleshoot environment, pipeline, and deployment issues and support root cause analysis and service restoration. • Contribute to platform standards, runbooks, technical documentation, and continuous improvement initiatives. • Support change management, patching, upgrades, and operational governance for shared services. • Collaborate with architects and leadership to align platform evolution with future-state delivery needs.
Senior Site Reliability Engineer, Government
SentinelOneSecure your enterprise with the autonomous cybersecurity platform. Endpoint. Cloud. Identity. XDR. Now.
• Drive continuous software delivery, resolve incidents, run post mortems, and create automation strategies for deployment, self-testing, and alerting. • Lead and execute incident management for production issues, ensuring rapid recovery, root cause analysis, and preventative follow-up actions. • Improve and optimize the observability strategy by collaborating with application engineering teams to design monitoring solutions that enhance alerting capabilities and reduce noise. • Define, implement, and monitor SLOs, SLIs, and SLAs in collaboration with product and engineering teams to align with business objectives. • Design, develop, and maintain software solutions that address operational, compliance, and pipeline challenges. • Own and coordinate all government environment releases, driving process improvements to enhance the release pipeline's efficiency, reliability, and visibility. • Partner cross-functionally with engineering, product, SecOps, compliance, and leadership teams to align priorities, define testing strategies, and resolve challenges. • Ensure all infrastructure and deployments meet FedRAMP, government regulations, and industry standards, while maintaining required release documentation and risk assessments.
Senior DevOps Engineer
UnqorkUsing CaaS (Codeless-as-a-Service) to accelerate time-to-market & eliminate legacy code for the enterprise 🚀
• Reporting to the Director of DevOps • Build the next-generation control plane that provisions, configures, and manages Unqork's Kubernetes fleet across commercial, government, and edge customer environments, continuing to push toward an architecture that is automated, modular, and built to scale • Design and deliver self-service infrastructure tooling that enables Ops and Support teams to execute common operational workflows without engineering intervention, shifting operational work left and freeing engineers to build • Drive observability improvements across the fleet, establishing alerting and instrumentation that produces cleaner signal, enables faster mitigation, and supports deeper root cause analysis when incidents occur • Improve the internal engineering experience by building faster CI/CD pipelines, better tooling, and paved-road patterns that reduce cognitive load and make the right way to build and ship the obvious way • Collaborate closely with the Principal Architect to shape technical direction across the control plane, infrastructure automation, and cross-cutting infrastructure concerns
Role Description Build out a cloud-native team that owns the entire software delivery life cycle on Amazon Web Services. You will combine deep Kubernetes expertise with Python and shell scripting to automate, monitor, and continuously improve the Linqia platform while driving FinOps practices to keep our cloud footprint efficient. Work in a GitOps culture where every change is delivered through pull requests and rolled out by automated pipelines. What You Will Do - Design, maintain, and evolve our AWS account structure, VPC networking, IAM policies, security boundaries, and cost-management controls using Terraform and the AWS console. - Maintain secure networking layers with AWS load balancers, ingress controllers, service-mesh policies, network policies, and zero-trust principles. - Operate and harden production-grade Kubernetes clusters on AWS EKS, including upgrades, service mesh, policy management, and multi-cluster architectures driven by Argo CD. - Build reusable infrastructure-as-code modules with Terraform that provision cloud resources in minutes while enforcing tagging standards and least-privilege access. - Create self-service CI/CD pipelines in Jenkins and GitHub Actions for fast, safe releases with automated testing and promotion across environments. - Deliver real-time observability with Datadog, Prometheus, Grafana, CloudWatch, and OpenTelemetry, and use these tools to assist in solving production bugs and issues. - Administer and maintain purpose-built Linux VMs via configuration management tools like Puppet, Ansible, or Chef. - Deploy, scale, and maintain databases on AWS (Aurora, PostgreSQL, MySQL, OpenSearch, etc.), maintaining high database performance/uptime, optimizing tables and datasets, and ensuring disaster recovery protocols are in place. - Support developers by maintaining Podman-based local dev boxes and Kubernetes staging environments that mirror production, ensuring smooth hand-off from local code to cloud-native deployments. - Implement FinOps practices: track and forecast AWS spend, enforce cost-allocation tagging, identify rightsizing opportunities, manage Savings Plans or Reserved Instances, and build cost-optimization dashboards for engineering and finance stakeholders. - Write automation utilities and command-line tools in Python and craft shell scripts that glue components and workflows together. - Champion reliability through incident reviews, capacity planning, game days, chaos testing, and service-level objective tracking. - Collaborate in Agile rituals, plan sprints, refine backlog tickets, and pair with peers to spread DevOps and FinOps best practices. Qualifications - Bachelor degree in Computer Science or equivalent practical experience. - Three plus years working with cloud infrastructure or platform engineering focused on AWS. - Deep hands-on experience with Kubernetes, preferably EKS, covering upgrades, networking, storage, RBAC, and custom resources. - Proficiency in Python and Bash or Zsh scripting. - Strong understanding of core AWS services EC2, VPC, IAM, ALB, S3, RDS, CloudFormation, and CloudWatch. - Demonstrated experience applying FinOps principles: cost monitoring, forecasting, and optimization on AWS. - Solid experience with Docker and container runtimes, with emphasis on Podman for local development environments. - Hands-on practice with configuration-management tools such as Ansible or Puppet and infrastructure-as-code with Terraform. - Proven use of Datadog for metrics, logs, and APM, plus familiarity with Prometheus and Grafana dashboards. - Comfortable with Git-based workflows, feature branching, and pull-request reviews. - Strong SQL skills and a deep understanding of relational database internals. - Competent in Linux administration, process troubleshooting, and performance tuning. - Practical knowledge of TCP/IP, HTTP, TLS, DNS, and common networking tools. - Clear communication skills and an ability to translate complex technical topics to diverse audiences. - Familiarity with Scrum or Kanban and a continuous-improvement mindset. Extra Credit - AWS certifications such as Solutions Architect, DevOps Engineer, or FinOps Practitioner. - Experience with AWS security tooling GuardDuty, Security Hub, IAM Access Analyzer, and KMS. - Building data pipelines with Apache Spark, Flink, or similar frameworks. - Implementing event-driven architectures with Kafka Streams or KSQL. - Applying SRE practices such as error budgets and service-level dashboards. - Exposure to machine-learning workflows, ModelOps, or MLOps in production.




