Using CaaS (Codeless-as-a-Service) to accelerate time-to-market & eliminate legacy code for the enterprise 🚀
Senior DevOps Engineer
Location
United States
Posted
3 days ago
Salary
$120K - $170K / year
Seniority
Senior
Job Description
Senior DevOps Engineer
Unqork
• Reporting to the Director of DevOps • Build the next-generation control plane that provisions, configures, and manages Unqork's Kubernetes fleet across commercial, government, and edge customer environments, continuing to push toward an architecture that is automated, modular, and built to scale • Design and deliver self-service infrastructure tooling that enables Ops and Support teams to execute common operational workflows without engineering intervention, shifting operational work left and freeing engineers to build • Drive observability improvements across the fleet, establishing alerting and instrumentation that produces cleaner signal, enables faster mitigation, and supports deeper root cause analysis when incidents occur • Improve the internal engineering experience by building faster CI/CD pipelines, better tooling, and paved-road patterns that reduce cognitive load and make the right way to build and ship the obvious way • Collaborate closely with the Principal Architect to shape technical direction across the control plane, infrastructure automation, and cross-cutting infrastructure concerns
Job Requirements
- 5+ years operating production Kubernetes clusters or equivalent container orchestration experience at scale in multi-tenant or regulated environments
- Strong AWS fundamentals (EKS, IAM, VPC networking, ACM) as the primary cloud; working knowledge of GCP and Azure in a multi-cloud operating environment is a plus
- Hands-on IaC experience (Terraform, OpenTofu, or Pulumi); comfort treating infrastructure as software, with real opinions on testability and safe change management at scale.
- Strong software engineering skills using general-purpose programming language (Go, TypeScript, Python, or similar) to model and manage the infrastructure lifecycle in an API and workflow driven control plane
- High autonomy in ambiguous problem spaces, defining the path forward when one doesn't exist, and taking full ownership of outcomes rather than waiting to be handed a spec.
- A track record of building internal tooling or self-service workflows that non-engineering teams can actually use in production
- Experience improving observability posture, including instrumentation strategy, alert quality, distributed tracing, and reducing time from detection to resolution
- Experience in regulated or compliance-sensitive environments where security, auditability, and change management are non-negotiable
- Bias toward fewer moving parts, not more, and can articulate why every layer of complexity earns its place.
- An AI-forward mindset: you actively use AI tools to accelerate your own work and think critically about where automation creates real leverage
Benefits
- 💻 Work from home with a remote-first community
- 🏝 Unlimited PTO (and the encouragement to use it)
- 📝 Student loan payback program
- 🏥 100% employer-covered medical, dental, and vision options available to you and your dependents
- 💸 Flexible Spending Account (FSA)
- 🏠 Monthly stipend toward your WFH setup, vacation, development and more
- 💰 Employer-sponsored 401(k) with contribution match
- 🏋🏻♀️ Subsidized ClassPass Membership
- 🍼 Generous Paid Parental Leave
Related Guides
Related Categories
Related Job Pages
More DevOps Engineer Jobs
Role Description Build out a cloud-native team that owns the entire software delivery life cycle on Amazon Web Services. You will combine deep Kubernetes expertise with Python and shell scripting to automate, monitor, and continuously improve the Linqia platform while driving FinOps practices to keep our cloud footprint efficient. Work in a GitOps culture where every change is delivered through pull requests and rolled out by automated pipelines. What You Will Do - Design, maintain, and evolve our AWS account structure, VPC networking, IAM policies, security boundaries, and cost-management controls using Terraform and the AWS console. - Maintain secure networking layers with AWS load balancers, ingress controllers, service-mesh policies, network policies, and zero-trust principles. - Operate and harden production-grade Kubernetes clusters on AWS EKS, including upgrades, service mesh, policy management, and multi-cluster architectures driven by Argo CD. - Build reusable infrastructure-as-code modules with Terraform that provision cloud resources in minutes while enforcing tagging standards and least-privilege access. - Create self-service CI/CD pipelines in Jenkins and GitHub Actions for fast, safe releases with automated testing and promotion across environments. - Deliver real-time observability with Datadog, Prometheus, Grafana, CloudWatch, and OpenTelemetry, and use these tools to assist in solving production bugs and issues. - Administer and maintain purpose-built Linux VMs via configuration management tools like Puppet, Ansible, or Chef. - Deploy, scale, and maintain databases on AWS (Aurora, PostgreSQL, MySQL, OpenSearch, etc.), maintaining high database performance/uptime, optimizing tables and datasets, and ensuring disaster recovery protocols are in place. - Support developers by maintaining Podman-based local dev boxes and Kubernetes staging environments that mirror production, ensuring smooth hand-off from local code to cloud-native deployments. - Implement FinOps practices: track and forecast AWS spend, enforce cost-allocation tagging, identify rightsizing opportunities, manage Savings Plans or Reserved Instances, and build cost-optimization dashboards for engineering and finance stakeholders. - Write automation utilities and command-line tools in Python and craft shell scripts that glue components and workflows together. - Champion reliability through incident reviews, capacity planning, game days, chaos testing, and service-level objective tracking. - Collaborate in Agile rituals, plan sprints, refine backlog tickets, and pair with peers to spread DevOps and FinOps best practices. Qualifications - Bachelor degree in Computer Science or equivalent practical experience. - Three plus years working with cloud infrastructure or platform engineering focused on AWS. - Deep hands-on experience with Kubernetes, preferably EKS, covering upgrades, networking, storage, RBAC, and custom resources. - Proficiency in Python and Bash or Zsh scripting. - Strong understanding of core AWS services EC2, VPC, IAM, ALB, S3, RDS, CloudFormation, and CloudWatch. - Demonstrated experience applying FinOps principles: cost monitoring, forecasting, and optimization on AWS. - Solid experience with Docker and container runtimes, with emphasis on Podman for local development environments. - Hands-on practice with configuration-management tools such as Ansible or Puppet and infrastructure-as-code with Terraform. - Proven use of Datadog for metrics, logs, and APM, plus familiarity with Prometheus and Grafana dashboards. - Comfortable with Git-based workflows, feature branching, and pull-request reviews. - Strong SQL skills and a deep understanding of relational database internals. - Competent in Linux administration, process troubleshooting, and performance tuning. - Practical knowledge of TCP/IP, HTTP, TLS, DNS, and common networking tools. - Clear communication skills and an ability to translate complex technical topics to diverse audiences. - Familiarity with Scrum or Kanban and a continuous-improvement mindset. Extra Credit - AWS certifications such as Solutions Architect, DevOps Engineer, or FinOps Practitioner. - Experience with AWS security tooling GuardDuty, Security Hub, IAM Access Analyzer, and KMS. - Building data pipelines with Apache Spark, Flink, or similar frameworks. - Implementing event-driven architectures with Kafka Streams or KSQL. - Applying SRE practices such as error budgets and service-level dashboards. - Exposure to machine-learning workflows, ModelOps, or MLOps in production.
• Build highly interactive, single-page React apps that can scale with both increased interaction complexity and volume. • Design, implement, and maintain deployments at scale, infrastructure, reliability, and scalability; then iterate and optimize continual improvements. • Manage always-available infrastructure, deployment pipelines, and platform tooling to eliminate downtime and improve the manageability of services and systems. • Collaborate with Software Engineering teams to architect and develop infrastructure and automated deployments for cloud-native SaaS applications. • Research and integrate new technologies and innovative solutions to continuously enhance platform functionality and performance. • Partner with peers on product development to define and execute the company’s roadmap and to address critical technical challenges.
• Helping migrate a self-managed Kubernetes cluster onto Amazon EKS. • Managing and improving AWS infrastructure defined in Terraform. • Supporting the migration of self-hosted Kafka onto Amazon managed services. • Ensuring platform stability, observability, and security during changes. • Collaborating closely with a senior internal team and taking initiative on tasks. • Documenting work for team maintenance.
• Design and implement Infrastructure as Code practices • Build and improve observability (monitoring, logging, tracing) • Stabilize and evolve production environments • Support multi-environment deployments (Azure, private cloud, on-premise) • Improve platform reliability and system health • Participate in incident response and post-mortem analysis




