BlackSky is a leading provider of real-time geospatial intelligence.
Senior Infrastructure Engineer
Location
Virginia
Posted
3 days ago
Salary
$135K - $150K / year
Seniority
Senior
Job Description
Senior Infrastructure Engineer
BlackSky
• Design and operate AWS infrastructure (VPC, subnets, NLB/ALB, IAM, EKS, EC2, S3, Route 53) and the hybrid connectivity that ties cloud to on-premises and private/air-gapped networks. • Stand up and run production-grade Kubernetes clusters on EKS, Rancher (RKE2) and/or Red Hat OpenShift 4, including upgrades, capacity planning, networking, storage, and day-2 operations. • Implement and own GitOps workflows with Argo CD — declarative cluster and application state, app-of-apps patterns, sync policies, drift detection, and progressive rollout strategies. • Author, version, and maintain Helm charts for internal and third-party workloads, including values management, chart dependencies, and templating standards across environments. • Build repeatable delivery into disconnected environments using Zarf (and equivalent packaging/mirroring tooling) — bundling images, charts, and manifests for air-gapped installs and reproducible deployments. • Codify infrastructure and platform configuration as code (Terraform, Helm, Kustomize) with a clear build-once / promote-per-environment strategy. • Build and harden CI/CD pipelines that move artifacts safely from dev through to restricted production and BCP targets. • Integrate platform services — certificate management (cert-manager), secrets management, container registries, storage, and observability — as shared, reusable building blocks. • Establish operational standards: monitoring, alerting, logging, runbooks, incident response, and capacity/cost management. • Other responsibilities as assigned.
Job Requirements
- At least five years years in infrastructure, platform, DevOps, or SRE engineering, with at least 3 years running Kubernetes in production.
- Bachelor's degree in a relevant field of study or equivalent experience (four years).
- Strong hands-on AWS experience across networking, compute, storage, and IAM, including hybrid/on-prem connectivity patterns.
- Production experience operating Kubernetes in one or more enterprise distributions — Amazon EKS, Rancher/RKE2, or OpenShift 4.
- Demonstrated GitOps experience with Argo CD (or Flux) as the primary deployment mechanism.
- Proficiency authoring and maintaining Helm charts, and a solid grasp of Kubernetes primitives (workloads, networking, RBAC, storage, CRDs/operators).
- Experience with the Kubernetes Operator deployment model — deploying and managing workloads via operators and CRDs (OLM/OperatorHub).
- Strong infrastructure-as-code skills, ideally with Terraform.
- Comfort with Linux systems administration and scripting (Bash, plus Python or Go).
- Experience building on hardened, non-CVE / zero-known-vulnerability base images (e.g., Chainguard, Iron Bank, or distroless/minimal baselines) and supply-chain security practices.
- Production monitoring and observability with Prometheus and Grafana (exporters, PromQL, alerting, dashboards).
- Clear written and verbal communication, and the ability to work independently across the full lifecycle of a platform component.
Benefits
- Medical, dental, vision, disability, group term life and AD&D, voluntary life and AD&D insurance
- BlackSky pays 100% of employee-only premiums for medical, dental and vision and contributes $100/month for out-of-pocket expenses!
- 15 days of PTO, 11 Company holidays, four Floating Holidays, one day of paid volunteerism leave per year, parental leave and more
- 401(k) pre-tax and Roth deferral options with employer match
- Flexible Spending Accounts
- Employee Stock Purchase Program
- Employee Assistance and Travel Assistance Programs
- Employer matching donations
- Professional development
- Mac or PC? Your choice!
- Awesome swag
Related Guides
Related Categories
Related Job Pages
More Infrastructure Engineer Jobs
Senior Platform Engineer, Voice Infrastructure
ConquerClose more deals by connecting with your buyers faster, where they want to talk, without leaving Salesforce.
• Own and scale Conquer’s telephony systems powering real-time sales workflows in Salesforce. • Solve reliability, performance, and infrastructure challenges while shaping the future of our platform. • Design, build, and scale Conquer’s telephony infrastructure and supporting services • Own integrations with domestic and international SIP Trunking providers • Work directly in production environments, troubleshooting real-time call quality and reliability issues • Improve observability across the voice stack, including SIP, RTP, and media paths • Develop and evolve call routing, dialing workflows, and real-time communication services • Partner with Product, Engineering, and Customer teams to shape the calling experience • Contribute to architectural decisions, including ongoing platform and orchestration improvements • Document systems and create runbooks that improve team understanding and operational readiness
Platform Architect, Server Infrastructure
Phononic IncCooling the Data Centers and Optics that Power AI
• Define end-to-end thermal architecture strategies for GPU servers, optical interconnects, and CPO-based systems • Develop system-level approaches to balance performance, heat dissipation, reliability, and energy efficiency • Design and optimize solutions for: High-power GPUs and accelerators, Dense optical I/O (pluggable and co-packaged optics), Rack- and cluster-level thermal constraints • Optimize cooling strategies for high-density AI workloads and optical bandwidth scaling • Analyze and improve thermal resistance, junction temperatures, and cooling efficiency • Lead design and evaluation of advanced cooling approaches, including: Air cooling (high-performance heatsinks, airflow optimization), Liquid cooling (direct-to-chip, cold plates), Immersion cooling and emerging techniques • Architect thermal solutions for: High-speed optical transceivers (400G/800G/1.6T+), Co-packaged optics (CPO) integrated with switch or GPU ASICs • Collaborate with silicon photonics teams to co-design thermal-aware optical packaging architectures • Design GPU server platforms optimized for thermal efficiency, including: Multi-GPU configurations and interconnect density, Power delivery and cooling integration, Airflow and liquid loop design • Drive innovations in rack-level and data center-level cooling, including: High-density rack (>50–100kW) thermal strategies, Integration with facility cooling systems, Optimization for power usage effectiveness (PUE)
IT Cyber Security Architect, Plant Infrastructure
Recurrent EnergyDelivering clean, reliable and affordable power to the world, today and tomorrow.
• Develop and execute holistic cybersecurity strategies tailored to the unique challenges of Operational Technology environments, focusing on protecting critical assets, ensuring availability, and preventing unauthorized access. • Stay abreast of relevant regulations and standards, particularly NERC CIP (Critical Infrastructure Protection) standards, and ensure the organization's systems, processes, and procedures are aligned with compliance requirements. • Design, review, and enhance network architectures for both IT and OT environments, incorporating security measures that prevent unauthorized intrusion, data breaches, and other cyber threats. • Conduct thorough risk assessments to identify vulnerabilities and potential threats within the OT landscape. Translate findings into actionable security recommendations and solutions. • Lead the deployment of advanced security solutions, including intrusion detection systems, firewalls, access controls, and encryption mechanisms, to safeguard critical infrastructure. • Collaborate with cross-functional teams, including IT, operations, engineering, and compliance, to align cybersecurity initiatives with business goals, operational needs, and regulatory requirements. • Develop and maintain robust incident response plans specific to OT environments. Coordinate with incident response teams to ensure a swift and effective response to security incidents. • Raise awareness and provide training to employees, contractors, and partners about OT cybersecurity best practices, policies, and procedures. • Evaluate the security posture of third-party vendors and partners, ensuring that their solutions and services meet cybersecurity standards.
• Paperpile runs on data at scale, with a literature database of 250M+ academic papers and a growing body of user data accumulated over more than a decade. You'll work across the systems that ingest, process, store, and serve this data reliably: building pipelines, optimizing search, handling PDFs at scale, and exposing clean APIs.




