Job Closed

This listing is no longer active.

VOLT AI logo
VOLT AI

Improving security effectiveness while reducing costs through advanced AI

Field & Cloud DevOps Engineer – Edge Infrastructure

DevOps EngineerDevOps EngineerOtherRemoteSeniorTeam 11-50H1B No SponsorCompany SiteLinkedIn

Location

California

Posted

134 days ago

Salary

$135K - $160K / year

Seniority

Senior

Job Description

Field & Cloud DevOps Engineer – Edge Infrastructure

VOLT AI

• Own end-to-end infrastructure operations spanning customer sites, edge devices, and AWS production environments • Plan and execute customer deployments in collaboration with customer IT teams, including leading technical discussions around network architecture, security requirements, and deployment constraints, translating customer IT policies into executable deployment plans, and driving deployment readiness and timelines to ensure on-time launches • Deploy, configure, and maintain edge compute hardware in customer environments, including installing and maintaining Linux-based operating systems, creating, deploying, and updating standardized golden images, managing OS patches, drivers, firmware, and security updates, and diagnosing hardware, OS, and performance issues remotely and on-site • Design, integrate, and troubleshoot customer-side networking, including VLANs, firewall rules, NAT, routing, and bandwidth constraints, IP camera and video infrastructure (RTSP, ONVIF, PoE, managed switches), and operating effectively within locked-down or highly regulated networks • Build and operate AWS-based infrastructure supporting edge systems, including Kubernetes (EKS) clusters and containerized workloads, CI/CD pipelines for deployment and upgrades, and logging, monitoring, and alerting across distributed systems • Develop and maintain automation and internal tooling, including Python for deployment automation, operational tooling, and system workflows, shell scripting for provisioning, configuration, and diagnostics, and tools that reduce manual effort and improve deployment repeatability • Ensure reliable edge-to-cloud connectivity and data flow, debugging failures across networking, software, and infrastructure boundaries • Lead incident response and operational debugging across edge and cloud systems, driving issues to root cause and permanent resolution • Own infrastructure cost visibility and optimization, including monitoring and analyzing AWS spend, implementing right-sizing, scaling, and cost controls, and working with external cost-optimization partners and tooling • Feed real-world deployment and operations insights back into product and infrastructure design, improving reliability and scalability

Job Requirements

  • Experience with edge computing or on-prem deployments
  • Familiarity with IP camera systems, RTSP, video infrastructure and Video Management Systems
  • Experience with infrastructure-as-code (Terraform or CloudFormation)
  • Experience managing or optimizing cloud infrastructure costs
  • Background working in fast-moving startup environments

Related Categories

Related Job Pages

More DevOps Engineer Jobs

OtherRemoteTeam 201-500H1B No Sponsor

• Working in an agile environment as part of a full design and development team, participating in agile ceremonies, interacting with and supporting our customer • Continuously improving the client infrastructure and platform for users • Supporting developer teams in deploying, maintaining, and troubleshooting web applications • Deploying new CI/CD workflows and improving the performance of existing ones • Serving as the subject matter expert for containerization of Node.js applications and DevSecOps and provide relevant advice to team members and customers

United States
$133K - $147K / year
Job Closed
Underdog Fantasy logo

Senior Site Reliability Engineer – Infrastructure

Underdog Fantasy

Underdog Fantasy describes itself as one of the fastest-growing sports companies on the market, bringing "fun, approachable contests and games to the masses." A

DevOps Engineer134 days ago

• Own and maintain the incident response process, including defining procedures, tools, and best practices • Guide teams in establishing and monitoring Service Level Objectives (SLOs), including setting up alerts and reporting systems • Lead capacity planning initiatives, focusing on both short and long-term scalability while optimizing costs • Develop and implement disaster recovery plans, including regular testing and regulatory compliance • Collaborate with teams on architecture decisions to ensure high availability and scalability • Manage launch and event planning for high-traffic occasions, focusing on infrastructure preparation and capacity management (a.k.a. Launch Readiness) • Act as an internal expert and consultant for monitoring tools like Datadog and Pagerduty and infrastructure like AWS and Kubernetes • Emphasis on automation and tooling to scale our workload • Contribute across codebases in Ruby, Python, Go, TypeScript, Swift, and Kotlin as needed to support the initiatives described above.

United States
$160K - $240K / year
Job Closed
Full TimeRemoteTeam 1,001-5,000H1B Sponsor

• Design, build, and operate reliable and scalable systems by defining and monitoring SLOs/SLIs • work directly on production infrastructure • collaborate closely with software engineers on system design and reliability improvements • actively develop automation for infrastructure and operational workflows to eliminate toil and reduce MTTR • participate in and lead incident response • drive blameless post-incident reviews with concrete follow-ups implemented in code and tooling • continuously analyze and optimize system performance and cost • provide data, insights, and recommendations to inform capacity planning • support security best practices through hands-on vulnerability remediation and threat mitigation

Italy
Hashgraph logo

Senior Site Reliability Engineer

Hashgraph

Hashgraph, formerly Swirlds Labs, is a software company home to some of the brightest minds in web3.

DevOps Engineer134 days ago
OtherRemoteTeam 51-200Since 2022H1B No Sponsor

• Help design, build, and integrate key product features for enterprise businesses built on Hiero, for our private distributed ledger technology • Leverage distributed systems engineering experience, software development skills, and understanding of industry standard SRE and DevOps practices to deliver core platform services • Contribute to a highly scalable, mission-critical infrastructure product used by some of the largest companies in finance, supply chain, and healthcare industries.

United States
Job Closed