The #1 field service management app for contractors.
Senior Infrastructure Engineer
Location
United States
Posted
7 days ago
Salary
$200K - $210K / year
Seniority
Senior
Job Description
Senior Infrastructure Engineer
CompanyCam
Role Description We’re looking for a sharp, self-motivated, problem-solving Senior Infrastructure Engineer to join our team. As a Senior Infrastructure Engineer, you’ll: - Work with a small team to build solutions in AWS that keep CompanyCam’s web and mobile applications humming. - Collaborate with product engineers to deploy new features and improve our existing infrastructure to make it more reliable, resilient, and secure. - Bring depth in one or more of the following areas: - Database operations and performance tuning - Search infrastructure (OpenSearch/Elasticsearch) - Observability and monitoring - Help scale and harden the systems that underpin CompanyCam's products as we grow. Qualifications - Strong knowledge of AWS, including its networking, relational database management, and cloud storage services. - Experience operating and tuning PostgreSQL or similar relational databases in production. - Experience deploying containerized applications and working with Github Actions and Terraform. - Capable of scripting in bash, ruby, and/or python. - Able to break down a complex task and document its solution. - Understands how to analyze an existing system to identify improvements. - Experience with AWS ECS/EKS Fargate. - Experience operating or tuning OpenSearch/Elasticsearch clusters. - Be technically savvy and hungry to learn. - Reside permanently and currently somewhere in the U.S. Requirements Must-haves: - Show up: give us your best and have the bravery to do difficult but necessary stuff. - Grow up: take responsibility, learn continuously, and have a growth mindset. - Do good: treat your co-workers and our customers the way you want to be treated. Nice-to-haves: - Expertise in one or more of the following: - Datadog, Grafana, OpenTelemetry - PGBouncer or similar connection pooling tools - AWS Fargate, Aurora PostgreSQL or OpenSearch - Designing systems for high availability and resiliency - Experience using AI/LLM tools to improve infrastructure or engineering workflows - Producing and presenting educational content of technical concepts Benefits - This is a salaried/hourly position at CompanyCam. - Our starting salary range is $200,000 - $210,000 per year and is based on experience. - We offer meaningful equity and other benefits.
Related Guides
Related Categories
Related Job Pages
More Infrastructure Engineer Jobs
Senior System Engineer / Cloud & Infrastructure
Intercon Solutions GmbHWir verbinden IT Fach- und Führungskräfte und Unternehmen
Role Description Wir suchen einen erfahrenen System Engineer, der nicht nur betreibt, sondern mitdenkt, mitgestaltet und mit aufbaut. Du übernimmst Verantwortung für geschäftskritische IT-Infrastrukturen und moderne Cloud-Umgebungen – pragmatisch, lösungsorientiert und mit Blick fürs Ganze. - Betrieb, Weiterentwicklung und Härtung von Microsoft 365-, Azure- und Windows-Infrastrukturen - Verantwortung für Entra ID, Intune, Exchange Online, Teams (inkl. Telefonie) sowie Tenant-Konfiguration - Administration klassischer Windows-Server-Umgebungen (AD, GPO, DNS, DHCP, Fileservices) - Planung und Betrieb von Netzwerk- und Security-Infrastrukturen (VLAN, VPN, Firewalls, PKI) - Sicherstellung von Backup-, Virtualisierungs- und VDI-Lösungen (Veeam, ESXi/Proxmox, Citrix) - Aktive Mitarbeit an Security-Maßnahmen, Audits und Härtungskonzepten - Dokumentation, Automatisierung und strukturierter Betrieb mit modernen Tools Qualifications - Mehrjährige Erfahrung im Betrieb komplexer IT-Infrastrukturen - Tiefe Praxis in Microsoft- und Cloud-Technologien - Sehr gutes Verständnis für Netzwerk, Security und Identitätsmanagement - Solide Linux-Kenntnisse in der Serveradministration - Freude daran, Strukturen aufzubauen, Standards zu definieren und Verantwortung zu übernehmen - Hands-on-Mentalität, Eigeninitiative und Qualitätsbewusstsein Benefits - Gestaltungsspielraum statt reiner Abarbeitung - Moderne Toollandschaft und klare technische Standards - Verantwortung, Vertrauen und die Möglichkeit, nachhaltig etwas aufzubauen - Ein Umfeld, in dem Erfahrung geschätzt und Initiative ausdrücklich gewünscht ist
• Serve as subject matter expert in at least two areas: Cloud & infrastructure orchestration (automation scripting, integration) and O365/Exchange management • Lead deployment and support of hybrid cloud environments (Azure/on-prem) • Perform advanced support (Tier 3), including root-cause analysis and performance tuning • Contribute to automation efforts using scripting (PowerShell, Python, etc.) • Ensure security, backups, monitoring, patching, and documentation are rock solid • Support SOX-compliant operations and participate in audits as needed • Partner with global teams to align on standards and drive toward 'One Signode'
Role Description As a Cloud Systems Engineer, you will join our team on a journey to help eliminate barriers for patients, increase their access to medications, and help them receive lifesaving treatments while working in an environment that nurtures you. As a part of the infrastructure group, you will help shape the core technologies used at Valeris. The role will have a primary focus on project work, with escalation from other teams as needed. This is a fast-paced environment with new challenges and technologies added to the mix daily. Teamwork and communication are key to be successful in this position. Responsibilities - Architect, design, and lead the implementation of enterprise-scale, Azure-first cloud solutions, with supporting workloads across AWS and hybrid on-prem environments. - Own, define, and drive cloud platform strategy and architecture, including governance, security, cost optimization, and operational best practices. - Lead the consolidation, standardization, and optimization of multiple cloud environments to improve scalability, efficiency, and overall operational maturity. - Serve as a senior technical escalation point for complex infrastructure, cloud, and platform-level issues. - Design, implement, and maintain highly available, secure, and resilient cloud infrastructure aligned with enterprise standards. - Lead system and cloud architecture design efforts, translating business requirements into scalable technical solutions. - Oversee capacity planning, workload forecasting, and performance optimization across cloud and hybrid environments. - Ensure consistent execution of change management, patching, and lifecycle management processes. - Participate in on-call rotation and perform off-hours maintenance as required to support platform reliability. - Utilize Valeris’ values as the driving force behind the team’s success. - On time adherence to training deadlines for all corporate policies and procedures. - Ensure all SOPs are followed with consistency. - Perform additional tasks or projects as assigned. Qualifications - Bachelor’s degree in computer science or related field; equivalent combination of education and work experience is acceptable. - 7+ years of experience in enterprise infrastructure, systems, or cloud engineering. - 5+ years of deep, hands-on Microsoft Azure experience, including architecture, deployment, and operations. - 3+ years of practical AWS experience supporting enterprise production workloads. - Proven ability to independently own and lead initiatives end-to-end. - Strong communication skills with the ability to translate complex technical concepts to technical and non-technical stakeholders. - Strong security-first mindset across design, implementation, and operations. - Extensive experience managing Windows Server environments (2012–2025). - 5+ years of experience managing VMware vSphere and enterprise hybrid virtualization environments. - Hands-on experience with containerization and orchestration technologies such as Docker and Kubernetes (preferred). - Linux administration experience in production or hybrid environments preferred. - Advanced scripting and automation experience (PowerShell, MS Graph, Bash, Terraform, or equivalent). - Strong knowledge of enterprise monitoring, backup, and disaster recovery solutions. - Demonstrated ability to work with PowerPoint and Visio for infrastructure and workflow architectural diagrams. Requirements - Prefer candidates who can type at least 35 words per minute with 97% accuracy. - Although very minimal, flexibility to travel as needed is preferred. - This job operates in a professional office environment. This role routinely uses standard office equipment such as computers, phones, photocopiers, etc. Benefits - Medical, dental, and vision plans, including HSA- and FSA-eligible options, with Valeris contributing toward premium costs. - Additional health support, including telehealth and Employee Assistance Program (EAP) services. - Company match on Health Savings Account contributions. - Free Basic Life and AD&D coverage equal to your annual earnings, with a minimum of $50,000 and a maximum of $300,000. - Company-paid Short-Term Disability coverage, with the option to purchase Long-Term Disability. - 401(k) Retirement Savings Plan with 100% match on the first 5% you contribute, with immediate vesting. - Paid Time Off (PTO) and Sick Leave to support work-life balance. - Team members receive nine paid holidays plus two floating holidays. - Opportunities for advancement in a company that supports personal and professional growth. - A challenging, stimulating work environment that encourages new ideas. - Work for a company that values diversity and makes deliberate efforts to create an inclusive workplace. - A mission-driven, inclusive culture where your work makes a meaningful impact.
Founding ML Infrastructure Engineer
uRunWe build the stage, not the show. We're an infrastructure company, a developer-tools company, and a production partner for model labs, and focus is a deliberate choice we've made and hold to. Day-to-day, that means a small team, a high bar, and real ownership. You won't wait for permission or inherit a backlog of someone else's decisions. In a founding security role, the function is what you make it. It also means ambiguity: priorities shift, not everything is documented. You'll often be the person who decides what "secure enough, for now" means.
Role Description We are building the next generation of AI inference infrastructure. As our ML Infrastructure and Platform Engineer, you will own the architecture and scaling of our GPU compute platform from the ground up. This is a founding technical hire with end-to-end ownership across the full infrastructure stack, from bare metal to model serving. You will work directly with the founding team and define how we build. What you'll actually be doing day-to-day: - Design and scale our GPU compute platform to support 1,000+ GPU clusters, ensuring high availability and low-latency inference across the fleet. - Build and maintain the infrastructure layer for our compute marketplace, including multi-tenant scheduling, isolation, and billing-aware resource allocation. - Own production reliability for ML systems end-to-end: observability, incident response, and SLA achievement across model serving and infrastructure. - Architect feature stores and model registry systems that support rapid iteration and reproducibility at scale. - Design an experiment tracking infrastructure capable of handling thousands of concurrent runs with full auditability. - Build resource orchestration and scheduling systems that optimise for throughput, cost, and latency across heterogeneous hardware. - Set engineering standards for infrastructure reliability, capacity planning, and operational excellence as an early technical leader. Qualifications - Proven experience designing and operating large-scale distributed infrastructure at 1,000+ nodes or equivalent complexity, in any domain. - Deep expertise in distributed systems, cluster orchestration (Kubernetes, Slurm, or custom schedulers), and large-scale resource scheduling. - Strong production reliability instincts: observability, incident response, capacity planning, and SLA ownership across complex systems. - Experience building infrastructure that other engineers build on top of, not just operating it. - Ability to operate as a technical lead: set direction, make tradeoffs under uncertainty, and raise the bar for the team around you. - Startup orientation. You are energised by ambiguity, move fast, and build for scale from day one. Requirements - Exposure to ML infrastructure concepts: GPU networking (NCCL, InfiniBand, RoCE), model serving frameworks (vLLM, SGLang, TensorRT-LLM), or hardware-aware performance tuning (CuTe, Triton, TileLang). - Experience with multi-cloud GPU procurement and capacity management across AWS, GCP, Azure, and bare metal providers. - Familiarity with inference marketplace architectures, dynamic routing, or spot/preemptible workload management. - Prior experience at a Series A or earlier stage company scaling from early infrastructure to production. Benefits - Competitive salary and meaningful equity in an early-stage AI infrastructure company. The band above is our target; for an exceptional candidate we'll go higher. Equity is real — you're early, and the grant reflects that. - Health, dental, and vision — full coverage. - 401(k) — company-supported retirement savings. - FSA/HSA — flexible spending accounts for healthcare costs. - Paid time off — we trust you to manage your time. - Top-tier tooling — access to the best AI tools available: Claude, Codex, Kimi, and whatever else helps you move faster. - MacBook Pro and AirPods — the hardware you need, on us.


