Leading the Incremental Computing Revolution
Backend Infrastructure Engineer – Control Plane, Rust
Location
United States
Posted
4 days ago
Salary
0
Seniority
Senior
Job Description
Backend Infrastructure Engineer – Control Plane, Rust
Feldera
• Control plane engineering: Own and evolve the control-plane services that orchestrate pipelines across diverse customer environments. • Kubernetes platform: evolve the Kubernetes layer that runs Feldera pipelines reliably, including resource management and self-hosted deployment workflows. • Enterprise readiness: Build and harden capabilities to support security, isolation, and access-control needs of large enterprises. • Cloud & self-hosted deployment: Develop the capabilities that let customers run Feldera across environments, from single-node laptops to multi-node clusters, on their own infrastructure. • API design: Design clean, stable, well-documented APIs for the control plane that internal teams and customers build workflows against. • Reliability & operability: Make the control plane observable, debuggable, and resilient at scale. • Troubleshooting: Debug complex distributed-systems issues across the control plane and Kubernetes layer.
Job Requirements
- Strong proficiency in Rust or strong systems-programming experience in a comparable language (C++, Go) with a demonstrated ability to ramp quickly on Rust.
- Experience building backend services, APIs, and distributed systems in production.
- Hands-on experience with Kubernetes — building on top of it, writing controllers/operators, or designing services that run reliably within it.
- Familiarity with authentication, SSO, and multi-tenancy, compliance standards (FIPS etc.), and the security considerations of enterprise/self-hosted software.
- Solid Linux fundamentals and comfort operating in containerized environments (Docker).
- Strong troubleshooting skills and the ability to debug complex distributed-systems issues.
- Self-directed with excellent communication skills and the ability to work effectively in a remote team.
- Candidates must have authorization to work in their country of residence. **We are unable to sponsor employment visas at this time.**
Benefits
- Competitive salary & meaningful equity
- Medical, dental & vision - 90% of premiums covered by Feldera
- HSA & FSA
- 401(k)
- Fully remote
Related Guides
Related Categories
Related Job Pages
More Infrastructure Engineer Jobs
Role Description We are seeking a hands-on Cloud Infrastructure Engineer to act as the technical builder responsible for deploying, managing, and automating complex Google Cloud solutions for both internal and external client projects. In this role, you will work across a wide variety of cloud domains—including networking, compute, storage, and CI/CD—while ensuring that every environment you build is inherently secure. Beyond deploying core infrastructure using Infrastructure as Code (IaC), you will be instrumental in configuring the critical identity, access, and logging foundations required for enterprise-grade, secure-by-design deployments. Duties and Responsibilities - Core Infrastructure & Automation - Infrastructure Deployment: Design, build, and maintain highly available GCP architectures (Compute Engine, Kubernetes Engine, VPCs, Cloud Storage) using Infrastructure as Code (primarily Terraform). - Automation & CI/CD: Assist in building and maintaining deployment pipelines to automate infrastructure provisioning and application rollouts. - GCP Project Provisioning: Set up foundational cloud environments, including establishing organizational policies, creating logical folder structures, and configuring baseline resource hierarchies. - Identity, Access & Security - Federated Identity Management: Configure and troubleshoot integrations and establish Google Workforce Identity Federation to allow external IdPs secure access to GCP resources. - Security Controls: Implement robust IAM and security controls, ensuring least-privilege access across service accounts, users, and groups. - Network Security: Configure firewalls, VPC peering, Cloud NAT, and basic network security protocols to protect client workloads. - Observability & Operations - Monitoring & Logging Setup: Deploy comprehensive observability tools using Cloud Monitoring to ensure system health, performance, and uptime. - Audit & Compliance: Configure Cloud Logging to capture, route, and retain system and security events, ensuring centralized audit trails for compliance. - Troubleshooting: Act as an escalation point to diagnose and resolve complex infrastructure, network, and IAM-related issues. Qualifications - Requires a BA/BS degree in Information Technology, Computer Science or related field of study or equivalent experience. - Experience: 3+ years of hands-on experience provisioning and managing infrastructure on Google Cloud Platform (GCP). - Core GCP Knowledge: Strong understanding of fundamental GCP services (VPC Networking, Compute Engine, Cloud Storage, Load Balancing). - Infrastructure as Code (IaC): Proficient in writing and deploying infrastructure using Terraform. - Identity & Security: Practical experience configuring IAM roles, organizational policies, and SSO/Identity Federation. - Scripting: Ability to write automation scripts in Python, Bash, or Go. - Client-Facing Skills: Strong communication skills with the ability to explain technical decisions and infrastructure designs to clients and internal Project Managers. - A positive outlook, a passion for development and the ability to work as part of a high-performance team. Benefits - Competitive salary and bonus plan. - 14 Paid holidays. - Medical Insurance. - Dental Insurance, 100% company paid premiums. - Vision Insurance, 100% company paid premiums. - Fully Company Paid Short term Disability Insurance. - Fully Company Paid Long Term Disability Insurance. - Fully Company Paid Life Insurance and AD&D. - 401k plan: Promevo offers a Safe Harbor 401K plan for full time employees with immediate eligibility. Promevo matches the first 3% of earnings you contribute at 100% and matches the next 2% of earnings you contribute at 50%. - Home office setup allowance. - Cell phone allowance. - 4 weeks of PTO and a healthy work/life balance. - Development and Training opportunities to help you grow.
• Provide Level 2 support for infrastructure-related incidents and service requests. • Diagnose and resolve issues across Windows Server environments, Active Directory, Group Policy and Microsoft 365. • Monitor and maintain core infrastructure services, including backups, patch management, and endpoint security solutions. • Investigate networking issues involving DNS, DHCP, and TCP/IP. • Escalate complex technical issues to senior engineers or third-party vendors when required. • Maintain accurate technical documentation, procedures, and knowledge base articles. • Support hardware deployments, upgrades, and routine infrastructure maintenance. • Collaborate with Service Desk and wider IT teams to ensure timely resolution of tickets and incidents. • Assist with ongoing infrastructure improvement projects and technology upgrades.
AI & Automation Cloud Infrastructure Engineer
JedoxThe world’s most adaptable planning and performance management platform.
• As an AI & Automation Cloud Infrastructure Engineer, you will shape and constantly develop Jedox's in-house AI cloud platform, allowing expandable, protected, and automated AI, ML, and GenAI workloads. • This highly technical role involves designing, building, and operating robust cloud infrastructures, defining reusable standards and ensuring operational excellence in production. • Design AI cloud solution architectures: Design and develop strong architectures for infrastructure, data, security and integration patterns for AI workloads. • Enable AI, ML and GenAI workloads: Define reusable templates, standards and architectures for efficient cloud-based deployment. • Collaborate across teams: Work closely with the Cloud Platform, SRE, Architecture and Engineering teams to ensure scalable and reliable AI services. • Implement GitOps workflows: Promote controlled environment PR-based processes for models, services and infrastructure. • Support MLOps lifecycle: Contribute to model deployment, promotion, rollback and operational support. • Ensure production readiness: Design for scalability, performance and high availability in production environments, with the aim of ensuring maximum efficiency and reliability. • Strengthen observability, reliability, and security: Enhance monitoring, incident response, and ensure compliance, data protection, and best practices.
Role Description We are looking for a hands-on infrastructure automation engineer to join a 6-month engagement supporting platform modernization and operational continuity at a leading global financial institution. The role is fully remote with required EST working hours (9am–6pm EST). You will lead the migration and modernization of infrastructure automation from Puppet to Ansible, support the stabilization of a critical structured finance platform (Intex/SPG), and help transition the environment toward future integration readiness. This role combines infrastructure automation, operational support, ETL/integration coordination, and structured finance platform support. Qualifications - Ansible — strong hands-on experience (playbooks, roles, inventories, Jinja2 templating) - Puppet-to-Ansible migration experience - Linux/Unix administration - Scripting — Python and/or Bash/Shell - SQL-based environment experience - Familiarity with ETL/data integration workflows and backend pipelines - Full fluency in English Requirements - Design, build, and maintain Ansible playbooks, roles, and inventories for infrastructure automation - Lead the migration of existing Puppet configurations to Ansible - Stabilize and optimize the current Intex platform setup supporting SPG and broader SANCAP processes - Coordinate and support ETL/data integration workflows and backend pipelines - Manage scheduled jobs and batch processing operations - Create and maintain operational documentation to support transition from a single-resource support model - Prepare the environment and processes for future ARDA integration Benefits - Duration: 6 months (July – December 2026) - Location: Remote — full EST hours required (9am–6pm EST) - Client: Leading global financial institution


