BRG logo
BRG

BRG combines world-leading academic credentials with world-tested business expertise purpose-built for agility and connectivity, which sets us apart—and gets you ahead. At BRG, our top-tier professionals include specialist consultants, industry experts, renowned academics, and leading-edge data scientists. Together, they bring a diversity of proven real-world experience to economics, disputes, and investigations; corporate finance; and performance improvement services that address the most complex challenges for organizations across the globe. Our unique structure nurtures the interdisciplinary relationships that give us the edge, laying the groundwork for more informed insights and more original, incisive thinking from diverse perspectives that, when paired with our global reach and resources, make us uniquely capable to address our clients’ challenges. We get results because we know how to apply our thinking to your world. At BRG, we don’t just show you what’s possible. We’re built to help you make it happen. BRG is proud to be an Equal Opportunity Employer.

AI Infrastructure Engineer

Infrastructure EngineerInfrastructure EngineerFull TimeRemoteMid LevelTeam 1,001-5,000

Location

Argentina

Posted

41 days ago

Salary

0

Seniority

Mid Level

Job Description

AI Infrastructure Engineer

BRG

Role Description BRG’s Ai Department is seeking an AI Infrastructure junior Engineer to contribute to the development of our Virtual Ai Lab initiative. Following the successful completion of Phase 01 (physical Ai Lab build-out), this role will focus on creating a virtual access layer that makes our high-performance Ai Lab remotely accessible to teams across BRG. The ideal candidate will design and implement scalable infrastructure to support processing 100,000+ documents daily using state-of-the-art LLMs from OpenAI and Anthropic. You will work alongside our Senior AI Infrastructure Engineer to deliver and maintain the cloud infrastructure that supports BRG’s AI workloads: - Terraform / OpenTofu modules - AKS workloads - CI/CD and the observability layer The role is hands-on from day one—you will take ownership of well-scoped infrastructure tasks while learning the platform’s patterns deeply enough to grow into more complex work. Your focus is the infrastructure that enables AI pipelines, not the AI applications themselves. Key Responsibilities - Contribute to Terraform / OpenTofu modules—write, review, and update IaC under the Senior Engineer’s direction. - Help operate AKS clusters and supporting Azure services (Key Vault, storage, networking): deployments, configuration changes, and triage. - Support CI/CD and GitOps workflows—PR reviews, pipeline fixes, ArgoCD / Flux manifests. - Instrument services for observability (OpenTelemetry traces, metrics, logs) and build dashboards and alerts under guidance. - Assist with distributed document-processing pipelines: debugging, performance analysis, and reliability improvements. - Monitor platform health, investigate incidents, and document runbooks. Qualifications - Bachelor’s degree in Computer Science (or equivalent practical experience) and 1–3 years of engineering experience. - Hands-on experience with at least one of: Terraform (or OpenTofu), Kubernetes, or a major cloud platform—Azure strongly preferred. - Solid Python fundamentals; comfortable writing scripts and small services. - Working understanding of REST APIs, HTTP, and data pipelines. - Curious, self-directed learner; comfortable asking for help and taking feedback. - Strong written and verbal English; works well in a distributed team. Nice to Have - Exposure to GitOps tooling (ArgoCD, Flux) or CI/CD systems (GitHub Actions, Azure DevOps). - Familiarity with OpenTelemetry, Prometheus / Grafana, or similar observability stacks. - Curiosity about AI/ML pipelines and frameworks such as Haystack, LangChain, or LangGraph. - Docker and containers, basic networking (VNet, subnets, security groups). Company Description BRG combines world-leading academic credentials with world-tested business expertise and purpose-built emerging technologies. Our culture centers on agility and connectivity which sets us apart and gets you ahead. At BRG, our professionals include specialist consultants, industry experts, renowned academics, and leading-edge data scientists. Together, they bring a diversity of real-world experience, data, and human and artificial intelligence, to economics, disputes, and investigations; corporate finance; and performance improvement services that address the most complex challenges facing organizations across the globe. Our unique structure nurtures the interdisciplinary relationships that give us the edge, laying the groundwork for more informed insights and more original, incisive thinking. When paired with our global reach and resources, our diverse perspectives and technical capabilities make us uniquely capable to address our clients’ challenges. We get results because we know how to apply our thinking to your world. At BRG, we don’t just show you what’s possible. We’re built to help you make it happen. BRG is proud to be an Equal Opportunity Employer. Our hiring practices provide equal opportunity for employment without regard to race, religion, color, sex, gender, national origin, age, United States military veteran status, ancestry, sexual orientation, marital status, family structure, medical condition including genetic characteristics or information, veteran status, or mental or physical disability so long as the essential functions of the job can be performed with or without reasonable accommodation, or any other protected category under federal, state, or local law.

Related Categories

Related Job Pages

More Infrastructure Engineer Jobs

Full TimeRemoteTeam 11-50

Job DetailsJob Location: Work From Home - McLean, VA 22012Position Type: Full TimeSalary Range: $128,000.00 - $145,000.00 Salary/yearCORAS is a secure, cloud-native SaaS platform that enables government and defense organizations to manage risk, compliance, and security operations in highly regulated environments. We’re looking for a Cloud Infrastructure Engineer to help operate and evolve the infrastructure behind CORAS. In this role, you’ll work closely with senior engineers to support the reliability, security, and performance of a production system used by Federal customers. This is a hands-on, mid-level opportunity for someone with strong Cloud and Linux fundamentals. You’ll contribute to day-to-day operations while also helping improve the platform’s scalability and resilience over time. Key Responsibilities Cloud & Infrastructure Support and maintain AWS infrastructure (EC2, VPC, IAM, storage, load balancing) Monitor system performance, troubleshoot issues, and improve reliability Assist with infrastructure changes and deployments in a production environment Systems Administration Administer and maintain Linux-based systems (RHEL or similar) Support patching, updates, and system hardening best practices (STIGs) Assist with Windows Server and identity systems, as needed Containers & Platform Work with containerized workloads (Docker, ECS/Fargate, or similar) Support application deployments and infrastructure operations Security & Compliance Follow security best practices and support compliance requirements in a regulated environment Assist with vulnerability remediation and system monitoring Gain exposure to frameworks such as FedRAMP, NIST, and DoD security standards Collaboration Partner with engineering and security teams to support a stable, secure platform Participate in on-call rotation for production support Required Qualifications Experience 3–6 years of experience in cloud infrastructure, systems engineering, and/or DevOps roles Experience working with AWS (Commercial or GovCloud) Experience supporting production systems or applications, preferably a SaaS offering Technical Skills Strong Linux administration skills (RHEL or similar) Familiarity with core AWS services (compute, networking, storage) Basic experience with containers (Docker, ECS, Kubernetes, or similar) Understanding of networking fundamentals (VPCs, subnets, security groups) Exposure to monitoring, logging, or troubleshooting tools Clearance & Compliance Must be eligible to obtain a DoD security clearance US Citizenship required Nice to have experience Experience in regulated environments (FedRAMP, DoD, healthcare, fintech, etc.) Familiarity with security practices (system hardening, vulnerability scanning) Experience with tools such as Splunk, Nessus, or similar Exposure to identity systems (Active Directory, SSO) Experience with databases (e.g., MongoDB or similar) AWS certifications or relevant technical certifications Work Environment This is a fully remote position. Candidates must be able to work standard US business hours and be available for on-call support as required by a production mission-critical environment. All work is performed within a DoD-compliant, FedRAMP High authorized AWS GovCloud environment. Candidates must be comfortable operating within the security constraints and change management processes of a government-facing platform. Experience with databases (e.g., MongoDB or similar) AWS certifications or relevant technical certifications Benefits Medical, Dental and Vision Coverage 401(k) Matching PTO Qualifications

United States
$128K - $145K / year
Full TimeRemoteTeam 11-50

Job DetailsJob Location: Work From Home - McLean, VA 22012Position Type: Full TimeSalary Range: $128,000.00 - $145,000.00 Salary/yearCORAS is a secure, cloud-native SaaS platform that enables government and defense organizations to manage risk, compliance, and security operations in highly regulated environments. We’re looking for a Cloud Infrastructure Engineer to help operate and evolve the infrastructure behind CORAS. In this role, you’ll work closely with senior engineers to support the reliability, security, and performance of a production system used by Federal customers. This is a hands-on, mid-level opportunity for someone with strong Cloud and Linux fundamentals. You’ll contribute to day-to-day operations while also helping improve the platform’s scalability and resilience over time. Key Responsibilities Cloud & Infrastructure Support and maintain AWS infrastructure (EC2, VPC, IAM, storage, load balancing) Monitor system performance, troubleshoot issues, and improve reliability Assist with infrastructure changes and deployments in a production environment Systems Administration Administer and maintain Linux-based systems (RHEL or similar) Support patching, updates, and system hardening best practices (STIGs) Assist with Windows Server and identity systems, as needed Containers & Platform Work with containerized workloads (Docker, ECS/Fargate, or similar) Support application deployments and infrastructure operations Security & Compliance Follow security best practices and support compliance requirements in a regulated environment Assist with vulnerability remediation and system monitoring Gain exposure to frameworks such as FedRAMP, NIST, and DoD security standards Collaboration Partner with engineering and security teams to support a stable, secure platform Participate in on-call rotation for production support Required Qualifications Experience 3–6 years of experience in cloud infrastructure, systems engineering, and/or DevOps roles Experience working with AWS (Commercial or GovCloud) Experience supporting production systems or applications, preferably a SaaS offering Technical Skills Strong Linux administration skills (RHEL or similar) Familiarity with core AWS services (compute, networking, storage) Basic experience with containers (Docker, ECS, Kubernetes, or similar) Understanding of networking fundamentals (VPCs, subnets, security groups) Exposure to monitoring, logging, or troubleshooting tools Clearance & Compliance Must be eligible to obtain a DoD security clearance US Citizenship required Nice to have experience Experience in regulated environments (FedRAMP, DoD, healthcare, fintech, etc.) Familiarity with security practices (system hardening, vulnerability scanning) Experience with tools such as Splunk, Nessus, or similar Exposure to identity systems (Active Directory, SSO) Experience with databases (e.g., MongoDB or similar) AWS certifications or relevant technical certifications Work Environment This is a fully remote position. Candidates must be able to work standard US business hours and be available for on-call support as required by a production mission-critical environment. All work is performed within a DoD-compliant, FedRAMP High authorized AWS GovCloud environment. Candidates must be comfortable operating within the security constraints and change management processes of a government-facing platform. Experience with databases (e.g., MongoDB or similar) AWS certifications or relevant technical certifications Benefits Medical, Dental and Vision Coverage 401(k) Matching PTO Qualifications

United States
$128K - $145K / year

Infrastructure Engineer, Data & Automations

ElevenLabs

ElevenLabs is a young voice AI research and deployment company on a mission to make content universally accessible. Specifically, the company provides a text-to

• Owning the infrastructure underpinning our Data and Automations teams - setting up internal services, building and maintaining ETLs, and connecting systems with one another. • Taking end-to-end ownership of platform reliability and security, with a particular focus on improving security across our internal systems. • Collaborating closely with the Infrastructure team to bridge platform needs with infra capabilities. • Partnering with Growth, Finance and other internal teams to ensure they have the data and tooling they need.

United Kingdom
Job Closed
Full TimeRemoteTeam 11-50Since 2022H1B No Sponsor

• Build and operate production-grade model serving infrastructure using frameworks such as vLLM, TGI, Triton, or equivalent • Design and implement robust deployment pipelines with blue/green and canary rollout strategies for ML models • Develop and maintain auto-scaling systems, multi-model serving architectures, and intelligent request routing layers • Optimize GPU utilization, memory efficiency, network throughput, and model artifact storage performance • Design observability systems for tracking inference latency, throughput, GPU usage, cost metrics, and system health • Manage model registries and CI/CD pipelines enabling automated and reproducible model deployments • Own the full lifecycle of ML systems from development through production, including operational support and on-call responsibilities • Define engineering best practices and contribute to platform scalability in a fast-moving startup environment

Ukraine