Job Closed

This listing is no longer active.

Jensen Hughes

Platform Engineer

Platform EngineerPlatform EngineerOther Remote Mid LevelTeam 1,001-5,000Since 1939H1B SponsorCompany Site LinkedIn

Location

United States

Posted

85 days ago

Salary

$100K - $125K / year

Seniority

Mid Level

AWS Terraform CI/CD GitHub Actions Splunk Observability / Monitoring Linux Python Amazon Bedrock OpenTelemetry Databricks Unity

Job Description

Company Overview Throughout our worldwide network of experts, clients and communities, we are renowned for our leadership in fire protection engineering – a legacy of responsibility we have proudly upheld since 1939. Today, our expertise extends broadly across closely related security and risk-based fields – from accessibility consulting and risk analysis to process safety, forensic investigations, security risk consulting, emergency management, digital innovation and more. Our engineers and consultants collaborate to solve complex safety and security challenges, ensuring our clients can protect what matters most. For over 80 years, we have helped mitigate risks that threaten lives, property and reputations. Through technology, expertise and industry-leading research, we remain dedicated to our purpose of making our world safe, secure and resilient. At Jensen Hughes, we believe that creating and sustaining a culture of trust, integrity and professional growth starts with putting our people first. Our employees are our greatest strength, and we value the unique perspectives and talents they bring to our organization. Our wide range of Global Employee Networks connect people from across the organization, supporting career development and providing forums for individuals to share experiences on topics they're passionate about. Together, we are cultivating a connected culture where everyone has the opportunity to learn, grow and succeed together. Job Overview We’re hiring a Platform Engineer to build and operate our cloud platform on AWS, with Terraform as the infrastructure control plane. You will design and implement Infrastructure CI/CD and PR-driven infrastructure delivery (GitOps principles) for cloud infrastructure and platform configuration (Git as source of truth, automated checks, controlled applies, strong auditability). You’ll also own platform-grade observability (Splunk preferred) and help enable secure, production-ready agentic AI capabilities using Amazon Bedrock and Bedrock AgentCore, partnering with application teams to establish reliable patterns and guardrails. Responsibilities: AWS platform engineering (multi-account) - Design, build, and operate secure, reliable AWS foundations across a multi-account AWS environment (AWS Organizations / Control Tower where applicable), including networking, IAM, KMS, secrets, tagging, and shared services - Establish scalable patterns for compute, storage, and networking; enable repeatable environments across dev/stage/prod - Improve developer experience through standards, templates, and clear platform documentation Terraform (deep expertise) - Own Terraform architecture end-to-end: module strategy, state design, environment separation, provider/version management. - Build and maintain a production-grade Terraform SDLC: - PR-driven workflows with plan previews, approvals, and promotion across environments - Controlled apply mechanisms with audit trails and rollback plans - Drift detection and safe reconciliation strategy • Import/migration/refactor patterns without downtime - Implement baseline guardrails (tagging, encryption, access controls) as code wherever feasible Infrastructure CI/CD + PR-driven infrastructure delivery (GitOps principles) - Implement PR-driven infrastructure delivery using GitOps principles (not Kubernetes-only): - Git as the source of truth; PRs as change requests - Automated validation/testing/security checks on every change - Safe promotion model (dev → stage → prod) with appropriate gates - Controlled applies for production (approval gates / break-glass procedures), with full traceability - Standardize pipelines in the team’s primary CI/CD platform (GitHub Actions) and integrate with existing enterprise tooling where needed - Establish repo structure, branching strategy, and operational runbooks for the infrastructure delivery workflow Observability (Splunk preferred) - Own the Splunk observability operating model: dashboards, alerting standards, SLOs/SLIs, runbooks, and on-call readiness - Build/operate telemetry pipelines for reliability and cost efficiency (noise reduction, sampling/cardinality strategies, retention and routing). - Partner with application teams to improve visibility, reduce MTTR, and drive incident learnings into platform improvements Agentic AI enablement (Amazon Bedrock + AgentCore) - Partner with engineering teams to enable agentic AI use cases using Amazon Bedrock and AgentCore (tool integration patterns, secure operation, production readiness) - Help establish foundational patterns for agent deployment and operations (environments, permissions, observability, and evaluation/reliability practices) aligned to enterprise controls Operational excellence - Participate in incident response; lead postmortems and drive systemic, preventive fixes - Measure and improve platform reliability, security posture, and cost efficiency over time Requirements (must have): - 8–10 years of experience in Platform Engineering / SRE / DevOps (or equivalent experience delivering platform outcomes)AWS expertise, including multi-account patterns (AWS Organizations / Control Tower preferred), networking, IAM/security, and operations - Terraform expert with proven ownership of org-scale infrastructure-as-code (modules, state, CI controls, large refactors) - Proven experience designing Infrastructure CI/CD and PR-driven infrastructure delivery (GitOps principles) for Terraform and cloud configuration: - PR-based automation with plan previews and security/policy checks - Controlled apply processes with approvals and auditability - Environment promotion patterns and rollback strategies - Strong production experience with observability platforms such as Splunk, Datadog, Grafana, or Dynatrace, including building and operating dashboards, alerting standards, and telemetry pipelines (logs/metrics/traces) in production - Strong Linux and troubleshooting skills; proficiency in automation (Python or Go preferred) Preferred Qualifications: - Experience building agentic AI solutions using Amazon Bedrock Agents and/or Amazon Bedrock AgentCore (deployment/operations, tool integration patterns) - OpenTelemetry at scale (standards, collectors/gateways, sampling, correlation across logs/metrics/traces) - Policy-as-code experience (Conftest/Sentinel or similar) applied to Terraform and platform guardrails - Experience building an Internal Developer Platform (IDP) / self-service workflows (golden paths, templates, paved roads). - Databricks on AWS platform support (workspace/cluster policies, reliability, cost controls; Unity Catalog familiarity a plus) Please note that the salary range provided is a good faith estimate for the position at the time of posting and not a guarantee of compensation. Final compensation may vary based on factors, including but not limited to, responsibilities of the job, education, experience, knowledge, skills, and abilities, geographic location, internal equity, alignment with market data. Jensen Hughes offers a competitive total rewards package, which includes a retirement plan, healthcare coverage, and a broad range of other benefits. Incentives and/or benefit packages may vary depending on the position and location. National Pay Range $100,000—$125,000 USD Jensen Hughes is an Equal Opportunity Employer. Qualified candidates will receive consideration for employment without regard to race, color, religion, sex, national origin, sexual orientation, gender identity, disability or protected veteran status. At Jensen Hughes, we embrace innovation and understand that people are increasingly using artificial intelligence (AI) tools like ChatGPT and other generative platforms to learn, prepare and communicate. We have provided some guidelines regarding the responsible use of AI in the recruitment process. Please click here to review. The security of your personal data is important to us. Jensen Hughes has implemented reasonable physical, technical, and administrative security standards to protect personal data from loss, misuse, alteration, or destruction. We protect your personal data against unauthorized access, use, or disclosure, using security technologies and procedures, such as encryption and limited access. Only authorized individuals may access your personal data for the purpose for which it was collected, and these individuals receive training about the importance of protecting personal data. Jensen Hughes is committed to compliance with all relevant data privacy laws in all areas where we do business, including, but not limited to, the GDPR and the CCPA. Additionally, our service providers are contractually bound to maintain the confidentiality of personal data and may not use the information for any unauthorized purpose. *Policy on use of 3rd party recruiting agency for direct placements Jensen Hughes will occasionally augment a recruiting search through agencies for certain positions when business conditions warrant. Jensen Hughes will not accept resumes, inquiries or proposals from recruiting agencies as an acceptable method to consider a candidate. 3rd party recruiting agencies must sign a standard Jensen Hughes agreement after being evaluated and accepted by a Human Resources or Talent Acquisition manager, or member of the talent acquisition team. Hiring managers and employees of Jensen Hughes are not authorized to accept resumes, engage in fee-based searches through recruiting firms or sign a search agreement. Please note this policy does not apply to “staffing firms” or firms that are involved with hiring temporary staff. Any recruiting agency interested in being considered may contact our recruiting team at jensenhughesrecruiting.com.

Related Categories

Platform Engineer

Related Job Pages

Remote Python Jobs (US)More Remote Jobs

More Platform Engineer Jobs

Platform Engineer

Jensen Hughes

Platform Engineer85 days ago

Full Time RemoteTeam 1,001-5,000Since 1939H1B Sponsor

Company Site LinkedIn

• Build and operate cloud platform on AWS using Terraform • Design and implement Infrastructure CI/CD and PR-driven infrastructure delivery • Own platform-grade observability • Enable secure, production-ready agentic AI capabilities

AWS Grafana Linux Python Splunk Terraform

View details: Platform Engineer

United States

$100K - $125K / year

Apply

Job Closed

Platform Engineer – Kubernetes, GitOps, AI interest

Cloudiax

Global Business Cloud provider for SAP B1, Cloud Infrastructure, AI Server & more - made in Germany, available worldwide

Platform Engineer85 days ago

Full Time RemoteTeam 11-50Since 2002H1B No Sponsor

Company Site LinkedIn

• You act as the interface between development and cloud administration • Your responsibility is to design our Kubernetes platform and deployment processes so developers can work efficiently, reproducibly, and with standardized workflows • You work closely with development, infrastructure, and security teams to provide a reliable platform • Enhance and operate our Kubernetes platform in on‑premise data centers, ensuring stability, scalability, and performance • Build and optimize self‑service processes that enable developers to perform efficient, standardized deployments • Optimize resource usage (CPU, RAM, storage, GPU) and develop fair workload strategies • Operate and evolve our PostgreSQL platform, including high availability, backup/recovery, and performance tuning • Automate infrastructure and deployment tasks using Terraform, Ansible, or similar tools • Build and run modern CI/CD and GitOps processes (e.g., Argo CD, Flux) • Define platform standards, templates, and best practices to ensure a consistent developer experience • Integrate and operate authentication and authorization systems (Keycloak) as well as API gateways (Kong) • Ensure multi‑tenancy, isolation, and compliance with security requirements • Implement and operate monitoring, logging, and tracing solutions (Prometheus, Grafana, OpenTelemetry) • Develop intelligent scaling mechanisms (e.g., HPA, custom metrics) beyond classical CPU/RAM metrics • Support development teams in using the platform and continuously improve the developer experience • Serve as the interface between development and cloud administration: gather requirements, improve workflows, and provide feedback • Bring your interest in AI topics and explore automation and orchestration opportunities

Ansible Flux Grafana Kubernetes PostgreSQL Prometheus Terraform

View details: Platform Engineer – Kubernetes, GitOps, AI interest

Germany

Apply

Job Closed

Senior Platform Engineer, CVML

Blue River Technology

Our mission is to create intelligent machinery that solves monumental challenges for our customers.

Platform Engineer85 days ago

Full Time RemoteTeam 201-500Since 2011H1B Sponsor

Company Site LinkedIn

• Design, build, and evolve platform capabilities that support ML training, batch inference, and model deployment workflows at scale. • Own and improve core platform components (e.g., compute orchestration, data pipelines, inference systems) used by multiple teams across Blue River and John Deere. • Continuously enhance platform reliability, scalability, and performance, with a focus on real-world ML workloads. • Enable ML engineers to move faster by building intuitive, well-documented platform tools and workflows across the model lifecycle (experimentation, deployment, and iteration). • Improve model inference performance and throughput while balancing trade-offs among cost, latency, and reliability. • Support and scale distributed training and inference systems, including frameworks such as Ray and related tooling. • Develop and optimize hybrid compute environments (cloud + on-prem/GPU infrastructure) to support large-scale ML workloads. • Build and maintain infrastructure leveraging Kubernetes, Slurm, and cloud platforms (AWS preferred). • Identify and resolve bottlenecks in compute, storage, and data movement pipelines. • Evaluate existing platform systems and make thoughtful decisions on when to extend, refactor, or rebuild components. • Drive improvements in system architecture, balancing short-term delivery with long-term platform health. • Contribute to shaping the platform roadmap and technical direction in response to evolving business and ML needs. • Partner closely with ML engineers, robotics teams, infrastructure teams, and product stakeholders to translate requirements into scalable platform solutions. • Act as a technical bridge between teams, ensuring platform capabilities align with real-world use cases and constraints. • Influence platform adoption and best practices across multiple teams. • Support platform capabilities that enable simulation-based testing and validation of ML systems, including synthetic data workflows. • Improve tooling that allows teams to test and validate models before production deployment. • Provide technical guidance and mentorship to junior engineers on platform and systems design. • Lead implementation efforts for key platform initiatives and ensure high-quality execution. • Demonstrate strong ownership and accountability for delivering impactful platform improvements.

AWS Kubernetes Python Ray

View details: Senior Platform Engineer, CVML

United States

$160K - $287K / year

Apply

Job Closed

Sr Platform Engineer, CVML

Blue River Technology

Our mission is to create intelligent machinery that solves monumental challenges for our customers.

Platform Engineer85 days ago

Other RemoteTeam 201-500Since 2011H1B Sponsor

Company Site LinkedIn

We’re Blue River, a team of innovators driven to create intelligent machinery that solves monumental problems for our customers. We empower our customers – farmers, construction crews, and foresters - to implement safer and more sustainable solutions, driving increased profitability with less reliance on scarce labor. We believe that focusing on the small stuff – pixel-by-pixel and task-by-task - leads to big gains. Blue River Technology aligns with John Deere’s vision to “innovate on behalf of humanity” by quickly identifying and solving high-value, high-uncertainty challenges in AI, machine learning, computer vision, and robotics. BRT acts as a research and development flywheel, building not only new products but also new platforms that reliably create value for both Deere and its customers. From fully autonomous machines to highly precise farming equipment, BRT and Deere are partnering to create technical breakthroughs in industries like agriculture and construction. Summary We are seeking a Senior CVML Platform Engineer to help design, build, and evolve the platforms that support computer vision and ML workloads at scale. This role focuses on enabling ML teams through well-designed infrastructure, tooling, and workflows, rather than developing models or conducting ML research. The ideal candidate brings strong technical judgment, is comfortable navigating existing and evolving platforms, and can incrementally improve systems while maintaining reliability. We strongly prefer engineers with a DevOps or platform engineering background who have moved into ML-adjacent systems and are motivated by building durable foundations that other teams rely on. This role requires both hands-on engineering and the ability to influence platform direction through collaboration and thoughtful design. - Employment Type: Full-Time - Work Location: Remote in the United States - Visa sponsorship will be considered on a case-by-case basis. Job Responsibilities A combination, not necessarily all-inclusive, of the following: - Platform Development & Ownership - Design, build, and evolve platform capabilities that support ML training, batch inference, and model deployment workflows at scale. - Own and improve core platform components (e.g., compute orchestration, data pipelines, inference systems) used by multiple teams across Blue River and John Deere. - Continuously enhance platform reliability, scalability, and performance, with a focus on real-world ML workloads. - ML Systems Enablement - Enable ML engineers to move faster by building intuitive, well-documented platform tools and workflows across the model lifecycle (experimentation, deployment, and iteration). - Improve model inference performance and throughput while balancing trade-offs among cost, latency, and reliability. - Support and scale distributed training and inference systems, including frameworks such as Ray and related tooling. - Infrastructure & Compute Systems - Develop and optimize hybrid compute environments (cloud + on-prem/GPU infrastructure) to support large-scale ML workloads. - Build and maintain infrastructure leveraging Kubernetes, Slurm, and cloud platforms (AWS preferred). - Identify and resolve bottlenecks in compute, storage, and data movement pipelines. - System Evolution & Technical Judgment - Evaluate existing platform systems and make thoughtful decisions on when to extend, refactor, or rebuild components. - Drive improvements in system architecture, balancing short-term delivery with long-term platform health. - Contribute to shaping the platform roadmap and technical direction in response to evolving business and ML needs. - Cross-Functional Collaboration - Partner closely with ML engineers, robotics teams, infrastructure teams, and product stakeholders to translate requirements into scalable platform solutions. - Act as a technical bridge between teams, ensuring platform capabilities align with real-world use cases and constraints. - Influence platform adoption and best practices across multiple teams. - Testing, Simulation & Validation - Support platform capabilities that enable simulation-based testing and validation of ML systems, including synthetic data workflows. - Improve tooling that allows teams to test and validate models before production deployment. - Technical Leadership - Provide technical guidance and mentorship to junior engineers on platform and systems design. - Lead implementation efforts for key platform initiatives and ensure high-quality execution. - Demonstrate strong ownership and accountability for delivering impactful platform improvements. Required Experience and Skills - 5+ years of professional engineering experience, with a focus on platform, infrastructure, or systems engineering. - Strong technical judgment, balancing the evolution of legacy platforms with the design and delivery of new, greenfield components shared across multiple teams and workloads. - Excellent Python skills, used in production systems, tooling, and platform components. - Solid understanding of ML systems and the end-to-end model development lifecycle, from experimentation to deployment and iteration. - Hands-on experience or strong familiarity with cloud platforms (AWS preferred) and container orchestration systems such as Kubernetes and Slurm. - Ability to partner effectively with ML engineers, infra teams, and product stakeholders to translate requirements into platform capabilities. - Ability to quickly ramp up on new domains, tools, and complex existing systems. Preferred Experience and Skills - Golang experience, particularly for platform or infrastructure components. - Experience building or integrating ML pipelines using tools such as Kubeflow and/or Airflow. - Understanding of model inference architectures, including performance, scalability, reliability, and cost considerations. - Experience enabling distributed training and inference through platforms and frameworks such as Ray. - Experience supporting ML systems in computer vision or robotics environments. Only individual applicants will be considered. We do not work with unsolicited third-party agencies or proxy interview services. At Blue River, we’re passionate about creating an inclusive workplace that promotes and values diversity. While we have more work to do to advance diversity and inclusion, we’re investing in our programs, including recruiting, mentorship, career development, and learning & development to ensure they support our Diversity, Equity, and Inclusion goals. We support each employee in living a full life, enabling a thriving career, and accomplishing a meaningful, challenging mission while collaborating with incredible people. We are dedicated to building a diverse and inclusive workplace, so if you’re excited about this role but your experience doesn’t align completely with the job description, we encourage you to apply anyway. We are an equal-opportunity employer and do not discriminate based on race, religion, color, national origin, sex, gender, gender expression, sexual orientation, age, marital status, veteran status, or disability status. We will ensure that individuals with disabilities are provided reasonable accommodation to participate in the job application or interview process, perform essential job functions, and receive other benefits and privileges of employment. Please contact us to request an accommodation. The US annual base salary range for this position is $160,000 - $287,000, along with eligibility for Blue River’s bonus and benefit programs. Our salary ranges are determined by role, level, and location. Within the range, individual pay is determined by work location and additional factors, including job-related skills, experience, and relevant education or training. Your recruiter can share more about the specific salary range for your location during the hiring process. During the recruitment process, we may identify an alternative role or level to which you are more suited. If your ideal role at Blue River differs from the advertised position, we will provide an updated pay range as soon as possible during the hiring process. #LI-AN1

Python AWS Kubernetes Ray Kubeflow Airflow

View details: Sr Platform Engineer, CVML

United States

$160K - $287K / year

Apply