Primer

Powerful no-code automation for payments and commerce.

Senior DevEx Engineer – Infrastructure

EngineerEngineerFull Time Remote SeniorTeam 51-200H1B SponsorCompany Site LinkedIn

Location

Poland

Posted

10 days ago

Salary

Seniority

Senior

Bachelor DegreeEnglishAnsible AWS Cloud Google Cloud Platform Kubernetes Terraform

Job Description

• Own Primer's internal developer platform end to end: the CI/CD pipelines, deployment workflows (canary, blue/green), self-service tooling and ephemeral environments that every engineer here depends on to ship. • Build the human-AI development loop. This is the most open part of the role: the tooling and automation that let engineers work effectively alongside coding agents, and the patterns that make agent-driven workflows fast and safe at Primer. • Treat developer productivity as a measurable system. Instrument it, find where delivery actually slows down, and ship the changes that fix it instead of guessing. • Take real operational ownership. You'll join the Core Infrastructure on-call rotation and own the reliability of what you build. • Set the technical direction for how Primer's engineers build and ship, and bring less experienced engineers along with you as you do it. • Improve how engineers write code, not just how they ship it. Partner with product teams to make the tools, frameworks and internal APIs they use everyday more ergonomic, and help reduce the friction of building features at Primer. • Build Primers paved roads. Reduce the application boilerplate engineers write, shape standards around how services are built and configured, and own the golden-path tooling that lifts efficiency across every team. • Work in the open across a fully distributed team, making your decisions and trade-offs visible rather than holding them in your head.

Job Requirements

Strong cloud infrastructure experience. AWS is preferred; we'll also consider strong GCP backgrounds.
Hands-on Kubernetes experience.
Infrastructure-as-code as a working tool, not a buzzword. Terraform is preferred, but Pulumi, CloudFormation, Ansible or similar are fine. Not having Terraform specifically is not a blocker.
CI/CD as a core competency. You've owned the build, test and deploy workflow and preview environments, not just used them.
A history of measuring and improving delivery, using DORA, SPACE or a similar framework to back the work up.
Comfortable owning production systems and being part of an on-call rotation.
A genuine software engineering background. You've built and maintained applications or products, not just infrastructure tooling.
You can reason about what makes code pleasant to work with. This is what lets you build AI and developer tooling that engineers actually want to use.
An async-first communicator who works well in a remote, distributed team.
Nice to have:
Hands-on experience building with AI or agent tooling, or working directly with LLM-based development workflows. This field is new, so real curiosity counts alongside real experience.
Familiarity with MCP servers and agent integration patterns.
Payments, fintech or other regulated-industry experience.

Benefits

We are fully remote and globally distributed; and have been since day one
Competitive share options
Uncapped holiday, with 25 days minimum to be taken
Co-working space access
Workations & Company Retreat
The best equipment for your role
£500 towards your home office setup
Generous learning budget
Private Medical Insurance
A broad set of additional perks and benefits (*depending on location)

Related Categories

Engineer

Related Job Pages

Remote Full-time Jobs (US)More Remote Jobs

More Engineer Jobs

Model Serving Engineer

Bright Vision Technologies

Engineer10 days ago

Full Time Remote

Company Site

Role Description We are seeking a Model Serving Engineer to design, build, and operate high-performance, highly reliable inference platforms for serving large machine learning models in production. The role focuses on the systems engineering side of AI deployment, including: - Request routing - Batching - Caching - Autoscaling - GPU utilization - End-to-end observability across diverse model workloads The ideal candidate brings strong distributed systems and performance engineering expertise, has shipped serving systems at scale, and understands the trade-offs between latency, throughput, cost, and quality in ML serving. Qualifications - Bachelor’s or Master’s degree in Computer Science or a related field - Six or more years of experience in distributed systems, infrastructure, or ML platform engineering - Strong proficiency in Python and a systems language such as Go, Rust, or C++ - Deep experience operating high-throughput, low-latency services in production - Hands-on experience with LLM or large model inference frameworks such as vLLM or TensorRT-LLM - Strong understanding of GPU architecture, memory hierarchies, and accelerator utilization - Familiarity with Kubernetes, autoscaling, and modern cloud platforms - Experience with observability stacks including metrics, tracing, and structured logging - Solid grounding in performance engineering and capacity planning - Strong communication and incident response skills Requirements - Design and operate model serving platforms supporting diverse workloads including LLMs, vision models, and recommendation systems - Optimize inference performance using continuous batching, paged attention, speculative decoding, and request multiplexing - Implement multi-tenant routing, rate limiting, and quality-of-service policies across model endpoints - Build autoscaling and capacity management systems that balance latency, throughput, and cost - Tune GPU utilization, memory management, and KV cache strategies for LLM serving workloads - Integrate model serving with API gateways, identity systems, and observability platforms - Implement caching, prompt deduplication, and response reuse strategies where appropriate - Drive end-to-end observability including latency histograms, queue dynamics, GPU utilization, and error tracking - Develop deployment workflows including canary releases, shadow testing, and automated rollback - Operate incident response for high-availability AI services and drive durable reliability improvements - Collaborate with ML and product teams to support new model releases and capability rollouts - Implement security controls including request signing, content filtering, and abuse detection at the serving layer - Document operational procedures, performance characteristics, and tuning guidance for internal teams - Stay current with AI serving research and translate advances into production capabilities Benefits - Competitive base salary commensurate with experience - Benefits package How to Apply For immediate consideration, please send your resume to [email protected] or contact us at (908) 676-4399. Learn more about Bright Vision Technologies at www.bvteck.com .

AI/ML AI Observability/Monitoring Distributed Systems Python Rust C++LLM Kubernetes

View details: Model Serving Engineer

United States

100K - 150K / year

Apply

Job Closed

Demo Engineer

Okta

The World's Identity Company

Engineer10 days ago

Full Time RemoteTeam 5,001-10,000Since 2010H1B Sponsor

Company Site LinkedIn

Role Description Demo Engineering sits at the intersection of go-to-market and product, serving as a global accelerator for Okta's technical sales success. Our team provides our internal and partner technical sellers worldwide with the capabilities to quickly and securely showcase how Okta secures identity for employees, customers, and AI agents. As a member of this team, you'll have a global impact on Okta’s go-to-market technology strategy: the demo components and customer experiences you build will be used by hundreds of Solution Engineers across dozens of countries, reaching thousands of customers and prospects. You'll work in a highly collaborative, engineering-driven culture where we prioritize reusability, automation, and self-service enablement. We're building the platform and tooling that makes Okta's field organization more effective, scalable, and customer-focused. What you get to do in this role: - Design and build reusable demonstration assets that encapsulate product configurations used across multiple demonstrations for solution-oriented outcomes. - Work directly with field teams to understand customer perspectives to capture their expectations, preferences, and aversions in a “Voice of the Customer.” - Work with Field Readiness team to drive adoption of the Demo Platform within the go-to-market organization. - Build analytics and reporting to track demo usage and effectiveness. - Collaborate with Product Managers and Engineering to prepare assets to support demonstrations of new product introductions. - Participate in the release planning process to influence the product direction based on customer feedback. - Create supporting documents for demos like technical demonstration guides and video examples. Qualifications - Bachelor’s degree or equivalent experience. - 5+ years of developer experience with Enterprise SaaS products. - Full Stack development experience: React.js, Node.js, AWS Serverless patterns (Lambda, DynamoDB, SQS and SNS). - Experience designing and building for multi-tenancy and tenant isolation. - Experience with infrastructure-as-code or configuration management tools. - API integration experience for connecting applications to identity platforms. - Have working technical knowledge of digital identity and authentication (OAuth, OIDC & SAML). Requirements - Global first: comfortable working in and supporting a globally distributed team. - Self-directed and proactive: identifies opportunities to improve and drives solutions without waiting for direction. - Collaborative: thrives in cross-functional environments working with field teams, product, and engineering. - Systems thinker: balances immediate needs with long-term scalability and reusability. - Customer-centric approach: ability to gather and synthesize "Voice of Customer" feedback into actionable requirements. Nice to Have - Experience with demo engineering teams or technical marketing. - Experience with presales. - Experience with Okta or Auth0. Benefits - The OTE range for this position for candidates located in the San Francisco Bay area is between $179,000 — $246,000 USD. - The annual OTE range for this position for candidates located in California (excluding San Francisco Bay Area), Colorado, Illinois, New York, and Washington is between $160,000 — $220,000 USD. - Okta offers equity (where applicable) and benefits, including health, dental and vision insurance, 401(k), flexible spending account, and paid leave (including PTO and parental leave) in accordance with our applicable plans and policies.

Okta AI Agents React Node.js AWS Amazon Lambda DynamoDB Amazon SQS Amazon SNS OAuth OpenID Connect SAML Auth0

View details: Demo Engineer

United States

$160K - $246K / year

Apply

Job Closed

Observability Engineer

Bright Vision Technologies

Engineer10 days ago

Full Time Remote

Company Site

Role Description We are looking for an Observability Engineer to design and operate the metrics, logging, tracing, and alerting platforms that give engineering teams confidence in the systems they run. The role spans the full observability stack — from collection agents and pipelines to long-term storage, dashboards, and alerting workflows — with a strong focus on usability, signal quality, and operational ROI. The ideal candidate has built and operated observability platforms at scale, understands the trade-offs between open-source and SaaS approaches, and can translate noisy telemetry into actionable insight for both engineers and business stakeholders. Key Responsibilities - Design and operate enterprise-grade observability platforms covering metrics, logs, traces, events, and synthetic monitoring. - Architect Prometheus / Thanos / Mimir, Grafana, Loki, Tempo, OpenTelemetry, and Datadog deployments for high availability and scale. - Develop standards for service instrumentation, including OpenTelemetry adoption, metric naming, label cardinality, and structured logging conventions. - Define and enforce SLOs, SLIs, and error budgets, and build the dashboards and alerts that operationalize them. - Build alerting strategies that minimize noise, surface actionable signals, and integrate cleanly with on-call workflows in PagerDuty, Opsgenie, or similar tools. - Operate large-scale time-series and log storage platforms, balancing retention, query performance, and cost. - Design distributed tracing pipelines and help teams use traces to diagnose latency and reliability issues. - Develop self-service tooling, paved-road libraries, and templates that make adoption of observability standards easy for product teams. - Drive cost management and label-cardinality discipline across the observability estate. - Lead incident response readiness improvements through better dashboards, alerting hygiene, and post-incident analysis tooling. - Partner with SRE and platform teams to integrate observability into deployment pipelines, canary analysis, and progressive delivery workflows. - Evaluate and recommend observability vendors and open-source tools based on cost, capability, and operational maturity. - Mentor engineering teams on observability fundamentals, debugging techniques, and SLO-driven operations. - Maintain documentation, onboarding guides, and runbooks for the observability platform. Qualifications - Bachelor’s degree in Computer Science or a related field. - Five or more years of experience in SRE, platform engineering, or observability roles. - Deep hands-on experience with Prometheus, Grafana, and at least one major commercial observability platform such as Datadog, New Relic, or Splunk. - Strong understanding of OpenTelemetry, distributed tracing, and structured logging. - Proficiency in at least one general-purpose language such as Go, Python, or Java. - Experience operating high-cardinality, high-throughput metrics and log pipelines. - Strong understanding of SLOs, error budgets, and SRE principles. - Experience integrating observability with CI/CD and incident management tooling. - Solid grasp of Linux internals, networking, and container platforms. - Excellent communication and collaboration skills. Preferred Qualifications - Experience with Thanos, Mimir, Cortex, Loki, or Tempo at scale. - Contributions to OpenTelemetry or observability open-source projects. - Familiarity with eBPF-based observability tooling. - Experience driving observability cost optimization initiatives. - Exposure to regulated environments with audit-grade logging requirements. How to Apply Would you like to know more about this opportunity? For immediate consideration, please send your resume to [email protected] . Learn more about Bright Vision Technologies at www.bvteck.com .

Observability/Monitoring Prometheus Thanos Grafana OpenTelemetry Datadog New Relic Splunk Python Java CI/CD Linux

View details: Observability Engineer

United States

$100K - $150K / year

Apply

Job Closed

Team Leader – Fire Damper Engineer

INDEPTH HYGIENE SERVICES LIMITED

Engineer10 days ago

Full Time RemoteTeam ,

Company Site LinkedIn

• Lead a two-person fire damper and grease extraction team in Reading/Slough • Carry out fire damper inspections and drop testing • Clean commercial kitchen extract systems to TR19 Grease standards • Identify and report defects • Ensure all works are completed safely and to a high professional standard • Manage team productivity and maintain health & safety • Complete accurate reports with photographic evidence

View details: Team Leader – Fire Damper Engineer

United Kingdom

£32.5K / year

Apply

Senior DevEx Engineer – Infrastructure

Job Description

Job Requirements

Benefits

Related Guides

Related Categories

Related Job Pages

More Engineer Jobs

Model Serving Engineer

Demo Engineer

Observability Engineer

Team Leader – Fire Damper Engineer