Senior Site Reliability Engineer
Location
Europe
Posted
6 days ago
Salary
0
Seniority
Senior
Job Description
Senior Site Reliability Engineer
Trigger.dev
Role Description We're hiring a Senior Site Reliability Engineer to keep Trigger.dev fast, observable and hard to break as we scale. You'll work across our open source codebase and the Cloud product that runs it in production. We're handling hundreds of millions of executions a month on infrastructure we run ourselves, and the next order of magnitude needs someone who thinks in distributed systems and treats observability and security as part of the product, not bolted on later. Day to day you'll be chasing bottlenecks, hardening services like the sandbox runtime that executes untrusted user code, and making the platform legible to the engineers running it at 3am. What you'll be doing - Owning observability across the platform. - Designing and operating the distributed systems primitives we lean on (queues, schedulers, checkpoints, idempotency, backpressure) under real production load. - Architecting and tuning the auto-scaling infrastructure that runs untrusted customer code at high throughput. - Hunting bottlenecks across the stack, from Postgres query plans and Redis hot keys down to kernel, cgroup and network behaviour. - Hardening the security posture of our multi-tenant runtime: sandbox isolation, secrets handling, network policy, supply chain. - Owning Terraform and IaC as the source of truth for our cloud-native footprint, rather than an afterthought. - Working on runtime internals: CPU/RAM snapshotting, cold-start optimization, live migration between hosts, resilient distributed file storage. - Designing and running our on-call practice: runbooks, SLOs, blameless postmortems, paging hygiene. - Making the rest of engineering faster and safer by keeping the platform easy to reason about. - Contributing to architectural decisions and the technical roadmap. Requirements - Strong observability chops. - Production experience with OpenTelemetry, Prometheus or equivalent, and opinions about cardinality, sampling and signal-to-noise. - Distributed systems experience. - Cloud-native fluency. - Self-managed Kubernetes in production, not just clicking around managed control planes. - Performance and scaling debugging instincts. - Terraform fandom. - Security mindset. - Expertise with Postgres and Redis under load. - Experience with Go. - Familiarity with Linux. - Cloud infrastructure experience. AWS strongly preferred, GCP/Azure considered. - OK with being on call and understanding reliability is a shared responsibility for the engineering team. You'll be an amazing fit if you have: - Experience running container orchestration at scale. - Worked with MicroVMs (Firecracker, gVisor) or other sandbox runtimes for executing untrusted code. - A proven track record of contributing to open source projects, especially in the observability or cloud-native ecosystem. - Expertise in Node.js and TypeScript. - Experience with React, or better still, Remix. - Designed SDKs for developers. - Worked at a developer tools company or commercial open source company. - You've previously been a venture-backed startup founder. Benefits - Generous, transparent compensation and equity. - Async working. - Home office support. - Generous vacation policy. - Training budget. - Pension and 401k contributions. Our values - We are proud to be open source. - We ship uncomfortably fast. - Working autonomously. Interview process - Application review. - Screening call. - Hiring manager call. - Paid task day. - Final interview. - References & offer.
Related Guides
Related Categories
Related Job Pages
More Engineer Jobs
Role Description - Definir e executar estratégia de testes para jornadas críticas e integrações. - Implementar e manter suíte de testes UI E2E utilizando Playwright com TypeScript. - Automatizar testes de API REST e validar autenticação, contratos, paginação e versionamento. - Implementar testes de contrato entre Frontend, BFF e APIs. - Integrar testes automatizados aos pipelines CI/CD. - Apoiar definição de critérios de aceite, DoR/DoD e rastreabilidade de testes. - Atuar em conjunto com times de Frontend, Backend, Arquitetura e UX/UI. - Realizar análise de causa raiz, triagem de defeitos e melhoria contínua dos testes e pipelines. - Garantir evidências e relatórios auditáveis de execução de testes. - Documentar padrões e apoiar evolução da automação de testes. Qualifications - Experiência de pelo menos 5 anos em QA de aplicações web complexas. - Domínio de Playwright para automação UI E2E com TypeScript. - Experiência com testes de API REST. - Conhecimento em autenticação OAuth2/OIDC e validação de schemas. - Experiência com testes de contrato (Pact ou equivalente). - Vivência com Git e pipelines CI/CD. - Conhecimento em Docker para execução e troubleshooting de testes. - Experiência com execução cross-browser e testes em CI/headless. - Conhecimento em observabilidade e geração de evidências de testes. - Experiência com Jira e ferramentas de rastreabilidade de testes. - Boa comunicação em português e leitura/escrita em inglês. - Perfil sênior com autonomia e atuação colaborativa. Requirements - Experiência com SAP Commerce Cloud (Hybris) e Spartacus/Composable Storefront. - Vivência em projetos B2B complexos. - Experiência com integrações SAP ECC/S/4HANA. - Conhecimento em testes de acessibilidade e qualidade web. - Experiência com testes de performance utilizando k6 ou JMeter. - Conhecimento em API Gateway (Kong) e IdP (Keycloak). - Vivência com GraphQL. - Experiência com ferramentas como Allure, Zephyr, Xray ou TestRail. Benefits - Oportunidades 100% remotas 👨🏻💻 - Vale home office 💻 - Feedbacks periódicos 💬 - Programa de indicações 🏅 - Acolhimento psicológico 🙋🏻♂️ - Ginástica laboral 🏋️ - Academia de conhecimento 🧠 - Convênio com escola de inglês 🔤 - Reuniões mensais de transparência 🔃 - Happy hour online 🍻 - Kit de boas-vindas 🎁
Telecom Engineer II - Wireless Core
GCI Communication CorpAt GCI, we foster an environment where the unique perspectives of our employees, customers, and fellow Alaskans are celebrated. We add value to our community by nurturing and empowering each member of our workforce, ensuring equal opportunities for every Trailblazer. GCI is an equal opportunity employer. Qualified applicants are considered for employment without regard to race, color, religion, national origin, age, sex, sexual orientation, gender identity, marital status, mental or physical disability, veteran status, or any other status or classification protected under applicable state or federal law.
Role Description GCI's Telecom Engineer II will apply engineering principles across Technology Planning & Engineering to design, implement, optimize, and support telecommunications network architectures that meet industry standards and business needs. Responsible for delivering scalable, reliable, and secure solutions, supporting project execution, maintaining accurate network documentation, and monitoring and optimizing network performance while resolving complex deployment and operational issues. Qualifications - A combination of relevant work experience and/or education sufficient to perform the duties of the job may substitute to meet the total years required on a year-for-year basis. - High School diploma or equivalent. - Bachelor’s degree in Electrical Engineering, Computer Science, Computer Engineering, Telecommunications, or relevant field. - Minimum of four (4) years of progressive engineering experience in information technology, development, and managing moderate to complex technical projects within telecom environments, or related background. - Experience within the telecommunications industry (preferred). - Relevant telecom industry or job specific certifications (preferred). Requirements - Solid understanding of 5G Core architecture (PCC/PCG, AMF, SMF, UPF, UDM, AUSF, PCF, NRF, NSSF, NEF). - Strong working knowledge of 4G EPC (MME, SGW, PGW, HSS) and experience supporting migration strategies to 5G core environments. - Working proficiency of 3G Core architecture (MSC, MediaGateway, SGSN, and GGSN) and familiarity with circuit switch and packet switch topologies. - Experience supporting IMS-based services including VoLTE, VoWiFi, and VoNR, with exposure to service integration and optimization. - Functional knowledge of ancillary telecom systems such as SMSC/MMSC, Prepaid platforms (OCS, IVR, CAMEL), Voicemail/VVM, and RTT/TTY, with experience supporting integrations, monitoring service performance, and troubleshooting service impacts. - Familiarity with network slicing and cloud-native or virtualized core deployments. - Proficiency with core protocols including Diameter, IP, SIP, GTP-C/U, PFCP, SCTP, HTTP/2, and TLS, with the ability to troubleshoot protocol-level issues. - Ability to contribute to end-to-end core network designs and implementations. Benefits - Some travel to remote sites throughout Alaska and to lower 48 States may be required. - Work is primarily sedentary, requiring daily routine computer usage. - Ability to work shifts as assigned, work in standard office/home office setting, and operate standard office equipment. - Must work well in a team environment and be able to work with a diverse group of people and customers. - Virtual workers must comply with remote work policies and agreements. Company Description At GCI, we foster an environment where the unique perspectives of our employees, customers, and fellow Alaskans are celebrated. We add value to our community by nurturing and empowering each member of our workforce, ensuring equal opportunities for every Trailblazer. GCI is an equal opportunity employer. Qualified applicants are considered for employment without regard to race, color, religion, national origin, age, sex, sexual orientation, gender identity, marital status, mental or physical disability, veteran status, or any other status or classification protected under applicable state or federal law.
Sr. ASIC EDA Workflow Engineer
TensordyneTensordyne is a system solution company that specializes in the design of industry-leading high-performance, low-power AI inferencing. Our mission is to enable multimodal Generative AI inference acceleration at scale by providing safe, sustainable, high-performance AI-driven solutions for many markets. We are at the leading edge of advancing the latest research and product improvements for AI inference solutions that will make AI even more advantageous for compelling new applications. Well-funded, fast-paced startup company with headquarters in Sunnyvale, CA, and Munich, Germany. Many talented team members working remotely. Prioritize employees' well-being and their families. Value contributions and offer tailored benefits.
Role Description In this hands-on, technology leadership role, you will lead EDA tool flow management, and associated engineering workflow development for Tensordyne's multimodal generative AI inference acceleration products. As a valued senior member of our ASIC team, you will: - Guide and assist colleagues to improve and invent EDA workflows within a fast-paced, agile HPC development environment. - Drive Tensordyne’s optimization, implementation, and exploration of new EDA tools and technologies for the full ASIC chip design process. - Continuously innovate and improve scalable, reliable, high-performance systems and tools for the next generation of Tensordyne products. - Work closely with ASIC team members engaged in the design and verification of Tensordyne products to understand and improve their workflows and EDA needs. Qualifications - Experience leading the development and support for compilation, build automation, testing, packaging, and installation project generators (CMake, GNU make, Ninja). - Hands-on ASIC engineering experience, including knowledge of VLSI/SoC chip design and verification workflows, with ASIC EDA tool suites from Synopsys and/or Cadence. - Knowledge of Linux system administration and familiarity with cloud-based DevOps, with experience in supporting EDA tools. - Programming and debugging skills with key languages to automate tasks and improve efficiency using scripts. - Prior work experience supporting ASIC engineers with EDA workflows, including installation of new tool versions, FlexLM license management, and debugging/fixing issues with EDA vendors. - Excellent analytical, written, and verbal interpersonal skills, with the ability to collaborate productively within a global engineering team. - Bachelor’s or Master's degree in Computer Science, Computer Engineering, Electrical Engineering, or a related technical field. Requirements - Experience with CI/CD and modern Git Branching workflows. Benefits - Comprehensive benefits. - Competitive compensation. - Flexible spending options. - Recognition programs. Company Description Tensordyne is an AI system solution company that builds very high-performance, low-power generative AI inference systems. Our mission is to enable multimodal Generative AI inference acceleration at scale, with safe, sustainable, high-performance systems for our hyperscaler and neocloud data center customers. We are a well-funded, fast-paced startup with headquarters in Sunnyvale, CA, and Munich, Germany, and many talented team members working remotely across North America and Europe.
Context Engineer
CapIntelWe're an investment sales platform for wealth enterprises and professionals. Sign up for free and grow your practice!
• Design and implement LLM-powered features into our core application via model APIs (e.g. Anthropic, OpenAI, Cohere), with a focus on reliability and production-readiness • Architect and maintain retrieval-augmented generation (RAG) pipelines, connecting language models to internal knowledge bases, databases, and live data sources • Manage context window strategy, determining what information enters the model, when, in what format, and at what level of compression to optimise for accuracy, cost, and latency • Design and implement agentic workflows enabling the platform to handle multi-step, autonomous tasks • Build guardrail and output validation layers that constrain model behaviour and ensure AI features act within well-defined, compliant boundaries • Develop reusable agent primitives, prompt templates, and workflow components that other engineers can build on independently • Build evaluation frameworks to measure context effectiveness, output quality, and agent reliability in production • Monitor deployed AI systems for failure patterns and implement mitigation strategies, feeding learnings back into continuous improvement cycles • Collaborate with Product, Product Engineering, Implementation, and Data teams to translate business requirements, and proof of concepts into production AI system specifications • Act as an internal practitioner and resource helping upskill the broader engineering team on context engineering principles and agentic best practices
