Job Closed
This listing is no longer active.
Modern Data Workspace ✨ | A Leader in The Forrester Wave™️ | Follow for resources, blogs, and more from the data world.
Staff Software Engineer
Location
Worldwide
Posted
73 days ago
Salary
0
Seniority
Lead
Job Description
Staff Software Engineer
Atlan
Role Description As an engineer on the Foundation Platform team, you will be responsible for the production infrastructure that powers Atlan’s context layer for AI across AWS, Azure, and GCP, using AI-assisted development tools (such as Claude Code and Cursor) as a natural part of your daily workflow. - Own and evolve our multi-tenant infrastructure on Kubernetes, including dedicated clusters per customer and the full tenant lifecycle (provisioning, scaling, migration, and offboarding). - Make our GitOps deployments faster and safer by improving the ArgoCD and Helm based pipeline that deploys 1,000+ applications across hundreds of tenants. - Replace manual infrastructure runbooks (for example, Kubernetes upgrades, Private Link setups, DR drills, cluster onboarding) with reliable automation using Infrastructure-as-Code and workflow engines. - Strengthen observability and efficiency by improving our logging, metrics, and alerting stack and using it to drive better reliability, visibility, and meaningful cloud cost reduction. - Lead customer-facing infrastructure work and incidents end to end, and turn what you learn into clear runbooks, dashboards, and Claude Skills that help both humans and AI agents operate the platform. Qualifications - 7+ years in platform engineering, infrastructure, SRE, or backend systems at a SaaS company, with high ownership, strong written/async communication, and enthusiasm for AI-native development tools. - Deep hands-on experience operating Kubernetes in production: managing clusters, upgrades, networking and RBAC, and multi-tenant concerns, not just deploying apps. - Strong GitOps and Helm experience (for example ArgoCD or similar) at meaningful scale, including dealing with sync failures, drift, chart complexity, and improving deployment safety and speed. - Production-quality infrastructure automation skills in Go or Python; familiarity with TypeScript is a plus. - Solid cloud and Infrastructure-as-Code foundation: deep experience with at least one major cloud (AWS, GCP, or Azure), and having designed, written, and reviewed substantial Terraform or Crossplane modules. - Comfort debugging end to end across GitOps pipelines, Kubernetes, and cloud provider layers when deployments or tenants are stuck. - Experience with multi-tenant SaaS infrastructure, observability stacks (logs, metrics, traces, dashboards), and practical cloud cost optimisation (for example autoscaling, instance strategy, or savings mechanisms), ideally with exposure to workflow engines such as Temporal or internal self-service / developer platform tooling. Requirements - Has led at least one major, cross-cutting infrastructure initiative end to end, such as a deployment pipeline overhaul, Kubernetes upgrade framework, multi-tenant migration, DR architecture, or cost programme, and can explain the system-level impact across reliability, cost, and developer experience. - Can walk through a Kubernetes and GitOps platform they have built or significantly evolved, and how they improved deployment safety, speed, and operability for other teams. - Has clear examples of turning manual runbooks into automation for workflows like upgrades, DR, or networking, and making these safe, repeatable, and well-documented. - Uses observability and cost signals together to drive better reliability and meaningful cloud savings, and is confident owning customer-facing infrastructure and incidents end to end. - Acts as a technical multiplier through thoughtful design docs, reviews, documentation, and tools or Claude Skills that reduce “how do I do X?” questions for the whole team. Benefits - Competitive Compensation: We benchmark at the top of the market and keep compensation simple: strong base salary, performance‑based variable pay, and impact‑driven equity (for most roles), so your total rewards grow in step with the value you create over time. - AI Native Culture: Atlan is where AI-native builders come to build the systems the future of work will run on. AI isn’t an add-on, it’s woven into how we build, think, and work every day, empowering every Atlanian to move faster and create a bigger impact. - Health & Wellness: From Day‑1 health, dental, vision, and mental health to flexible health stipends, we design benefits offerings that lead in each country we're in. - Flexible Time Off & Leave Policies: We trust you to own your energy: flexible time off and modern leave so you can unplug properly, support yourself and your loved ones, and come back ready to drive an impact. - Accelerated Growth & Learning: Develop at an uncommon velocity through cutting-edge tech, complex implementations, and an experienced team that values mastery. - Global, Remote-First, High-Trust: Work from anywhere with a diverse team across 15+ countries, in a trust-first, async environment that gives you true flexibility and ownership over how you work.
Job Requirements
- 7+ years in platform engineering, infrastructure, SRE, or backend systems at a SaaS company, with high ownership, strong written/async communication, and enthusiasm for AI-native development tools.
- Deep hands-on experience operating Kubernetes in production: managing clusters, upgrades, networking and RBAC, and multi-tenant concerns, not just deploying apps.
- Strong GitOps and Helm experience (for example ArgoCD or similar) at meaningful scale, including dealing with sync failures, drift, chart complexity, and improving deployment safety and speed.
- Production-quality infrastructure automation skills in Go or Python; familiarity with TypeScript is a plus.
- Solid cloud and Infrastructure-as-Code foundation: deep experience with at least one major cloud (AWS, GCP, or Azure), and having designed, written, and reviewed substantial Terraform or Crossplane modules.
- Comfort debugging end to end across GitOps pipelines, Kubernetes, and cloud provider layers when deployments or tenants are stuck.
- Experience with multi-tenant SaaS infrastructure, observability stacks (logs, metrics, traces, dashboards), and practical cloud cost optimisation (for example autoscaling, instance strategy, or savings mechanisms), ideally with exposure to workflow engines such as Temporal or internal self-service / developer platform tooling.
- Has led at least one major, cross-cutting infrastructure initiative end to end, such as a deployment pipeline overhaul, Kubernetes upgrade framework, multi-tenant migration, DR architecture, or cost programme, and can explain the system-level impact across reliability, cost, and developer experience.
- Can walk through a Kubernetes and GitOps platform they have built or significantly evolved, and how they improved deployment safety, speed, and operability for other teams.
- Has clear examples of turning manual runbooks into automation for workflows like upgrades, DR, or networking, and making these safe, repeatable, and well-documented.
- Uses observability and cost signals together to drive better reliability and meaningful cloud savings, and is confident owning customer-facing infrastructure and incidents end to end.
- Acts as a technical multiplier through thoughtful design docs, reviews, documentation, and tools or Claude Skills that reduce “how do I do X?” questions for the whole team.
Benefits
- Competitive Compensation: We benchmark at the top of the market and keep compensation simple: strong base salary, performance‑based variable pay, and impact‑driven equity (for most roles), so your total rewards grow in step with the value you create over time.
- AI Native Culture: Atlan is where AI-native builders come to build the systems the future of work will run on. AI isn’t an add-on, it’s woven into how we build, think, and work every day, empowering every Atlanian to move faster and create a bigger impact.
- Health & Wellness: From Day‑1 health, dental, vision, and mental health to flexible health stipends, we design benefits offerings that lead in each country we're in.
- Flexible Time Off & Leave Policies: We trust you to own your energy: flexible time off and modern leave so you can unplug properly, support yourself and your loved ones, and come back ready to drive an impact.
- Accelerated Growth & Learning: Develop at an uncommon velocity through cutting-edge tech, complex implementations, and an experienced team that values mastery.
- Global, Remote-First, High-Trust: Work from anywhere with a diverse team across 15+ countries, in a trust-first, async environment that gives you true flexibility and ownership over how you work.
Related Guides
Related Job Pages
More Software Engineer Jobs
Senior Java Software Developer
RadixA Radix está sempre no topo das Melhores Empresas para se trabalhar porque: Temos profissionais comprometidos, dedicados, curiosos e inovadores. O espírito de equipe é a nossa maior força. Trabalhamos de forma cooperativa e sabemos que estamos juntos, remando na mesma direção. Temos um ambiente diverso, que valoriza equidade e inclusão. Nossa jornada de trabalho é flexível e em quase todos os projetos é possível trabalhar de qualquer lugar do Brasil. Valorizamos o bem-estar e o cuidado com as nossas pessoas, com programas de apoio à saúde mental, psiquiatra e médico consultor disponíveis.
Role Description A primeira coisa que você precisa saber é que aqui você não vai cair na rotina. A Radix desenvolve soluções para empresas de diferentes setores e indústrias. Cada projeto tem suas tecnologias, soluções e prazos e você terá oportunidade de atuar e experimentar diferentes desafios. Como Profissional de Desenvolvimento de Software você vai: - Atuação como desenvolvedor sênior em projetos de arquitetura de microserviços, com foco na construção, evolução e modernização de sistemas distribuídos em ambiente cloud (AWS + Kubernetes). - Participar ativamente da quebra de monolitos legados (aplicações web e/ou procedures PL/SQL Oracle), implementação de mensageria assíncrona robusta e adoção de boas práticas de cloud-native e arquitetura moderna. - Contribuição estratégica para times ágeis em demandas de alta complexidade e criticidade. Responsabilidades: - Projetar, desenvolver e evoluir microserviços em Java 17+ com Spring Boot e Spring Cloud. - Liderar ou apoiar processos de quebra de monolitos (monolith-to-microservices), incluindo extração de domínios de aplicações web legadas e/ou lógica pesada em PL/SQL Oracle. - Implementar comunicação assíncrona entre serviços utilizando mensageria (Kafka, RabbitMQ ou AWS SQS). - Desenvolver soluções cloud-native preparadas para rodar em Kubernetes na AWS (EKS), com observabilidade, resiliência e escalabilidade horizontal. - Aplicar princípios de arquitetura moderna: Twelve-Factor App, Reactive Manifesto, e (quando aplicável) ports & adapters (arquitetura hexagonal). - Produzir e manter documentação arquitetural clara (C4 Model, diagramas de contexto/container/componente/código, fluxos de eventos). - Participar de decisões técnicas, code reviews, spikes arquiteturais e definição de padrões para o time. - Trabalhar em ambiente ágil (Scrum/Kanban), com acompanhamento de entregas via backlog e cards. Qualifications - Experiência sólida em desenvolvimento Java backend com Java 8++, Spring Boot e Spring Cloud (config, gateway, circuit breaker, service discovery, etc.). - Experiência comprovada em quebra de monolitos (migração de aplicações web legadas Java e/ou reengenharia de lógica de negócio de procedures PL/SQL Oracle para microserviços). - Conhecimento sólido em mensageria assíncrona: Kafka, RabbitMQ (fanout/topic/direct) ou AWS SQS/SNS. - Vivência com deploy e operação de aplicações em Kubernetes (preferencialmente na AWS e observabilidade). - Conhecimento dos princípios Twelve-Factor App e Reactive Manifesto. - Capacidade de documentar arquitetura de forma estruturada (preferencialmente com C4 Model). - Experiência em times ágeis e trabalho colaborativo em projetos de grande porte. Requirements - Conhecimento prático de arquitetura hexagonal (ports & adapters) em projetos reais. - Experiência avançada com PL/SQL Oracle (tuning, migração de lógica para Java). - Participação em projetos cloud-native na AWS (EKS, ECS, Lambda, etc.). - Familiaridade com padrões de resiliência (retry, circuit breaker, bulkhead, saga, event sourcing/outbox). - Experiência com observabilidade distribuída. Benefits - Assistência Médica Nacional (para o titular e dependentes, com quarto privativo). - Assistência odontológica nacional (para o titular e dependentes). - Vale refeição / alimentação flexível. - Auxílio home office. - Day off (no mês do aniversário). - Wellhub (antigo Gympass). - Licença Maternidade (6 meses) e Paternidade (20 dias) estendidas. - Auxílio creche para filhos de até 3 anos (por filho). - Apoio em saúde mental com a Wellz. - Clube de Vantagens com descontos em diversos parceiros. - Convênio com instituições de ensino e cursos de idioma. - Desenvolvimento Profissional (Universidade Corporativa). - Parceria com empresa de coworkings no Brasil. - Programa de Qualidade de Vida e Bem-Estar. - Médico consultor para acompanhamento de radixers. - Planos de incentivos.
Multi-Sport Social Programmer – Temporary
Warner Bros. DiscoveryWarner Bros. Discovery (WBD) is a prominent global media and entertainment conglomerate, renowned for its expansive television, film, streaming, and gaming port
• The Multi-Sport Social Programmer role is responsible for populating Bleacher Report and TNT Sports US social feeds with a focus across Instagram, YouTube, TikTok, X and FB • This individual will be tasked with helping curate and package content from social onto TNT Sports US & Bleacher Report social platforms, including highlights, interviews, studio show content and original and user-generated content • They will also contribute to content and packaging ideation with a focus on social programing optimization • Identify and curate content that will engage with the Bleacher Report Sports Portfolio audience • Edit, package, write titles and captions, and create thumbnails optimized for both vertical and longform content across YouTube, Instagram, TikTok, X and Facebook • Identify and apply latest trends to content • Brainstorm and collaborate with content and programming teams on key initiatives • Cover sports events as they’re happening (weekends/night)
Role Description We are seeking a Software Engineer Instructor to own and evolve our internal technical training program for MLE and SWE engineers. This person will design and deliver curriculum spanning software engineering fundamentals and AI engineering. The role combines live instruction, asynchronous content development, and hands-on coaching to accelerate engineer growth and placement readiness. - Design, build, and maintain the end-to-end technical training curriculum for SWE engineers, covering: - System design - Coding best practices - Software architecture - Testing strategies - AI engineering topics - Deliver live workshops and sessions, and interactive coding labs to groups of engineers at varying seniority levels. - Develop high-quality asynchronous learning content: - Recorded lectures - Written guides - Hands-on exercises - Project-based assessments - Reference materials - Provide 1-on-1 and small-group coaching to engineers, helping them close specific skill gaps and accelerate professional growth. - Track and report on engineer skill progression metrics, using data to identify trends, measure training effectiveness, and prioritize curriculum improvements. - Continuously update training materials to reflect evolving industry standards, new frameworks, emerging AI tools, and client demand patterns. - Collaborate with Engineering Leads and Expertise Group leads to align training content with real-world project needs. - Contribute to onboarding programs to ensure new Factored engineers ramp up quickly on standards, tools, and expectations. Qualifications - 5+ years of professional software engineering experience, with hands-on work in system design, backend/full-stack development, and software architecture. - Demonstrated experience teaching, mentoring, or leading technical training for software engineers (formal or informal). - Experience in deploying and managing Cloud Infrastructure (AWS / GCP / Azure) – AWS Preferred. - Strong proficiency in at least two of: Python, JavaScript/TypeScript, Java, or Go. - Solid understanding of software architecture patterns: microservices, event-driven, layered architecture, and domain-driven design. - Practical knowledge of testing strategies: unit testing, integration testing, TDD/BDD, and test automation frameworks. - Excellent written and verbal communication skills in English; able to explain complex technical topics clearly to diverse audiences. - Experience designing structured curriculums, learning paths, or technical programs. - Proactive and self-directed: Identifies skill gaps, designs interventions, and drives outcomes without waiting for direction. - Comfortable working with data to measure training impact and iterate on content. - Ability to adapt teaching style across seniority levels—from junior engineers to senior developers. - Strong documentation habits: create reusable, well-organized training assets. - Collaborative mindset: works effectively across teams. - Comfort navigating ambiguity and managing competing priorities in a fast-paced environment. Requirements - Strong plus: Proven expertise in integrating Generative AI frameworks and APIs into production-grade applications. - Strong plus: Working knowledge of AI engineering concepts: LLM APIs, RAG patterns, Prompt engineering, Agentic Workflows. - Experience with MLOps tools and pipelines. - Experience creating technical video content or running live-streamed technical sessions. - Prior experience in nearshore, distributed, or remote-first engineering teams. - Contributions to open-source projects, tech blogs, or conference talks. Benefits - Ownership through equity participation. - Annual company retreat. - Education bonus for continuous learning. - Company-wide winter break. - Paid time off. - Optional in-person events and meetups. - Tailored career roadmaps. - High-performance culture.
Role Description We are seeking an experienced Core Architect to define and drive the technical architecture, engineering standards, and long-term technology strategy for advanced AI-powered platforms. In this role, you will lead multiple engineering teams responsible for AI infrastructure, model orchestration, vector knowledge systems, and developer tooling. The ideal candidate will combine deep expertise in distributed systems, AI infrastructure, and cloud-native platforms with strong leadership capabilities to guide engineering teams and deliver scalable, production-ready GenAI solutions. Key Responsibilities - Platform Architecture & Technical Leadership - Define and own the overall architectural vision across data ingestion, vector retrieval, orchestration, inference, and developer tooling. - Establish engineering standards, design patterns, and best practices for building scalable AI platforms. - Lead architecture discussions and design reviews across engineering teams. - AI Systems & Workflow Design - Architect agentic workflows using modern frameworks such as MCP, LangGraph, CrewAI, and related orchestration systems. - Design and optimize retrieval systems, vector knowledge architectures, and multi-tool AI pipelines. - Build scalable frameworks for integrating embeddings, vector databases, and Retrieval-Augmented Generation (RAG) systems. - Engineering Leadership & Mentorship - Lead and mentor engineers across machine learning, AI application development, and full-stack engineering teams. - Provide technical guidance on system scalability, performance optimization, and architecture decisions. - Platform Reliability & Cloud Infrastructure - Define best practices for MLOps, observability, monitoring, and cost optimization across AI platforms. - Architect secure, cloud-native deployments across multiple regions using modern infrastructure frameworks. - Ensure systems meet reliability, scalability, and operational standards. - Cross-Functional Collaboration - Work closely with product, data, and research teams to convert experimental GenAI concepts into production-ready solutions. - Align architecture decisions with product roadmaps and business goals. Qualifications - 10–15+ years of experience in distributed systems, platform engineering, or large-scale infrastructure architecture. - Deep expertise in AI frameworks, vector databases, embeddings, and Retrieval-Augmented Generation (RAG) systems. - Proven experience leading architecture initiatives and scaling complex platforms. Technical Expertise - Strong proficiency with LangChain, LlamaIndex, and related AI orchestration frameworks. - Hands-on experience with vector databases, embeddings, and AI retrieval architectures. - Experience with agentic frameworks and orchestration tools. - Deep expertise in cloud-native environments including AWS, Azure, Kubernetes, and containerized platforms. - Familiarity with Kafka, Terraform, and observability systems for distributed architectures. Key Skills - AI Platform Architecture - LangChain / LlamaIndex - Vector Databases & RAG Systems - Distributed Systems Design - Cloud-Native Infrastructure (AWS / Azure / Kubernetes) - Kafka & Terraform - MLOps & Observability - Scalable AI System Design


