MongoDB logo
MongoDB

MongoDB, originally called 10gen, is a software development company. Since 2007, MongoDB has created an open-source, document-oriented database to help clients

Team Lead, Site Reliability Engineering

DevOps EngineerDevOps EngineerFull TimeHybridLeadTeam 5,550Since 2008Company Site

Location

New York

Posted

46 days ago

Salary

$151K - $297K / year

Seniority

Lead

No structured requirement data.

Job Description

Team Lead, Site Reliability Engineering

MongoDB

Title: Team Lead, Site Reliability Engineering - Storage Layer Service Location: New York City United States Job Description: MongoDB's Storage Layer Services (SLS) team is re-architecting the MongoDB cloud storage layer and sits at the heart of our next-generation cloud storage architecture. This relatively new team is building performant, multi-tenant distributed storage services that both enhance today's Atlas storage stack and enable more customer workloads to run more efficiently. As the Lead Site Reliability Engineer for SLS, you will partner with the teams building these storage services to define SLOs, shape capacity plans, and ensure the reliability, durability, and operational safety of the storage layer that underpins Atlas. You'll help grow and lead a small, senior team of SREs as founding members of this organization, playing a crucial role in executing on a multi-year roadmap for MongoDB's cloud storage architecture. We are looking to speak to candidates who are based in New York City for our hybrid working model. Responsibilities - Build and lead a team of 6-8 engineers, fostering a positive culture, handling career growth and performance conversations, and proactively removing blockers - Define and drive a clear technical vision and comprehensive roadmap for our multi-tenant distributed storage systems, balancing long-term strategic infrastructure goals with immediate engineering needs - Contribute through hands-on technical work, such as leading architectural design reviews, reviewing PRs, and stepping in to guide the team through complex operational challenges - Act as the primary liaison for the Storage Layer Services SRE team, collaborating closely with other engineering leaders to ensure platform alignment and manage stakeholder expectations You may be a good fit if you - Have 10+ years of experience working on software and operating distributed systems, with 2+ years managing engineering teams - Possess a customer-focused mindset, treating internal developers as your primary users - Value efficiency in processes and operations, and have a track record of optimizing team workflows - Prefer automation over manual processes, fostering a culture of building software solutions to eliminate toil - Have deep technical familiarity with Kubernetes ecosystems, containerization technologies, and modern IaC tooling (e.g., Terraform, Crossplane, or Operators) so you can effectively guide the team's technical decisions - Have operated or supported stateful storage or database systems at scale and are comfortable with durability, consistency and recovery trade-offs - Excel at translating complex business and engineering requirements into actionable, phased technical roadmaps - Have a high level of empathy, responsibility, ownership, and accountability - Excellent verbal and written technical communication skills Strong candidates may also have experience with - Leading major architectural shifts, such as moving from legacy storage stacks to new multi-tenant storage architectures, including planning and executing large-scale data and workload migrations with tight availability and durability requirements - Managing and scaling infrastructure across multi-cloud environments (AWS, GCP, or Azure) - Designing secure, multi-tenant runtime environments at scale About MongoDB MongoDB is built for change, empowering our customers and our people to innovate at the speed of the market. We have redefined the database for the AI era, enabling innovators to create, transform, and disrupt industries with software. MongoDB's unified database platform, the most widely available, globally distributed database on the market, helps organizations modernize legacy workloads, embrace innovation, and unleash AI. Our cloud-native platform, MongoDB Atlas, is the only globally distributed, multi-cloud database and is available across AWS, Google Cloud, and Microsoft Azure. With offices worldwide and over 60,000 customers, including 75% of the Fortune 100 and AI-native startups, relying on MongoDB for their most important applications, we're powering the next era of software. Our compass at MongoDB is our Leadership Commitment, guiding how and why we make decisions, show up for each other, and win. It's what makes us MongoDB. To drive the personal growth and business impact of our employees, we're committed to developing a supportive and enriching culture for everyone. From employee affinity groups, to fertility assistance and a generous parental leave policy, we value our employees' wellbeing and want to support them along every step of their professional and personal journeys. Learn more about what it's like to work at MongoDB, and help us make an impact on the world! MongoDB is committed to providing any necessary accommodations for individuals with disabilities within our application and interview process. To request an accommodation due to a disability, please inform your recruiter. MongoDB, Inc. provides equal employment opportunities to all employees and applicants for employment and prohibits discrimination and harassment of any type and makes all hiring decisions without regard to race, color, religion, age, sex, national origin, disability status, genetics, protected veteran status, sexual orientation, gender identity or expression, or any other characteristic protected by federal, state or local laws. Req ID: 1273396229 MongoDB's base salary range for this role is posted below. Compensation at the time of offer is unique to each candidate and based on a variety of factors such as skill set, experience, qualifications, and work location. Salary is one part of MongoDB's total compensation and benefits package. Other benefits for eligible employees may include: equity, participation in the employee stock purchase program, flexible paid time off, 20 weeks fully-paid gender-neutral parental leave, fertility and adoption assistance, 401(k) plan, mental health counseling, access to transgender-inclusive health insurance coverage, and health benefits offerings. Please note, the base salary range listed below and the benefits in this paragraph are only applicable to U.S.-based candidates. MongoDB's base salary range for this role in the U.S. is: $151,000-$297,000 USD

Related Categories

Related Job Pages

More DevOps Engineer Jobs

Broadridge logo

Java Developer

Broadridge

Broadridge Financial Solutions, Inc., founded in 1962 as a division of ADP, became a publicly-traded company in 2007. Now an award-winning business services fir

DevOps Engineer46 days ago

Title: Java Developer (Hybrid- Flexible Options) Location: Newark United States Job Description: At Broadridge, we've built a culture where the highest goal is to empower others to accomplish more. If you're passionate about developing your career, while helping others along the way, come join the Broadridge team. Broadridge is growing! Our team is seeking an experienced Java Developer who is passionate about the craft and art of developing mission-critical software. You love to learn and work with leading-edge technologies in a collaborative work environment. Superb opportunity for applicants that are proficient in Java. Main responsibility of this role will be to write clean code, data design, testing, technical design and troubleshooting. This is a contract role to work remotely. Are you looking for a dynamic and creative environment where you can build applications from ground up? Are you seeking an excellent opportunity to drive the future of this emerging and dynamic IT development organization? If that sounds like you, we'd love to hear from you. Work Mode: We are made up of high performing teams that meet in person to learn and collaborate as needed. This role is considered hybrid, which means you'll be coming into the office 2 days a week and given the flexibility to work remotely the rest of the time. Key Responsibilities: - Design, develop, test, and deploy high-performance, scalable full stack applications using Java (Spring Boot) and React. - Architect and implement microservices and RESTful APIs. - Develop and integrate solutions using Apache Kafka and AWS MSK. - Design and optimize relational database solutions (PostgreSQL, Oracle PL/SQL). - Create and maintain CI/CD pipelines using Jenkins and related DevOps tools. - Build and deploy cloud-native applications using AWS services including MSK, Aurora PostgreSQL, API Gateway, EKS, ECS, and S3. - Containerize applications using Docker and deploy to Kubernetes (EKS/ECS). - Implement secure coding practices and ensure compliance with security standards. - Develop automated unit, integration, and end-to-end tests using JUnit, Mockito, Cucumber, and Karate. - Apply TDD and test automation best practices. - Ensure non-functional requirements such as scalability, resiliency, maintainability, and performance are incorporated into system design. - Participate in architectural design discussions and contribute to technical decision-making. - Collaborate within Agile teams to deliver high-quality software solutions. - Support production deployments, troubleshooting, and performance tuning. - Mentor junior developers and promote engineering best practices. Qualifications: - Bachelor of Science in Computer Science or equivalent education and experience. - 8+ years of professional experience in Java development. - Expert proficiency with Spring Framework and Spring Boot. - Strong experience developing front-end applications using React. - Deep understanding of RESTful API design and integration. - Strong hands-on experience with Apache Kafka and AWS MSK. - Extensive experience with PostgreSQL and writing complex SQL queries; Oracle PL/SQL experience required. - Strong knowledge of AWS cloud services including MSK, Aurora PostgreSQL, API Gateway, EKS, ECS, and S3. - Experience building, deploying, and operating applications in AWS. - Hands-on experience with Docker and Kubernetes. - Proficiency with CI/CD tools such as Jenkins. - Experience with build tools such as Maven and Gradle. - Proven experience leveraging DevOps practices using tools such as Git, Jenkins, and Nexus. - Strong understanding of secure coding practices and relational database design. - Experience with JUnit, Mockito, Cucumber, and Karate testing frameworks. - Working knowledge of TDD and automated testing methodologies. - Proven experience designing and integrating highly complex enterprise software systems. - Experience working in Agile development environments. - Strong analytical, problem-solving, and communication skills. Preferred Qualifications - Experience with data modeling. - AWS certifications. - Experience designing highly resilient and distributed systems. - Prior experience in large-scale enterprise environments. "Broadridge considers various factors when evaluating a candidate's final salary including, but not limited to, relevant experience, skills, and education." Salary Range: 125,000.00 - 140,000.00 USD annual Bonus Eligible Please visit www.broadridgebenefits.com for more information on our comprehensive benefit offerings. #LI-MR1 #LI-Hybrid We are dedicated to fostering a collaborative, engaging, and inclusive environment and are committed to providing a workplace that empowers associates to be authentic and bring their best to work. We believe that associates do their best when they feel safe, understood, and valued, and we work diligently and collaboratively to ensure Broadridge is a company-and ultimately a community-that recognizes and celebrates everyone's unique perspective. Use of AI in Hiring As part of the recruiting process, Broadridge may use technology, including artificial intelligence (AI)-based tools, to help review and evaluate applications. These tools are used only to support our recruiters and hiring managers, and all employment decisions include human review to ensure fairness, accuracy, and compliance with applicable laws. Please note that honesty and transparency are critical to our hiring process. Any attempt to falsify, misrepresent, or disguise information in an application, resume, assessment, or interview will result in disqualification from consideration. US applicants: Click here to view the EEOC "Know Your Rights" poster. Disability Assistance We recognize that ensuring our long-term success means creating an environment where everyone is welcome, where everyone's strengths are valued, and where everyone can perform at their best. Broadridge provides equal employment opportunities to all associates and applicants for employment without regard to race, color, religion, sex (including sexual orientation, gender identity or expression, and pregnancy), marital status, national origin, ethnic origin, age, disability, genetic information, military or veteran status, and other protected characteristics protected by applicable federal, state, or local laws. If you need assistance or would like to request reasonable accommodations during the application and/or hiring process, please contact us at 888-237-7769 or by sending an email to BRcareers@broadridge.com.

New Jersey
$125K - $140K / year
CAI logo

Azure DevOps Power Automate Administrator

CAI

WHEN YOU NEED TO MEET A HIGHER STANDARD® in US | ASIA | EUROPE | AUSTRALIA

DevOps Engineer46 days ago
Full TimeRemoteTeam 501-1,000H1B Sponsor

Role Description Technical resource responsible for monitoring, optimizing, and improving operational reliability within our Microsoft Dataverse / Power Platform environment. This role will lead monitoring, observability, DevOps enablement, CI/CD governance, and automation initiatives to improve system reliability, integration stability, and backend IT support efficiencies. This position will be full-time and remote. What You'll Do - Scheduled Job & Workflow Monitoring - Monitor scheduled jobs and background processes in Dataverse - Track: - Failed Power Automate flows - Long-running flows - Workflow execution anomalies - Build proactive alerting mechanisms - Perform root cause analysis and corrective action planning - Improve retry logic and resiliency patterns - Dashboard & Observability Development - Design and implement centralized monitoring dashboards displaying: - Failed workflows - Slow-running automations - Tools may include: - Power BI - Azure Monitor - Application Insights - Log Analytics - Azure Cost Management - Dataverse analytics APIs - Integration health and latency - System account lockouts - Job schedules and failure trends - Long-running background jobs - Email error logs - Azure DevOps & GitHub (CI/CD Governance) - Implement and manage CI/CD pipelines for Power Platform solutions - Manage solution versioning and environment promotion strategies (Dev → Test → Prod) - Configure Azure DevOps pipelines for: - Solution export/import automation - Automated deployments - Validation testing - Maintain GitHub repositories for: - Source control of Power Platform solutions - Infrastructure-as-Code (IaC) scripts - Automation scripts - Enforce branching strategies and pull request governance - Integrate automated quality checks into deployment pipelines - Enable automated environment provisioning where applicable - Source Control & Environment Governance - Support Git-based source control best practices - Support: - Branching models (GitFlow or trunk-based) - Automated solution packaging - Maintain deployment documentation and release runbooks - Ensure secure credential and secret management within pipelines - Automation & Operational Efficiency - Identify repetitive backend support tasks suitable for automation - Implement self-healing automation where feasible - Reduce alert fatigue through improved monitoring configuration - Develop operational runbooks and knowledge base documentation Qualifications - Required: - Monitoring & Observability - Azure Monitor - Log Analytics - Application Insights - Power BI dashboard development - Structured logging and alerting frameworks - Power Platform & Dataverse - Azure DevOps Pipelines - GitHub (branching strategies, PR governance) - YAML-based pipeline configuration - Power Platform Build Tools - Environment promotion strategies - Advanced Power Automate - Dataverse administration - Solution management - Power Platform ALM best practices - Preferred: - DevOps & CI/CD - Experience implementing DevOps for enterprise Power Platform environments - Familiarity with Infrastructure as Code (ARM, Bicep, Terraform) - Understanding of SRE principles - ITIL or service management experience - Experience reducing production incidents through automation Physical Demands - Ability to safely and successfully perform the essential job functions consistent with federal, state and local standards - Sedentary work that involves sitting or remaining stationary most of the time with occasional need to move around the office to attend meetings, etc. - Ability to conduct repetitive tasks on a computer, utilizing a mouse, keyboard and monitor Reasonable Accommodation Statement If you require a reasonable accommodation in completing this application, interviewing, completing any pre-employment testing, or otherwise participating in the employment selection process, please direct your inquiries to application.accommodations@cai.io or (888) 824 – 8111.

Philippines
Dun & Bradstreet logo

Senior Engineer - Cloud DevOps

Dun & Bradstreet

Leading global provider of business decisioning data and analytics, enabling companies to improve business performance.

DevOps Engineer46 days ago
Full TimeHybridTeam 5,001-10,000Since 1841H1B Sponsor

Title: Senior Engineer -Cloud DevOps (R-19112) Location: Hyderabad, India Workplace: hybrid Full-time Category: Technology Job Description: Shape the Future with Dun & Bradstreet At Dun & Bradstreet, we believe data has the power to create a better tomorrow. As a global leader in business decisioning data and analytics, we help companies worldwide grow, manage risk, and innovate. For over 180 years, businesses have trusted us to turn uncertainty into opportunity. We’re a diverse, global team that values creativity, collaboration, and bold ideas. Are you ready to make an impact and help shape what’s next? Join us! Explore opportunities at dnb.com/careers. Job Summary: As a Cloud Platform / DevOps Engineer within our Cloud Engineering & Operations organization, you will play a pivotal role in designing, engineering, and operating the hybrid‑cloud platform capabilities that power our internal developer platform (IDP) and shared cloud runtimes. You will help define and scale the enterprise’s cloud‑first transformation by enabling secure, automated, self‑service infrastructure and establishing standards that accelerate delivery across AWS and GCP environments. You will partner closely with development teams, product owners, cybersecurity, and architecture groups to build reusable patterns, improve operational maturity, and drive automation at scale. This is a mid– to senior‑level engineering role requiring strong ownership, technical depth, and the ability to influence engineering practices across the organization. Key Roles & Responsibilities: • Design, engineer, operate, and continuously evolve hybrid cloud platform runtimes across AWS and GCP • Establish and enforce standards for infrastructure provisioning, application deployment, CI/CD pipelines, container images, compute images, and configuration management within the internal developer platform • Design, implement, and maintain Infrastructure as Code using Terraform and Terraform Cloud for Business • Build, optimize, and support enterprise‑grade CI/CD pipelines using tools such as Harness, Jenkins, and GitHub Enterprise • Enable and scale developer self‑service capabilities using platforms such as GitHub Enterprise, JFrog Artifactory, SonarQube, and Backstage • Contribute to the architecture, monitoring, scalability, and lifecycle management of cloud platforms, including Kubernetes runtimes • Drive automation across infrastructure, CI/CD, image lifecycle, and operational processes using Python, Bash, and configuration management tools • Partner closely with application development teams, product owners, cybersecurity, and architecture teams to understand requirements and prioritize platform enhancements • Apply AI‑assisted engineering tools (such as Gemini or equivalent) to improve development velocity, troubleshooting, and platform operations • Operate within Agile delivery frameworks (Jira) and established ITSM processes (ServiceNow) Key Skills & Qualifications: • 8–12 years of experience in platform engineering, DevOps engineering, or cloud engineering in hybrid‑cloud environments (AWS & GCP). • 5–8 years of hands‑on experience supporting hybrid‑cloud environments across AWS and GCP. • Proficiency in designing and deploying large‑scale API Gateway and/or Apigee X solutions. • Strong proficiency with Terraform and Terraform Cloud for Business (TFC/TFCB) for Infrastructure‑as‑Code. • Strong CI/CD expertise using Harness CI/CD and/or Jenkins. • Experience enabling developer self‑service platforms using GitHub Enterprise, Artifactory, SonarQube, and Backstage. • Advanced experience with containerization and runtime platforms (Kubernetes, Docker), including scaling automated or self‑service Kubernetes environments. • Strong automation skills using Python and/or Bash and configuration management tools such as Ansible. • Strong Linux administration and troubleshooting skills across applications, networking, and systems. • Experience integrating with internal development teams and supporting APIs and integration solutions. • Proficiency with AI‑assisted engineering tools (Gemini preferred). • Excellent verbal and written communication skills suitable for collaboration and presenting to leadership. Preferred Skills & Certifications: • AWS Certified Solutions Architect (Associate or Professional) • AWS Certified DevOps Engineer (Associate or Professional) • Google Cloud Professional Cloud Architect or Professional Cloud DevOps Engineer • Experience with data or streaming platforms such as RedPanda, Databricks, Elasticsearch, or similar "This position is internally titled as Senior Engineer" All Dun & Bradstreet job postings can be found at https://jobs.lever.co/dnb. Official communication from Dun & Bradstreet will come from an email address ending in @dnb.com. Notice to Applicants: Please be advised that this job posting page is hosted and powered by Lever, a subsidiary of Employ Inc. Your use of this page is subject to Employ's Privacy Notice and Cookie Policy, which governs the processing of visitor data on this platform.

TG + 1 moreAll locations: TG | India

Role Description O time de SRE da Appmax é o guardião da alta disponibilidade e confiabilidade dos nossos sistemas. Você vai atuar de perto com os times de engenharia, integrando práticas de operação e desenvolvimento para garantir entregas robustas e alinhadas aos objetivos do negócio. Buscamos alguém apaixonado por arquiteturas Cloud Native, ferramentas open source e pelo universo SRE, com disposição constante para aprender e evoluir (Lifelong learning). O que você fará (Responsabilidades): - Administrar e evoluir ambientes AWS, garantindo alta disponibilidade e performance - Atuar em resposta a incidentes: identificar causa-raiz, implementar ações preventivas e colaborar com outros times na resolução - Identificar gargalos de performance e conduzir otimizações em sistemas e aplicações - Configurar e manter sistemas de monitoramento e observabilidade (alertas, dashboards, relatórios de status) - Desenvolver e manter automações, scripts e ferramentas para agilizar deploys, monitoramento e troubleshooting - Integrar práticas de DevOps e DevSecOps nos pipelines de CI/CD - Criar e manter documentação técnica da infraestrutura e processos operacionais - Disseminar conhecimento e boas práticas com outros times Qualifications - Cloud AWS (administração de ambientes, multi-account) - Kubernetes – EKS - Infra como Código: Terraform - Observabilidade: Elasticsearch, Zabbix, New Relic, AWS CloudWatch, Grafana - Docker e containers - CI/CD (Bitbucket Pipelines ou similar) - Bancos de dados: MySQL, PostgreSQL, Aurora, DynamoDB, ElastiCache Redis, DocumentDB - Scripting: Python e/ou Shell - Servidores Linux e Windows Requirements - Experiência em ambientes financeiros regulados (PCI Compliant, Banco Central/Pix/RSFN) - Certificações (AWS, Kubernetes, Terraform ou FinOps) - Visão de FinOps e otimização de custos em cloud - Experiência com GitOps - Segurança: HTTPS, SSL, SAST e DAST, PCI, segurança defensiva - Ensino superior em TI ou áreas correlatas (em andamento ou completo) Benefits - 💻 Todos os equipamentos e recursos necessários para realizar o trabalho em modelo presencial, híbrido ou remoto - 🌎 Ajuda de custo para despesas para quem atua no formato híbrido ou remoto - 🍟 Flexfood, assim você não precisa escolher entre VR ou VA - 🏥 Plano de Saúde e Odontológico - 🏋️‍♀️ Wellhub - 💊 Avus - 🧠 Starbem - 🩹 Convênio farmácia - 🚌 Vale transporte - ❤️‍🩹 Seguro de vida - 🐶 Plano Pet Guapeco - 📚 Upmaxter para auxiliar nos seus estudos - 🚀 Um ambiente que favorece e incentiva o desenvolvimento e o alto desempenho com checkpoints mensais de performance, práticas de 1:1, rotinas de feedbacks contínuos, acompanhamento do PDI e muito mais...

Brazil
Job Closed