MongoDB, originally called 10gen, is a software development company. Since 2007, MongoDB has created an open-source, document-oriented database to help clients
Site Reliability Engineer (Senior or Staff), Atlas
Location
New York
Posted
42 days ago
Salary
$127K - $249K / year
Seniority
Senior
Job Description
Site Reliability Engineer (Senior or Staff), Atlas
MongoDB
The TeamThis role can sit in our NYC HQ on a hybrid basis, or it can be fully remote while working from a location based in either Eastern or Central time zones. We are looking for an experienced Senior Engineer for our SRE, Atlas team to support, maintain and grow the Atlas platform. As a senior SRE, you will be expected to be able to design & build complex systems, operate with autonomy and act as owner for everything you do. The SRE Atlas team works alongside the various Atlas software engineering teams to provide expertise about running systems at scale, build new tooling and automation and perform essential maintenance of the Atlas fleet. This is an SRE team, which means you can expect a highly hands-on approach, tackling the technical challenges of implementing large scale solutions that have the ability to impact our customer’s most crucial workloads. Role OverviewWe are seeking a talented Site Reliability Engineer (SRE) with a strong infrastructure background. This role requires engineers to have a customer-first mindset to ensure that everything we do results in a stronger product and a better experience for all Atlas customers. The ideal candidate should - Have 5+ years of experience running critical systems at scale - Value efficiency in processes and operations, and display a preference for automation over manual processes (“allergic to ops work”) - Be familiar with a major cloud provider (AWS, Azure, or GCP) and possess the ability to build and operate systems in a multi-cloud environment - A strong understanding of how to run a large scale Linux environment, including low level fundamentals - Firm grasp of at least one modern programming language, beyond basic scripting (Go, Ruby, Python) - Solid understanding of web and network protocols and standards (HTTP, TLS, DNS, etc) Expectations - Participate in the development of a reliable and resilient multi-cloud platform that hosts business critical applications for a wide & varied range of customer applications - Collaborate with service-owning teams to provide internal support, solve technical challenges and adapt or build tooling to solve novel use cases in a generic fashion - Participate in a 24/7 on-call rotation to swiftly resolve issues related to any disruption of our customer facing Atlas fleet, ensuring minimal disruption and high availability About MongoDBMongoDB is built for change, empowering our customers and our people to innovate at the speed of the market. We have redefined the database for the AI era, enabling innovators to create, transform, and disrupt industries with software. MongoDB’s unified database platform—the most widely available, globally distributed database on the market—helps organizations modernize legacy workloads, embrace innovation, and unleash AI. Our cloud-native platform, MongoDB Atlas, is the only globally distributed, multi-cloud database and is available across AWS, Google Cloud, and Microsoft Azure. With offices worldwide and nearly 60,000 customers—including 75% of the Fortune 100 and AI-native startups—relying on MongoDB for their most important applications, we’re powering the next era of software. Our compass at MongoDB is our Leadership Commitment, guiding how and why we make decisions, show up for each other, and win. It’s what makes us MongoDB. To drive the personal growth and business impact of our employees, we’re committed to developing a supportive and enriching culture for everyone. From employee affinity groups, to fertility assistance and a generous parental leave policy, we value our employees’ wellbeing and want to support them along every step of their professional and personal journeys. Learn more about what it’s like to work at MongoDB, and help us make an impact on the world! MongoDB is committed to providing any necessary accommodations for individuals with disabilities within our application and interview process. To request an accommodation due to a disability, please inform your recruiter. MongoDB, Inc. provides equal employment opportunities to all employees and applicants for employment and prohibits discrimination and harassment of any type and makes all hiring decisions without regard to race, color, religion, age, sex, national origin, disability status, genetics, protected veteran status, sexual orientation, gender identity or expression, or any other characteristic protected by federal, state or local laws. Req ID: 426187 MongoDB’s base salary range for this role is posted below. Compensation at the time of offer is unique to each candidate and based on a variety of factors such as skill set, experience, qualifications, and work location. Salary is one part of MongoDB’s total compensation and benefits package. Other benefits for eligible employees may include: equity, participation in the employee stock purchase program, flexible paid time off, 20 weeks fully-paid gender-neutral parental leave, fertility and adoption assistance, 401(k) plan, mental health counseling, access to transgender-inclusive health insurance coverage, and health benefits offerings. Please note, the base salary range listed below and the benefits in this paragraph are only applicable to U.S.-based candidates. MongoDB’s base salary range for this role in the U.S. is: $127,000—$249,000 USD
Benefits
- 401(K), Adoption Assistance, Childcare benefits, Commuter benefits, Company equity, Company-sponsored outings, Customized development tracks, Dental insurance, Disability insurance, Volunteer in local community, Employee stock purchase plan, Fitness stipend, Flexible Spending Account (FSA), Flexible work schedule, Generous parental leave, Generous PTO, Company-sponsored happy hours, Health insurance, Job training & conferences, Open door policy, Life insurance, Mentorship program, Open office floor plan, Paid holidays, Pair programming, Paid sick days, Onsite office parking, Partners with nonprofits, Performance bonus, Pet insurance, Promote from within, Recreational clubs, Lunch and learns, Relocation assistance, Remote work program, Return-to-work program post parental leave, Sabbatical, Free snacks and drinks, Team based strategic planning, OKR operational model, Vision insurance, Wellness programs, Some meals provided, Mental health benefits, Home-office stipend for remote employees, Fertility benefits, Employee resource groups, Employee-led culture committees, Hybrid work model, President's club, Employee awards, Transgender health care benefits, Abortion travel benefits, Meditation space, Mother's room, Flexible time off, Bereavement leave benefits
Related Guides
Related Categories
Related Job Pages
More DevOps Engineer Jobs
Site Reliability Engineer III
RenishawLexisNexis® Risk Solutions provides customers with solutions and decision tools that combine public and industry specific content with advanced technology and analytics to assist them in evaluating and predicting risk and enhancing operational efficiency. We use the power of data and advanced analytics to help our customers make better, timelier decisions. By bringing clarity to information, we ultimately help make communities safer, insurance rates more accurate, commerce more transparent, business decisions easier and processes more efficient. You can learn more about LexisNexis Risk at the link below: LexisNexis Risk Solutions
Are you ready to design and operate cloud-native, Kubernetes-driven systems at scale—while leading best practices in automation, reliability, and infrastructure as code? Do you want to be the senior technical voice shaping real-time transaction screening platforms that protect global financial systems through resilient, secure cloud infrastructure? About the Business: LexisNexis® Risk Solutions provides customers with solutions and decision tools that combine public and industry specific content with advanced technology and analytics to assist them in evaluating and predicting risk and enhancing operational efficiency. We use the power of data and advanced analytics to help our customers make better, timelier decisions. By bringing clarity to information, we ultimately help make communities safer, insurance rates more accurate, commerce more transparent, business decisions easier and processes more efficient. You can learn more about LexisNexis Risk at the link below, risk.lexisnexis.com About the Team: This team supports a product suite to support Real-time transaction screening systems designed to monitor payments and trades, ensuring compliance with sanctions, watchlists, PEPs, and other financial crime regulations About the Role As a Sr Developer/SRE you will provide assistance and input to management, lead large multifunctional development activities, solve complex technical problems, write complex code for computer systems, and serve as a senior source of expertise. Responsibilities - Deliver cloud-native solutions and patterns that are highly elastic (AWS, Azure, GCP). - Manage Kubernetes deployment using Helm and empower stakeholders and reduce toil through self-service pipelines. - Drive best practices in cloud-native, security, and DRY through your team. - Mentor your team in solving deep technical issues, advanced cloud infrastructure topics, and complex coding problems. - Set an example of methodical, systematic task execution for your team. - Work with project managers and stakeholders to provide status and reporting. - Act as an ambassador to other teams, finding common ground and defining clear agreements. - Drive projects to schedule and perform code reviews with an eye toward rigor and best practice. - Apply continuous process improving techniques across the operation and automate everything. - You will ensure that everything is IAC (Terraform), and that your team’s infra can be rebuilt from the ground up without relying on manual configurations. Requirements - Proven experience in Application development, DevOps, SRE, or Cloud Infrastructure roles. - Hands on experience in Kubernetes, Helm & Terraform - Proven experience managing AKS/Azure clusters in production environments. - Hands-on experience with Azure cloud services, GitHub workflows, and infrastructure automation. - Strong understanding of networking, security, and monitoring in cloud-native environments. - Good communication skills and ability to work effectively in a collaborative team setting. - Experience developing/maintaining Java/Spring Boot applications would be a value add. Working for You: We know that your wellbeing and happiness are key to a long and successful career. These are some of the benefits we are delighted to offer: - Medical Inpatient and Outpatient Insurance: Coverage for your healthcare needs. - Life Assurance Policies: Providing financial security for your loved ones. - Modern Family Benefits: Support for maternity, paternity, and adoption needs. - Long Service Award: Recognition for your dedication and loyalty. - Celebratory Allowance/Gifts: Marking special occasions to celebrate with you. - Flexible Benefits Plan : Offering you wider choice of services and products - Employee Assistance Program : Access support for personal and work-related challenges. - Flexible Working Arrangements: Balance work and personal life effectively. - Access to Learning and Development Resources: Empowering your professional growth. U.S. National Base Pay Range: $86,600 - $144,400. Geographic differentials may apply in some locations to better reflect local market rates. This job is eligible for an annual incentive bonus. We know your well-being and happiness are key to a long and successful career. We are delighted to offer country specific benefits. Click here to access benefits specific to your location. We are committed to providing a fair and accessible hiring process. If you have a disability or other need that requires accommodation or adjustment, please let us know by completing our Applicant Request Support Form or please contact 1-855-833-5120. Criminals may pose as recruiters asking for money or personal information. We never request money or banking details from job applicants. Learn more about spotting and avoiding scams here. Please read our Candidate Privacy Policy. We are an equal opportunity employer: qualified applicants are considered for and treated during employment without regard to race, color, creed, religion, sex, national origin, citizenship status, disability status, protected veteran status, age, marital status, sexual orientation, gender identity, genetic information, or any other characteristic protected by law. USA Job Seekers: EEO Know Your Rights.
Site Reliability Engineer III
RemitlyRemitly is a global digital financial services company providing fast, affordable, and secure remittance services with the aim of making it easier for people to
Are you ready to design and operate cloud-native, Kubernetes-driven systems at scale—while leading best practices in automation, reliability, and infrastructure as code? Do you want to be the senior technical voice shaping real-time transaction screening platforms that protect global financial systems through resilient, secure cloud infrastructure? About the Business: LexisNexis® Risk Solutions provides customers with solutions and decision tools that combine public and industry specific content with advanced technology and analytics to assist them in evaluating and predicting risk and enhancing operational efficiency. We use the power of data and advanced analytics to help our customers make better, timelier decisions. By bringing clarity to information, we ultimately help make communities safer, insurance rates more accurate, commerce more transparent, business decisions easier and processes more efficient. You can learn more about LexisNexis Risk at the link below, risk.lexisnexis.com About the Team: This team supports a product suite to support Real-time transaction screening systems designed to monitor payments and trades, ensuring compliance with sanctions, watchlists, PEPs, and other financial crime regulations About the Role As a Sr Developer/SRE you will provide assistance and input to management, lead large multifunctional development activities, solve complex technical problems, write complex code for computer systems, and serve as a senior source of expertise. Responsibilities - Deliver cloud-native solutions and patterns that are highly elastic (AWS, Azure, GCP). - Manage Kubernetes deployment using Helm and empower stakeholders and reduce toil through self-service pipelines. - Drive best practices in cloud-native, security, and DRY through your team. - Mentor your team in solving deep technical issues, advanced cloud infrastructure topics, and complex coding problems. - Set an example of methodical, systematic task execution for your team. - Work with project managers and stakeholders to provide status and reporting. - Act as an ambassador to other teams, finding common ground and defining clear agreements. - Drive projects to schedule and perform code reviews with an eye toward rigor and best practice. - Apply continuous process improving techniques across the operation and automate everything. - You will ensure that everything is IAC (Terraform), and that your team’s infra can be rebuilt from the ground up without relying on manual configurations. Requirements - Proven experience in Application development, DevOps, SRE, or Cloud Infrastructure roles. - Hands on experience in Kubernetes, Helm & Terraform - Proven experience managing AKS/Azure clusters in production environments. - Hands-on experience with Azure cloud services, GitHub workflows, and infrastructure automation. - Strong understanding of networking, security, and monitoring in cloud-native environments. - Good communication skills and ability to work effectively in a collaborative team setting. - Experience developing/maintaining Java/Spring Boot applications would be a value add. Working for You: We know that your wellbeing and happiness are key to a long and successful career. These are some of the benefits we are delighted to offer: - Medical Inpatient and Outpatient Insurance: Coverage for your healthcare needs. - Life Assurance Policies: Providing financial security for your loved ones. - Modern Family Benefits: Support for maternity, paternity, and adoption needs. - Long Service Award: Recognition for your dedication and loyalty. - Celebratory Allowance/Gifts: Marking special occasions to celebrate with you. - Flexible Benefits Plan : Offering you wider choice of services and products - Employee Assistance Program : Access support for personal and work-related challenges. - Flexible Working Arrangements: Balance work and personal life effectively. - Access to Learning and Development Resources: Empowering your professional growth. U.S. National Base Pay Range: $86,600 - $144,400. Geographic differentials may apply in some locations to better reflect local market rates. This job is eligible for an annual incentive bonus. We know your well-being and happiness are key to a long and successful career. We are delighted to offer country specific benefits. Click here to access benefits specific to your location. We are committed to providing a fair and accessible hiring process. If you have a disability or other need that requires accommodation or adjustment, please let us know by completing our Applicant Request Support Form or please contact 1-855-833-5120. Criminals may pose as recruiters asking for money or personal information. We never request money or banking details from job applicants. Learn more about spotting and avoiding scams here. Please read our Candidate Privacy Policy. We are an equal opportunity employer: qualified applicants are considered for and treated during employment without regard to race, color, creed, religion, sex, national origin, citizenship status, disability status, protected veteran status, age, marital status, sexual orientation, gender identity, genetic information, or any other characteristic protected by law. USA Job Seekers: EEO Know Your Rights.
Systems Reliability Operator (3PM - 12AM)
MEMXMEMX is an exchange operator and market technology platform dedicated to delivering transparent, efficient, and cost-effective securities trading services designed to revolutionize
Description MEMX is searching for a Systems Reliability Operator who will be responsible for providing support for MEMX exchange platforms. Shift time for this role will be 3 PM ET to 12 AM ET. MEMX currently has a U.S. presence in these states: California, Colorado, Connecticut, Delaware, Florida, Georgia, Illinois, Kansas, Maine, Maryland, Michigan, Nevada, New Jersey, New York, North Carolina, Pennsylvania, South Carolina, & Utah. *If you live outside of the above states, please list in your application and our team will evaluate. What You’ll Do - Responsible for providing support of MEMX exchange platforms including on-call, respond to incidents and support triaging the issue - Help isolate and resolve unplanned system outages - Work with cross-functional teams to support the availability of all MEMX exchange platforms. This includes market operations, systems, networking and development teams - Help improve operational processes (such as deployments and upgrades) by identifying areas which need improvement - Document every action so that the findings turn into repeatable actions which eventually can be automated - Debug issues as they arise, across the different services and interaction points - Enhance monitoring and alerting based on symptoms - Run nightly processes that are essential to exchange operations. We automate as much as possible but there are processes that require a level of manual input and attention Requirements - Good understanding of Linux and know your way around Linux Shell - Mid to advanced Linux administration, scripting skills - Proficiency in Bash scripting skills (Python is nice to have) - Proficiency in a configuration management tool (Ansible, Chef, Puppet) - Experience with monitoring tools - Familiar with incident tracking / ticketing systems and escalation procedures - 2 years or more of experience in an operation support role with incident response - Highly curious, driven and have attention to detail - Seek problems to solve so to help make the platform better - Strong urge to collaborate and improve existing processes - Have an urge for delivering quickly and iterating fast - Trading and/or exchange experience a plus but not required - Share our values and work in accordance to those values Benefits At MEMX you will have the ability to work with a talented team of professionals who bring diversity of thought and background. You will have the opportunity to shape the future of our company and the impact MEMX will have on our clients and the broader markets. We offer competitive employee benefits and perks and will continue to make this a priority to attract the best. - Work From Home - Health Care Plan (Medical, Dental & Vision) - Retirement Plan (401k) - Life Insurance (Basic, Voluntary & AD&D) - Unlimited Paid Time Off - Generous Paid Family Leave - Short Term & Long-Term Disability - Training & Development - Wellness Resources Pay Range: $90,000 to $120,000 *Pay ranges are a general guideline only and not a guarantee of compensation. Compensation may vary depending on a number of factors, including a candidate’s qualifications, skills, competencies and experience, and location. Equal Opportunity Statement MEMX is an equal opportunity employer. We are committed to creating a diverse and inclusive environment for all employees. All qualified applicants will receive consideration for employment without regard to race, color, religion, gender, gender identity or expression, sexual orientation, national origin, genetics, disability, age, or veteran status. Diversity Inclusion Statement At MEMX, we believe that diversity and inclusion are essential to driving innovation and success. We welcome and celebrate individuals from all backgrounds and perspectives, and we strive to create an inclusive culture where everyone can thrive.
Senior SRE
LiveRampThe leading data enablement platform for the safe, easy, and effective use of data.
LiveRamp is the data collaboration platform of choice for the world’s most innovative companies. A groundbreaking leader in consumer privacy, data ethics, and foundational identity, LiveRamp is setting the new standard for building a connected customer view with unmatched clarity and context while protecting precious brand and consumer trust. LiveRamp offers complete flexibility to collaborate wherever data lives to support the widest range of data collaboration use cases—within organizations, between brands, and across its premier global network of top-quality partners. Hundreds of global innovators, from iconic consumer brands and tech giants to banks, retailers, and healthcare leaders turn to LiveRamp to build enduring brand and business value by deepening customer engagement and loyalty, activating new partnerships, and maximizing the value of their first-party data while staying on the forefront of rapidly evolving compliance and privacy requirements. LiveRamp powers exceptional experiences by making it safe and easy to connect the world's data, people, and applications. We are the industry pace-setter and one of the fastest growing SaaS businesses—the enabling product behind many of the world's biggest brands and technology platforms. The Global SRE team is responsible for owning and supporting deployments of global products, and providing first line operational support. We are looking for a Senior Site Reliability Engineer who is excited about establishing and advocating for best practices for product deployments and SRE. Relevant industry experience is important (Software Engineer, Site Reliability Engineer (SRE), Systems Engineer, DevOps Engineer, Network Engineer, Database Administrator or similar role), but ultimately less so than your demonstrated abilities and attitude. You will be able to leverage your software engineering expertise to understand the needs of teams and guide them in improving their systems. You will: - Support and/or own the deployment of global products including setting up production and internal environments - Provide 24/7 first line of Engineering support (via follow the sun teams in all regions) for any issues related to global product deployment, availability and internal operations support. - Drive effective resolutions of core product issues with Engineering teams - Setup and maintain Infrastructure & Product Reliability monitoring and alerting - Maintain and enhance CI/CD Tooling and Terraform scripts in support of the mission in close collaboration with DevOps team - Maintain and enhance Engineering Operational Documentation for supported products. - Provide expertise to build and maintain products operational documentation and setting up product SRE practices - Experience working with real-time and NoSQL Databases such as SingleStore DB, ScyllaDB, Cassandra or Dynamodb - Optimize the performance and cost of the systems and rightsize Kubernetes containers. - Work in close collaboration with SRE team members and Engineering organizations based in California, Paris, Nantong, Singapore, Australia and others. Required Skills: - 5+ years of experience in the fields of SRE, DevOps or production engineering - Experience in Infrastructure as code (IaC) using Terraform - Experience in building continuous integration declarative pipelines in Jenkins or CircleCI - Experience with platforms like Kubernetes, Containers and public clouds (GCP or AWS) - Experience with deployment and monitoring of highly scalable products. - Hands on experience on FinOps and autoscaling Kubernetes clusters. - Experience in Python or Go programming language. - Experience with SRE best practices, working knowledge of observability principles is a big plus - Ability to lead and mentor other engineers in the team for SRE best practices - Ability to diagnose technical problems, debug code, and automate routine tasks - Experience with securing systems in a public cloud environment - Understands how to engage other engineers as stakeholders - Enjoy working as part of a distributed team: smart, ethical, friendly, hard-working, and productive - Bachelor’s degree in Computer Science, Engineering, Mathematics, or a related technical field, or equivalent practical experience Benefits: - People. Work with talented, collaborative, and friendly people who love what they do. - Food. Enjoy catered meals, boundless snacks, and the occasional food truck. - Fun. We host events such as game nights, happy hours, camping trips, and sports leagues. - Stock. Every employee is a stakeholder in our future. Health and Saving. Receive the benefits of comprehensive health, dental, vision and disability insurance along with a 401k matching plan. More about us: For All NYC POSTINGS & SF POSTINGS The approximate annual base compensation range is $135,000 to $159,000. The actual offer, reflecting the total compensation package and benefits, will be determined by a number of factors including the applicant's experience, knowledge, skills, and abilities, geography, as well as internal equity among our team. LiveRamp is the leader in data connectivity, helping the world’s largest brands use their data to improve customer interactions on any channel and device. We thrive on mind-bending technical challenges and value entrepreneurship, humility, and constant personal growth. There is so much more that we want to build and that we could continue to improve. We value strong engineers who are agile enough to hit the ground running and tackle challenges. To all recruitment agencies: LiveRamp does not accept agency resumes. Please do not forward resumes to our jobs alias, LiveRamp employees or any other company location. LiveRamp is not responsible for any fees related to unsolicited resumes. LiveRamp is an affirmative action and equal opportunity employer (AA/EOE/W/M/Vet/Disabled) and does not discriminate in recruiting, hiring, training, promotion or other employment of associates or the awarding of subcontracts because of a person's race, color, sex, age, religion, national origin, protected veteran, disability, sexual orientation, gender identity, genetics or other protected status. Qualified applicants with arrest and conviction records will be considered for the position in accordance with the San Francisco Fair Chance Ordinance. Benefits: - People: Work with talented, collaborative, and friendly people who love what they do. - Fun: We host in-person and virtual events such as game nights, happy hours, camping trips, and sports leagues. - Work/Life Harmony: Flexible paid time off, paid holidays, options for working from home, and paid parental leave. - Comprehensive Benefits Package: LiveRamp offers a comprehensive benefits package designed to help you be your best self in your personal and professional lives. Our benefits package offers medical, dental, vision, life and disability, an employee assistance program, voluntary benefits as well as perks programs for your healthy lifestyle, career growth and more. - Savings: Our 401K matching plan—1:1 match up to 6% of salary—helps you plan ahead. Also Employee Stock Purchase Plan - 15% discount off purchase price of LiveRamp stock (U.S. LiveRampers) - RampRemote: A comprehensive office equipment and ergonomics program—we provide you with equipment and tools to be your most productive self, no matter where you're located More about us: LiveRamp’s mission is to connect data in ways that matter, and doing so starts with our people. We know that inspired teams enlist people from a blend of backgrounds and experiences. And we know that individuals do their best when they not only bring their full selves to work but feel like they truly belong. Connecting LiveRampers to new ideas and one another is one of our guiding principles—one that informs how we hire, train, and grow our global team across nine countries and four continents. Click here to learn more about Diversity, Inclusion, & Belonging (DIB) at LiveRamp. LiveRamp is an affirmative action and equal opportunity employer (AA/EOE/W/M/Vet/Disabled) and does not discriminate in recruiting, hiring, training, promotion or other employment of associates or the awarding of subcontracts because of a person's race, color, sex, age, religion, national origin, protected veteran, disability, sexual orientation, gender identity, genetics or other protected status. Qualified applicants with arrest and conviction records will be considered for the position in accordance with the San Francisco Fair Chance Ordinance. We use automated decision systems (ADS) as part of our recruitment and hiring process. If you require an accommodation or believe that the use of an ADS may create a barrier to your application or participation in the hiring process due to a disability or other protected characteristic, please let us know. We are committed to providing reasonable accommodations and ensuring an equitable hiring experience for all candidates. California residents: Please see our California Personnel Privacy Policy for more information regarding how we collect, use, and disclose the personal information you provide during the job application process. To all recruitment agencies: LiveRamp does not accept agency resumes. Please do not forward resumes to our jobs alias, LiveRamp employees or any other company location. LiveRamp is not responsible for any fees related to unsolicited resumes.

