GitLab, founded in 2011 and based in San Francisco, California, maintains a distributed team of professionals that work remotely across multiple continents. GitLab advocates for pr
Intermediate Site Reliability Engineer, Environment Automation
Location
United States
Posted
45 days ago
Salary
$103.6K - $222K / year
Seniority
Senior
Job Description
Intermediate Site Reliability Engineer, Environment Automation
GitLab
• Contribute to the design and evolution of infrastructure automation using Terraform, Ansible, and Kubernetes • Help debug and resolve production issues across Kubernetes clusters • Assist in creating and maintaining deployment and orchestration tools • Contribute to automating operational tasks across many GitLab environments • Help build and refine the observability stack for multi-tenant GitLab environments • Assist in responding to platform alerts and incidents • Support planning and implementation of infrastructure changes • Develop and maintain scripts, automation tools, and infrastructure-as-code workflows • Apply best practices for running GitLab on Kubernetes and cloud platforms • Participate in the on-call rotation for production GitLab environments • Document operational tasks, runbooks, and lessons learned
Job Requirements
- Experience working as an SRE or in a similar role operating production infrastructure
- Hands-on experience with backend programming languages such as Golang
- Hands-on experience running Kubernetes-based workloads in production
- Familiarity with infrastructure automation and configuration management tools such as Terraform and Ansible
- Solid understanding of Git-based workflows and infrastructure-as-code practices
- Experience working in distributed systems or cloud-based production environments
- A proactive mindset focused on automation and documentation
- Comfort working asynchronously across distributed teams
Benefits
- Benefits to support your health, finances, and well-being
- Flexible Paid Time Off
- Team Member Resource Groups
- Equity Compensation & Employee Stock Purchase Plan
- Growth and Development Fund
- Parental leave
- Home office support
Related Guides
Related Categories
Related Job Pages
More DevOps Engineer Jobs
About Binagora At Binagora, we are a fully remote community of software crafters with over a decade of experience partnering with international clients. We collaborate with bold organizations to deliver high-quality, custom solutions that achieve tangible results. Our expertise spans various sectors, including Media & Entertainment, Solar Energy, Healthcare, Marketing, Audit & Compliance, Diversity & Inclusion, among many others. From initial strategy to final delivery, we go above and beyond, infusing creativity and aligning with business objectives to develop innovative products that challenge conventions and propel businesses forward. Our Client Our client is on a mission to be the category-defining global platform for connected risk, continually elevating customers through innovation. Founded by enterprise audit, risk, and compliance professionals who experienced the challenges of managing internal controls and fieldwork manually, they developed a platform to transform how organizations manage risk. Today, the platform is trusted by over 50% of the Fortune 500, empowering them to streamline their risk management processes.
Solution Consultant, DevOps
AtlassianAtlassian is a publicly-traded computer software business specializing in collaboration, development, and issue-tracking software for teams. As an employer, Atlassian maintains a t
Solution Consultant, DevOps Sales | New York, United States | Remote, Remote | Full-Time Working at Atlassian Atlassians can choose where they work – whether in an office, from home, or a combination of the two. That way, Atlassians have more control over supporting their family, personal goals, and other priorities. We can hire people in any country where we have a legal entity. Interviews and onboarding are conducted virtually, a part of being a distributed-first company. The Atlassian Advisory Services team is a globally distributed team of Atlassian solutions advisors who are passionate about creating customer success. Advisory Services team members engage with enterprise organizations with some of the most complex business challenges and help them deliver a delightful solution to their users. We provide teams of trusted advisors helping orchestrate successful outcomes with customers to help them get the most out of their Atlassian investment and unlock their ideal solution for team collaboration. At Atlassian, you'll have an impact on millions of users, fast! We don't just want to know your opinions, we want to see your ideas in action. We hire great people and then trust them to be great. We're hiring a Solution Consultant, with a DevOps focus to join our Advisory Services Delivery team, reporting to a Delivery Manager. This is not a managerial role. Atlassian Solution Consultants are technical experts with Atlassian solutions, delivering guidance to drive value realization for clients that have purchased Advisory Services from Atlassian. Solution Consultants are accomplished at delivering performant technical guidance at scale, aligning product capabilities with business needs and desired outcomes. They partner to provide the solutions and technical guidance to help the customer achieve their desired outcomes and goals. You will help our strongest promoters showcase successes to their peers, and serve as the tip of the spear in growing the reach of our technologies for new use cases and markets. This role aims to help enterprise customers get the most of their Atlassian investment. - Collaborate with your peers in Advisory Services to align on strategic outcomes that deliver exceptional service to our customers - Partner with customers to help solve their business challenges and achieve their internal goals through Atlassian products, practices, and solutions - Identify and promote opportunities for service and product expansion within a client’s organization - Cultivate deep industry and solution expertise, staying up-to-date with evolving best practices that support different types of teams - Demonstrated expertise in DevSecOps, including - Direct experience with popular best-in-breed products in the DevSecOps space and Atlassian ecosystem that support enterprise-grade SDLC and CI/CD pipelines - Background in software development and/or IT operations roles - Experience directing stakeholders through Atlassian Cloud transformation and adoption at an enterprise scale - Experience identifying, testing, deploying, and/or integrating best-in-breed tooling that satisfies customer requirements and improves the end-user experience across their entire SDLC pipeline - Experience with application security, compliance, hardening techniques, and best practices to secure software and infrastructure - Spend up to 30% of your time traveling domestically, and in some cases internationally, for both internal and customer-facing events - 4-6 years of experience within SaaS companies - Demonstrated expertise in our Cloud Platform, including: - Experience administering Atlassian Cloud ecosystems, including applications like Guard, BitBucket, Jira Software, Jira Service Management, Confluence, Focus, Rovo, etc - Background in SaaS based-architectures, integration methods, and associated security considerations - Experience performing Atlassian’s Cloud migration process at varying customer scales, and expertise in associated migration utilities, and hybrid SaaS and on-premise deployment models - 5+ years of experience in customer-facing roles where you’ve engaged with and influenced customer stakeholders that range from technical administrators to executive leadership Compensation At Atlassian, we strive to design equitable, explainable, and competitive compensation programs. To support this goal, the baseline of our range is higher than that of the typical market range, but in turn we expect to hire most candidates near this baseline. Base pay within the range is ultimately determined by a candidate's skills, expertise, or experience. In the United States, we have three geographic pay zones. For this role, our current base pay ranges for new hires in each zone are: Zone A: $128,700 - 168,025 Zone B: $116,100 - $151,575 Zone C: $107,100 - $139,825 This role may also be eligible for benefits, bonuses, commissions, and equity. Please visit go.atlassian.com/payzones for more information on which locations are included in each of our geographic pay zones. However, please confirm the zone for your specific location with your recruiter. Benefits & Perks Atlassian offers a wide range of perks and benefits designed to support you, your family and to help you engage with your local community. Our offerings include health and wellbeing resources, paid volunteer days, and so much more. To learn more, visit go.atlassian.com/perksandbenefits. About Atlassian At Atlassian, we're motivated by a common goal: to unleash the potential of every team. Our software products help teams all over the planet and our solutions are designed for all types of work. Team collaboration through our tools makes what may be impossible alone, possible together. We believe that the unique contributions of all Atlassians create our success. To ensure that our products and culture continue to incorporate everyone's perspectives and experience, we never discriminate based on race, religion, national origin, gender identity or expression, sexual orientation, age, or marital, veteran, or disability status. All your information will be kept confidential according to EEO guidelines. To provide you the best experience, we can support with accommodations or adjustments at any stage of the recruitment process. Simply inform our Recruitment team during your conversation with them. To learn more about our culture and hiring process, visit go.atlassian.com/crh.
• Collaborate with development and IT teams to design and implement automation solutions for infrastructure provisioning, configuration, and deployment. • Maintain and improve CI/CD pipelines to ensure efficient and reliable software delivery. • Propose, plan, and execute projects to improve existing solutions and processes. • Implement and manage containerization technologies (Docker, Kubernetes) to enhance scalability and resource utilization. • Troubleshoot and resolve issues related to system and application performance, ensuring high availability. • Manage and optimize cloud-based infrastructure, including resource provisioning, cost optimization, and security. • Work on enhancing monitoring and logging solutions to provide insights into system health and performance. • Collaborate with cross-functional teams to ensure alignment between development and operations.
Are you someone who thrives on helping others succeed, enjoys making an impact, and takes pride in guiding customers to the right solutions for their projects? If you’re also naturally curious and eager to keep learning, consider starting or growing your career with us at The Home Depot. Position: Manager, Technology Site Reliability Engineering (SRE) - eCommerce Position Overview: The Manager, SRE will lead a team of Site Reliability Engineers to ensure the reliability, performance, and operational support of our eCommerce systems, with a focus on Google Cloud Platform (GCP) environments. This role requires a strong background in reliability reviews, performance engineering practices, production engineering, and operational support, with emphasis on DevOps principles and GCP expertise. Responsibilities: - Leadership & Management: - Lead and mentor a team of Site Reliability Engineers - Foster a culture of continuous improvement and innovation - Collaborate with cross-functional teams to align SRE practices with business objectives - Reliability & Performance: - Conduct reliability reviews to identify areas for improvement and implement solutions to enhance system reliability, particularly in GCP environments - Implement and promote performance engineering practices to ensure optimal system performance on GCP - Develop and maintain service level objectives (SLOs) and error budgets - Production Engineering & Operational Support: - Oversee production engineering efforts to ensure systems are designed for operational excellence and reliability, leveraging GCP services and best practices - Manage incident response and post-incident reviews to minimize downtime and improve system resilience - Implement monitoring, alerting, and observability solutions to proactively identify and address issues - Develop and maintain runbooks and playbooks for common operational tasks. - Coordinate with security teams to ensure compliance with security policies and best practice - DevOps & Continuous Improvement: - Drive DevOps initiatives to improve collaboration between development and operations teams, with a focus on GCP-native tools and services - Implement and maintain CI/CD pipelines to streamline deployment processes in GCP environments - Identify and implement automation opportunities to reduce manual tasks and improve efficiency - Promote the use of Infrastructure as Code (IaC) to manage and provision cloud resources. - Continuously evaluate and integrate new tools and technologies to enhance DevOps practices - Release Management: - Implement and maintain release management best practices to minimize disruptions and maximize system stability - Collaborate with DevOps teams to integrate release management into CI/CD pipelines - Oversee release schedules, ensuring minimal impact on business operations - Ensure there is a rigorous release readiness process in place that includes reviews and post-release retrospectives - Maintain a release calendar and communicate release plans to stakeholders - Strategic Planning: - Create and maintain a strategic roadmap for SRE initiatives, aligning with business goals and technological advancements. - Refine and standardize Standard Operating Procedures (SOPs) to enhance operational efficiency and consistency. - Address customer pain points by developing and implementing solutions that improve user experience and system reliability. - Engage with stakeholders to understand their needs and incorporate feedback into strategic planning and execution - Monitor industry trends and best practices to ensure the SRE team remains at the forefront of technology. Experience: - Bachelor’s degree in computer science, Engineering, or a related field - Strong problem-solving and analytical abilities - Excellent communication and collaboration skills - 4-6 years of relevant work experience, including significant experience with GCP - Extensive experience with cloud infrastructure, GCP services and architecture - Proven track record of managing and optimizing large-scale systems on GCP - Proven ability to effectively communicate with individuals at all levels of the organization - Ability to maintain relationship and negotiate with vendors. - Ability to operate in and leverage resources in a matrixed environment. - Ability to analyze and present data to support ideas. - Ability to clearly communicate to all levels of the organization. The pay range for this position is between $89,600.00 - $108,800.00




