Job Closed
This listing is no longer active.
The cannabis retail platform for modern dispensaries. Making safe cannabis products accessible to every adult on Earth.
Software Engineer II – SRE/DevOps
Location
United States
Posted
104 days ago
Salary
$115K - $145K / year
Seniority
Senior
Job Description
Software Engineer II – SRE/DevOps
Flowhub
• Own the stability and reliability of all environments, including production, by implementing and supporting SRE practices across the organization. • Lead the execution and maintenance of our Observability stack, developing and maintaining comprehensive monitoring, alerting, and logging capabilities. • Manage and tune core database systems (OLAP, OLTP) to ensure high availability and reliability as application and business needs evolve. • Lead performance testing and engineering efforts for key services, conducting load tests and identifying bottlenecks to maintain system scalability. • Maintain and optimize our CDN/Edge infrastructure, ensuring optimal performance and security feature configuration. • Provide critical cross-support for core infrastructure, assisting with automation, cluster maintenance, disaster recovery, and CI/CD troubleshooting. • Develop internal tooling and platform capabilities to accelerate engineering productivity. • Mentor and guide junior engineers and developers to ensure application and infrastructure needs are tightly aligned.
Job Requirements
- 3+ years of hands-on experience in an SRE, DevOps, or Platform Engineering role.
- Strong Experience in Observability and SRE Practices, utilizing modern tooling (e.g., Prometheus, Grafana, Datadog).
- Strong experience managing and optimizing databases, specifically focusing on high availability, scaling, and reliability of OLAP (Analytical) and OLTP (Transactional) systems.
- Solid foundation in core infrastructure technologies including Kubernetes (or similar orchestration), GCP (or other cloud services), and Infrastructure-as-Code (Terraform, etc.).
Benefits
- Health insurance
- Flexible work arrangements
- Professional development opportunities
Related Guides
Related Categories
Related Job Pages
More DevOps Engineer Jobs
Site Reliability Engineer
PartlyBuilding the first global platform for replacement parts, starting with auto parts.
• Reliability Engineering: Ensure the stability, scalability, and security of our cloud infrastructure, Partly & 3rd party applications in our Kubernetes powered clusters. Leverage Infrastructure-as-Code and automation (Terraform for GCP, GitOps with ArgoCD, Custom scripts in Python/Bash, etc.) to deploy and manage workloads and resources in a repeatable, automated way. • Cost Optimisation: Monitor and optimise costs across our cloud and on-prem infrastructure, ensuring we get maximum value from our investments. Make recommendations for resource allocation or architecture changes to improve cost-efficiency without sacrificing reliability or performance. • Cross-Functional Collaboration: Work closely with developers, data engineers, and leadership to plan infrastructure needs and improvements. Provide tooling, guidance and training to the engineering team on SRE practices, and collaborate during software delivery to ensure smooth integrations from code to production. • Software Engineering: Make sure our software meets high production readiness standards. When you see a problem or an opportunity to improve, you drive the solution. • Troubleshooting: participate in incidents resolutions, give developers helping hand in debugging applications, networks, databases, compute systems.
DevOps Engineer
Booz Allen HamiltonBooz Allen Hamilton is an award-winning provider of strategic innovation, management consulting, technology, and engineering services. Founded in 1914, the comp
DevOps Engineer The Opportunity: Everyone is trying to “harness the cloud,” but not everyone knows how. As a DevOps engineer, you’re eager to develop, manage, and secure a container platform that meets your client’s needs and takes advantage of cloud capabilities. We need you to help us develop container management sof tware to solve some of our clients’ toughest challenges. As a platform DevOps Engineer at Booz Allen, you can use your technical skills to affect mission-forward change. On our team, you’ll strengthen your skills using the latest cloud technologies as you look for ways to improve your client’s environment with current container sof tware to ensure seamless orchestration. Using your DevOps platform knowledge, you’ll support your team as you inform strategy and design while ensuring standards are met throughout the containerization process. You’ll work with your team to recommend resources that will help your client manage and securely adopt containers. Additionally, you’ll gain DevOps skills and experience while supporting the development of critical cloud platforms. Work with us to use cloud platform technology for good. Join us. The world can’t wait. You Have: 2+ years of experience with containerization technologies 2+ years of experience with container orchestration platforms 2+ years of experience managing sof tware deployments through CI / CD pipelines Experience developing enterprise cloud-native solutions and applying basic principles, theories, and concepts Experience with OOP scripting or program languages Ability to work with AWS or Azure Ability to work with container orchestration platforms Secret clearance Bachelor's degree Nice If You Have: Knowledge of automation, programming and scripting languages, infrastructure automation, and microservices Knowledge of triaging and resolving issues related to both open source and commer cia l tools in public cloud environments Top Secret clearance Clearance: Applicants selected will be subject to a security investigation and may need to meet eligibility requirements for access to classified information Compensation At Booz Allen, we celebrate your contributions, provide you with opportunities and choices, and support your total well-being. Our offerings include health, life, disability, financial, and retirement benefits, as well as paid leave, professional development, tuition assistance, work-life programs, and dependent care. Our recognition awards program acknowledges employees for exceptional performance and superior demonstration of our values. Full-time and part-time employees working at least 20 hours a week on a regular basis are eligible to participate in Booz Allen’s benefit programs. Individuals that do not meet the threshold are only eligible for select offerings, not inclusive of health benefits. We encourage you to learn more about our total benefits by visiting the Resource page on our Careers site and reviewing Our Employee Benefits page. Salary at Booz Allen is determined by various factors, including but not limited to location, the individual’s particular combination of education, knowledge, skills, competencies, and experience, as well as contract-specific affordability and organizational requirements. The projected compensation range for this position is $61,900.00 to $141,000.00 (annualized USD). The estimate displayed represents the typical salary range for this position and is just one component of Booz Allen’s total compensation package for employees. This posting will close within 90 days from the Posting Date. Identity Statement As part of the application process, you are expected to be on camera during interviews and assessments. We reserve the right to take your picture to verify your identity and prevent fraud. Work Model Our people-first culture prioritizes the benefits of flexibility and collaboration, whether that happens in person or remotely. If this position is listed as remote or hybrid, you’ll periodically work from a Booz Allen or client site facility. If this position is listed as onsite, you’ll work with colleagues and clients in person, as needed for the specific role. Commitment to Non-Discrimination All qualified applicants will receive consideration for employment without regard to disability, status as a protected veteran or any other status protected by applicable federal, state, local, or international law.
ABOUT BASETEN Baseten powers mission-critical inference for the world's most dynamic AI companies, like Cursor, Notion, OpenEvidence, Abridge, Clay, Gamma and Writer. By uniting applied AI research, flexible infrastructure, and seamless developer tooling, we enable companies operating at the frontier of AI to bring cutting-edge models into production. We're growing quickly and recently raised our $300M Series E, backed by investors including BOND, IVP, Spark Capital, Greylock, and Conviction. Join us and help build the platform engineers turn to to ship AI products. THE ROLE As a Site Reliability Engineer, you'll envision and build robust systems and processes that ensure our infrastructure is scalable, reliable, and efficient. This can range from automating deployments and monitoring systems to optimizing performance and managing incidents. We all work closely with our users, learning from their past struggles in operationalizing ML, onboarding them onto our platform, and turning our learnings into ideas for improving Baseten. EXAMPLE INITIATIVES You'll get to work on these types of projects as part of our Infrastructure team: Multi-cloud capacity management Inference on B200 GPUs Multi-node inference Fractional H100 GPUs for efficient model serving RESPONSIBILITIES Build and maintain scalable infrastructure to support the deployment and operation of machine learning models. Establish standards and best practices for reliability and performance across the infrastructure. Automate processes when relevant, particularly for managing CI/CD pipelines. Own products and projects end-to-end, functioning as both an engineer and a project manager, with a focus on user empathy, project specification, and end-to-end execution. Collaborate with cross-functional teams to understand project requirements and translate them into technical solutions. REQUIREMENTS Bachelor's, Master's, or Ph.D. degree in Computer Science, Engineering, Mathematics, or related field. 5+ years of professional work experience in a fast-paced, high-growth environment. Extensive experience with Kubernetes. Experience in building and maintaining scalable infrastructure. Experience with infrastructure-as-code tools (e.g., Terraform, CloudFormation, Pulumi) and CI/CD tooling (e.g., GitHub Actions, GitLab CI, Circle CI, Jenkins). Relevant OSS observability experience (Prometheus, ELK stack, Grafana stack, Opentelemetry) is a plus. Ability to own projects end-to-end, from project specification to execution. No prior machine learning experience required, but should be open to learning about it. BENEFITS Competitive compensation, including meaningful equity. 100% coverage of medical, dental, and vision insurance for employee and dependents Generous PTO policy including company wide Winter Break (our offices are closed from Christmas Eve to New Year's Day!) Paid parental leave Company-facilitated 401(k) Exposure to a variety of ML startups, offering unparalleled learning and networking opportunities. Apply now to embark on a rewarding journey in shaping the future of AI! If you are a motivated individual with a passion for machine learning and a desire to be part of a collaborative and forward-thinking team, we would love to hear from you. At Baseten, we are committed to fostering a diverse and inclusive workplace. We provide equal employment opportunities to all employees and applicants without regard to race, color, religion, gender, sexual orientation, gender identity or expression, national origin, age, genetic information, disability, or veteran status.
• Build and maintain the Multigres Operator - Maintain our Go-based Kubernetes operator that orchestrates distributed Postgres deployments • Architect cloud deployment infrastructure - Design and implement robust deployment patterns for EKS and other Kubernetes platforms • Manage storage and networking layers - Work with CSI drivers, persistent volumes, and cross-cloud networking to ensure data reliability and connectivity • Develop deployment tooling - Create internal tools and automation for provisioning, scaling, and managing Multigres clusters • Ensure operational excellence - Build monitoring, alerting, and diagnostic capabilities into the deployment layer • Collaborate across teams - Work with database engineers, SRE, and product teams to deliver seamless deployment experiences



