Backblaze is the cloud storage innovator delivering a modern alternative to traditional cloud providers.
Cluster & Systems Capacity Engineer
Location
United States
Posted
3 days ago
Salary
$123K - $175K / year
Seniority
Senior
Job Description
Cluster & Systems Capacity Engineer
Backblaze
• Develop and maintain short, medium, and long-term capacity demand and hardware deployment forecasts across storage, compute, and network domains within the platform • Build predictive models that translate business demand signals into infrastructure requirements using historical utilization, growth trends, product sales plans, hardware lifecycle roadmaps, and other key business inputs • Partner with Infrastructure, Production, and Network Engineering teams to align capacity plans with system design and scaling initiatives • Develop and automate forecasting pipelines, simulation calculators and tools, and capacity dashboards to improve data quality, reduce manual analysis, and provide stakeholders clear visibility into platform usage and cluster health metrics • Monitor and analyze cluster and system-level utilization and performance across CPU, memory, IOPS, and network resources • Adjust deployment plans and recommended configurations in real-time to maintain adequate headroom and system stability in support of delivering a world-class customer experience • Partner with service and platform owners to develop headroom and live buffer policies, optimize hardware BoMs, leverage virtualized orchestration, and reduce product cost • Work in lockstep with Operations and Finance peers to align capacity plans and hardware requirements with capital budgets, cost targets, and financial outcomes • Support strategic optimization initiatives across infrastructure investments, engineering development, and operations processes, contributing to long-term infrastructure strategy and capital planning • Lead efforts to evaluate, procure, and provision requests for new or additional hardware, working with Systems and Network Engineering, SRE, NOC, and Data Center Operations teams to identify and deliver optimal solutions • Maintain alignment with Product and Sales to support customer onboarding, growth, and demand variability • Communicate complex capacity and infrastructure insights clearly to technical and non-technical stakeholders
Job Requirements
- Bachelor’s degree in Computer Science, Engineering, Mathematics, Data Science, Information Systems, Statistics or a related, technical field (or equivalent experience).
- 3-6+ years of experience in Site Reliability Engineering, Infrastructure Capacity Planning, Systems/Infrastructure Engineering, Production Engineering, Data Center Operations or similar Cloud Operations role
- Familiarity and experience working with Cloud Storage infrastructure, particularly highly-available, large-scale distributed systems supporting large amounts of data with high throughput and complex performance requirements
- Background in capacity modeling, performance analysis, scenario modeling, and/or infrastructure cost optimization, with an ability to quantify ideas within financial frameworks and forecasts.
- Proficiency in database and data analysis tools (preferably Snowflake, Metabase, Grafana, Python, SQL, Prometheus, Victoria Metrics, and Excel/Google Sheets)
- Demonstrated deep, creative, and logical thinking complimented by a strong data analysis skillset
- Excellent communication and documentation skills, with the ability to share knowledge and explain concepts accurately and concisely
- Desire to work on a highly-autonomous team that cares deeply about quality, cost, and the customer experience
Benefits
- Healthcare for family, including dental and vision
- Competitive compensation and 401K
- RSU grants for full-time employees
- ESPP program
- Flexible vacation policy
- Maternity & paternity leave
- MacBook Pro to use for work, plus a generous stipend to personalize your workstation
- Childcare bonus (human children only)
- Fertility treatment and support
- Learning & development program
- Commuter benefits
- Culture that supports a healthy work-life balance
Related Guides
Related Categories
Related Job Pages
More Systems Engineer Jobs
Senior Systems Engineer London, United Kingdom Full-time, Permanent 3 Days Onsite - Hybrid The Senior Systems Engineer is a Tier 2 support role responsible for the stability, performance, and continuous improvement of business critical applications supporting the Global Financial Solutions (GFS) business unit. This role acts as the technical escalation point from Tier 1, provides deep application and systems expertise, and partners closely with development, infrastructure, and business stakeholders to resolve incidents, prevent recurrence, and improve service quality. A critical aspect of this role is strong Microsoft SQL Server expertise, including the ability to analyze data issues, support reporting, troubleshoot performance problems, and safely execute data related activities in line with operational and regulatory controls. The ideal candidate combines strong technical troubleshooting skills with a service oriented mindset and a solid understanding of financial services operational requirements. Key Responsibilities: - Provide Tier 2 support for GFS applications, including investigation, diagnosis, and resolution of complex incidents not resolved at Tier 1 - Act as an escalation point for application-related issues, ensuring timely resolution in line with SLAs and business priorities - Investigate incidents across application, database, and infrastructure layers - Manage incidents through to resolution, ensuring accurate documentation and stakeholder communication - Participate in major incident bridges, providing clear technical leadership and communication - Provide advanced support for MS SQL Server, including: writing and analyzing SQL queries, investigating data integrity and data quality issues, supporting application reporting and extracts, and troubleshooting performance and blocking issues - Analyze SQL logs, queries, indexes, and execution plans to diagnose issues - Work closely with DBAs and development teams on database‑related incidents and improvements - Monitor application health, performance, and availability using enterprise monitoring tools. Identify trends and proactively address potential issues before they impact the business - Partner with infrastructure and platform teams to ensure systems are resilient, scalable, and secure - Support application releases, patches, and configuration changes, including validation and post‑deployment monitoring - Review and assess changes for risk and operational readiness - Collaborate closely with GFS business users to understand application usage, pain points, and operational needs - Provide clear, concise communication to both technical and non‑technical stakeholders - Create and maintain technical documentation, runbooks, and support procedures - Contribute to knowledge articles to improve Tier 1 resolution rates and reduce incident volumes - Identify opportunities to improve application reliability, supportability, and operational efficiency - Support automation efforts for monitoring, alerting, and routine operational tasks - Promote best practices in application support, security, and compliance Required Skills & Experience: - Strong experience supporting enterprise applications in a production environment - Advanced Microsoft SQL Server Skills, including: - Complex SQL querying and data analysis - Understanding of indexing, query optimization, and performance tuning - Experience supporting reporting and data extracts - Solid understanding of: - Application architecture and integrations - Operating systems (Windows) - APIs, batch processing, and job scheduling - Experience with monitoring, logging, and alerting tools - Ability to troubleshoot across application, infrastructure, and integration layers - Typically 5+ years in application support, or a similar role - Proven experience in a Tier 2 or Tier 3 support function - Experience supporting systems in a financial services or regulated environment is strongly preferred - Strong analytical and problem‑solving skills - Calm, methodical approach when working under pressure - Excellent written and verbal communication skills - Ability to manage multiple priorities and incidents concurrently - Strong sense of ownership and accountability - Bachelor’s degree in Computer Science, Information Systems, or a related field, or equivalent experience - ITIL Foundation or higher - Relevant technical certifications (cloud, database, OS, or application platforms) are a plus About Us CSC is a global business, legal, and financial services company based in Wilmington, Delaware, USA, providing knowledge-based solutions to clients worldwide. We have offices and capabilities in over 140 jurisdictions in the Americas, Europe, Asia Pacific, and the Middle East, and more than 8,000 colleagues. We are the business behind business.® Visit our careers site to learn more about CSC and our commitment to our clients, communities, and each other. CSC is committed to creating a feeling of belonging through a diverse and growth-oriented environment where everyone is valued. CSC colleagues have global career opportunities and excellent benefits, including annual success-sharing bonuses or commission plans based on individual performance. To learn more, visit cscglobal.com/service/careers. We offer a range of support to colleagues with disabilities, ensuring people have the necessary resources to thrive in their roles. We encourage candidates to work closely with our talent acquisition partners to convey their specific needs. Our commitment to accessibility reflects our broader dedication to diversity and belonging, CSC only accepts resumes from employment agencies that are part of our approved supplier program. Resumes submitted from other agencies either to talent acquisition, our hiring leaders, employees, or through any other mechanism other than our supplier process, will not be eligible to claim related fees and the submitted resumes will be considered property of CSC. We encourage candidates to apply directly to our website and not through third-party sources. Disclaimer: The information above describes the general nature and level of work performed by employees in this role. It is not intended to describe all duties, responsibilities, and qualifications. About the Team At CSC®, we’re always looking ahead, finding ways to innovate, challenge the status quo, and anticipate the needs of our clients. We exceed expectations by adapting client ambitions and goals as our own. This Fierce Client Spirit has helped us adapt and create solutions that have enabled businesses to run smoother and smarter for more than 125 years. It’s also the reason we’re the trusted partner of many of the world’s most successful organizations. CSC is committed to attracting, developing, and retaining talented people whose values align with ours. We empower our colleagues to bring the right solutions to market to meet client demand. That’s why we are the leading provider of business administration and compliance solutions. - CSC is a great place to work with smart and dedicated people. - We have won several employer recognition awards, including Top Workplace USA, Great Places to Work India, and Built In’s Best Places to Work. - We offer fulfilling work and career opportunities. Most positions are filled with internal moves and employee referrals. - Employees are eligible for Success Sharing, bonuses, or commission plans based on role and individual performance. - CSC offers a competitive and comprehensive benefits package that includes annual leave, tuition reimbursement, referral bonuses, and more. - As business needs allow, CSC offers hybrid or remote work schedules in alignment with local regulations. Specific details for this position will be discussed during the interview process.
Business Systems Analyst
The AES GroupWe bring businesses and talent together to deliver the most innovative technology solution that create the most positive
• Support the development of Statements of Work for system integration and platform implementation activities • Participate in requirements gathering sessions with business and technical stakeholders • Analyze business needs and translate them into detailed System Requirements Documents • Create process flows, system impact assessments, functional specifications, and documentation artifacts • Identify and document functional and non-functional system requirements • Ensure requirements are clearly communicated to development and delivery teams • Identify stakeholders and collaborate with Information Systems teams to design effective solutions • Facilitate solution design sessions with developers, architects, analysts, and QA teams • Create screen layouts, wireframes, and user interface mockups to support development • Conduct peer reviews and incorporate feedback into requirements documentation • Lead requirements walkthroughs with development and testing teams • Review test coverage and help prioritize QA scenarios • Analyze defects, identify root causes, and recommend corrective actions • Track deliverables, milestones, and requirements-related activities • Provide effort estimates and support project planning • Manage change requests and maintain requirements traceability • Support governance processes and ensure documentation accuracy
• Supports and governs assigned finance/accounting applications • Responsible for system audits and reviewing application outputs • Builds and maintains dashboards, reports, and presentations • Assists with month-end close processes and analytical reporting
• EDI Guardian: Manage and ensure the smooth operation of Finnet's EDI software, acting as a technical reference and troubleshooter. • Connectivity Lead: Configure and maintain secure communications via Site-to-Site VPN (Juniper and AWS), ensuring data integrity and confidentiality. • Automation Architect: Develop and implement automation and process monitoring routines to optimize workflows and minimize errors. • Cloud Specialist: Configure and manage servers in cloud environments (AWS, GCP, Huawei, Azure), ensuring high availability and scalability. • Strong EDI experience: Minimum of 2 years of hands-on EDI experience, with knowledge of standards such as EDIFACT, X12, RND, CNAB, NF-e CONEMB and NOTIFIS. • EDI/B2B project experience: Active participation in EDI/B2B implementation projects, with a focus on the financial and banking sector. • Financial process expertise: Deep understanding of financial-sector business processes, with the ability to interpret and analyze related EDI documents. • Planning skills: Ability to create and manage project schedules, ensuring deadlines and objectives are met. • Sterling B2B Integrator expertise: Hands-on experience with Sterling B2B Integrator, including configuration, maintenance and troubleshooting. • File transfer tools: Proficiency with tools such as Connect:Direct, XFB, CFT, STCP and Control Center, ensuring secure and efficient file transfers. • Batch process experience: Knowledge of pre-batch and post-batch processes, including management, generation, import and file consistency analysis. • Workflow familiarity: Experience with workflows such as file reception, delivery and mapping, among other related functions. • Education: Bachelor's degree in IT-related fields such as Computer Science, Information Systems or Computer Engineering.




