Job Closed
This listing is no longer active.
Grafana Labs supports organizations’ monitoring, visualization and observability goals. 950,000+ active installations
Staff Software Engineer – Grafana Cloud, k6
Location
United States
Posted
108 days ago
Salary
$175.0K - $210.0K / year
Seniority
Lead
Job Description
Staff Software Engineer – Grafana Cloud, k6
Grafana Labs
• Build and scale a strong culture of operational excellence by defining standards and coaching teams to own reliability and availability. • Drive mature DevOps/SRE practices, including incident response and PIRs, on-call readiness, runbooks, alerting, observability, and release/change management. • Establish reliability frameworks such as SLIs/SLOs and error budgets, and use them to guide prioritization and engineering trade-offs. • Provide visibility into system health through clear operational metrics and reliability reporting. • Guide teams in the design, development, evolution, and operation of large-scale, distributed cloud systems. • Influence product and system direction through design reviews, architectural discussions, and cross-team collaboration. • Share knowledge through clear, high-quality documentation and technical communication—internally and, where appropriate, externally—to help teams build and operate systems more effectively. • As the reliability foundation matures, grow into broader application and product development leadership, contributing architectural and technical depth beyond operations.
Job Requirements
- Strong experience with DevOps/SRE practices, including operating and evolving production systems at scale
- Strong programming background in a modern language (Python and Go are primary, but prior experience is not required)
- Experience designing, building, and operating large-scale distributed systems
- Strong understanding of reliability engineering concepts (e.g. incident management, observability, and failure modes)
- Experience with test automation, including performance and functional testing
- Ability to influence engineering practices through clear technical communication, reviews, and collaboration
- Strong interpersonal skills and ability to work effectively across teams
- Familiarity with modern software engineering processes and delivery practices
- Self-driven and comfortable operating with a high degree of autonomy and ambiguity
Benefits
- 100% Remote, Global Culture
- Scaling Organization – Tackle meaningful work in a high-growth, ever-evolving environment.
- Transparent Communication – Expect open decision-making and regular company-wide updates.
- Innovation-Driven – Autonomy and support to ship great work and try new things.
- Open Source Roots – Built on community-driven values that shape how we work.
- Empowered Teams – High trust, low ego culture that values outcomes over optics.
- Career Growth Pathways – Defined opportunities to grow and develop your career.
- Approachable Leadership – Transparent execs who are involved, visible, and human.
- Passionate People – Join a team of smart, supportive folks who care deeply about what they do.
- In-Person onboarding - We want you to thrive from day 1 with your fellow new ‘Grafanistas’ to learn all about what we do and how we do it.
- Balance is Key - We operate a global annual leave policy of 30 days per annum. 3 days of your annual leave entitlement are reserved for Grafana Shutdown Days to allow the team to really disconnect.
Related Guides
Related Job Pages
More Full-stack Engineer Jobs
Senior Software Engineer – AI, Building Design
KP ReddyLeading Change, Building Futures. KP Reddy is preparing built environment leaders to thrive in an AI driven future.
• Design and implement generative AI models for automated building design, including floor plan generation, facade design, and structural optimization using state-of-the-art architectures (diffusion models, transformers, GANs). • Develop computer vision pipelines for design and drawing analysis using modern frameworks like YOLO, SAM, and NeRF-based 3D reconstruction. • Build graph neural networks and geometric deep learning models for structural analysis and MEP (Mechanical, Electrical, Plumbing) system optimization. • Create reinforcement learning systems for multi-objective building optimization (energy efficiency, cost, occupant comfort, sustainability metrics). • Integrate AI models with industry-standard BIM tools (Revit, Rhino/Grasshopper) through custom APIs and plugins. • Deploy production ML pipelines using modern MLOps practices, including experiment tracking (Weights & Biases, MLflow), model versioning, and A/B testing frameworks. • Implement physics-informed neural networks for building performance simulation and predictive modeling. • Collaborate with architects and engineers to ensure AI systems produce practical, code-compliant, and constructible designs. • Lead research initiatives and publish findings to establish us as a thought leader in AEC AI innovation.
• Design, develop, and implement cloud infrastructure solutions, ensuring high availability, scalability, and security. • Write clean, efficient, and well-documented infrastructure code using Terraform, CloudFormation, or other IaC tools. • Collaborate with cross-functional teams to design and integrate infrastructure solutions that support software development and deployment. • Work closely with external stakeholders, including customers, to understand requirements, conduct system testing, and implement necessary improvements. • Optimize system performance, monitor infrastructure health, and ensure robust, reliable operations. • Debug and troubleshoot system issues, identifying and implementing effective solutions.
Full-Stack Engineer, GEOINT
Orcrist Technologies GmbHPioneering Future Technologies with Advanced AI and Data Analytics
• Build map and spatial analytics features used by mission teams. • Develop TypeScript/Python services and deliver 2D/3D UX. • Ensure offline-ready experiences across connected and disconnected environments. • Ship React/TypeScript map workspace features (layers, annotations, timelines, overlays). • Build APIs integrating PostGIS, geoprocessing services, and vector/3D tiles. • Optimize performance for large datasets and offline workflows (tiling, caching, packaging). • Instrument telemetry, tests, and join on-call rotations for spatial services. • Collaborate with designers, analysts, and data engineers to iterate quickly on user feedback.
Software Developer II – Information Services
Liberty UniversityLiberty University is the largest Christian University in the world, offering a premier Christian education to nearly 100,000 online and offline students. Found
• Contribute to the design, development, and maintenance of high-quality software solutions • Write clean, efficient code and participate in code reviews • Analyze code and identify bugs, using logical reasoning to resolve issues • Deliver high-quality code, identify and address potential issues, and test application performance • Collaborate with cross-functional teams to gather requirements and implement solutions • Mentor junior developers, providing guidance on best practices and development processes



