
Jetbro
Remote Jobs
Hiring now
1 Jobs
• Audit Prometheus scrape targets, exporters, and metric endpoints • Review Grafana dashboards, alert rules, and data sources • Assess log coverage across Kibana and Loki • Map monitoring coverage across application, infrastructure, database, ingress, and platform layers • Identify missing exporters, stale dashboards, broken panels, and alert gaps • Analyze historical metrics to establish performance baselines • Define SLOs, KPIs, warning thresholds, and breach thresholds • Suggest Prometheus alert rules and Alertmanager routing strategies • Implement KPI and SLO alerts within Grafana alert management • Evaluate Kubernetes cluster topology and infrastructure usage patterns • Recommend architecture optimizations based on observed load and behavior • Document findings in structured audit and advisory reports • Participate in weekly syncs and structured handover sessions