Skip to main content

Observability Overview

observability overview infographic

Bloodraven exposes operational state through Kubernetes status, Events, structured logs, Prometheus metrics, and Grafana dashboards.

Signals by audience

AudiencePrimary signals
On-callAlerts, runbooks, MysqlFailoverGroup conditions, Events
PlatformOperator health, reconcile errors, RBAC errors, DNS updates
Database ownerReplication lag, backup freshness, verification freshness, PITR archive lag
App teamActive-site DNS, connection errors, write success rate, pool reconnect behavior

Setup path

  1. Enable the metrics Service in the Helm chart.
  2. Configure Prometheus scraping with a ServiceMonitor or raw scrape config.
  3. Install Grafana dashboards.
  4. Install the alert package used by your environment.
  5. Link each alert to Runbooks.
  6. Use the Observability Change Checklist for PRs or releases that change observability signals.

Pages