Monitoring stack — teaches AI agents to deploy, configure, operate, and troubleshoot Prometheus metrics collection, Grafana dashboards, Alertmanager notifications, and the exporter ecosystem.
Install:
pipx install agentic-stacks # if you haven't already
agentic-stacks pull agentic-stacks/prometheus-grafana
Skills
concepts
Metrics model, time series, labels, metric types, PromQL preview
architecture
Component overview, data flow, network requirements, deployment patterns
prometheus
Install and configure Prometheus server (binary, Docker, Helm)
grafana
Install and configure Grafana (package, Docker, Helm)
alertmanager
Install and configure Alertmanager, routing, receivers
exporters
node_exporter setup, common exporters reference, Pushgateway
promql
PromQL query language — selectors, operators, functions, examples
dashboards
Grafana dashboards — panels, variables, provisioning, as code
alerting
Alert rules (Prometheus and Grafana), notification channels
recording-rules
Recording rules for performance and aggregation
storage
TSDB, retention, remote write/read, federation
upgrades
Upgrade procedures for Prometheus, Grafana, Alertmanager
prometheus-diagnose
Troubleshoot targets, cardinality, OOM, WAL, config errors
grafana-diagnose
Troubleshoot datasource, auth, performance, plugin issues
alertmanager-diagnose
Troubleshoot routing, notifications, silences, clustering
known-issues
Version-specific issues and breaking changes
compatibility
Version compatibility matrix for all components
best-practices
Naming, instrumentation, histogram, alerting, dashboard best practices