Simon Pasquier
e36ab7e192
prometheus-mixin: improve description of sample alerts ( #6050 )
...
Signed-off-by: Simon Pasquier <spasquie@redhat.com>
5 years ago
Callum Styan
a98599bea8
Update remote write max shards alert; properly template/query for max
...
shards in description.
Signed-off-by: Callum Styan <callumstyan@gmail.com>
5 years ago
Callum Styan
3b75614892
Add a warning alert, since the remote write behind alert will probably
...
already be going off, about desired shards being higher than max shards.
Signed-off-by: Callum Styan <callumstyan@gmail.com>
5 years ago
Simon Pasquier
dd174963a2
prometheus-mixin: remove PrometheusTSDBWALCorruptions
...
The counter is only increased when tsdb.Open() is called which
Prometheus does only once in its lifetime (when it initializes). If the
corruption can't be recovered, tsdb.Open() returns an error and
Prometheus exits. Hence the metric is either 0 (no corruption) or 1
(corruption detected and repaired). If the latter, the alert isn't
actionable and the only way to resolve it is to restart Prometheus which
would reset the counter.
Signed-off-by: Simon Pasquier <spasquie@redhat.com>
5 years ago
Matthias Loibl
20d12ff1c7
Fix prometheus-mixin dashboards to use grafanaDashboards
...
Signed-off-by: Matthias Loibl <mail@matthiasloibl.com>
5 years ago
beorn7
4825585834
Tweak tenses
...
Signed-off-by: beorn7 <beorn@grafana.com>
5 years ago
beorn7
9a2177949d
Protect gauge-based alerts against failed scrapes
...
Signed-off-by: beorn7 <beorn@grafana.com>
5 years ago
beorn7
52707535b8
Remove/improve unused variables and weird doc comments
...
Signed-off-by: beorn7 <beorn@grafana.com>
5 years ago
beorn7
7a25a2586d
Sync with alerts from kube-prometheus
...
While doing so, re-introduce the summary/description
annotations. Also, add a few more rules and tweak a few of the
existing ones.
Signed-off-by: beorn7 <beorn@grafana.com>
5 years ago
beorn7
ded0705bdc
Update remote repo for grafana-builder dependency
...
Signed-off-by: beorn7 <beorn@grafana.com>
5 years ago
beorn7
1336a28848
Use a config variable for the Prometheus name
...
Signed-off-by: beorn7 <beorn@grafana.com>
5 years ago
beorn7
613cb5430c
Add a "work in progress" disclaimer.
...
Signed-off-by: beorn7 <beorn@grafana.com>
5 years ago
beorn7
e34af6d4d3
Address various comments from the review
...
Signed-off-by: beorn7 <beorn@grafana.com>
5 years ago
beorn7
23c03207e9
Fixed indentation
...
Signed-off-by: beorn7 <beorn@grafana.com>
5 years ago
beorn7
d5845ad05b
Fix formatting
...
This is the outcome of `make fmt`.
Signed-off-by: beorn7 <beorn@grafana.com>
5 years ago
beorn7
d45e8a0f61
Adjust to jsonnet v0.13
...
Signed-off-by: beorn7 <beorn@grafana.com>
5 years ago
beorn7
5c04ef3935
Make README.md immediately useful
...
Signed-off-by: beorn7 <beorn@grafana.com>
5 years ago
beorn7
ddfabda152
Add Makefile and suitable jsonnet files
...
This makes the mixins usable as abvertised.
Signed-off-by: beorn7 <beorn@grafana.com>
5 years ago
beorn7
e943803a3c
Add .gitignore file
...
Signed-off-by: beorn7 <beorn@grafana.com>
5 years ago
Callum Styan
a5762f3681
Add dashboard for remote write to prometheus-mixin.
...
Signed-off-by: Callum Styan <callumstyan@gmail.com>
6 years ago
Tom Wilkie
38a9bbbec2
Loosen off PrometheusRemoteWriteBehind alert.
...
Signed-off-by: Tom Wilkie <tom.wilkie@gmail.com>
6 years ago
Tom Wilkie
b615069289
Update metric names.
...
Signed-off-by: Tom Wilkie <tom.wilkie@gmail.com>
6 years ago
Tom Wilkie
e248ffb220
Add alert for WAL remote write falling behind.
...
Signed-off-by: Tom Wilkie <tom.wilkie@gmail.com>
6 years ago
Tom Wilkie
638204c775
Typo
...
Signed-off-by: Tom Wilkie <tom.wilkie@gmail.com>
6 years ago
Tom Wilkie
8f42192e52
Add Prometheus alerts from kube-prometheus, remove the alertmanager alerts.
...
Signed-off-by: Tom Wilkie <tom.wilkie@gmail.com>
6 years ago
Tom Wilkie
dfbdf8d3bb
Add a basic readme with link to the mixin docs.
...
Signed-off-by: Tom Wilkie <tom.wilkie@gmail.com>
6 years ago
Tom Wilkie
5fd712b210
copypasta.
...
Signed-off-by: Tom Wilkie <tom.wilkie@gmail.com>
6 years ago
Tom Wilkie
50861d586a
Alert if more than 1% of alerts fail for a given integration.
...
Signed-off-by: Tom Wilkie <tom.wilkie@gmail.com>
6 years ago
Tom Wilkie
266ba185fe
Remove PromScrapeFailed alert.
...
Signed-off-by: Tom Wilkie <tom.wilkie@gmail.com>
6 years ago
Tom Wilkie
e8a8ce5654
Basic Prometheus dashboard.
...
Signed-off-by: Tom Wilkie <tom.wilkie@gmail.com>
6 years ago
Tom Wilkie
ee1427faad
Prometheus monitoring mixin for Prometheus itself.
...
Signed-off-by: Tom Wilkie <tom.wilkie@gmail.com>
6 years ago