Commit Graph

43 Commits (cc390aab64badd803ce68c6d7fd4299592120799)

Author SHA1 Message Date
machine424 f9ca6c4ae6 chore: add an alert based on the metric prometheus_sd_kubernetes_failures_total
5 months ago
Will Bollock 839b9e5b53
fix: PrometheusNotIngestingSamples label matching
10 months ago
Leo Q 4268feb9d7
add alert for sd refresh failure (#12410)
1 year ago
Iain Lane e5cd5a33d0
PrometheusHighQueryLoad alert: use configured selector
2 years ago
Haoyu Sun 26a7f80aa1 add alert PrometheusHighQueryLoad.
2 years ago
fpetkovski 501a8a7865
Address code review comments
3 years ago
fpetkovski 877320784b
Add alert in mixin for exceeded sample limit
3 years ago
Haoyu Sun 3c903af474
Add Alert PrometheusScrapeBodySizeLimitHit
3 years ago
Niko Smeds 53ca693f9e Be specific
3 years ago
Niko Smeds 0bc2cbdd7d Leave time range for clean restarts as-is
3 years ago
Niko Smeds fdcd423dfe Increase time range for PrometheusHAGroupCrashlooping alert
3 years ago
Julien Duchesne 8855c2e626
Add `prometheus_tsdb_clean_start` metric (#8824)
3 years ago
Levi Harrison 2826fbeeb7
SD: Add target creation failure counter and change failure handling (#8786)
4 years ago
Damien Grisonnet b50f9c1c84
Add label scrape limits (#8777)
4 years ago
ravilr adc8807851
Update remote-write alert rules mixin (#8423)
4 years ago
beorn7 553f904f2d mixin: Add a capability to exclude non-prod AM instances
4 years ago
beorn7 638e99c814 prometheus-mixin: Make PrometheusRemoteWriteBehind more generic
4 years ago
beorn7 371ca9ff46 prometheus-mixin: add HA-group aware alerts
4 years ago
Simon Pasquier f381d8a9bd documentation/prometheus-mixin: improve PrometheusNotIngestingSamples
4 years ago
Julien Pivotto f482c7bdd7
Add per scrape-config targets limit (#7554)
4 years ago
Callum Styan 5400e71b91 Update mixin dashboards and alerts for new remote write label names.
5 years ago
Marco Pracucci 1e1785690a
Fix queue in alerts annotation
5 years ago
beorn7 9c8f9bfa63 Fix the description template for PrometheusRemoteWriteDesiredShards
5 years ago
beorn7 61617eb2d9 Fix PrometheusRemoteWriteDesiredShards
5 years ago
Simon Pasquier e36ab7e192
prometheus-mixin: improve description of sample alerts (#6050)
5 years ago
Björn Rabenstein 3b3eaf3496
Merge pull request #5787 from cstyan/reshard-max-logging
5 years ago
Callum Styan a98599bea8 Update remote write max shards alert; properly template/query for max
5 years ago
Callum Styan 3b75614892 Add a warning alert, since the remote write behind alert will probably
5 years ago
Simon Pasquier dd174963a2 prometheus-mixin: remove PrometheusTSDBWALCorruptions
5 years ago
beorn7 4825585834 Tweak tenses
5 years ago
beorn7 9a2177949d Protect gauge-based alerts against failed scrapes
5 years ago
beorn7 7a25a2586d Sync with alerts from kube-prometheus
5 years ago
beorn7 1336a28848 Use a config variable for the Prometheus name
5 years ago
beorn7 e34af6d4d3 Address various comments from the review
5 years ago
beorn7 23c03207e9 Fixed indentation
5 years ago
Tom Wilkie 38a9bbbec2 Loosen off PrometheusRemoteWriteBehind alert.
6 years ago
Tom Wilkie b615069289 Update metric names.
6 years ago
Tom Wilkie e248ffb220 Add alert for WAL remote write falling behind.
6 years ago
Tom Wilkie 638204c775 Typo
6 years ago
Tom Wilkie 8f42192e52 Add Prometheus alerts from kube-prometheus, remove the alertmanager alerts.
6 years ago
Tom Wilkie 50861d586a Alert if more than 1% of alerts fail for a given integration.
6 years ago
Tom Wilkie 266ba185fe Remove PromScrapeFailed alert.
6 years ago
Tom Wilkie ee1427faad Prometheus monitoring mixin for Prometheus itself.
6 years ago