Paweł Krupa (paulfantom)
8571536327
docs/node-mixin: add missing selectors
...
Signed-off-by: Paweł Krupa (paulfantom) <pawel@krupa.net.pl>
2 years ago
Daniel Lenar
0b50eb7294
Reverse fsSpaceAvailableCriticalThreshold and fsSpaceAvailableWarningThreshold
...
Currently critical alert for space available alerts on warning and
warning alert for space available alerts on critical.
Signed-off-by: Daniel Lenar <dlenar@vailsys.com>
3 years ago
Vitaly Zhuravlev
8823605f12
Fix NodeFileDescriptorLimit alerts
...
Signed-off-by: Vitaly Zhuravlev <zhuravlev.vitaly@gmail.com>
3 years ago
paulfantom
832909dd25
docs/node-mixin/alerts: make NodeFilesystemAlmostOutOfSpace fire earlier
...
Signed-off-by: paulfantom <pawel@krupa.net.pl>
3 years ago
Loïc Blot
55ffe57cbc
feat(rules): add NodeFileDescriptorLimit kernel exhaustion alert
...
Add a new alert when fs.file-nr is close to fs.file-max
Signed-off-by: Loic Blot <loic.blot@unix-experience.fr>
4 years ago
Ben Kochie
eefb18db02
Merge pull request #1764 from dhoppe/patch-1
...
Use description instead of message as field for annotations
4 years ago
Ben Kochie
4b68aeb80a
Merge pull request #1862 from fsschmitt/fix/alerts-label-naming
...
fix: node_md_disks state label from fail to failed
4 years ago
Björn Rabenstein
9c9c636305
Merge pull request #1861 from paulfantom/network-alerts
...
docs/node-mixin/alerts: use ratio for network alerts
4 years ago
paulfantom
f81747e608
docs/node-mixin/alerts: add max error condition to alert about desynchronized clock
...
Signed-off-by: paulfantom <pawel@krupa.net.pl>
4 years ago
fsschmitt
effa4da989
fix: node_md_disks state label as failed
...
Signed-off-by: fsschmitt <492108+fsschmitt@users.noreply.github.com>
4 years ago
paulfantom
d7cbe85d22
docs/node-mixin/alerts: use a rate for network alerts
...
Signed-off-by: paulfantom <pawel@krupa.net.pl>
4 years ago
Nicolas Lamirault
ff2ff3410f
Configure 2 thresholds for NodeFilesystemAlmostOutOfSpace alert ( #1835 )
...
* Add: configure 2 thresholds for NodeFilesystemAlmostOutOfSpace alert
Signed-off-by: Nicolas Lamirault <nicolas.lamirault@gmail.com>
4 years ago
Rajat Vig
7dd8adf7ed
Fix NodeRAIDDegraded to not use a string rule expressions
...
Signed-off-by: Rajat Vig <rvig@etsy.com>
4 years ago
Simon Pasquier
02212dd2c6
Run jsonnetfmt
...
Signed-off-by: Simon Pasquier <spasquie@redhat.com>
4 years ago
Hao Ke
9b7a0d06a1
Fix syntax error
...
Signed-off-by: Hao Ke <hao.ke@auryc.com>
Signed-off-by: Simon Pasquier <spasquie@redhat.com>
4 years ago
paulfantom
e4ec8e04c5
docs/node-mixin: add alerts about failing RAID array
...
Signed-off-by: paulfantom <pawel@krupa.net.pl>
4 years ago
Dennis Hoppe
fc64b70386
Use description instead of message as field for annotations
...
Signed-off-by: Dennis Hoppe <github@debian-solutions.de>
4 years ago
Frederic Branczyk
b42819b69d
Merge pull request #1657 from povilasv/NodeTextFileCollectorScrapeError
...
Add NodeTextFileCollectorScrapeError alert to mixin
5 years ago
Povilas Versockas
bd3e6d224c
Add NodeTextFileCollectorScrapeError alert to mixin
...
Signed-off-by: Povilas Versockas <p.versockas@gmail.com>
5 years ago
beorn7
8b00b22904
Fix sign error in `NodeClockSkewDetected`
...
Signed-off-by: beorn7 <beorn@grafana.com>
5 years ago
paulfantom
820f8d595e
docs/node-mixin: alert on desynchronised clock
...
Signed-off-by: paulfantom <pawel@krupa.net.pl>
5 years ago
Neraud
1006a2c4bb
Add missing coma
...
Signed-off-by: Neraud <neraud.login@gmail.com>
5 years ago
Povilas Versockas
48bb6f670c
Add NodeHighNumberConntrackEntriesUsed
...
Signed-off-by: Povilas Versockas <p.versockas@gmail.com>
5 years ago
iuri aranda
0107bc7942
Make FS space alerts thresholds configurable ( #1624 )
...
* Make FS space alerts thresholds configurable (#1 )
This makes it possible to tweak the thresholds for
the NodeFilesystemSpaceFillingUp alerts. Which
might be necessary in systems like Kubernetes,
where the image garbage collector runs at 85%,
so it's not a problem that the disk reaches that usage %.
Signed-off-by: iuri aranda <iuri@skyscrapers.eu>
5 years ago
Leo
dfeec07f2f
Fix node-mixin prometheus alert rules to use percentage
...
Signed-off-by: Leo <leonardjonathanoh@live.com>
5 years ago
beorn7
97ef113762
Make the severity of "critical" alerts configurable
...
This addresses the blissful scenario where single-node failures are
unproblematic. No reason to wake somebody up if a node is about to
screw itself up by filling the disk.
Signed-off-by: beorn7 <beorn@grafana.com>
5 years ago
beorn7
3a770a0b1d
Convert annotations from message to summary/description
...
Signed-off-by: beorn7 <beorn@grafana.com>
5 years ago
beorn7
a92d1d7889
Address review comments, batch 2
...
Signed-off-by: beorn7 <beorn@grafana.com>
5 years ago
beorn7
b3b47f2d07
Make selector naming consistent
...
Signed-off-by: beorn7 <beorn@grafana.com>
5 years ago
beorn7
dec5b5b053
Fix indentation
...
Signed-off-by: beorn7 <beorn@grafana.com>
5 years ago
beorn7
2df034c055
Move node-mixin into docs directory
...
Signed-off-by: beorn7 <beorn@grafana.com>
5 years ago