Björn Rabenstein
b5a2ad46e3
Merge pull request #2351 from grafana/vzhuravlev/macos
...
Add darwin dashboard
3 years ago
Daniel Lenar
0b50eb7294
Reverse fsSpaceAvailableCriticalThreshold and fsSpaceAvailableWarningThreshold
...
Currently critical alert for space available alerts on warning and
warning alert for space available alerts on critical.
Signed-off-by: Daniel Lenar <dlenar@vailsys.com>
3 years ago
Gabriel Amaral Antunes
410e069471
Add darwin dashboard to mixin
...
Signed-off-by: Vitaly Zhuravlev <zhuravlev.vitaly@gmail.com>
3 years ago
Vitaly Zhuravlev
8823605f12
Fix NodeFileDescriptorLimit alerts
...
Signed-off-by: Vitaly Zhuravlev <zhuravlev.vitaly@gmail.com>
3 years ago
Severyn Lisovskyi
7b86b7cb29
[node-mixin] change current datasource to grafana's default
...
Signed-off-by: Severyn Lisovskyi <993215+sev3ryn@users.noreply.github.com>
3 years ago
Julian Wiedmann
3e6f4ce627
mixin: exclude iowait and steal from CPU Utilisation ( #2194 )
...
'iowait' and 'steal' indicate specific idle/wait states, which shouldn't
be counted into CPU Utilisation. Also see
https://github.com/prometheus-operator/kube-prometheus/pull/796 and
https://github.com/kubernetes-monitoring/kubernetes-mixin/pull/667 .
Per the iostat man page:
%idle
Show the percentage of time that the CPU or CPUs were idle and the
system did not have an outstanding disk I/O request.
%iowait
Show the percentage of time that the CPU or CPUs were idle during
which the system had an outstanding disk I/O request.
%steal
Show the percentage of time spent in involuntary wait by the
virtual CPU or CPUs while the hypervisor was servicing another
virtual processor.
Signed-off-by: Julian Wiedmann <jwi@linux.ibm.com>
3 years ago
Ben Kochie
421fc429f3
Replace deprecated linter ( #2176 )
...
Upstream is replacing `golint` with `revive`.
* Cleanup unused mixin go files.
Signed-off-by: Ben Kochie <superq@gmail.com>
3 years ago
ngc104
4bc1c02000
fix bug in #2130 ( #2170 )
...
Signed-off-by: Yves Mettier <yves.mettier@orange.com>
Co-authored-by: Yves Mettier <yves.mettier@orange.com>
3 years ago
Tom Wilkie
9bc184d236
Datasource template variable should be labelled 'Data Source'
...
Signed-off-by: Tom Wilkie <tom@grafana.com>
3 years ago
Ben Kochie
5a38949451
Fix up mixin tests ( #2167 )
...
Use new Go install format, cleanup working dir setup.
Signed-off-by: Ben Kochie <superq@gmail.com>
3 years ago
Julien Pivotto
68a6c78c0d
Update go to 1.17 ( #2159 )
...
Signed-off-by: Julien Pivotto <roidelapluie@inuits.eu>
3 years ago
Michal
186e2e79c8
add yamllint config, fix yamllint errors ( #2088 )
...
After a recent change in prometheus/prometheus, Makefile.common includes
now a yamllint target which currently fails. This PR adds the missing
yamllint config and fixes the yamllint errors.
Signed-off-by: Michal Wasilewski <mwasilewski@gmx.com>
3 years ago
Ben Kochie
aeef1edd62
mixin: Add fallback for MemAvailable ( #2130 )
...
Add a fallback to Buffers+Cached+MemFree+Slab for older Linux kernels
where the MemAvailable metric is not available for memory utilization.
Signed-off-by: Ben Kochie <superq@gmail.com>
3 years ago
Johannes 'fish' Ziemke
6f1286b314
mixin: Drop mode label for num cpu metric
...
Signed-off-by: Johannes 'fish' Ziemke <github@freigeist.org>
3 years ago
Johannes 'fish' Ziemke
fa9926c4eb
mixin: Cheaper calculation for instance:node_num_cpu:sum
...
Signed-off-by: Johannes 'fish' Ziemke <github@freigeist.org>
3 years ago
paulfantom
832909dd25
docs/node-mixin/alerts: make NodeFilesystemAlmostOutOfSpace fire earlier
...
Signed-off-by: paulfantom <pawel@krupa.net.pl>
3 years ago
Johannes 'fish' Ziemke
7fc5c6045a
Read config from $
...
Signed-off-by: Johannes 'fish' Ziemke <github@freigeist.org>
3 years ago
ArthurSens
3731f93fd7
Refactor USE method mixin dashboards with grafonnet-lib, add multi-cluster support.
...
Aiming for cleaner code and following standards used on younger mixins.
Signed-off-by: ArthurSens <arthursens2005@gmail.com>
3 years ago
Frederic Hemberger
5bee84f30d
docs: Replace `go get` with `go install` for command installation
...
`go get` is deprecated for installation of commands as of go v1.17
Ref: https://go.googlesource.com/go/+/ced0fdbad0655d63d535390b1a7126fd1fef8348
Signed-off-by: Frederic Hemberger <mail@frederic-hemberger.de>
3 years ago
Loïc Blot
55ffe57cbc
feat(rules): add NodeFileDescriptorLimit kernel exhaustion alert
...
Add a new alert when fs.file-nr is close to fs.file-max
Signed-off-by: Loic Blot <loic.blot@unix-experience.fr>
4 years ago
raviprasad_lr
504f9b785c
fix interval in graphs panels of node dashboard
...
Signed-off-by: raviprasad_lr <raviprasad_lr@yahoo.com>
4 years ago
Johannes 'fish' Ziemke
a5908bf82b
Make interval configurable
...
Signed-off-by: Johannes 'fish' Ziemke <github@freigeist.org>
4 years ago
Johannes 'fish' Ziemke
772335caa8
Use 5m rate in mixins
...
The default scrape interval of Prometheus is 60s, so we can't use a 1m
rate.
Signed-off-by: Johannes 'fish' Ziemke <github@freigeist.org>
4 years ago
Ben Kochie
eefb18db02
Merge pull request #1764 from dhoppe/patch-1
...
Use description instead of message as field for annotations
4 years ago
Ben Kochie
4b68aeb80a
Merge pull request #1862 from fsschmitt/fix/alerts-label-naming
...
fix: node_md_disks state label from fail to failed
4 years ago
Anthony D'Atri
8b466360a3
Modest doc improvements ( #1876 )
...
* Modest doc improvements
Signed-off-by: Anthony D'Atri <anthony.datri@gmail.com>
4 years ago
Julien Pivotto
f645d49242
Mixin: Bump jsonnet requirement to 0.16 to use go-jsonnetcmd
...
Signed-off-by: Julien Pivotto <roidelapluie@inuits.eu>
4 years ago
Matthias Loibl
77e76485c0
Use absolute jsonnet import paths
...
This should be the way forward when importing libraries in jsonnet. It's
closer to how Go imports look and makes it more obvious where packages
live.
This is not breaking anything, as the old imports were already symlinks
to the now directly used directories.
Signed-off-by: Matthias Loibl <mail@matthiasloibl.com>
4 years ago
Björn Rabenstein
9c9c636305
Merge pull request #1861 from paulfantom/network-alerts
...
docs/node-mixin/alerts: use ratio for network alerts
4 years ago
paulfantom
f81747e608
docs/node-mixin/alerts: add max error condition to alert about desynchronized clock
...
Signed-off-by: paulfantom <pawel@krupa.net.pl>
4 years ago
fsschmitt
effa4da989
fix: node_md_disks state label as failed
...
Signed-off-by: fsschmitt <492108+fsschmitt@users.noreply.github.com>
4 years ago
paulfantom
d7cbe85d22
docs/node-mixin/alerts: use a rate for network alerts
...
Signed-off-by: paulfantom <pawel@krupa.net.pl>
4 years ago
Arthur Outhenin-Chalandre
6585e43eec
Fix memory gauge in mixin with multiple pods
...
Signed-off-by: Arthur Outhenin-Chalandre <arthur@cri.epita.fr>
4 years ago
Nicolas Lamirault
ff2ff3410f
Configure 2 thresholds for NodeFilesystemAlmostOutOfSpace alert ( #1835 )
...
* Add: configure 2 thresholds for NodeFilesystemAlmostOutOfSpace alert
Signed-off-by: Nicolas Lamirault <nicolas.lamirault@gmail.com>
4 years ago
Rajat Vig
7dd8adf7ed
Fix NodeRAIDDegraded to not use a string rule expressions
...
Signed-off-by: Rajat Vig <rvig@etsy.com>
4 years ago
Simon Pasquier
02212dd2c6
Run jsonnetfmt
...
Signed-off-by: Simon Pasquier <spasquie@redhat.com>
4 years ago
Hao Ke
9b7a0d06a1
Fix syntax error
...
Signed-off-by: Hao Ke <hao.ke@auryc.com>
Signed-off-by: Simon Pasquier <spasquie@redhat.com>
4 years ago
Simon Pasquier
6d959e2e8c
*: add mixin tests to CI
...
Signed-off-by: Simon Pasquier <spasquie@redhat.com>
4 years ago
paulfantom
e4ec8e04c5
docs/node-mixin: add alerts about failing RAID array
...
Signed-off-by: paulfantom <pawel@krupa.net.pl>
4 years ago
Dennis Hoppe
fc64b70386
Use description instead of message as field for annotations
...
Signed-off-by: Dennis Hoppe <github@debian-solutions.de>
4 years ago
Frederic Branczyk
b42819b69d
Merge pull request #1657 from povilasv/NodeTextFileCollectorScrapeError
...
Add NodeTextFileCollectorScrapeError alert to mixin
5 years ago
jangdm
d4d2e1db98
fix typo in TIME.md ( #1670 )
...
fix typo in TIME.md
Signed-off-by: jangdm <jamin4@naver.com>
5 years ago
WOO CHANG HO
612ea0cd12
Add more compatible rules
...
Signed-off-by: zodiac12k <zodiac12k@gmail.com>
5 years ago
Povilas Versockas
bd3e6d224c
Add NodeTextFileCollectorScrapeError alert to mixin
...
Signed-off-by: Povilas Versockas <p.versockas@gmail.com>
5 years ago
beorn7
8b00b22904
Fix sign error in `NodeClockSkewDetected`
...
Signed-off-by: beorn7 <beorn@grafana.com>
5 years ago
paulfantom
820f8d595e
docs/node-mixin: alert on desynchronised clock
...
Signed-off-by: paulfantom <pawel@krupa.net.pl>
5 years ago
Neraud
1006a2c4bb
Add missing coma
...
Signed-off-by: Neraud <neraud.login@gmail.com>
5 years ago
Povilas Versockas
48bb6f670c
Add NodeHighNumberConntrackEntriesUsed
...
Signed-off-by: Povilas Versockas <p.versockas@gmail.com>
5 years ago
iuri aranda
0107bc7942
Make FS space alerts thresholds configurable ( #1624 )
...
* Make FS space alerts thresholds configurable (#1 )
This makes it possible to tweak the thresholds for
the NodeFilesystemSpaceFillingUp alerts. Which
might be necessary in systems like Kubernetes,
where the image garbage collector runs at 85%,
so it's not a problem that the disk reaches that usage %.
Signed-off-by: iuri aranda <iuri@skyscrapers.eu>
5 years ago
paulfantom
40570924b1
docs/node-mixin/dashboards: do not mix tabs and spaces
...
Signed-off-by: paulfantom <pawel@krupa.net.pl>
5 years ago