node_exporter

Commit Graph

Author	SHA1	Message	Date
Daniel Lenar	0b50eb7294	Reverse fsSpaceAvailableCriticalThreshold and fsSpaceAvailableWarningThreshold Currently critical alert for space available alerts on warning and warning alert for space available alerts on critical. Signed-off-by: Daniel Lenar <dlenar@vailsys.com>	3 years ago
Gabriel Amaral Antunes	410e069471	Add darwin dashboard to mixin Signed-off-by: Vitaly Zhuravlev <zhuravlev.vitaly@gmail.com>	3 years ago
Vitaly Zhuravlev	8823605f12	Fix NodeFileDescriptorLimit alerts Signed-off-by: Vitaly Zhuravlev <zhuravlev.vitaly@gmail.com>	3 years ago
Severyn Lisovskyi	7b86b7cb29	[node-mixin] change current datasource to grafana's default Signed-off-by: Severyn Lisovskyi <993215+sev3ryn@users.noreply.github.com>	3 years ago
Julian Wiedmann	3e6f4ce627	mixin: exclude iowait and steal from CPU Utilisation (#2194 ) 'iowait' and 'steal' indicate specific idle/wait states, which shouldn't be counted into CPU Utilisation. Also see https://github.com/prometheus-operator/kube-prometheus/pull/796 and https://github.com/kubernetes-monitoring/kubernetes-mixin/pull/667. Per the iostat man page: %idle Show the percentage of time that the CPU or CPUs were idle and the system did not have an outstanding disk I/O request. %iowait Show the percentage of time that the CPU or CPUs were idle during which the system had an outstanding disk I/O request. %steal Show the percentage of time spent in involuntary wait by the virtual CPU or CPUs while the hypervisor was servicing another virtual processor. Signed-off-by: Julian Wiedmann <jwi@linux.ibm.com>	3 years ago
Ben Kochie	421fc429f3	Replace deprecated linter (#2176 ) Upstream is replacing `golint` with `revive`. * Cleanup unused mixin go files. Signed-off-by: Ben Kochie <superq@gmail.com>	3 years ago
ngc104	4bc1c02000	fix bug in #2130 (#2170 ) Signed-off-by: Yves Mettier <yves.mettier@orange.com> Co-authored-by: Yves Mettier <yves.mettier@orange.com>	3 years ago
Tom Wilkie	9bc184d236	Datasource template variable should be labelled 'Data Source' Signed-off-by: Tom Wilkie <tom@grafana.com>	3 years ago
Ben Kochie	5a38949451	Fix up mixin tests (#2167 ) Use new Go install format, cleanup working dir setup. Signed-off-by: Ben Kochie <superq@gmail.com>	3 years ago
Julien Pivotto	68a6c78c0d	Update go to 1.17 (#2159 ) Signed-off-by: Julien Pivotto <roidelapluie@inuits.eu>	3 years ago
Michal	186e2e79c8	add yamllint config, fix yamllint errors (#2088 ) After a recent change in prometheus/prometheus, Makefile.common includes now a yamllint target which currently fails. This PR adds the missing yamllint config and fixes the yamllint errors. Signed-off-by: Michal Wasilewski <mwasilewski@gmx.com>	3 years ago
Ben Kochie	aeef1edd62	mixin: Add fallback for MemAvailable (#2130 ) Add a fallback to Buffers+Cached+MemFree+Slab for older Linux kernels where the MemAvailable metric is not available for memory utilization. Signed-off-by: Ben Kochie <superq@gmail.com>	3 years ago
Johannes 'fish' Ziemke	6f1286b314	mixin: Drop mode label for num cpu metric Signed-off-by: Johannes 'fish' Ziemke <github@freigeist.org>	3 years ago
Johannes 'fish' Ziemke	fa9926c4eb	mixin: Cheaper calculation for instance:node_num_cpu:sum Signed-off-by: Johannes 'fish' Ziemke <github@freigeist.org>	3 years ago
paulfantom	832909dd25	docs/node-mixin/alerts: make NodeFilesystemAlmostOutOfSpace fire earlier Signed-off-by: paulfantom <pawel@krupa.net.pl>	3 years ago
Johannes 'fish' Ziemke	7fc5c6045a	Read config from $ Signed-off-by: Johannes 'fish' Ziemke <github@freigeist.org>	3 years ago
ArthurSens	3731f93fd7	Refactor USE method mixin dashboards with grafonnet-lib, add multi-cluster support. Aiming for cleaner code and following standards used on younger mixins. Signed-off-by: ArthurSens <arthursens2005@gmail.com>	3 years ago
Frederic Hemberger	5bee84f30d	docs: Replace `go get` with `go install` for command installation `go get` is deprecated for installation of commands as of go v1.17 Ref: https://go.googlesource.com/go/+/ced0fdbad0655d63d535390b1a7126fd1fef8348 Signed-off-by: Frederic Hemberger <mail@frederic-hemberger.de>	3 years ago
Loïc Blot	55ffe57cbc	feat(rules): add NodeFileDescriptorLimit kernel exhaustion alert Add a new alert when fs.file-nr is close to fs.file-max Signed-off-by: Loic Blot <loic.blot@unix-experience.fr>	4 years ago
raviprasad_lr	504f9b785c	fix interval in graphs panels of node dashboard Signed-off-by: raviprasad_lr <raviprasad_lr@yahoo.com>	4 years ago
Johannes 'fish' Ziemke	a5908bf82b	Make interval configurable Signed-off-by: Johannes 'fish' Ziemke <github@freigeist.org>	4 years ago
Johannes 'fish' Ziemke	772335caa8	Use 5m rate in mixins The default scrape interval of Prometheus is 60s, so we can't use a 1m rate. Signed-off-by: Johannes 'fish' Ziemke <github@freigeist.org>	4 years ago
Anthony D'Atri	8b466360a3	Modest doc improvements (#1876 ) * Modest doc improvements Signed-off-by: Anthony D'Atri <anthony.datri@gmail.com>	4 years ago
Julien Pivotto	f645d49242	Mixin: Bump jsonnet requirement to 0.16 to use go-jsonnetcmd Signed-off-by: Julien Pivotto <roidelapluie@inuits.eu>	4 years ago
Matthias Loibl	77e76485c0	Use absolute jsonnet import paths This should be the way forward when importing libraries in jsonnet. It's closer to how Go imports look and makes it more obvious where packages live. This is not breaking anything, as the old imports were already symlinks to the now directly used directories. Signed-off-by: Matthias Loibl <mail@matthiasloibl.com>	4 years ago
paulfantom	f81747e608	docs/node-mixin/alerts: add max error condition to alert about desynchronized clock Signed-off-by: paulfantom <pawel@krupa.net.pl>	4 years ago
fsschmitt	effa4da989	fix: node_md_disks state label as failed Signed-off-by: fsschmitt <492108+fsschmitt@users.noreply.github.com>	4 years ago
paulfantom	d7cbe85d22	docs/node-mixin/alerts: use a rate for network alerts Signed-off-by: paulfantom <pawel@krupa.net.pl>	4 years ago
Arthur Outhenin-Chalandre	6585e43eec	Fix memory gauge in mixin with multiple pods Signed-off-by: Arthur Outhenin-Chalandre <arthur@cri.epita.fr>	4 years ago
Nicolas Lamirault	ff2ff3410f	Configure 2 thresholds for NodeFilesystemAlmostOutOfSpace alert (#1835 ) * Add: configure 2 thresholds for NodeFilesystemAlmostOutOfSpace alert Signed-off-by: Nicolas Lamirault <nicolas.lamirault@gmail.com>	4 years ago
Rajat Vig	7dd8adf7ed	Fix NodeRAIDDegraded to not use a string rule expressions Signed-off-by: Rajat Vig <rvig@etsy.com>	4 years ago
Simon Pasquier	02212dd2c6	Run jsonnetfmt Signed-off-by: Simon Pasquier <spasquie@redhat.com>	4 years ago
Hao Ke	9b7a0d06a1	Fix syntax error Signed-off-by: Hao Ke <hao.ke@auryc.com> Signed-off-by: Simon Pasquier <spasquie@redhat.com>	4 years ago
Simon Pasquier	6d959e2e8c	*: add mixin tests to CI Signed-off-by: Simon Pasquier <spasquie@redhat.com>	4 years ago
paulfantom	e4ec8e04c5	docs/node-mixin: add alerts about failing RAID array Signed-off-by: paulfantom <pawel@krupa.net.pl>	4 years ago
Dennis Hoppe	fc64b70386	Use description instead of message as field for annotations Signed-off-by: Dennis Hoppe <github@debian-solutions.de>	4 years ago
jangdm	d4d2e1db98	fix typo in TIME.md (#1670 ) fix typo in TIME.md Signed-off-by: jangdm <jamin4@naver.com>	5 years ago
WOO CHANG HO	612ea0cd12	Add more compatible rules Signed-off-by: zodiac12k <zodiac12k@gmail.com>	5 years ago
Povilas Versockas	bd3e6d224c	Add NodeTextFileCollectorScrapeError alert to mixin Signed-off-by: Povilas Versockas <p.versockas@gmail.com>	5 years ago
beorn7	8b00b22904	Fix sign error in `NodeClockSkewDetected` Signed-off-by: beorn7 <beorn@grafana.com>	5 years ago
paulfantom	820f8d595e	docs/node-mixin: alert on desynchronised clock Signed-off-by: paulfantom <pawel@krupa.net.pl>	5 years ago
Neraud	1006a2c4bb	Add missing coma Signed-off-by: Neraud <neraud.login@gmail.com>	5 years ago
Povilas Versockas	48bb6f670c	Add NodeHighNumberConntrackEntriesUsed Signed-off-by: Povilas Versockas <p.versockas@gmail.com>	5 years ago
iuri aranda	0107bc7942	Make FS space alerts thresholds configurable (#1624 ) * Make FS space alerts thresholds configurable (#1) This makes it possible to tweak the thresholds for the NodeFilesystemSpaceFillingUp alerts. Which might be necessary in systems like Kubernetes, where the image garbage collector runs at 85%, so it's not a problem that the disk reaches that usage %. Signed-off-by: iuri aranda <iuri@skyscrapers.eu>	5 years ago
paulfantom	40570924b1	docs/node-mixin/dashboards: do not mix tabs and spaces Signed-off-by: paulfantom <pawel@krupa.net.pl>	5 years ago
beorn7	c6914477f5	Fix the normalization for the cluster-wide dashboards We actually have to count or sum, respectively, _all_ the selected metrics for the cluster-wide view. Which means it's easiest to use the `scalar` approach after all (but only in the cluster dashboard). This still propagates all the labels. I have extended the comment for the `nodeExporterSelector` to note that the cluster dashboard only makes sense if all the selected node exporter actually belong to the same cluster. Since this is jsonnet, users can easily disable the cluster dashboard. Or even create multiple instances of the dashboards with different `nodeExporterSelector`s for different clusters. Signed-off-by: beorn7 <beorn@grafana.com>	5 years ago
Benoît Knecht	5a7b85876d	docs/node-mixin: Improve memory pressure rule The `instance:node_memory_swap_io_pages:rate1m` rule was intended to measure the amount of memory pressure a system is under, but its name is a bit misleading (it specifically refers to swap), and the rate of `node_vmstat_pgmajfault` is a better metric for memory pressure (see #1524). This commit renames `instance:node_memory_swap_io_pages:rate1m` to `instance:node_vmstat_pgmajfault:rate1m`, and defines it as `rate(node_vmstat_pgmajfault{%(nodeExporterSelector)s}[1m])`. The dashboards are updated accordingly. Signed-off-by: Benoît Knecht <benoit.knecht@fsfe.org>	5 years ago
Scott Brenner	813a4bdf8b	Two quick typo fixes Signed-off-by: Scott Brenner <scott@scottbrenner.me>	5 years ago
Sergiusz Urbaniak	f4417b209a	node-mixin: fix configuration for unset fsSelector/diskDeviceSelector As per https://github.com/prometheus/node_exporter/pull/1429#discussion_r304210103 we want to fetch all devices and all fs types. Currently, this is done by setting empty string which breaks most queries which rely on it. This fixes it by setting the appropriate selector instead of empty string. Signed-off-by: Sergiusz Urbaniak <sergiusz.urbaniak@gmail.com>	5 years ago
Sergiusz Urbaniak	ed78237036	node-mixin: fix query in Disk Space Utilisation dashboard Signed-off-by: Sergiusz Urbaniak <sergiusz.urbaniak@gmail.com>	5 years ago

1 2

95 Commits (1b6aaeb2e876cc58991b3859910527caddec0e19)