Commit Graph

133 Commits (f63b64ec63f32dc72b1480b82167612766650388)

Author SHA1 Message Date
Ben Kochie aeef1edd62
mixin: Add fallback for MemAvailable (#2130)
3 years ago
Johannes 'fish' Ziemke 6f1286b314 mixin: Drop mode label for num cpu metric
3 years ago
Johannes 'fish' Ziemke fa9926c4eb mixin: Cheaper calculation for instance:node_num_cpu:sum
3 years ago
paulfantom 832909dd25 docs/node-mixin/alerts: make NodeFilesystemAlmostOutOfSpace fire earlier
3 years ago
Johannes 'fish' Ziemke 7fc5c6045a Read config from $
3 years ago
ArthurSens 3731f93fd7 Refactor USE method mixin dashboards with grafonnet-lib, add multi-cluster support.
3 years ago
Frederic Hemberger 5bee84f30d docs: Replace `go get` with `go install` for command installation
3 years ago
Loïc Blot 55ffe57cbc
feat(rules): add NodeFileDescriptorLimit kernel exhaustion alert
4 years ago
raviprasad_lr 504f9b785c fix interval in graphs panels of node dashboard
4 years ago
Johannes 'fish' Ziemke a5908bf82b Make interval configurable
4 years ago
Johannes 'fish' Ziemke 772335caa8 Use 5m rate in mixins
4 years ago
Ben Kochie eefb18db02
Merge pull request #1764 from dhoppe/patch-1
4 years ago
Ben Kochie 4b68aeb80a
Merge pull request #1862 from fsschmitt/fix/alerts-label-naming
4 years ago
Anthony D'Atri 8b466360a3
Modest doc improvements (#1876)
4 years ago
Julien Pivotto f645d49242 Mixin: Bump jsonnet requirement to 0.16 to use go-jsonnetcmd
4 years ago
Matthias Loibl 77e76485c0
Use absolute jsonnet import paths
4 years ago
Björn Rabenstein 9c9c636305
Merge pull request #1861 from paulfantom/network-alerts
4 years ago
paulfantom f81747e608 docs/node-mixin/alerts: add max error condition to alert about desynchronized clock
4 years ago
fsschmitt effa4da989 fix: node_md_disks state label as failed
4 years ago
paulfantom d7cbe85d22
docs/node-mixin/alerts: use a rate for network alerts
4 years ago
Arthur Outhenin-Chalandre 6585e43eec Fix memory gauge in mixin with multiple pods
4 years ago
Nicolas Lamirault ff2ff3410f
Configure 2 thresholds for NodeFilesystemAlmostOutOfSpace alert (#1835)
4 years ago
Rajat Vig 7dd8adf7ed
Fix NodeRAIDDegraded to not use a string rule expressions
4 years ago
Simon Pasquier 02212dd2c6 Run jsonnetfmt
4 years ago
Hao Ke 9b7a0d06a1 Fix syntax error
4 years ago
Simon Pasquier 6d959e2e8c *: add mixin tests to CI
4 years ago
paulfantom e4ec8e04c5 docs/node-mixin: add alerts about failing RAID array
4 years ago
Dennis Hoppe fc64b70386
Use description instead of message as field for annotations
4 years ago
Frederic Branczyk b42819b69d
Merge pull request #1657 from povilasv/NodeTextFileCollectorScrapeError
5 years ago
jangdm d4d2e1db98
fix typo in TIME.md (#1670)
5 years ago
WOO CHANG HO 612ea0cd12 Add more compatible rules
5 years ago
Povilas Versockas bd3e6d224c
Add NodeTextFileCollectorScrapeError alert to mixin
5 years ago
beorn7 8b00b22904 Fix sign error in `NodeClockSkewDetected`
5 years ago
paulfantom 820f8d595e
docs/node-mixin: alert on desynchronised clock
5 years ago
Neraud 1006a2c4bb Add missing coma
5 years ago
Povilas Versockas 48bb6f670c Add NodeHighNumberConntrackEntriesUsed
5 years ago
iuri aranda 0107bc7942
Make FS space alerts thresholds configurable (#1624)
5 years ago
paulfantom 40570924b1
docs/node-mixin/dashboards: do not mix tabs and spaces
5 years ago
beorn7 c6914477f5 Fix the normalization for the cluster-wide dashboards
5 years ago
Benoît Knecht 5a7b85876d docs/node-mixin: Improve memory pressure rule
5 years ago
Scott Brenner 813a4bdf8b Two quick typo fixes
5 years ago
Björn Rabenstein 855a1f1d18
Merge pull request #1482 from leojonathanoh/fix-node-mixin-prometheus-alert-rules-to-use-percentage
5 years ago
Sergiusz Urbaniak f4417b209a node-mixin: fix configuration for unset fsSelector/diskDeviceSelector
5 years ago
Sergiusz Urbaniak ed78237036 node-mixin: fix query in Disk Space Utilisation dashboard
5 years ago
Leo dfeec07f2f Fix node-mixin prometheus alert rules to use percentage
5 years ago
Björn Rabenstein ab8cf1f718 Node mixin: Clarify dashboard dependency on rules (#1475)
5 years ago
beorn7 76ff263ca6 Update legendLink
5 years ago
Björn Rabenstein 0f38d680b4
Merge pull request #1449 from prometheus/beorn7/mixin3
5 years ago
beorn7 44e5731de7 Add line for number of cores to load graph
5 years ago
beorn7 024d5ed55e Fix title of CPU panel to usage
5 years ago
beorn7 a016d9cd6f node-mixin: Improve disk usage panel
5 years ago
Björn Rabenstein 7ef6f2576d
node-mxin: Improve nodes dashboard (#1448)
5 years ago
beorn7 97ef113762 Make the severity of "critical" alerts configurable
5 years ago
beorn7 f350aaf87e node-mixin: Fix various straight-forward issues in the USE dashboards
5 years ago
paulfantom c41826274d
docs/node-mixin: move fsSelector and diskDeviceSelector to the end of query
5 years ago
beorn7 79f0357e38 Added `_excluding_lo` to name of network rules that exclude lo
5 years ago
beorn7 36dc7451c9 Improvement of comments and panel titles
5 years ago
beorn7 e01d9f9e78 Break out device in disk IO rules/dashboard
5 years ago
beorn7 b8c4b0cb29 Removed unneeded `sum_` and `avg_` from rule names
5 years ago
beorn7 706511a495 Responses to review comments, round 3
5 years ago
beorn7 3a770a0b1d Convert annotations from message to summary/description
5 years ago
beorn7 a92d1d7889 Address review comments, batch 2
5 years ago
beorn7 3ab1f41d12 Make more use of config.libsonnet
5 years ago
beorn7 2180c2f3bf Address first batch of old review comments
5 years ago
beorn7 b3b47f2d07 Make selector naming consistent
5 years ago
beorn7 dec5b5b053 Fix indentation
5 years ago
beorn7 9d7045e483 (Re-)adjust to Grafana gauge expecting percentage 0-100 (rather than 1-0)
5 years ago
beorn7 f331b308f3 Use promgrafonnet as a vendored library from its source
5 years ago
beorn7 e5266c242e Add README.md
5 years ago
beorn7 f2891703a5 Add Makefile to easily make output files and lint sources
5 years ago
beorn7 f17829c48b Create jsonnet files to create output files
5 years ago
beorn7 cd2981f1b8 Update vendoring to current location of jsonnet-libs
5 years ago
beorn7 2df034c055 Move node-mixin into docs directory
5 years ago
Cougar 764da30556 Add compat rules for node_time, node_memory_ShmemHugePages and node_memory_ShmemPmdMapped (#1138)
6 years ago
Ben Kochie 5d23ad0ca7
Fix supervisord collector (#978)
6 years ago
Rene Treffer 80a5712b97 Fix sample rules for migration (#1022)
6 years ago
Ivan Kiselev ae90bac5b8 Add example of translating new metrics to old format in case of migration to 1.16 version (#982)
6 years ago
Roman Vynar 55c32fcf02 Add compat rules for filesystem collector. (#973)
7 years ago
Nicholas Capo 09d11817d0 docs: Add example recording rule for node_memory_MemAvailable
7 years ago
Ben Kochie c5a74ce1a1
Add label mangling.
7 years ago
Ben Kochie dc1972e9e3
Document upgrade options for v0.16.0
7 years ago
Brian Brazil 52c031890e
Add _seconds suffix to node_time. (#823)
7 years ago
Leonid Evdokimov c169b4b1c5 Add metrics from SNTPv4 packet to ntp collector & add ntpd sanity check (#655)
7 years ago