node_exporter

Commit Graph

Author	SHA1	Message	Date
Pranshu Srivastava	c0c1a8c572	chore: sync with latest `procfs` release (#3059 ) Needed-for: https://github.com/prometheus/node_exporter/pull/3032 Signed-off-by: Pranshu Srivastava <rexagod@gmail.com>	5 months ago
chengjoey	4f7bd3544d	fix pressure metric collection fails on systems that do not expose a full CPU stat #3051 (#3054 ) Signed-off-by: joey <zchengjoey@gmail.com>	5 months ago
Benny Siegert	80859a9f18	Do not panic as much in Linux collector tests (#3050 ) Running "go test" in the collector directory, without the fixtures available, results in multiple panics, including `SIGSEGV`. Most of these are due to incorrect error handling. This cleans them up. Signed-off-by: Benny Siegert <bsiegert@gmail.com>	6 months ago
kangjie	dae4c87f7d	slab-collector: add filter for slab name. (#3041 ) Signed-off-by: Kangjie Xu <kanxu@ebay.com> Co-authored-by: Kangjie Xu <kanxu@ebay.com>	6 months ago
Ben Kochie	3afc0a341e	Fix pressure collector nil reference (#3016 ) Check that the PSI metrics are returned in order to avoid nil pointer dereference. * Update fixutre to match real-world samples. Fixes: https://github.com/prometheus/node_exporter/issues/3015 Signed-off-by: Ben Kochie <superq@gmail.com>	7 months ago
Pranshu Srivastava	66fab10db4	collector/cpu: s/cpu_ticks/cpu_nsec for solaris (#2963 ) Replace all cpu_ticks_* with cpu_nsec_*, since the former was off my a magnitude of 10e6, and showed incorrect values for node_cpu_seconds_total. Fixes: #1837 Signed-off-by: Pranshu Srivastava <rexagod@gmail.com>	7 months ago
Sam Leiken	9572e7a07b	Add logging for ethtool device include/exclude and metrics include flags (#2979 ) Signed-off-by: Sam Leiken <sam.k.leiken@gmail.com>	7 months ago
Chris Cleeland	d333366914	Fix watchdog_test lint and test failures on macos. (#3003 ) Ensure identical build flags embedded in both files. Signed-off-by: Chris Cleeland <chris.cleeland@gmail.com>	7 months ago
Ben Kochie	acb36765b4	Update build (#3000 ) * Update Go to 1.22. * Update Go modules. * Use new version collector. * Use standard library slices package. Signed-off-by: Ben Kochie <superq@gmail.com>	7 months ago
John Guo	e9e27138a8	fix: data race of NetClassCollector metrics initialization when multiple requests happen (#2995 ) Signed-off-by: John Guo <john@johng.cn>	7 months ago
Jonathan Davies	36e0d1f6d4	os_release.go: Removed caching of modtime/filename of os-release file. (#2987 ) Signed-off-by: Jonathan Davies <jpds@protonmail.com>	7 months ago
coderwander	0202220881	refactor: Optimize code by using built-in constants in the standard library (#2989 ) Signed-off-by: coderwander <770732124@qq.com>	8 months ago
Ayoub Mrini	bf67c859bb	fibre_channel: update procfs to take into account optional attributes (#2933 ) Signed-off-by: machine424 <ayoubmrini424@gmail.com>	8 months ago
looklose	7d4103c089	chore: fix typo in comment Signed-off-by: looklose <shishuaiqun@yeah.net>	8 months ago
Daniel Kimsey	29cdbd63fe	zfs: Log mib when sysctl read fails on FreeBSD When the zfs collector fails on FreeBSD it doesn't log which `mib` triggered the issue. This makes diagnostics hard. Incompatibilities in the list of supported mibs is not uncommon with major os updates. By adding this change, it'll be easier for users to report the specific mib that is triggering the failure. Related to #2847 Signed-off-by: Daniel Kimsey <90741+dekimsey@users.noreply.github.com>	8 months ago
Jonathan Davies	b6227af54b	os_release.go: Added support end parsing support. (#2982 ) * os_release.go: Added support end parsing support. Fixes: #2977 Signed-off-by: Jonathan Davies <jpds@protonmail.com> * os_release_test.go: Added TestParseOSSupportEnd. Signed-off-by: Jonathan Davies <jpds@protonmail.com> --------- Signed-off-by: Jonathan Davies <jpds@protonmail.com>	8 months ago
Pranshu Srivastava	ebddab47e1	collector/textfile: Avoid inconsistent help-texts (#2962 ) Avoid metrics with inconsistent help-texts. The earlier behaviour has been preserved in the sense that the first encountered instance is still used to generate metrics, whereas the subsequent inconsistent ones are ignored along with a few peripheral changes. ``` # HELP node_scrape_collector_duration_seconds node_exporter: Duration of a collector scrape. #TYPE node_scrape_collector_duration_seconds gauge node_scrape_collector_duration_seconds{collector="textfile"} 0.0004005 # HELP node_scrape_collector_success node_exporter: Whether a collector succeeded. # TYPE node_scrape_collector_success gauge node_scrape_collector_success{collector="textfile"} 1 # HELP node_textfile_mtime_seconds Unixtime mtime of textfiles successfully read. # TYPE node_textfile_mtime_seconds gauge node_textfile_mtime_seconds{file="/Users/rexagod/repositories/misc/node_exporter/ne-bar.prom"} 1.710812009e+09 node_textfile_mtime_seconds{file="/Users/rexagod/repositories/misc/node_exporter/ne-foo.prom"} 1.710811982e+09 # HELP node_textfile_scrape_error 1 if there was an error opening or reading a file, 0 otherwise # TYPE node_textfile_scrape_error gauge node_textfile_scrape_error 1 # HELP promhttp_metric_handler_errors_total Total number of internal errors encountered by the promhttp metric handler. # TYPE promhttp_metric_handler_errors_total counter promhttp_metric_handler_errors_total{cause="encoding"} 0 promhttp_metric_handler_errors_total{cause="gathering"} 0 # HELP promhttp_metric_handler_requests_in_flight Current number of scrapes being served. # TYPE promhttp_metric_handler_requests_in_flight gauge promhttp_metric_handler_requests_in_flight 1 # HELP promhttp_metric_handler_requests_total Total number of scrapes by HTTP status code. # TYPE promhttp_metric_handler_requests_total counter promhttp_metric_handler_requests_total{code="200"} 0 promhttp_metric_handler_requests_total{code="500"} 0 promhttp_metric_handler_requests_total{code="503"} 0 # HELP tau_infrastructure_performing_maintenance_task At what timestamp a given task started or stopped, the last time it was run. # TYPE tau_infrastructure_performing_maintenance_task gauge tau_infrastructure_performing_maintenance_task{main_task="nightly",start_or_stop="start",sub_task="main"} 1.64728080198446e+09 ``` Fixes: #2317 Signed-off-by: Pranshu Srivastava <rexagod@gmail.com>	8 months ago
Ben Kochie	b3bbd1f52c	Sanitize ethtool metric name keys Apply the same metric name sanitization to the keys as to the metric names. This avoids conflicting help strings in the metric registry. Fixes: https://github.com/prometheus/node_exporter/issues/2893 Signed-off-by: Ben Kochie <superq@gmail.com>	8 months ago
Gavin Lam	94ef5cc666	Enable watchdog module by default; Add no data error (#2953 ) Signed-off-by: Gavin Lam <gavin.oss@tutamail.com>	9 months ago
Gavin Lam	95efb86f6b	Add new collector and metrics for watchdog (#2309 ) (#2880 ) Signed-off-by: Gavin Lam <gavin.oss@tutamail.com>	9 months ago
linuxgcc	5e412a689a	disable selinux,fix end-to-end-test.sh error(#2934 ) (#2937 ) Signed-off-by: heyitao <heyitao@uniontech.com> Co-authored-by: heyitao <heyitao@uniontech.com>	9 months ago
Ben Kochie	3a02ab1cf0	Revert "filesystem: fix mountTimeout not working issue (#2903 )" (#2932 ) This reverts commit `9f1f791ac2`. Signed-off-by: Ben Kochie <superq@gmail.com>	9 months ago
Pamela Mei	12192475c8	filesystem: surface device errors (#2923 ) filesystem: surface filesystem device error Fixes: #2918 --------- Signed-off-by: Pamela Mei i540369 <pamela.mei@sap.com>	9 months ago
DongWei	9f1f791ac2	filesystem: fix mountTimeout not working issue (#2903 ) Signed-off-by: DongWei <jiangxuege@hotmail.com>	10 months ago
Caleb Webber	6d18ce7bca	Revert "Add ZFS freebsd per dataset stats (#2753 )" (#2925 ) This reverts commit `f34aaa6109`. Signed-off-by: Caleb Webber <caleb@codingthemsoftly.com>	10 months ago
Ben Kochie	29fca60a45	Fix hwmon error capture (#2915 ) Fix golangci-lint "ineffectual assignment" by correctly capturing any errors within the hwmon gathering loop. Signed-off-by: Ben Kochie <superq@gmail.com>	10 months ago
TaoGe	fe78e7e51a	fix hwmon nil ptr (#2873 ) * fix hwmon nil ptr syslink maybe lost in some cases. --------- Signed-off-by: TaoGe <6657718+yowenter@users.noreply.github.com>	10 months ago
tyltr	34467b1d7a	chore:remove constant from function (#2884 ) Signed-off-by: tyltr <tylitianrui@126.com>	10 months ago
David O'Rourke	94ddad4dec	exec_bsd: Fix labels for vm.stats.sys.v_syscall sysctl (#2895 ) Signed-off-by: David O'Rourke <david.orourke@gmail.com>	10 months ago
DBS-ST-VIT	e22174ca8e	diskstats: ignore zram devices on linux systems by default (#2898 ) Signed-off-by: DBS-ST-VIT <dbs-st-vit@users.noreply.github.com> Co-authored-by: DBS-ST-VIT <dbs-st-vit@users.noreply.github.com>	11 months ago
João Pedro Lima	16f7122d31	Add mitigation information to the linux vulnerabilities collector (#2806 ) While the CPU vulnerabilities collector has been added in https://github.com/prometheus/node_exporter/pull/2721 , it's currently not including information regarding the mitigation strategy used for a given vulnerability. This information can be quite valuable, as often times different mitigation strategies come with a different performance impact. This commit adds a third label to the cpu_vulnerabilities_info metric, to include the "mitigation" used for a given vulnerability - if a given vulnerability is not affecting a node or the node is still vulnerable, the mitigation is expected to be empty. Signed-off-by: João Lima <jlima@cloudflare.com>	12 months ago
frigo	0550ab3f04	Add TCPOFOQueue to default netstat metrics (#2867 ) Adds a count for TCP packets received out of orders. This can be an indication that there is packet loss on the way packets travel towards this server. In that case, the sender will retransmit (and we can already monitor the Tcp_RetransSegs there), but we have no way to monitor the packet loss on the receiver side. When a packet is received and the receiver detects previous one missing, it will increase the TCPOFOQueue counter and reply with selective ACK to the sender, both possible indications of packet loss. Confirmation of packet loss can be achieved by taking packet captures, ignoring wireshark analysis, and carefully looking at data being retransmitted based on the TCP seq. Just like RetransSegs, TCPOFOQueue should be interesting for any deployment as a mean to detect packet loss, so here suggesting adding it to the default list. Signed-off-by: François Rigault <frigo@amadeus.com> Co-authored-by: François Rigault <frigo@amadeus.com>	12 months ago
Gavin Lam	332232c22c	Add new collector and metrics for XFRM (#2544 ) (#2866 ) Signed-off-by: Gavin Lam <gavin.oss@tutamail.com>	1 year ago
Simon Pasquier	12f1744e79	Fix debug log in cpu collector (#2857 ) Signed-off-by: Simon Pasquier <spasquie@redhat.com>	1 year ago
Tobias Klausmann	78af952e63	NFSd: handle new wdeleg_getattr attribute in /proc/net/rpc/nfsd (#2810 ) This attribute was introduced it v6.6-rc1. The relevant changes in procfs were merged here: https://github.com/prometheus/procfs/pull/574 and are part of procfs v0.11.2 I have also figured out that the stat should be part of the v4 ops counters struct, but that will need changes to both procfs and this code. Since people are already using 6.6-rc1, I think it's better to get the code out there --- even if they don't care about wdeleg_getattr, currently they get _no_ nfsd stats with 6.6-rc1. I will make two follow-up PRs to clean this up in the next releases of procfs and node-exporter. Signed-off-by: Tobias Klausmann <klausman@schwarzvogel.de>	1 year ago
dongjiang	86ed8cdc6b	NFSd: fix nfsd v4 index miss (#2824 ) * fix nfsd v4 index miss --------- Signed-off-by: dongjiang1989 <dongjiang1989@126.com>	1 year ago
Ben Kochie	31a9cca551	Update e2e fixtures Update for fixes in https://github.com/prometheus/procfs/pull/543 Signed-off-by: Ben Kochie <superq@gmail.com>	1 year ago
Conall O'Brien	60c86ab218	Fix inconsistent variable name, to address compilation issue (#2820 ) https://github.com/prometheus/node_exporter/issues/2819 Signed-off-by: Conall O'Brien <conall@conall.net>	1 year ago
dongjiang	e8c5110ada	fix(zfs) zfs `arcstats.p` on FreeBSD 14.0+ (#2754 ) * dongjiang, fix zfs arcstats.p Signed-off-by: dongjiang1989 <dongjiang1989@126.com> * dongjiang, fix gofmt -s Signed-off-by: dongjiang1989 <dongjiang1989@126.com> * change warn log to debug log by code review Signed-off-by: dongjiang1989 <dongjiang1989@126.com> --------- Signed-off-by: dongjiang1989 <dongjiang1989@126.com>	1 year ago
Metbog	e387997e4c	Move RO status before error return Signed-off-by: Metbog <metbog@gmail.com>	1 year ago
Conall O'Brien	f34aaa6109	Add ZFS freebsd per dataset stats (#2753 ) * Rename parsePoolObjsetFile to parseLinuxPoolObjsetFile to better reflect it's scope * Create a new parseFreeBSDPoolObjsetStats function, to generate a list of per pool metrics to be queried via sysctl --------- Signed-off-by: Conall O'Brien <conall@conall.net>	1 year ago
Daniel Swarbrick	685b98ec7f	Optionally fetch ARP stats via rtnetlink instead of procfs (#2777 ) * Optionally fetch ARP stats via rtnetlink instead of procfs Implement collection of ARP stats via rtnetlink to work around shortcomings in the output of /proc/net/arp, which truncates InfiniBand link-layer addresses. Fixes: #2776 --------- Signed-off-by: Daniel Swarbrick <daniel.swarbrick@gmail.com> Co-authored-by: Ben Kochie <superq@gmail.com>	1 year ago
Daniel Swarbrick	381f32b1c5	btrfs: close btrfs.FS handle after use Despite being quite hard to provoke (< 10% in my testing), the btrfs collector would occasionally leave stale FDs relating to btrfs mountpoints, making the filesystems unable to be unmounted. Fixes: #2772. Signed-off-by: Daniel Swarbrick <daniel.swarbrick@gmail.com>	1 year ago
Josh Bradley	f2b274350a	fix(qdisc) flag naming corrected for consistency (#2782 ) * fix collector qdisc flag naming for consistency --------- Signed-off-by: jbradleynh <jbradley@fastly.com>	1 year ago
John Kordich	e120d958f5	Change log message from Warn to Debug Signed-off-by: John Kordich <jkordich@gmail.com> Co-authored-by: Ben Kochie <superq@gmail.com> Signed-off-by: John Kordich <jkordich@gmail.com>	1 year ago
John Kordich	933b1c1797	Add new node_cpu_frequency_hertz metric Revert changes to node_cpu_info and add new node_cpu_frequency_hertz metric for measuring CPU frequency from /proc/cpuinfo Signed-off-by: John Kordich <jkordich@gmail.com>	1 year ago
John Kordich	e84c278107	Update e2e-output.txt with new expected metric values Changes the e2e-output.txt file to have the expected CPU MHz values for the node_cpu_info metric. Signed-off-by: John Kordich <jkordich@gmail.com>	1 year ago
John Kordich	223ebbd50c	Add CPU MHz as the value for "node_cpu_info" metric For CPUs which don't have an available (or insertable) cpufreq driver, the /proc/cpuinfo file can sometimes have accurate CPU core frequency measurements. This change replaces the constant value of "1" for the "node_cpu_info" metric with the parsed CPU MHz value from /proc/cpuinfo for each core. Signed-off-by: John Kordich <jkordich@gmail.com>	1 year ago
Daniel Swarbrick	37ce0bab8c	Sync build tags in *_test.go (#2767 ) Ensure that unwanted tests are correctly excluded when various build tags are specified, i.e. when the code that they test would be excluded from compilation. Signed-off-by: Daniel Swarbrick <daniel.swarbrick@gmail.com>	1 year ago
Daniel Swarbrick	3fb5f70b0c	Drop redundant GOOS build tags if already in filename Drop redundant GOOS build tags at start of file if the constraint is already specified by the filename, e.g. foo_GOOS.go or foo_GOOS_GOARCH.go, avoiding potential confusion in future. cf. https://pkg.go.dev/cmd/go#hdr-Build_constraints Signed-off-by: Daniel Swarbrick <daniel.swarbrick@gmail.com>	1 year ago

1 2 3 4 5 ...

1008 Commits (c0c1a8c57241071c651a67a9e51cb233faf2d539)