prometheus

Commit Graph

Author	SHA1	Message	Date
Bryan Boreham	1ed94142fc	remote-write: slow down retries to avoid DDOS (#9634 ) * remote-write: slow down retries to avoid DDOS Increase the default max retry time from 100ms to 5 seconds. Remote write calls are retried after a recoverable error such as the back-end returning 500. Prometheus waits the minimum time and retries, then doubles the wait on each subsequent retry until the maximum is reached. If some data is still getting through, remote-write will also increase shards, and the default maximum is 200. 200 shards sending every 100ms is 20 calls per second, to a back-end that is already in trouble. 5 seconds was chosen to match the default BatchSendDeadline: if we can afford to wait that long for no response, then we can wait the same time to retry. We will reach 5 seconds after 9 successive failures. Signed-off-by: Bryan Boreham <bjboreham@gmail.com> * Update config doc for max_backoff change Signed-off-by: Bryan Boreham <bjboreham@gmail.com>	3 years ago
Julien Pivotto	807f46a1ed	Gate agent behind a feature flag, valide mode flags (#9620 ) Signed-off-by: Julien Pivotto <roidelapluie@inuits.eu>	3 years ago
Furkan Türkal	a6e6011d55	Add scrape_body_size_bytes metric (#9569 ) Fixes #9520 Signed-off-by: Furkan <furkan.turkal@trendyol.com>	3 years ago
Levi Harrison	d81bbe154d	Rule alerts/series limit updates (#9541 ) * Add docs and do not limit inactive alerts. Signed-off-by: Levi Harrison <git@leviharrison.dev>	3 years ago
Julian Wiedmann	18886c33c2	docs/operators: fix a typo (#9559 ) s/are preserved the output/are preserved in the output Signed-off-by: Julian Wiedmann <jwi@linux.ibm.com>	3 years ago
Julien Pivotto	77f411b2ec	Enable tls_config in oauth2 (#9550 ) * Enable tls_config in oauth2 Signed-off-by: Julien Pivotto <roidelapluie@inuits.eu>	3 years ago
Levi Harrison	89a6ebd799	Add common HTTP client to Azure SD (#9267 ) * Add `proxy_url` option to Azure SD Signed-off-by: Levi Harrison <git@leviharrison.dev>	3 years ago
Julien Pivotto	432005826d	Add a feature flag to enable the new discovery manager (#9537 ) * Add a feature flag to enable the new manager This PR creates a copy of the legacy manager and uses it by default. It is a companion PR to #9349. With this PR, users can enable the new discovery manager and provide us with any feedback / side effects that the new behaviour might have. Signed-off-by: Julien Pivotto <roidelapluie@inuits.eu>	3 years ago
Julien Pivotto	df1bae0514	Add support for security-related HTTP headers (#9546 ) Signed-off-by: Julien Pivotto <roidelapluie@inuits.eu>	3 years ago
Witek Bedyk	cda2dbbef6	Add Uyuni service discovery (#8190 ) * Add Uyuni service discovery Signed-off-by: Witek Bedyk <witold.bedyk@suse.com> Co-authored-by: Joao Cavalheiro <jcavalheiro@suse.de> Co-authored-by: Marcelo Chiaradia <mchiaradia@suse.com> Co-authored-by: Stefano Torresi <stefano@torresi.io> Co-authored-by: Julien Pivotto <roidelapluie@gmail.com>	3 years ago
la3mmchen	6d3a4ed711	fix/9269 add documentation for endpointslice This commits add a documentation for the kubernetes_sd_configs: endpointslice feature. Signed-off-by: la3mmchen <alex@k3wl.net>	3 years ago
Ivana Huckova	a069e7ec80	Update api.md (#9429 ) Fix timestamp in example in exemplars query. The year was `020` instead of `2020`. Signed-off-by: Ivana <ivana.huckova@gmail.com>	3 years ago
Levi Harrison	a7de0cf276	Remove example and experimental Signed-off-by: Levi Harrison <git@leviharrison.dev>	3 years ago
Levi Harrison	a16a4cad2d	Add example Credit goes to @leonerd for the original example Signed-off-by: Levi Harrison <git@leviharrison.dev>	3 years ago
Levi Harrison	2f896c98ff	Address review comments Signed-off-by: Levi Harrison <git@leviharrison.dev>	3 years ago
Levi Harrison	a283b52e8c	Added unit Signed-off-by: Levi Harrison <git@leviharrison.dev>	3 years ago
Levi Harrison	be6ce7bcc2	Add docs Signed-off-by: Levi Harrison <git@leviharrison.dev>	3 years ago
sennatitcomb	9e7ae7b9ac	Typo fixes Signed-off-by: sennatitcomb <senna.titcomb@intel.com>	3 years ago
Julien Pivotto	8920024323	Add PuppetDB service discovery We have been Puppet user for 10 years and we are users of https://github.com/camptocamp/prometheus-puppetdb-sd However, that file_sd implementation contains business logic and assumptions around e.g. the modules which you are using. This pull request adds a simple PuppetDB service discovery, which will enable more use cases than the upstream sd. Signed-off-by: Julien Pivotto <roidelapluie@inuits.eu>	3 years ago
Levi Harrison	6faca22eec	Add inverse hyperbolic functions Signed-off-by: Levi Harrison <git@leviharrison.dev>	3 years ago
Levi Harrison	74faea64dd	Removed specification of pi digits Signed-off-by: Levi Harrison <git@leviharrison.dev>	3 years ago
Levi Harrison	53d88fd147	Added hyperbolic trig functions Signed-off-by: Levi Harrison <git@leviharrison.dev>	3 years ago
Levi Harrison	51bb3d4a27	Changed radian wording Signed-off-by: Levi Harrison <git@leviharrison.dev>	3 years ago
Levi Harrison	535d8904f7	Link to docs for special cases Signed-off-by: Levi Harrison <git@leviharrison.dev>	3 years ago
Levi Harrison	e5a44964ff	Added docs Signed-off-by: Levi Harrison <git@leviharrison.dev>	3 years ago
Levi Harrison	dc2f1993d8	Limit number of alerts or series produced by a rule (#9260 ) * Add limit to rules Signed-off-by: Levi Harrison <git@leviharrison.dev>	3 years ago
Dean Meehan	5751bfb61f	Removed Duplication Typo (to to) Signed-off-by: Dean Meehan <dean@dean.technology>	3 years ago
Łukasz Mierzwa	f0a26266c0	Add scrape_sample_limit metric This adds a new metric exposing per target scrape sample_limit value. Metrics are only exposed if extra-scrape-metrics feature flag is enabled. scrape_sample_limit will make it easy to monitor and alert on targets getting close to configured sample_limit, which is important given than exceeding sample_limit results in the entire scrape results being rejected. Signed-off-by: Łukasz Mierzwa <l.mierzwa@gmail.com>	3 years ago
SuperQ	31f4108758	Add scrape_timeout_seconds metric Add a new built-in metric `scrape_timeout_seconds` to allow monitoring of the ratio of scrape duration to the scrape timeout. Hide behind a feature flag to avoid additional cardinality by default. Signed-off-by: SuperQ <superq@gmail.com>	3 years ago
Levi Harrison	70f597b033	Configure Scrape Interval and Timeout Via Relabeling (#8911 ) * Configure scrape interval and timeout with labels Signed-off-by: Levi Harrison <git@leviharrison.dev>	3 years ago
Ganesh Vernekar	3d4c5f890d	Clarify in docs about the min disk space requirement when using size retention (#9245 ) Signed-off-by: Ganesh Vernekar <ganeshvern@gmail.com>	3 years ago
Julien Pivotto	cab96a06ef	Merge release 2.29 in main (#9196 ) * PromQL: Fix start and end keywords masking label and metric names This commit fixes an issue with the "at modifier" that introduced two new keywords: `start` and `end`. In grouping options and in metric names, these keywords took precedence over metric or label names, so that those metrics and labels could no longer be referenced. Signed-off-by: Clayton Peters <clayton.peters@man.com> * Add in additional tests for metrics and/or labels called start/end. Signed-off-by: Clayton Peters <clayton.peters@man.com> * : Cut 2.29.0-rc.0 Signed-off-by: Frederic Branczyk <fbranczyk@gmail.com> VERSION: bump to 2.29.0-rc.0 Signed-off-by: Frederic Branczyk <fbranczyk@gmail.com> * Remove experimental wording on size-based retention Followup of #9004 Signed-off-by: Julien Pivotto <roidelapluie@inuits.eu> * Fix PR reference in changelog Signed-off-by: George Brighton <george@gebn.co.uk> * Describe EC2 availability zone IDs at most once per refresh (#9142) Signed-off-by: George Brighton <george@gebn.co.uk> * Describe EC2 availability zones at most once per SD load Closes #9142. Signed-off-by: George Brighton <george@gebn.co.uk> * Incorporate feedback Signed-off-by: George Brighton <george@gebn.co.uk> * Integrate feedback Signed-off-by: George Brighton <george@gebn.co.uk> * Add a compatibility note for macOS users. Signed-off-by: Julien Pivotto <roidelapluie@inuits.eu> * : Cut v2.29.0-rc.1 Signed-off-by: Frederic Branczyk <fbranczyk@gmail.com> Fix `kuma_sd` targetgroup reporting (#9157) * Bundle all xDS targets into a single group Signed-off-by: austin ce <austin.cawley@gmail.com> * : cut v2.29.0-rc.2 Signed-off-by: Frederic Branczyk <fbranczyk@gmail.com> Rename links Signed-off-by: Levi Harrison <git@leviharrison.dev> * bump codemirror-promql to 0.17.0 Signed-off-by: Augustin Husson <husson.augustin@gmail.com> * : cut v2.29.0 Signed-off-by: Frederic Branczyk <fbranczyk@gmail.com> tsdb: align atomically accessed int64 (#9192) This prevents a panic in 32-bit archs: https://pkg.go.dev/sync/atomic#pkg-note-BUG Fixed #9190 Signed-off-by: Julien Pivotto <roidelapluie@inuits.eu> * Release 2.29.1 (#9193) Signed-off-by: Julien Pivotto <roidelapluie@inuits.eu> Co-authored-by: Clayton Peters <clayton.peters@man.com> Co-authored-by: Frederic Branczyk <fbranczyk@gmail.com> Co-authored-by: George Brighton <george@gebn.co.uk> Co-authored-by: Austin Cawley-Edwards <austin.cawley@gmail.com> Co-authored-by: Levi Harrison <git@leviharrison.dev> Co-authored-by: Augustin Husson <husson.augustin@gmail.com>	3 years ago
Levi Harrison	0d7eb73d92	Rename links Signed-off-by: Levi Harrison <git@leviharrison.dev>	3 years ago
Levi Harrison	b9b5adfe62	Rename links Signed-off-by: Levi Harrison <git@leviharrison.dev>	3 years ago
Ganesh Vernekar	ee7e0071d1	Snapshot in-memory chunks on shutdown for faster restarts (#7229 ) Signed-off-by: Ganesh Vernekar <ganeshvern@gmail.com>	3 years ago
TJ Hoplock	7baf084092	optimize Linode SD by polling for event changes during refresh (#8980 ) * optimize Linode SD by polling for event changes during refresh Most accounts are fairly "static", in the sense that they're not cycling through instances constantly. So rather than do a full refresh every interval and potentially make several behind-the-scenes paginated API calls, this will now poll the `/account/events/` endpoint every minute with a list of events that we care about. If a matching event is found, we then do a full refresh. Co-authored-by: William Smith <wsmith@linode.com> Signed-off-by: TJ Hoplock <t.hoplock@gmail.com> Signed-off-by: William Smith <wsmith@linode.com>	3 years ago
Levi Harrison	c1b1b826ce	HostNetworkHost -> HostNetworkingHost Signed-off-by: Levi Harrison <git@leviharrison.dev>	3 years ago
George Brighton	1f752b6910	Incorporate feedback Signed-off-by: George Brighton <george@gebn.co.uk>	3 years ago
Darshan Chaudhary	c4f2e9eec5	Add present_over_time (#9097 ) * Add present_over_time Signed-off-by: darshanime <deathbullet@gmail.com> * Add tests for present_over_time Signed-off-by: darshanime <deathbullet@gmail.com> * Address PR comments Signed-off-by: darshanime <deathbullet@gmail.com> * Add documentation for present_over_time Signed-off-by: darshanime <deathbullet@gmail.com> * Update documentation Signed-off-by: darshanime <deathbullet@gmail.com> * Update documentation comment Signed-off-by: darshanime <deathbullet@gmail.com>	3 years ago
Richard Hartmann	d68d3983d8	Make clear that start/end are inclusive Fixes: https://github.com/prometheus/prometheus/issues/9100 Signed-off-by: Richard Hartmann <richih@richih.org>	3 years ago
Levi Harrison	3556302c76	Added docs Signed-off-by: Levi Harrison <git@leviharrison.dev>	3 years ago
Levi Harrison	a8850a0819	Add note to docs Signed-off-by: Levi Harrison <git@leviharrison.dev> Co-authored-by: Julien Pivotto <roidelapluie@inuits.eu>	3 years ago
darshanime	c8a2ffdb72	Add computer name to azure sd Signed-off-by: darshanime <deathbullet@gmail.com>	3 years ago
George Brighton	bc0e76c8a3	Add AZ ID label to discovered EC2 targets (#8896 ) * Add AZ ID to EC2 SD Signed-off-by: George Brighton <george@gebn.co.uk>	3 years ago
austin ce	3593b20cdb	Add documentation for kuma_sd configuration Signed-off-by: austin ce <austin.cawley@gmail.com>	3 years ago
Arunprasad Rajkumar	83a56e22ab	docs: update unit_testing_rules to cover missing and stale samples (#9065 ) Signed-off-by: Arunprasad Rajkumar <arajkuma@redhat.com>	3 years ago
Ben Kochie	e98b639ac7	Rename "Disabled Features" docs page (#9073 ) Make the feature flags page more discoverable by naming it what it is. Signed-off-by: SuperQ <superq@gmail.com>	3 years ago
Lukas Kämmerling	263847e64a	hcloud discovery: Add new labelpresent label (#9028 ) * Add new labelpresent label Signed-off-by: Lukas Kämmerling <lukas.kaemmerling@hetzner-cloud.de>	3 years ago
Ankit Goel	d437cee73a	Move storage.tsdb.retention.size out of experimental #8728 (#9004 ) * Move storage.tsdb.retention.size out of experimental #8728 Signed-off-by: Ankit Goel <ankit.goel@deliveryhero.com>	3 years ago
Joey Freeland	8017dd7242	chore: always append interface ipv4 with api interface name Signed-off-by: Joey Freeland <joey@free.land>	3 years ago

1 2 3 4 5 ...

438 Commits (7400e07fa90d0cbce2c63415bf941676e8ffc909)