github/k3s - k3s - https://git.xinac.net

Commit Graph

Author	SHA1	Message	Date
Brad Davidson	8d47645312	Consistently set snapshotFile timestamp Attempt to use timestamp from creation or filename instead of file/object modification times Signed-off-by: Brad Davidson <brad.davidson@rancher.com>	1 year ago
Brad Davidson	f1afe153a3	Tidy s3 upload functions Consistently refer to object keys as such, simplify error handling. Signed-off-by: Brad Davidson <brad.davidson@rancher.com>	1 year ago
Brad Davidson	2b0e2e8ada	Elide old snapshot data when apiserver rejects configmap with ErrRequestEntityTooLarge Signed-off-by: Brad Davidson <brad.davidson@rancher.com>	1 year ago
Brad Davidson	676b00aa0e	Move etcd snapshot code into separate file Signed-off-by: Brad Davidson <brad.davidson@rancher.com>	1 year ago
Brad Davidson	8705a88bf4	Clear remove annotations on cluster reset; refuse to delete last member from cluster Signed-off-by: Brad Davidson <brad.davidson@rancher.com>	1 year ago
Brad Davidson	002e6c43ee	Reorganize Driver interface and etcd driver to avoid passing context and config into most calls Signed-off-by: Brad Davidson <brad.davidson@rancher.com>	1 year ago
Brad Davidson	890645924f	Don't export functions not needed outside the etcd package Signed-off-by: Brad Davidson <brad.davidson@rancher.com>	1 year ago
Brad Davidson	8c73fd670b	Disable HTTP on main etcd client port Fixes performance issue under load, ref: https://github.com/etcd-io/etcd/issues/15402 and https://github.com/kubernetes/kubernetes/pull/118460 Signed-off-by: Brad Davidson <brad.davidson@rancher.com>	1 year ago
Vitor Savian	e83b1ba4aa	Fixed the etcd retention to delete orphaned snapshots based on the date (#8177 ) * Fix retention using name instead of date Signed-off-by: Vitor <vitor.savian@suse.com>	1 year ago
Ian Cardoso	e551308db8	fix for etcd-snapshot delete with --etcd-s3 flag (#8110 ) k3s etcd-snapshot save --etcd-s3 ... is creating a local snapshot and uploading it to s3 while k3s etcd-snapshot delete --etcd-s3 ... was deleting the snapshot only on s3 buckets, this commit change the behavior of delete to do it locally and on s3 Signed-off-by: Ian Cardoso <osodracnai@gmail.com>	1 year ago
Vitor Savian	ca7aeed090	Etcd snapshots retention when node name changes (#8099 ) Fixed the etcd retention to delete orphaned snapshots Signed-off-by: Vitor <vitor.savian@suse.com>	1 year ago
Brad Davidson	aa76942d0f	Add FilterCN function to prevent SAN Stuffing Wire up a node watch to collect addresses of server nodes, to prevent adding unauthorized SANs to the dynamiclistener cert. Signed-off-by: Brad Davidson <brad.davidson@rancher.com>	1 year ago
Brad Davidson	e61fde93c1	Fix MemberList error handling and incorrect etcd-arg passthrough Signed-off-by: Brad Davidson <brad.davidson@rancher.com>	2 years ago
Brad Davidson	91afb38799	Retry cluster join on "too many learners" error Signed-off-by: Brad Davidson <brad.davidson@rancher.com>	2 years ago
Brad Davidson	d95980bba3	Lock bootstrap data with empty key to prevent conflicts Signed-off-by: Brad Davidson <brad.davidson@rancher.com>	2 years ago
Brad Davidson	b010db0cff	Ensure that loopback is used for the advertised address when resetting Signed-off-by: Brad Davidson <brad.davidson@rancher.com>	2 years ago
Brad Davidson	0c302f4341	Fix etcd member deletion Turns out etcd-only nodes were never running any of the controllers, so allowing multiple controllers didn't really fix things. Signed-off-by: Brad Davidson <brad.davidson@rancher.com>	2 years ago
Brad Davidson	3d146d2f1b	Allow for multiple sets of leader-elected controllers Addresses an issue where etcd controllers did not run on etcd-only nodes Signed-off-by: Brad Davidson <brad.davidson@rancher.com>	2 years ago
Brad Davidson	a298bfdb18	Add jitter to scheduled snapshots and retry harder on conflicts Also ensure that the snapshot job does not attempt to trigger multiple concurrent runs, as this is not supported. Signed-off-by: Brad Davidson <brad.davidson@rancher.com>	2 years ago
Derek Nola	06d81cb936	Replace deprecated ioutil package (#6230 ) * Replace ioutil package * check integration test null pointer * Remove rotate retries Signed-off-by: Derek Nola <derek.nola@suse.com>	2 years ago
Derek Nola	4c0bc8c046	Update etcd error to match correct url (#5909 ) Signed-off-by: Derek Nola <derek.nola@suse.com>	2 years ago
Brad Davidson	5eaa0a9422	Replace getLocalhostIP with Loopback helper method Requires tweaking existing method signature to allow specifying whether or not IPv6 addresses should be return URL-safe. Signed-off-by: Brad Davidson <brad.davidson@rancher.com>	2 years ago
Brad Davidson	1674b9d640	Raise etcd connection test timeout to 30 seconds Addressess issue where the compact may take more than 10 seconds on slower disks. These disks probably aren't really suitable for etcd, but apparently run fine otherwise. Signed-off-by: Brad Davidson <brad.davidson@rancher.com>	2 years ago
Brad Davidson	ffe72eecc4	Address issues with etcd snapshots * Increase the default snapshot timeout. The timeout is not currently configurable from Rancher, and larger clusters are frequently seeing uploads fail at 30 seconds. * Enable compression for scheduled snapshots if enabled on the command-line. The CLI flag was not being passed into the etcd config. * Only set the S3 content-type to application/zip if the file is zipped. * Don't run more than one snapshot at once, to prevent misconfigured etcd snapshot cron schedules from stacking up. Signed-off-by: Brad Davidson <brad.davidson@rancher.com>	2 years ago
Brad Davidson	6fad63583b	Only listen on loopback when resetting Signed-off-by: Brad Davidson <brad.davidson@rancher.com>	2 years ago
Brad Davidson	fb0a342a20	Sanitize filenames for use in configmap keys If the user points S3 backups at a bucket containing other files, those file names may not be valid configmap keys. For example, RKE1 generates backup files with names like `s3-c-zrjnb-rs-6hxpk_2022-05-05T12:05:15Z.zip`; the semicolons in the timestamp portion of the name are not allowed for use in configmap keys. Signed-off-by: Brad Davidson <brad.davidson@rancher.com>	2 years ago
Brad Davidson	ce5b9347c9	Replace DefaultProxyDialerFn dialer injection with EgressSelector support Signed-off-by: Brad Davidson <brad.davidson@rancher.com>	3 years ago
Brad Davidson	418c3fa858	Fix issue with datastore corruption on cluster-reset (#5515 ) * Bump etcd to v3.5.4-k3s1 * Fix issue with datastore corruption on cluster-reset * Disable unnecessary components during cluster reset Disable control-plane components and the tunnel setup during cluster-reset, even when not doing a restore. This reduces the amount of log clutter during cluster reset/restore, making any errors encountered more obvious. Signed-off-by: Brad Davidson <brad.davidson@rancher.com>	3 years ago
Brad Davidson	f2ceeb01d9	Fix issue with long-running apiserver endpoints watch (#5478 ) Use ListWatch helpers to retry when the watch channel is closed. Signed-off-by: Brad Davidson <brad.davidson@rancher.com>	3 years ago
Brad Davidson	7760e2177a	Bump etcd to 3.5.3-k3s1 Signed-off-by: Brad Davidson <brad.davidson@rancher.com>	3 years ago
Brad Davidson	b12cd62935	Move IPv4/v6 selection into helpers Signed-off-by: Brad Davidson <brad.davidson@rancher.com>	3 years ago
Roberto Bonafiglia	9c9adda61b	Added default endpoint for IPv6 Signed-off-by: Roberto Bonafiglia <roberto.bonafiglia@suse.com>	3 years ago
Brad Davidson	f37e7565b8	Move the apiserver addresses controller into the etcd package This controller only needs to run when using managed etcd, so move it in with the rest of the etcd stuff. This change also modifies the controller to only watch the Kubernetes service endpoint, instead of watching all endpoints in the entire cluster. Fixes an error message revealed by use of a newer grpc client in Kubernetes 1.24, which logs an error when the Put to etcd failed because kine doesn't support the etcd Put operation. The controller shouldn't have been running without etcd in the first place. Signed-off-by: Brad Davidson <brad.davidson@rancher.com>	3 years ago
Brad Davidson	2a429aac65	Fix crash on early snapshot Don't attempt to retrieve snapshot metadata configmap if the apiserver isn't available. This could be triggered if the cron expression caused a snapshot to be triggered before the apiserver is up. Signed-off-by: Brad Davidson <brad.davidson@rancher.com>	3 years ago
Roberto Bonafiglia	4afeb9c5c7	Merge pull request #5325 from rbrtbnfgl/fix-etcd-ipv6-url Fixed etcd URL in case of IPv6 address	3 years ago
Roberto Bonafiglia	0746dde758	Fixed http URL on etcd Signed-off-by: Roberto Bonafiglia <roberto.bonafiglia@suse.com>	3 years ago
Roberto Bonafiglia	06c779c57d	Fixed loadbalancer in case of IPv6 addresses Signed-off-by: Roberto Bonafiglia <roberto.bonafiglia@suse.com>	3 years ago
Brad Davidson	62cc1ed24f	Skip setting up client tls when etcd server does not have tls enabled Signed-off-by: Brad Davidson <brad.davidson@rancher.com>	3 years ago
Roberto Bonafiglia	dda409b041	Updated localhost address on IPv6 only setup Signed-off-by: Roberto Bonafiglia <roberto.bonafiglia@suse.com>	3 years ago
Brad Davidson	1339626a5b	Defragment etcd datastore before clearing alarms Signed-off-by: Brad Davidson <brad.davidson@rancher.com>	3 years ago
Roberto Bonafiglia	2285aa699b	Fixed etcd URL in case of IPv6 address Signed-off-by: Roberto Bonafiglia <roberto.bonafiglia@suse.com>	3 years ago
Brad Davidson	078da46532	Close additional leaked GPRC clients Signed-off-by: Brad Davidson <brad.davidson@rancher.com>	3 years ago
Derek Nola	1f7abe5dbb	Testing directory and documentation rework. (#5256 ) * Removed vagrant folder * Fix comments around E2E ENVs * Eliminate testutil folder * Convert flock integration test to unit test * Point to other READMEs Signed-off-by: Derek Nola <derek.nola@suse.com>	3 years ago
Luther Monson	9a849b1bb7	[master] changing package to k3s-io (#4846 ) * changing package to k3s-io Signed-off-by: Luther Monson <luther.monson@gmail.com> Co-authored-by: Derek Nola <derek.nola@suse.com>	3 years ago
Brad Davidson	9a48086524	Ignore cluster membership errors when reconciling from temp etcd Signed-off-by: Brad Davidson <brad.davidson@rancher.com>	3 years ago
Brad Davidson	e4846c92b4	Move temporary etcd startup into etcd module Reuse the existing etcd library code to start up the temporary etcd server for bootstrap reconcile. This allows us to do proper health-checking of the datastore on startup, including handling of alarms. Signed-off-by: Brad Davidson <brad.davidson@rancher.com>	3 years ago
Brad Davidson	555087b9b8	Add function to clear local alarms on etcd startup Signed-off-by: Brad Davidson <brad.davidson@rancher.com>	3 years ago
Brad Davidson	5014c9e0e8	Fix adding etcd-only node to existing cluster Signed-off-by: Brad Davidson <brad.davidson@rancher.com>	3 years ago
Brad Davidson	2989b8b2c5	Remove unnecessary copies of runtime struct Several types contained redundant references to ControlRuntime data. Switch to consistently accessing this via config.Runtime instead. Signed-off-by: Brad Davidson <brad.davidson@rancher.com>	3 years ago
Derek Nola	e28be2912c	Migrate Ginkgo testing framework to V2, consolidate integration tests (#5097 ) * Upgrade and convert ginkgo from v1 to v2 * Move all integration tests into integration folder * Update TESTING.md Signed-off-by: Derek Nola <derek.nola@suse.com>	3 years ago

1 2 3

131 Commits (8d4764531248f064e73944fa220ad346683b80c5)