github/k3s - k3s - https://git.xinac.net

Commit Graph

Author	SHA1	Message	Date
Brad Davidson	0a728b8ff9	Convert remaining http handlers over to use util.SendError Signed-off-by: Brad Davidson <brad.davidson@rancher.com> (cherry picked from commit `f8e0648304`) Signed-off-by: Brad Davidson <brad.davidson@rancher.com>	2024-05-31 09:16:55 -07:00
Brad Davidson	7ef30a2c60	Refactor supervisor listener startup and add metrics * Refactor agent supervisor listener startup and authn/authz to use upstream auth delegators to perform for SubjectAccessReview for access to metrics. * Convert spegel and pprof handlers over to new structure. * Promote bind-address to agent flag to allow setting supervisor bind address for both agent and server. * Promote enable-pprof to agent flag to allow profiling agents. Access to the pprof endpoint now requires client cert auth, similar to the spegel registry api endpoint. * Add prometheus metrics handler. Signed-off-by: Brad Davidson <brad.davidson@rancher.com> (cherry picked from commit `ff679fb3ab`) Signed-off-by: Brad Davidson <brad.davidson@rancher.com>	2024-05-31 09:16:55 -07:00
Brad Davidson	2b63eb4a27	Fix issue with k3s-etcd informers not starting Start shared informer caches when k3s-etcd controller wins leader election. Previously, these were only started when the main k3s apiserver controller won an election. If the leaders ended up going to different nodes, some informers wouldn't be started Signed-off-by: Brad Davidson <brad.davidson@rancher.com> (cherry picked from commit `3d14092f76`) Signed-off-by: Brad Davidson <brad.davidson@rancher.com>	2024-05-31 09:16:55 -07:00
Brad Davidson	7d9abc9f07	Improve etcd load-balancer startup behavior Prefer the address of the etcd member being joined, and seed the full address list immediately on startup. Signed-off-by: Brad Davidson <brad.davidson@rancher.com>	2024-04-09 15:36:33 -07:00
Brad Davidson	fe465cc832	Move etcd snapshot management CLI to request/response Signed-off-by: Brad Davidson <brad.davidson@rancher.com>	2024-04-09 15:21:26 -07:00
Vitor Savian	5d69d6e782	Add tls for kine Signed-off-by: Vitor Savian <vitor.savian@suse.com> Bump kine Signed-off-by: Vitor Savian <vitor.savian@suse.com> Add integration tests for kine with tls Signed-off-by: Vitor Savian <vitor.savian@suse.com>	2024-03-28 11:12:07 -03:00
Brad Davidson	c51d7bfbd1	Add health-check support to loadbalancer * Adds support for health-checking loadbalancer servers. If a health-check fails when dialing, all existing connections to the server will be closed. * Wires up a remotedialer tunnel connectivity check as the health check for supervisor/apiserver connections. * Wires up a simple ping request to the supervisor port as the health check for etcd connections. Signed-off-by: Brad Davidson <brad.davidson@rancher.com>	2024-03-27 16:50:27 -07:00
Brad Davidson	82e3c32c9f	Retry startup snapshot reconcile The reconcile may run before the kubelet has created the node object; retry until it succeeds Signed-off-by: Brad Davidson <brad.davidson@rancher.com>	2024-02-06 17:46:24 -08:00
Brad Davidson	7ecd5874d2	Skip initial datastore reconcile during cluster-reset Signed-off-by: Brad Davidson <brad.davidson@rancher.com>	2023-11-15 14:31:44 -08:00
Brad Davidson	d885162967	Add server token hash to CR and S3 This required pulling the token hash stuff out of the cluster package, into util. Signed-off-by: Brad Davidson <brad.davidson@rancher.com>	2023-10-12 15:04:45 -07:00
Brad Davidson	7464007037	Store extra metadata and cluster ID for snapshots Write the extra metadata both locally and to S3. These files are placed such that they will not be used by older versions of K3s that do not make use of them. Signed-off-by: Brad Davidson <brad.davidson@rancher.com>	2023-10-12 15:04:45 -07:00
Derek Nola	dface01de8	Server Token Rotation (#8265 ) * Consolidate NewCertCommands * Add support for user defined new token * Add E2E testlets Signed-off-by: Derek Nola <derek.nola@suse.com> * Ensure agent token also changes Signed-off-by: Derek Nola <derek.nola@suse.com>	2023-10-09 10:58:49 -07:00
Manuel Buil	f2c7117374	Take IPFamily precedence based on order Signed-off-by: Manuel Buil <mbuil@suse.com>	2023-09-29 11:04:15 +02:00
Brad Davidson	002e6c43ee	Reorganize Driver interface and etcd driver to avoid passing context and config into most calls Signed-off-by: Brad Davidson <brad.davidson@rancher.com>	2023-09-25 11:54:23 -07:00
Brad Davidson	0d23cfe038	Add RWMutex to address controller Fixes race condition when address map is updated by multiple goroutines Signed-off-by: Brad Davidson <brad.davidson@rancher.com>	2023-08-29 20:52:37 -07:00
Brad Davidson	cba9f0d142	Add new CLI flag to disable TLS SAN CN filtering Signed-off-by: Brad Davidson <brad.davidson@rancher.com>	2023-08-29 08:33:45 -07:00
Brad Davidson	66bae3e326	Bump dynamiclistener for init deadlock fix Signed-off-by: Brad Davidson <brad.davidson@rancher.com>	2023-08-15 16:36:12 -07:00
Brad Davidson	aa76942d0f	Add FilterCN function to prevent SAN Stuffing Wire up a node watch to collect addresses of server nodes, to prevent adding unauthorized SANs to the dynamiclistener cert. Signed-off-by: Brad Davidson <brad.davidson@rancher.com>	2023-08-02 11:15:39 -07:00
Manuel Buil	869e030bdd	VPN PoC Signed-off-by: Manuel Buil <mbuil@suse.com>	2023-06-09 12:39:33 +02:00
Brad Davidson	cf9ebb3259	Fail to validate server tokens that use bootstrap id/secret format Signed-off-by: Brad Davidson <brad.davidson@rancher.com>	2023-05-05 12:24:35 -07:00
Brad Davidson	c44d33d29b	Fix race condition in tunnel server startup Several places in the code used a 5-second retry loop to wait on Runtime.Core to be set. This caused a race condition where OnChange handlers could be added after the Wrangler shared informers were already started. When this happened, the handlers were never called because the shared informers they relied upon were not started. Fix that by requiring anything that waits on Runtime.Core to run from a cluster controller startup hook that is guaranteed to be called before the shared informers are started, instead of just firing it off in a goroutine that retries until it is set. Signed-off-by: Brad Davidson <brad.davidson@rancher.com>	2023-04-28 11:24:34 -07:00
Brad Davidson	d95980bba3	Lock bootstrap data with empty key to prevent conflicts Signed-off-by: Brad Davidson <brad.davidson@rancher.com>	2023-04-05 10:56:57 -07:00
Brad Davidson	0c302f4341	Fix etcd member deletion Turns out etcd-only nodes were never running any of the controllers, so allowing multiple controllers didn't really fix things. Signed-off-by: Brad Davidson <brad.davidson@rancher.com>	2023-02-14 09:39:41 -08:00
Brad Davidson	992e64993d	Add support for kubeadm token and client certificate auth Allow bootstrapping with kubeadm bootstrap token strings or existing Kubelet certs. This allows agents to join the cluster using kubeadm bootstrap tokens, as created with the `k3s token create` command. When the token expires or is deleted, agents can successfully restart by authenticating with their kubelet certificate via node authentication. If the token is gone and the node is deleted from the cluster, node auth will fail and they will be prevented from rejoining the cluster until provided with a valid token. Servers still must be bootstrapped with the static cluster token, as they will need to know it to decrypt the bootstrap data. Signed-off-by: Brad Davidson <brad.davidson@rancher.com>	2023-02-07 14:55:04 -08:00
Silvio Moioli	23c1040adb	Bugfix: do not break cert-manager when pprof is enabled (#6635 ) Signed-off-by: Silvio Moioli <silvio@moioli.net>	2023-01-13 16:09:14 -08:00
Derek Nola	13c633da12	Add Secrets Encryption to CriticalArgs (#6409 ) * Add EncryptSecrets to Critical Control Args * use deep comparison to extract differences Signed-off-by: Derek Nola <derek.nola@suse.com> Signed-off-by: Derek Nola <derek.nola@suse.com>	2022-11-04 10:35:29 -07:00
iyear	3aae7b8783	Fix incorrect defer usage Problem: Using defer inside a loop can lead to resource leaks Solution: Judge newer file in the separate function Signed-off-by: iyear <ljyngup@gmail.com>	2022-11-01 16:23:25 -07:00
Derek Nola	06d81cb936	Replace deprecated ioutil package (#6230 ) * Replace ioutil package * check integration test null pointer * Remove rotate retries Signed-off-by: Derek Nola <derek.nola@suse.com>	2022-10-07 17:36:57 -07:00
Brad Davidson	fc1c100ffd	Remove legacy bidirectional datastore sync code Since #4438 removed 2-way sync and treats any changed+newer files on disk as an error, we no longer need to determine if files are newer on disk/db or if there is a conflicting mix of both. Any changed+newer file is an error, unless we're doing a cluster reset in which case everything is unconditionally replaced. Signed-off-by: Brad Davidson <brad.davidson@rancher.com>	2022-07-12 12:10:30 -07:00
Brad Davidson	83420ef78e	Fix fatal error when reconciling bootstrap data Properly skip restoring bootstrap data for files that don't have a path set because the feature that would set it isn't enabled. Signed-off-by: Brad Davidson <brad.davidson@rancher.com>	2022-07-12 12:10:30 -07:00
Brad Davidson	96162c07c5	Handle egress-selector-mode change during upgrade Properly handle unset egress-selector-mode from existing servers during cluster upgrade. Signed-off-by: Brad Davidson <brad.davidson@rancher.com>	2022-06-30 11:57:41 -07:00
Igor	2999289e68	add support for pprof server (#5527 ) Signed-off-by: igor <igor@igor.io>	2022-06-13 22:06:55 -07:00
Brad Davidson	ce5b9347c9	Replace DefaultProxyDialerFn dialer injection with EgressSelector support Signed-off-by: Brad Davidson <brad.davidson@rancher.com>	2022-04-29 17:54:36 -07:00
Brad Davidson	418c3fa858	Fix issue with datastore corruption on cluster-reset (#5515 ) * Bump etcd to v3.5.4-k3s1 * Fix issue with datastore corruption on cluster-reset * Disable unnecessary components during cluster reset Disable control-plane components and the tunnel setup during cluster-reset, even when not doing a restore. This reduces the amount of log clutter during cluster reset/restore, making any errors encountered more obvious. Signed-off-by: Brad Davidson <brad.davidson@rancher.com>	2022-04-27 13:44:15 -07:00
Roberto Bonafiglia	4afeb9c5c7	Merge pull request #5325 from rbrtbnfgl/fix-etcd-ipv6-url Fixed etcd URL in case of IPv6 address	2022-04-05 09:55:42 +02:00
Roberto Bonafiglia	06c779c57d	Fixed loadbalancer in case of IPv6 addresses Signed-off-by: Roberto Bonafiglia <roberto.bonafiglia@suse.com>	2022-03-31 11:49:30 +02:00
Roberto Bonafiglia	dda409b041	Updated localhost address on IPv6 only setup Signed-off-by: Roberto Bonafiglia <roberto.bonafiglia@suse.com>	2022-03-29 09:35:54 +02:00
Brad Davidson	1339626a5b	Defragment etcd datastore before clearing alarms Signed-off-by: Brad Davidson <brad.davidson@rancher.com>	2022-03-28 09:27:59 -07:00
Brad Davidson	3cebde924b	Handle empty entries in bootstrap path map Signed-off-by: Brad Davidson <brad.davidson@rancher.com>	2022-03-17 13:42:27 -07:00
Brad Davidson	003e094b45	Populate EtcdConfig in runtime from datastore when etcd is disabled (#5222 ) Fixes issue with secrets-encrypt rotate not having any etcd endpoints available on nodes without a local etcd server. Signed-off-by: Brad Davidson <brad.davidson@rancher.com>	2022-03-08 09:04:31 -08:00
Luther Monson	9a849b1bb7	[master] changing package to k3s-io (#4846 ) * changing package to k3s-io Signed-off-by: Luther Monson <luther.monson@gmail.com> Co-authored-by: Derek Nola <derek.nola@suse.com>	2022-03-02 15:47:27 -08:00
Brad Davidson	9a48086524	Ignore cluster membership errors when reconciling from temp etcd Signed-off-by: Brad Davidson <brad.davidson@rancher.com>	2022-03-01 20:25:20 -08:00
Brad Davidson	e4846c92b4	Move temporary etcd startup into etcd module Reuse the existing etcd library code to start up the temporary etcd server for bootstrap reconcile. This allows us to do proper health-checking of the datastore on startup, including handling of alarms. Signed-off-by: Brad Davidson <brad.davidson@rancher.com>	2022-03-01 20:25:20 -08:00
Kamil Madac	333248466b	Add http/2 support to API server (#5149 ) fix issue #5148 Signed-off-by: Kamil Madac <kamil.madac@gmail.com>	2022-03-01 11:27:52 -08:00
Brad Davidson	5014c9e0e8	Fix adding etcd-only node to existing cluster Signed-off-by: Brad Davidson <brad.davidson@rancher.com>	2022-02-28 19:56:08 -08:00
Brad Davidson	a1b800f0bf	Remove unnecessary copies of etcdconfig struct Signed-off-by: Brad Davidson <brad.davidson@rancher.com>	2022-02-28 12:05:16 -08:00
Brad Davidson	2989b8b2c5	Remove unnecessary copies of runtime struct Several types contained redundant references to ControlRuntime data. Switch to consistently accessing this via config.Runtime instead. Signed-off-by: Brad Davidson <brad.davidson@rancher.com>	2022-02-28 12:05:16 -08:00
Brad Davidson	54bb65064e	Fix cluster bootstrap test Signed-off-by: Brad Davidson <brad.davidson@rancher.com>	2022-02-28 12:05:16 -08:00
Brad Davidson	5ca206ad3b	Fix handling of agent-token fallback to token Signed-off-by: Brad Davidson <brad.davidson@rancher.com>	2022-01-07 09:56:37 -08:00
Brad Davidson	e7464a17f7	Fix use of agent creds for secrets-encrypt and config validate Signed-off-by: Brad Davidson <brad.davidson@rancher.com>	2022-01-06 12:55:18 -08:00

1 2 3

132 Commits (b2a2ac0afc63e9c86bf10ab3c36c9af92c0152a8)