github/k3s - k3s - https://git.xinac.net

Commit Graph

Author	SHA1	Message	Date
Brad Davidson	f7dcc139ff	Bump klipper-lb image for arm fix Signed-off-by: Brad Davidson <brad.davidson@rancher.com>	2021-11-02 18:55:09 -07:00
Deshi Xiao	f1622129e4	refactor: Use plain channel send or receive fix issue #4369 should use a simple channel send/receive instead of select with a single case Signed-off-by: Deshi Xiao <xiaods@gmail.com>	2021-11-01 15:00:49 -07:00
Brad Davidson	f9f1cabe9c	Fix log/reap reexec Signed-off-by: Brad Davidson <brad.davidson@rancher.com>	2021-11-01 14:24:14 -07:00
Jacob Blain Christen	702fe24afe	containerd/cri: enable the btrfs snapshotter (#4316 ) * vendor: btrfs * enable the btrfs snapshotter * testing: snapshotter/btrfs Signed-off-by: Jacob Blain Christen <jacob@rancher.com>	2021-10-29 23:31:33 -07:00
Brad Davidson	3da1bb3af2	Fix other uses of NewForConfigOrDie in contexts where we could return err Signed-off-by: Brad Davidson <brad.davidson@rancher.com>	2021-10-29 15:18:14 -07:00
Brad Davidson	5acd0b9008	Watch the local Node object instead of get/sleep looping Signed-off-by: Brad Davidson <brad.davidson@rancher.com>	2021-10-29 15:18:14 -07:00
Brad Davidson	3fe460d080	Block scheduler startup on untainted node when using embedded CCM Signed-off-by: Brad Davidson <brad.davidson@rancher.com>	2021-10-29 15:18:14 -07:00
Derek Nola	7c3f21e581	K3s Integration test fixes (#4341 ) * Move tests into sub folders * Updated documentation * Prevent infinite loop is user has not made k3s Signed-off-by: dereknola <derek.nola@suse.com>	2021-10-28 12:35:28 -07:00
galal-hussein	ab3d25a2c5	Update peer address when running cluster-reset Signed-off-by: galal-hussein <hussein.galal.ahmed.11@gmail.com>	2021-10-25 15:43:27 -07:00
Brian Downs	0a0b915921	reset buffer after use (#4279 )	2021-10-22 15:56:01 -07:00
Derek Nola	918945da45	Added configuration input to etcd-snapshot (#4280 ) Signed-off-by: dereknola <derek.nola@suse.com>	2021-10-22 12:03:32 -07:00
Brian Downs	e11a4bf8bb	set duration to second (#4231 )	2021-10-15 16:46:39 -07:00
Brian Downs	0452f017c1	Add etcd s3 timeout (#4207 )	2021-10-15 10:24:14 -07:00
Brian Downs	34080b23b1	Copy old bootstrap buffer data for use during migration (#4215 )	2021-10-15 10:17:29 -07:00
Manuel Buil	dbc14b8990	Fix race condition in cloud provider Signed-off-by: Manuel Buil <mbuil@suse.com>	2021-10-15 13:28:32 +02:00
Brad Davidson	5a923ab8dc	Add containerd ready channel to delay etcd node join Signed-off-by: Brad Davidson <brad.davidson@rancher.com>	2021-10-14 14:03:52 -07:00
Hussein Galal	b282528ee2	Display cluster tls error only in debug mode (#4124 ) * Display cluster tls error only in debug mode Signed-off-by: galal-hussein <hussein.galal.ahmed.11@gmail.com> * fix Signed-off-by: galal-hussein <hussein.galal.ahmed.11@gmail.com>	2021-10-13 00:00:28 +02:00
Brad Davidson	dc18ef2e51	Refactor log and reaper exec to omit MAINPID Using MAINPID breaks systemd's exit detection, as it stops watching the original pid, but is unable to watch the new pid as it is not a child of systemd itself. The best we can do is just notify when execing the child process. We also need to consolidate forking into a sigle place so that we don't end up with multiple levels of child processes if both redirecting log output and reaping child processes. Signed-off-by: Brad Davidson <brad.davidson@rancher.com>	2021-10-12 13:35:10 -07:00
Derek Nola	feec44572d	Improve error message when using a "K10" prefixed token (#4180 ) * Add new error message with a K10 prefixed secret token Signed-off-by: dereknola <derek.nola@suse.com>	2021-10-11 10:00:22 -07:00
Brian Downs	ac7a8d89c6	Add ability to reconcile bootstrap data between datastore and disk (#3398 )	2021-10-07 12:47:00 -07:00
Derek Nola	b6919adf62	Add "etcd-" prefix to etcd-snapshot commands as aliases (#4161 ) * Add "etcd-" prefix to etcd-snapshot commands as alias Signed-off-by: dereknola <derek.nola@suse.com>	2021-10-06 14:20:22 -07:00
Manuel Buil	635f790eb4	Merge pull request #4114 from manuelbuil/lb-controller-dual-stack Dual-stack support in serviceLB controller	2021-10-06 16:08:10 +02:00
Manuel Buil	00cf4578ec	Dual-stack support LB controller Signed-off-by: Manuel Buil <mbuil@suse.com>	2021-10-06 11:06:20 +02:00
Marc Bachmann	9b35734e1a	Add topologySpreadConstraints to support scaling of coredns Signed-off-by: Marc Bachmann <marc.brookman@gmail.com>	2021-10-05 11:52:44 -07:00
Brad Davidson	12e675e2cc	Don't evacuate the root cgroup when rootless Signed-off-by: Brad Davidson <brad.davidson@rancher.com>	2021-10-01 16:18:12 -07:00
Brad Davidson	5d1a37ee32	Send MAINPID to systemd when reexecing for logfile output This allows the new process to notify systemd when it is ready. Signed-off-by: Brad Davidson <brad.davidson@rancher.com>	2021-09-29 11:41:09 -07:00
Brad Davidson	a16105b348	Properly handle operation as init process Signed-off-by: Brad Davidson <brad.davidson@rancher.com>	2021-09-28 11:05:34 -07:00
Brian Downs	f4cea90cb9	set transport to skip verify if se skip flag passed (#4102 )	2021-09-28 10:13:50 -07:00
Manuel Buil	87524a7ac7	Enable the inheritance of settings for ipv6 Signed-off-by: Manuel Buil <mbuil@suse.com>	2021-09-28 09:42:08 +02:00
Michal Rostecki	47676eff78	Merge pull request #4080 from manuelbuil/update_klipperlb2 Use the new klipper-lb image that has newer go and Alpine versions	2021-09-27 10:11:52 +02:00
Brad Davidson	73e21e739f	Drop broken SupportNoneCgroupDriver support Signed-off-by: Brad Davidson <brad.davidson@rancher.com>	2021-09-23 16:12:51 -07:00
Manuel Buil	b99b943c17	Use the new klipper-lb image that has newer go and Alpine versions Signed-off-by: Manuel Buil <mbuil@suse.com>	2021-09-22 09:23:38 +02:00
Brad Davidson	28be0de4e8	Revert "Use the newer klipper-lb image" This reverts commit `1d21491094`.	2021-09-20 13:19:38 -07:00
Brad Davidson	64b502e92c	Disable automounting service account token in servicelb pods Signed-off-by: Brad Davidson <brad.davidson@rancher.com>	2021-09-17 15:52:44 -07:00
Hussein Galal	7826407a2e	Make sure there are no duplicates in etcd member list (#4025 ) * Make sure there are no duplicates in etcd member list Signed-off-by: galal-hussein <hussein.galal.ahmed.11@gmail.com> * fix node names with hyphens Signed-off-by: galal-hussein <hussein.galal.ahmed.11@gmail.com> * use full server name for etcd node name Signed-off-by: galal-hussein <hussein.galal.ahmed.11@gmail.com>	2021-09-18 00:51:18 +02:00
Manuel Buil	1d21491094	Use the newer klipper-lb image Signed-off-by: Manuel Buil <mbuil@suse.com>	2021-09-17 15:42:48 -07:00
Brad Davidson	753e11ee3c	Enable JobTrackingWithFinalizers FeatureGate Works around issue with Job controller not tracking job pods that are in CrashloopBackoff during upgrade from 1.21 to 1.22. Signed-off-by: Brad Davidson <brad.davidson@rancher.com>	2021-09-17 11:26:45 -07:00
Derek Nola	eda65b19d9	Remove expiremental from cluster commands (#4024 ) Signed-off-by: dereknola <derek.nola@suse.com>	2021-09-15 16:41:50 -07:00
Joe Kralicky	debb508643	Nvidia container runtime discovery in containerd config template (#3890 ) * Update the default containerd config template with support for adding extra container runtimes. Add logic to discover nvidia container runtimes installed via the the gpu operator or package manager. Signed-off-by: Joe Kralicky <joe.kralicky@suse.com>	2021-09-15 14:31:11 -07:00
Brad Davidson	086ca8ba6a	Fix premature etcd shutdown when joining an existing cluster Signed-off-by: Brad Davidson <brad.davidson@rancher.com>	2021-09-15 10:35:07 -07:00
Manuel Buil	60cd86bc42	Merge pull request #3906 from manuelbuil/dual-stack Add dual-stack support on flannel	2021-09-15 18:48:10 +02:00
Brad Davidson	85e11c47d1	Add StargzSupported stub for Windows Signed-off-by: Brad Davidson <brad.davidson@rancher.com>	2021-09-15 09:45:57 -07:00
Chris Kim	acf9036b63	No-op when etcd member was already removed and use existing name for etcd controller (#4014 ) Signed-off-by: Chris Kim <oats87g@gmail.com>	2021-09-15 08:41:30 -07:00
Manuel Buil	9fcd79baae	Add tests to the dual-stack PR and enable dual-stack with flannel backend Signed-off-by: Manuel Buil <mbuil@suse.com>	2021-09-15 14:11:54 +02:00
Manuel Buil	681058bb40	Add dual-stack support Signed-off-by: Manuel Buil <mbuil@suse.com>	2021-09-15 11:44:48 +02:00
Brad Davidson	b72306ce3d	Return the error since it just gets logged and retried anyways Signed-off-by: Brad Davidson <brad.davidson@rancher.com>	2021-09-14 16:41:27 -07:00
Brad Davidson	5986898419	Use SubjectAccessReview to validate CCM RBAC Signed-off-by: Brad Davidson <brad.davidson@rancher.com>	2021-09-14 16:41:27 -07:00
Brad Davidson	dc556cbb72	Set controller authn/authz kubeconfigs Signed-off-by: Brad Davidson <brad.davidson@rancher.com>	2021-09-14 16:41:27 -07:00
Brad Davidson	199424b608	Pass context into all Executor functions Signed-off-by: Brad Davidson <brad.davidson@rancher.com>	2021-09-14 16:41:27 -07:00
Chris Kim	928b8531c3	[master] Add `etcd-member-management` controller to K3s (#4001 ) * Initial leader elected etcd member management controller * Bump etcd to v3.5.0-k3s2 Signed-off-by: Chris Kim <oats87g@gmail.com>	2021-09-14 08:20:38 -07:00
Brad Davidson	57377d2cd4	Minor cleanup on cribbed function Signed-off-by: Brad Davidson <brad.davidson@rancher.com>	2021-09-10 17:04:15 -07:00
Brad Davidson	3449d5b9f9	Wait for apiserver readyz instead of healthz Signed-off-by: Brad Davidson <brad.davidson@rancher.com>	2021-09-10 17:04:15 -07:00
Brad Davidson	b4d8c641c6	Add exposed metrics listener instead of replacing loopback listener Signed-off-by: Brad Davidson <brad.davidson@rancher.com>	2021-09-10 09:39:39 -07:00
Brad Davidson	29c8b238e5	Replace klog with non-exiting fork Signed-off-by: Brad Davidson <brad.davidson@rancher.com>	2021-09-10 09:36:16 -07:00
Brad Davidson	90960ebf4e	SupportPodPidsLimit is locked to true of 1.20, making pids cgroup support mandatory Signed-off-by: Brad Davidson <brad.davidson@rancher.com>	2021-09-09 11:49:53 -07:00
Darren Shepherd	741ba95b04	Migrate sqlite data to etcd when initializing the cluster Signed-off-by: Darren Shepherd <darren@rancher.com>	2021-09-09 10:24:02 -07:00
Devin Buhl	a1ec43e0b7	feat: add option to disable s3 over https Signed-off-by: Devin Buhl <devin.kray@gmail.com>	2021-09-05 12:03:49 -04:00
Kohei Tokunaga	8b857eef9c	Ship Stargz Snapshotter (#2936 ) * Ship Stargz Snapshotter Signed-off-by: ktock <ktokunaga.mail@gmail.com> * Bump github.com/containerd/stargz-snapshotter to v0.8.0 Signed-off-by: Kohei Tokunaga <ktokunaga.mail@gmail.com>	2021-09-01 16:27:42 -07:00
Brad Davidson	cf12a13175	Add missing node name entry to apiserver SAN list Also honor node-ip when adding the node address to the SAN list, instead of hardcoding the autodetected IP address. Signed-off-by: Brad Davidson <brad.davidson@rancher.com>	2021-09-01 13:22:32 -07:00
Brad Davidson	b8add39b07	Bump kine for metrics/tls changes Signed-off-by: Brad Davidson <brad.davidson@rancher.com>	2021-09-01 01:51:30 -07:00
Hussein Galal	933052a02c	Fix condition for adding kubernetes endpoints (#3941 ) * Fix condition for adding kubernetes endpoints Signed-off-by: galal-hussein <hussein.galal.ahmed.11@gmail.com> * Fix condition for adding kubernetes endpoints Signed-off-by: galal-hussein <hussein.galal.ahmed.11@gmail.com>	2021-08-31 00:57:17 +02:00
Derek Nola	60297a1bbe	Creation of K3s integration test Sonobuoy plugin (#3931 ) * Added test runner and build files * Changes to int test to output junit results. * Updated documentation, removed comments Signed-off-by: dereknola <derek.nola@suse.com>	2021-08-30 08:27:59 -07:00
Brad Davidson	2a68c7c8a4	Fix issue where addon checksum was never stored Signed-off-by: Brad Davidson <brad.davidson@rancher.com>	2021-08-27 10:26:13 -07:00
Manuel Buil	2e5c9e5cad	Merge pull request #3916 from manuelbuil/net_v6 Add functions to separate ipv4 and ipv6 CIDRs	2021-08-27 18:57:54 +02:00
Manuel Buil	96dcef478a	Add functions to separate ipv4 from ipv6 functions Signed-off-by: Manuel Buil <mbuil@suse.com>	2021-08-27 10:14:39 +02:00
Derek Nola	114b30277f	Redux: Enable K3s integration test to run on existing cluster (#3905 ) * Made it possible to run int tests on existing cluster Signed-off-by: dereknola <derek.nola@suse.com>	2021-08-26 16:26:19 -07:00
Akihiro Suda	331c6fed71	Remove runtime V1 (`containerd-shim`) Fix issue 3105 Signed-off-by: Akihiro Suda <akihiro.suda.cz@hco.ntt.co.jp>	2021-08-26 11:50:33 -07:00
Akihiro Suda	176451f4ea	Fix rootless regression in 1.22 (Set KubeletInUserNamespace gate) (#3901 ) Fix issue 3900 Kubernetes 1.22 requires `KuebletInUserNamespace` feature gate to be set for rootless: https://kubernetes.io/docs/tasks/administer-cluster/kubelet-in-userns/#userns-the-hard-way Signed-off-by: Akihiro Suda <akihiro.suda.cz@hco.ntt.co.jp>	2021-08-24 08:27:17 -07:00
Derek Nola	66dacc6ee0	Revert "Enable K3s integration test to run on existing cluster (#3892 )" (#3899 ) This reverts commit `703b5af950`.	2021-08-24 07:26:14 -07:00
Derek Nola	703b5af950	Enable K3s integration test to run on existing cluster (#3892 ) * Made it possible to run int tests on existing cluster Signed-off-by: dereknola <derek.nola@suse.com>	2021-08-23 12:12:03 -07:00
Brad Davidson	e95b75409a	Fix lint failures Signed-off-by: Brad Davidson <brad.davidson@rancher.com>	2021-08-20 18:47:16 -07:00
Brad Davidson	a5355f0827	Replace dropped v1beta1 APIs with v1 Requires updating traefik as well to drop deprecated types. Signed-off-by: Brad Davidson <brad.davidson@rancher.com>	2021-08-20 18:47:16 -07:00
Brad Davidson	dc14f370c4	Update wrangler to v0.8.5 Required to support apiextensions.v1 as v1beta1 has been deleted. Also update helm-controller and dynamiclistener to track wrangler versions. Signed-off-by: Brad Davidson <brad.davidson@rancher.com>	2021-08-20 18:47:16 -07:00
Brad Davidson	c434db7cc6	Wrap errors in runControllers for additional context Signed-off-by: Brad Davidson <brad.davidson@rancher.com>	2021-08-20 18:47:16 -07:00
Brad Davidson	422d266da2	Disable deprecated insecure port Signed-off-by: Brad Davidson <brad.davidson@rancher.com>	2021-08-20 18:47:16 -07:00
Brad Davidson	641ab26fde	Update containerd to 1.5 Signed-off-by: Brad Davidson <brad.davidson@rancher.com>	2021-08-20 18:47:16 -07:00
Brad Davidson	872855015c	Update etcd to v3.5.0 Signed-off-by: Brad Davidson <brad.davidson@rancher.com>	2021-08-20 18:47:16 -07:00
Brad Davidson	e204d863a5	Update Kubernetes to v1.22.1 * Update Kubernetes to v1.22.1 * Update dependent modules to track with upstream Signed-off-by: Brad Davidson <brad.davidson@rancher.com>	2021-08-20 18:47:16 -07:00
Derek Nola	ed5991f13b	K3s Flock Integration Test (#3887 ) * Upgraded flock with shared and integration test. Signed-off-by: dereknola <derek.nola@suse.com> Co-authored-by: Brian Downs <brian.downs@gmail.com>	2021-08-20 12:34:22 -07:00
Hussein Galal	e322924781	Reset load balancer state during restoraion (#3877 ) * Reset load balancer state during restoraion Signed-off-by: galal-hussein <hussein.galal.ahmed.11@gmail.com> * Reset load balancer state during restoraion Signed-off-by: galal-hussein <hussein.galal.ahmed.11@gmail.com>	2021-08-18 01:02:30 +02:00
Malte Starostik	b23955e835	Fix URL pruning when joining an etcd member (#3832 ) * Fix URL pruning when joining an etcd member Problem: Existing member clientURLs were checked if they contain the joining node's IP. In some edge cases this would prune valid URLs when the joining IP is a substring match of the only existing member's IP. Because of this, it was impossible to e.g. join 10.0.0.2 to an existing node that has an IP of 10.0.0.2X or 10.0.0.2XX: level=fatal msg="starting kubernetes: preparing server: start managed database: joining etcd cluster: etcdclient: no available endpoints" Solution: Fixed by properly parsing the URLs and comparing the IPs for equality instead of substring match. Signed-off-by: Malte Starostik <info@stellaware.de>	2021-08-12 15:59:04 -07:00
Derek Nola	a1e36153f9	Added locking system for integration tests (#3820 ) * Added locking system for integration tests Signed-off-by: dereknola <derek.nola@suse.com>	2021-08-10 16:22:12 -07:00
Jamie Phillips	ae909c73e5	Updated the code to use GetNetworkByName and tweaked logic. Updated the method being called and tweaked the logic. Signed-off-by: Jamie Phillips <jamie.phillips@suse.com>	2021-08-10 13:53:08 -07:00
Derek Nola	4cc781b5e3	Moved testing utils into tests directory. Improved gotests template. (#3805 ) * Moved testing utils into tests directory. Improved gotests template. * Updated cgroups2 with util folder rename Signed-off-by: dereknola <derek.nola@suse.com>	2021-08-10 11:13:26 -07:00
Brian Downs	dcf0657b20	account for an s3 folder when listing objects (#3807 ) * account for an s3 folder when listing objects	2021-08-09 16:14:41 -07:00
Derek Nola	b4eca61aeb	Prevent snapshot commands from creating empty snapshot directory (#3783 ) Signed-off-by: dereknola <derek.nola@suse.com>	2021-08-09 09:04:18 -07:00
Jiaqi Luo	3b01157a3a	Use New Image Names (#3749 ) * switch image names to the ones with the prefix mirrored * bump rancher/mirrored-coredns-coredns to 1.8.4 Signed-off-by: Jiaqi Luo <6218999+jiaqiluo@users.noreply.github.com>	2021-08-06 16:14:58 -07:00
Hussein Galal	bc96ffb5f3	Fix Node stuck at deletion (#3771 ) * fix Node stuck at deletion Signed-off-by: galal-hussein <hussein.galal.ahmed.11@gmail.com> * fix Node stuck at deletion Signed-off-by: galal-hussein <hussein.galal.ahmed.11@gmail.com>	2021-08-05 22:32:01 +02:00
Brad Davidson	dfd4e42e57	Wrap context with lease before importing images Signed-off-by: Brad Davidson <brad.davidson@rancher.com>	2021-08-04 10:22:19 -07:00
Hussein Galal	2069cdf4ee	Fix initial start of etcd only nodes (#3748 ) * Fix initial start of etcd only nodes Signed-off-by: galal-hussein <hussein.galal.ahmed.11@gmail.com> * more fixes Signed-off-by: galal-hussein <hussein.galal.ahmed.11@gmail.com> * more fixes Signed-off-by: galal-hussein <hussein.galal.ahmed.11@gmail.com>	2021-08-03 19:53:21 +02:00
Ryan Sanna	429af17e4d	update rancher/local-path-provisioner to v0.0.20 Signed-off-by: Ryan Sanna <ryansann@umich.edu>	2021-08-02 12:25:47 -07:00
Brad Davidson	5ab3590d9b	Improve config retrieval messages Signed-off-by: Brad Davidson <brad.davidson@rancher.com>	2021-07-30 12:26:50 -07:00
Brad Davidson	869b98bc4c	Sync DisableKubeProxy into control struct Sync DisableKubeProxy from cfg into control before sending control to clients, as it may have been modified by a startup hook. Signed-off-by: Brad Davidson <brad.davidson@rancher.com>	2021-07-30 12:26:50 -07:00
Hussein Galal	b1b5f72dc3	Notify systemd for etcd only node (#3732 ) Signed-off-by: galal-hussein <hussein.galal.ahmed.11@gmail.com>	2021-07-29 23:42:19 +02:00
Jamie Phillips	7704fb6ee5	Exporting the AddFeatureGate function and adding a unit test for it. (#3661 )	2021-07-28 13:04:42 -07:00
Jamie Phillips	fc19b805d5	Added logic to strip any existing hyphens before processing the args. (#3662 ) Updated the logic to handle if extra args are passed with existing hyphens in the arg. The test was updated to add the additional case of having pre-existing hyphens. The method name was also refactored based on previous feedback.	2021-07-28 13:04:19 -07:00
Derek Nola	a1d7a62493	Fix to allow non-root users access to storage volumes. (#3714 ) * Fix to prevent non-root users from accessing storage directory, while allowing non-root users access to subdirectories. Signed-off-by: dereknola <derek.nola@suse.com> * Added integration test Signed-off-by: dereknola <derek.nola@suse.com>	2021-07-28 10:25:34 -07:00
Brad Davidson	90445bd581	Wait until server is ready before configuring kube-proxy (#3716 ) Signed-off-by: Brad Davidson <brad.davidson@rancher.com>	2021-07-27 14:56:05 -07:00
Derek Nola	21c8a33647	Introduction of Integration Tests (#3695 ) * Commit of new etcd snapshot integration tests. * Updated integration github action to not run on doc changes. * Update Drone runner to only run unit tests Signed-off-by: dereknola <derek.nola@suse.com>	2021-07-26 09:59:33 -07:00
galal-hussein	20a48734c2	more fixes Signed-off-by: galal-hussein <hussein.galal.ahmed.11@gmail.com>	2021-07-21 22:42:05 +02:00
galal-hussein	7ebcc4b134	more fixes Signed-off-by: galal-hussein <hussein.galal.ahmed.11@gmail.com>	2021-07-21 22:39:44 +02:00
galal-hussein	b4401296ec	replace error with warn in delete Signed-off-by: galal-hussein <hussein.galal.ahmed.11@gmail.com>	2021-07-21 22:18:56 +02:00
galal-hussein	2f82bfcf67	fix warning msg Signed-off-by: galal-hussein <hussein.galal.ahmed.11@gmail.com>	2021-07-21 22:05:43 +02:00
galal-hussein	b377839148	migrate old token key format Signed-off-by: galal-hussein <hussein.galal.ahmed.11@gmail.com>	2021-07-21 20:59:57 +02:00
galal-hussein	997ed7b9b4	simplifying the code Signed-off-by: galal-hussein <hussein.galal.ahmed.11@gmail.com>	2021-07-21 19:56:19 +02:00
galal-hussein	ad17292fa8	migrate empty string key properly Signed-off-by: galal-hussein <hussein.galal.ahmed.11@gmail.com>	2021-07-21 19:21:38 +02:00
galal-hussein	a65e5b6466	Fix multiple bootstrap keys found Signed-off-by: galal-hussein <hussein.galal.ahmed.11@gmail.com>	2021-07-21 02:50:42 +02:00
Luther Monson	37fcb61f5e	move go routines for api server ready beneath wait group Signed-off-by: Luther Monson <luther.monson@gmail.com>	2021-07-20 17:36:34 -07:00
Luther Monson	18bc98f60c	adding startup hooks args to access to Disables and Skips (#3674 ) Signed-off-by: Luther Monson <luther.monson@gmail.com>	2021-07-20 05:24:52 +02:00
Derek Nola	bba49ea447	Fix to allow prune to correctly cleanup custom named snapshots (#3649 ) Signed-off-by: dereknola <derek.nola@suse.com>	2021-07-19 14:30:57 -07:00
Jamie Phillips	aef8a6aafd	Adding support for waitgroup to the Startuphooks (#3654 ) The startup hooks where executing after the deploy controller. We needed the deploy controller to wait until the startup hooks had completed.	2021-07-15 19:28:47 -07:00
Hussein Galal	a939decf01	fix a runtime core panic (#3627 ) Signed-off-by: galal-hussein <hussein.galal.ahmed.11@gmail.com>	2021-07-13 23:33:07 +02:00
Derek Nola	55fe4ff5b0	Convert existing unit tests to standard layout (#3621 ) * Converted parser_test.go, scrypt_test.go, types_test.go, nodeconfig_test.go Signed-off-by: dereknola <derek.nola@suse.com>	2021-07-13 10:44:11 -07:00
Brian Downs	238dc2086e	prevent snapshot save when snapshots are disabled (#3475 ) * prevent snapshot save when snapshots are disabled	2021-07-09 10:22:49 -07:00
William Zhang	a4c992ce52	🐳 burp to inetaf/tcpproxy Problem: tcpproxy repository has been moved out of the github.com/google org to github.com/inetaf. Solution: Switch to the new repo. FYI: https://godoc.org/inet.af/tcpproxy/ Signed-off-by: William Zhang <warmchang@outlook.com>	2021-07-08 16:58:09 -07:00
Chris Kim	ada145641c	Update etcd snapshot error message to be more informative when etcd database is not found (#3568 ) Signed-off-by: Chris Kim <oats87g@gmail.com>	2021-07-07 16:01:50 -07:00
Jamie Phillips	a62d143936	Fixing various bugs related to windows. This changes the crictl template for issues with the socket information. It also addresses a typo in the socket address. Last it makes tweaks to configuration that aren't required or had incorrect logic. Signed-off-by: Jamie Phillips <jamie.phillips@suse.com> spelling	2021-07-07 15:50:34 -07:00
Derek Nola	73df2d806b	Update embedded kube-router (#3557 ) * Update embedded kube-router Signed-off-by: dereknola <derek.nola@suse.com>	2021-07-07 08:46:10 -07:00
Deshi Xiao	77fcf2dfc5	missing build tag for windows Signed-off-by: Deshi Xiao <xiaods@gmail.com>	2021-07-05 22:30:54 +08:00
Derek Nola	c833183517	Add unit tests for pkg/etcd (#3549 ) * Created new etcd unit tests and testing support file Signed-off-by: dereknola <derek.nola@suse.com>	2021-07-01 16:08:35 -07:00
Brad Davidson	cbfe673c43	Fix spelling to satisfy codespell check Signed-off-by: Brad Davidson <brad.davidson@rancher.com>	2021-07-01 13:29:03 -07:00
Brad Davidson	cbacd7107e	Allow passing targeted environment variables to containerd Signed-off-by: Brad Davidson <brad.davidson@rancher.com>	2021-07-01 13:29:03 -07:00
Hussein Galal	f5fbb9a9a8	Export cli server flags and etcd restoration functions (#3527 ) * Export cli server flags and etfd restoration functions Signed-off-by: galal-hussein <hussein.galal.ahmed.11@gmail.com> * export S3 Signed-off-by: galal-hussein <hussein.galal.ahmed.11@gmail.com>	2021-06-30 22:29:03 +02:00
Brad Davidson	246b378a27	Bump kine to resolve race condition and unrevisioned delete Signed-off-by: Brad Davidson <brad.davidson@rancher.com>	2021-06-30 09:54:46 -07:00
Derek Nola	3e1693bc97	Changes local storage pods to have 700 permissions (#3537 ) * Changes local storage pods to have 700 permissions Signed-off-by: dereknola <derek.nola@suse.com>	2021-06-29 13:58:12 -07:00
Chris Kim	04398a2582	Move cloud-controller-manager into an embedded executor (#3525 ) * Move cloud-controller-manager into an embedded executor * Import K3s cloud provider and clean up imports Signed-off-by: Chris Kim <oats87g@gmail.com>	2021-06-29 07:28:38 -07:00
Joe Kralicky	a84c75af62	Adds a command-line flag '--disable-helm-controller' that will disable the server's built-in helm controller. Problem: Testing installation and uninstallation of the Helm Controller on k3s is not possible if the Helm Controller is baked into the k3s server. Solution: The Helm Controller can optionally be disabled, which will allow users to manage its installation manually. Signed-off-by: Joe Kralicky <joe.kralicky@suse.com>	2021-06-25 14:54:36 -04:00
Jamie Phillips	82394d7d36	Basic windows agent that will join a cluster without CNI. Signed-off-by: Jamie Phillips <jamie.phillips@suse.com>	2021-06-23 09:07:50 -07:00
Hussein Galal	136dddca11	Fix storing bootstrap data with empty token string (#3422 ) * Fix storing bootstrap data with empty token string Signed-off-by: galal-hussein <hussein.galal.ahmed.11@gmail.com> * delete node password secret after restoration fixes to bootstrap key vendor update Signed-off-by: galal-hussein <hussein.galal.ahmed.11@gmail.com> * fix comment Signed-off-by: galal-hussein <hussein.galal.ahmed.11@gmail.com> * fix typo Signed-off-by: galal-hussein <hussein.galal.ahmed.11@gmail.com> * more fixes Signed-off-by: galal-hussein <hussein.galal.ahmed.11@gmail.com> * fixes Signed-off-by: galal-hussein <hussein.galal.ahmed.11@gmail.com> * fixes Signed-off-by: galal-hussein <hussein.galal.ahmed.11@gmail.com> * typos Signed-off-by: galal-hussein <hussein.galal.ahmed.11@gmail.com> * Removing dynamic listener file after restoration Signed-off-by: galal-hussein <hussein.galal.ahmed.11@gmail.com> * go mod tidy Signed-off-by: galal-hussein <hussein.galal.ahmed.11@gmail.com>	2021-06-22 22:42:34 +02:00
Derek Nola	4b2ab8b515	Renamed client-cloud-controller crt and key (#3470 ) Signed-off-by: dereknola <derek.nola@suse.com>	2021-06-16 13:54:35 -07:00
Derek Nola	ef23c6c548	Redux: Change containerd image leases from context lifespan to permanent (#3464 ) * Changed containerd image licenses from context lifespan to permanent. Delete any existing licenses owned by k3s on server startup Signed-off-by: dereknola <derek.nola@suse.com>	2021-06-16 12:11:10 -07:00
Derek Nola	b74c499709	Revert "Change containerd image leases from 24h to permanent (#3452 )" (#3461 ) This reverts commit `86b3ba8dba`.	2021-06-15 14:56:14 -07:00
Derek Nola	86b3ba8dba	Change containerd image leases from 24h to permanent (#3452 ) * Changed containerd image licenses from 24h to permanent. Delete any existing licenses on server startup Signed-off-by: dereknola <derek.nola@suse.com>	2021-06-15 11:42:52 -07:00
Brian Downs	88f95ec409	Send systemd notifications for both server and agent (#3430 ) * update agent to sent systemd notify after everything starts	2021-06-15 04:20:26 -07:00
Brad Davidson	a7d1159ba6	Emit events for AddOn lifecycle Signed-off-by: Brad Davidson <brad.davidson@rancher.com>	2021-06-11 14:00:27 -07:00
Brad Davidson	ea2cd6d727	Add comments, clean up imports and function names Signed-off-by: Brad Davidson <brad.davidson@rancher.com>	2021-06-11 14:00:27 -07:00
Brad Davidson	6e48ca9b53	Tidy up function calls with many args Signed-off-by: Brad Davidson <brad.davidson@rancher.com>	2021-06-11 14:00:27 -07:00
Brad Davidson	6ef000091a	Add nodename to UA string for deploy controller Signed-off-by: Brad Davidson <brad.davidson@rancher.com>	2021-06-10 17:05:52 -07:00
Brad Davidson	f6cec4e75d	Add kubernetes.default.svc to serving certs Signed-off-by: Brad Davidson <brad.davidson@rancher.com>	2021-06-08 12:55:20 -07:00
Manuel Buil	243fd14cf1	Change Replace with ReplaceAll function strings has a specific function to replace all matches. We should use that one instead of strings.Replace(string, old, new string, -1) Signed-off-by: Manuel Buil <mbuil@suse.com>	2021-06-07 09:52:26 +02:00
Brian Downs	afd506a595	fix possible race where bootstrap data might not save Signed-off-by: Brian Downs <brian.downs@gmail.com>	2021-06-04 15:05:47 -07:00
Brian Downs	2682183773	add log message indicating etcd snapshots are disabled Signed-off-by: Brian Downs <brian.downs@gmail.com>	2021-06-04 09:18:16 -07:00
Derek Nola	664a98919b	Fix RBAC cloud-controller-manager name 3308 (#3388 ) * Changed cloud-controller-manager user name in ccm.yaml Signed-off-by: dereknola <derek.nola@suse.com> * Changed RBAC name in server.go Signed-off-by: dereknola <derek.nola@suse.com> * Changed "k3s" string prefix to version.Program to prevent static hardcoding Signed-off-by: dereknola <derek.nola@suse.com> * Changed user in ccm.yaml to k3s-cloud-controller-manager Signed-off-by: dereknola <derek.nola@suse.com>	2021-06-02 14:50:11 -07:00
Manuel Buil	5153088286	Merge pull request #3385 from manuelbuil/wireguard-fix Move wireguard's privatekey to flannel config directory	2021-06-02 09:44:27 +02:00
Manuel Buil	1576030d6b	Add a path for wireguard's privatekey Signed-off-by: Manuel Buil <mbuil@suse.com>	2021-06-01 21:54:17 +02:00
Jamie Phillips	7345ac35ae	Initial windows support for agent (#3375 ) Signed-off-by: Jamie Phillips <jamie.phillips@suse.com>	2021-06-01 12:29:46 -07:00
Brian Downs	ecbf17e2ed	move object channel defer close to goroutine Signed-off-by: Brian Downs <brian.downs@gmail.com>	2021-05-18 19:58:30 -07:00
Brian Downs	254b52077e	add retention default and wire in s3 prune Signed-off-by: Brian Downs <brian.downs@gmail.com>	2021-05-18 13:57:40 -07:00
Brad Davidson	7e175e8ad4	Handle conntrack-related sysctls in supervisor agent setup Signed-off-by: Brad Davidson <brad.davidson@rancher.com>	2021-05-18 13:40:44 -07:00
Brian Downs	e8ecc00fc8	add etcd snapshot save subcommand Signed-off-by: Brian Downs <brian.downs@gmail.com>	2021-05-17 10:55:13 -07:00
Brian Downs	6ee28214fa	Add the ability to prune etcd snapshots (#3310 ) * add prune subcommand to force rentention policy enforcement	2021-05-13 13:36:33 -07:00
Brad Davidson	079620ded0	Fix passthrough of SystemDefaultRegistry from server config Signed-off-by: Brad Davidson <brad.davidson@rancher.com>	2021-05-13 02:18:09 -07:00
MonzElmasry	24474c5734	change --disable-apiserver flag Signed-off-by: MonzElmasry <menna.elmasry@rancher.com>	2021-05-13 00:00:11 +02:00
Brad Davidson	e10524a6b1	Add executor.Bootstrap hook for pre-execution setup Signed-off-by: Brad Davidson <brad.davidson@rancher.com>	2021-05-11 18:46:15 -07:00
Brian Downs	bcd8b67db4	Add the ability to list etcd snapshots (#3303 ) * add ability to list local and s3 etcd snapshots	2021-05-11 16:59:33 -07:00
Brad Davidson	02a5bee62f	Add system-default-registry support and remove shared code (#3285 ) * Move registries.yaml handling out to rancher/wharfie * Add system-default-registry support * Add CLI support for kubelet image credential providers Signed-off-by: Brad Davidson <brad.davidson@rancher.com>	2021-05-10 15:58:41 -07:00
Hussein Galal	948295e8e8	Fix cluster restoration in rke2 (#3295 ) Signed-off-by: galal-hussein <hussein.galal.ahmed.11@gmail.com>	2021-05-11 00:06:33 +02:00
Brad Davidson	fc037e87f8	Use config file values in node-args annotation Signed-off-by: Brad Davidson <brad.davidson@rancher.com>	2021-05-10 14:08:02 -07:00
Brian Downs	e998cd110d	Add the ability to delete an etcd snapshot locally or from S3 (#3277 ) * Add the ability to delete a given set of etcd snapshots from the CLI for locally stored and S3 store snapshots.	2021-05-07 16:10:04 -07:00
Siegfried Weber	e77fd18270	Sign CSRs for kubelet-serving with the server CA Problem: Only the client CA is passed to the kube-controller-manager and therefore CSRs with the signer name "kubernetes.io/kubelet-serving" are signed with the client CA. Serving certificates must be signed with the server CA otherwise e.g. "kubectl logs" fails with the error message "x509: certificate signed by unknown authority". Solution: Instead of providing only one CA via the kube-controller-manager parameter "--cluster-signing-cert-file", the corresponding CA for every signer is set with the parameters "--cluster-signing-kube-apiserver-client-cert-file", "--cluster-signing-kubelet-client-cert-file", "--cluster-signing-kubelet-serving-cert-file", and "--cluster-signing-legacy-unknown-cert-file". Signed-off-by: Siegfried Weber <mail@siegfriedweber.net>	2021-05-05 15:59:57 -07:00
Hussein Galal	f410fc7d1e	Invoke cluster reset function when only reset flag is passed (#3276 ) Signed-off-by: galal-hussein <hussein.galal.ahmed.11@gmail.com>	2021-05-05 17:40:04 +02:00
Brian Downs	beb0d8397a	reference node name when needed Signed-off-by: Brian Downs <brian.downs@gmail.com>	2021-05-04 10:03:28 -07:00
Brian Downs	c5ad71ce0b	Collect and Store etcd Snapshots and Metadata (#3239 ) * Add the ability to store local etcd snapshots and etcd snapshots stored in an S3 compatible object store in a ConfigMap.	2021-04-30 18:26:39 -07:00
Hussein Galal	2db3bf7a89	Export CriConnection function (#3225 ) Signed-off-by: galal-hussein <hussein.galal.ahmed.11@gmail.com>	2021-04-29 22:11:19 +02:00
Brad Davidson	3cb4ca4b35	Use same SANs on ServingKubeAPICert as dynamiclistener The kube-apiserver cert should have the same SANs in the same order, excluding the extra user-configured SANs since this will only be used in-cluster. Signed-off-by: Brad Davidson <brad.davidson@rancher.com>	2021-04-28 09:58:19 -07:00
Darren Shepherd	8f1a20c0d3	Add ability to append to slice during config file merge If key ends in "+" the value of the key is appended to previous values found. If values are string instead of a slice they are automatically converted to a slice of one string. Signed-off-by: Darren Shepherd <darren@rancher.com>	2021-04-27 15:59:03 -07:00
Brad Davidson	2705431d96	Add support for dual-stack Pod/Service CIDRs and node IP addresses (#3212 ) * Add support for dual-stack cluster/service CIDRs and node addresses Signed-off-by: Brad Davidson <brad.davidson@rancher.com>	2021-04-21 15:56:20 -07:00
Darren Shepherd	a0a1071aa5	Support .d directory for k3s config file (#3162 ) Configuration will be loaded from config.yaml and then config.yaml.d/*.(yaml\|yml) in alphanumeric order. The merging is done by just taking the last value of a key found, so LIFO for keys. Slices are not merged but replaced. Signed-off-by: Darren Shepherd <darren@rancher.com>	2021-04-15 11:29:24 -07:00
Brad Davidson	601c4984f5	Fix service-account-issuer Signed-off-by: Brad Davidson <brad.davidson@rancher.com>	2021-04-14 14:51:42 -07:00
Brad Davidson	e8381db778	Update Kubernetes to v1.21.0 * Update Kubernetes to v1.21.0 * Update to golang v1.16.2 * Update dependent modules to track with upstream * Switch to upstream flannel * Track changes to upstream cloud-controller-manager and FeatureGates Signed-off-by: Brad Davidson <brad.davidson@rancher.com>	2021-04-14 14:51:42 -07:00
Brian Downs	66ed6efd57	Resolve local retention issue when S3 in use. Remove early return preventing local retention policy to be enforced resulting in N number of snapshots being stored. Signed-off-by: Brian Downs <brian.downs@gmail.com>	2021-04-14 10:40:08 -07:00
Brian Downs	80e4baf525	add hidden attribute to disable flags Signed-off-by: Brian Downs <brian.downs@gmail.com>	2021-04-13 14:30:47 -07:00
Brian Downs	d9381b84ad	add etcd s3 secret and access key flags and env vars to secret data Signed-off-by: Brian Downs <brian.downs@gmail.com>	2021-04-12 14:47:16 -07:00
Brian Downs	693c5290b1	Update CoreDNS to version 1.8.3. (#3168 ) * update CoreDNS to 1.8.3 Rerun go generate and update the CoreDNS RBAC	2021-04-09 16:47:16 -07:00
Brian Downs	ad4f04d2fc	Merge pull request #3155 from briandowns/rke2-issue-856 remove hidden attribute from cluster flags and related code	2021-04-09 12:55:27 -07:00
Erik Wilson	9a53fca872	Bump traefik to v2.4.8 Signed-off-by: Erik Wilson <Erik.E.Wilson@gmail.com>	2021-04-08 17:42:58 -07:00
Brad Davidson	58e93feda6	Fix CI failures non-deterministic traefik chart repackaging (#3165 ) * Fix CI failures non-deterministic traefik chart repackaging * Update generated bindata Signed-off-by: Brad Davidson <brad.davidson@rancher.com>	2021-04-08 15:33:15 -07:00
Brian Downs	4a49b9e40b	delete nocluster file and remove build tag Signed-off-by: Brian Downs <brian.downs@gmail.com>	2021-04-07 12:16:28 -07:00
Brian Downs	3ed9b0a997	remove hidden attribute from cluster flags and related code Signed-off-by: Brian Downs <brian.downs@gmail.com>	2021-04-07 11:36:02 -07:00
Xiao Deshi	cfe7e0c734	remove duplicated func GetAddresses refactor tunnel.go and controller.go, remove duplicated lines. Signed-off-by: Xiao Deshi <xiaods@gmail.com>	2021-03-31 14:23:05 -07:00
Akihiro Suda	cb73461a5b	AkihiroSuda/containerd-fuse-overlayfs -> containerd/fuse-overlayfs-snapshotter The repo has been moved. Signed-off-by: Akihiro Suda <akihiro.suda.cz@hco.ntt.co.jp>	2021-03-24 10:34:34 -07:00
Akihiro Suda	e672c988e4	rootless: allow kernel.dmesg_restrict=1 When `/dev/kmsg` is unreadable due to sysctl value `kernel.dmesg_restrict=1`, bind-mount `/dev/null` into `/dev/kmsg` Fix issue 3011 Signed-off-by: Akihiro Suda <akihiro.suda.cz@hco.ntt.co.jp>	2021-03-24 01:03:14 -07:00
Akihiro Suda	6e8284e3d4	rootless: enable resource limitation (requires cgroup v2, systemd) Now rootless mode can be used with cgroup v2 resource limitations. A pod is executed in a cgroup like "/user.slice/user-1001.slice/user@1001.service/k3s-rootless.service/kubepods/podd0eb6921-c81a-4214-b36c-d3b9bb212fac/63b5a253a1fd4627da16bfce9bec58d72144cf30fe833e0ca9a6d60ebf837475". This is accomplished by running `kubelet` in a cgroup namespace, and enabling `cgroupfs` driver for the cgroup hierarchy delegated by systemd. To enable cgroup v2 resource limitation, `k3s server --rootless` needs to be launched as `systemctl --user` service. Please see the comment lines in `k3s-rootless.service` for the usage. Running `k3s server --rootless` via a terminal is not supported. When it really needs to be launched via a terminal, `systemd-run --user -p Delegate --tty` needs to be prepended to create a systemd scope. Signed-off-by: Akihiro Suda <akihiro.suda.cz@hco.ntt.co.jp>	2021-03-24 00:37:30 -07:00
Akihiro Suda	11ef43011a	bump up RootlessKit Signed-off-by: Akihiro Suda <akihiro.suda.cz@hco.ntt.co.jp>	2021-03-24 00:37:30 -07:00
Brian Downs	400a632666	put etcd bootstrap save call in goroutine and update comment Signed-off-by: Brian Downs <brian.downs@gmail.com>	2021-03-17 14:33:00 -07:00
Hussein Galal	73df65d93a	remove etcd data dir when etcd is disabled (#3059 ) * remove etcd data dir when etcd is disabled Signed-off-by: galal-hussein <hussein.galal.ahmed.11@gmail.com> * fix comment Signed-off-by: galal-hussein <hussein.galal.ahmed.11@gmail.com> * more fixes Signed-off-by: galal-hussein <hussein.galal.ahmed.11@gmail.com> * use debug instead of info logs Signed-off-by: galal-hussein <hussein.galal.ahmed.11@gmail.com>	2021-03-16 18:14:43 +02:00
Jacob Blain Christen	618b0f98bf	registry mirror repository rewrites (#3064 ) Support repository regex rewrite rules when fetching image content. Example configuration: ```yaml # /etc/rancher/k3s/registries.yaml mirrors: "docker.io": endpoint: - "https://registry-1.docker.io/v2" rewrite: "^library/alpine$": "my-org/alpine" ``` This will instruct k3s containerd to fetch content for `alpine` images from `docker.io/my-org/alpine` instead of the default `docker.io/library/alpine` locations. Signed-off-by: Jacob Blain Christen <jacob@rancher.com>	2021-03-15 16:17:27 -07:00
Brian Downs	7c99f8645d	Have Bootstrap Data Stored in etcd at Completed Start (#3038 ) * have state stored in etcd at completed start and remove unneeded code	2021-03-11 13:07:40 -07:00
Chris Kim	69f96d6225	Define a Controllers and LeaderControllers on the server config (#3043 ) Signed-off-by: Chris Kim <oats87g@gmail.com>	2021-03-11 10:39:00 -08:00
Brad Davidson	8ace8975d2	Don't start up multiple apiserver load balancers get() is called in a loop until client configuration is successfully retrieved. Each iteration will try to configure the apiserver proxy, which will in turn create a new load balancer. Skip creating a new load balancer if we already have one. Signed-off-by: Brad Davidson <brad.davidson@rancher.com>	2021-03-08 17:05:25 -08:00
Brad Davidson	c0d129003b	Handle loadbalancer port in TIME_WAIT If the port wanted by the client load balancer is in TIME_WAIT, startup will fail. Set SO_REUSEPORT so that it can be listened on again immediately. The configurable Listen call wants a context, so plumb that through as well. Signed-off-by: Brad Davidson <brad.davidson@rancher.com>	2021-03-08 17:05:25 -08:00
Brad Davidson	7cdfaad6ce	Always use static ports for client load-balancers (#3026 ) * Always use static ports for the load-balancers This fixes an issue where RKE2 kube-proxy daemonset pods were failing to communicate with the apiserver when RKE2 was restarted because the load-balancer used a different port every time it started up. This also changes the apiserver load-balancer port to be 1 below the supervisor port instead of 1 above it. This makes the apiserver port consistent at 6443 across servers and agents on RKE2. Additional fixes below were required to successfully test and use this change on etcd-only nodes. * Actually add lb-server-port flag to CLI * Fix nil pointer when starting server with --disable-etcd but no --server * Don't try to use full URI as initial load-balancer endpoint * Fix etcd load-balancer pool updates * Update dynamiclistener to fix cert updates on etcd-only nodes * Handle recursive initial server URL in load balancer * Don't run the deploy controller on etcd-only nodes	2021-03-06 02:29:57 -08:00
Hussein Galal	c26b737b24	Mark disable components flags as experimental (#3018 ) Signed-off-by: galal-hussein <hussein.galal.ahmed.11@gmail.com>	2021-03-05 00:05:20 +02:00
Brian Downs	4d1f9eda9d	Etcd Snapshot/Restore to/from S3 Compatible Backends (#2902 ) * Add functionality for etcd snapshot/restore to and from S3 compatible backends. * Update etcd restore functionality to extract and write certificates and configs from snapshot.	2021-03-03 11:14:12 -07:00
Hussein Galal	1bf04b6a50	Merge pull request #3003 from galal-hussein/fix_etcd_only_nodes Fix etcd only nodes	2021-03-02 02:16:02 +02:00
Brad Davidson	4fb073e799	Log clearer error on startup if NPC cannot be started Servers should always be upgraded before agents, but generally this isn't required because things are compatible between versions. In this case we're OK with failing closed if the user upgrades out of order, but we should give a clearer message about what steps are required to fix the issue. Signed-off-by: Brad Davidson <brad.davidson@rancher.com>	2021-03-01 14:23:59 -08:00
galal-hussein	ef999f0b4f	change error to warn when removing self from etcd members Signed-off-by: galal-hussein <hussein.galal.ahmed.11@gmail.com>	2021-03-02 00:19:57 +02:00
galal-hussein	d6124981d5	remove etcd member if disable etcd is passed Signed-off-by: galal-hussein <hussein.galal.ahmed.11@gmail.com>	2021-03-01 23:50:50 +02:00
Erik Wilson	4e5218b62c	Apply suggestions from code review Logging cleanup Co-authored-by: Brad Davidson <brad@oatmail.org>	2021-03-01 10:44:24 -07:00
Erik Wilson	4aac6b6bd0	Update to Traefik 2.4.2 and combine manifests	2021-03-01 10:44:24 -07:00
Erik Wilson	54a35505f0	Remove Traefik v1 migration	2021-03-01 10:44:24 -07:00
Chin-Ya Huang	cc96f8140a	Allow download traefik static file and rename Allow writing static files regardless of the version. Signed-off-by: Chin-Ya Huang <chin-ya.huang@suse.com>	2021-03-01 10:44:24 -07:00
Chin-Ya Huang	10e0328977	Traefik v2 integration K3s upgrade via watch over file change of static file and manifest and triggers helm-controller for change. It seems reasonable to only allow upgrade traefik v1->v2 when there is no existing custom traefik HelmChartConfig in the cluster to avoid any incompatibility. Here also separate the CRDs and put them into a different chart to support CRD upgrade. Signed-off-by: Chin-Ya Huang <chin-ya.huang@suse.com>	2021-03-01 10:44:23 -07:00
Brad Davidson	f970e49b7d	Wait for apiserver to become healthy before starting agent controllers It is possible that the apiserver may serve read requests but not allow writes yet, in which case flannel will crash on startup when trying to configure the subnet manager. Fix this by waiting for the apiserver to become fully ready before starting flannel and the network policy controller. Signed-off-by: Brad Davidson <brad.davidson@rancher.com>	2021-02-26 19:28:53 -08:00
Brad Davidson	9b39c1c117	Hide the airgap-extra-registry flag Signed-off-by: Brad Davidson <brad.davidson@rancher.com>	2021-02-26 16:08:49 -08:00
Brad Davidson	88dd601941	Limit zstd decoder memory Signed-off-by: Brad Davidson <brad.davidson@rancher.com>	2021-02-17 11:48:03 -08:00
Brad Davidson	ae5b93a264	Use HasSuffixI utility function Signed-off-by: Brad Davidson <brad.davidson@rancher.com>	2021-02-17 11:48:03 -08:00
Brad Davidson	ec661c67d7	Add support for retagging images on load from tarball Adds support for retagging images to appear to have been sourced from one or more additional registries as they are imported from the tarball. This is intended to support RKE2 use cases with system-default-registry where the images need to appear to have been pulled from a registry other than docker.io. Signed-off-by: Brad Davidson <brad.davidson@rancher.com>	2021-02-17 11:48:03 -08:00
Hussein Galal	5749f66aa3	Add disable flags for control components (#2900 ) * Add disable flags to control components Signed-off-by: galal-hussein <hussein.galal.ahmed.11@gmail.com> * golint Signed-off-by: galal-hussein <hussein.galal.ahmed.11@gmail.com> * more fixes Signed-off-by: galal-hussein <hussein.galal.ahmed.11@gmail.com> * fixes to disable flags Signed-off-by: galal-hussein <hussein.galal.ahmed.11@gmail.com> * Add comments to functions Signed-off-by: galal-hussein <hussein.galal.ahmed.11@gmail.com> * Fix joining problem Signed-off-by: galal-hussein <hussein.galal.ahmed.11@gmail.com> * more fixes Signed-off-by: galal-hussein <hussein.galal.ahmed.11@gmail.com> * golint Signed-off-by: galal-hussein <hussein.galal.ahmed.11@gmail.com> * fix ticker Signed-off-by: galal-hussein <hussein.galal.ahmed.11@gmail.com> * fix role labels Signed-off-by: galal-hussein <hussein.galal.ahmed.11@gmail.com> * more fixes Signed-off-by: galal-hussein <hussein.galal.ahmed.11@gmail.com>	2021-02-12 17:35:57 +02:00
Brian Downs	21d1690d5d	update usage text (#2926 ) update to the --cluster-init usage flag to indicate it's for Etcd	2021-02-10 15:54:04 -07:00
Brad Davidson	6e768c301e	Use appropriate response codes for authn/authz failures Signed-off-by: Brad Davidson <brad.davidson@rancher.com>	2021-02-09 16:28:20 -08:00
Brad Davidson	374271e9a0	Collect IPs from all pods before deciding to use internal or external addresses (#2909 ) * Collect IPs from all pods before deciding to use internal or external addresses @Taloth correctly noted that the code that iterates over ServiceLB pods to collect IP addresses was failing to add additional internal IPs once the map contained ANY entry from a previous node. This may date back to when ServiceLB used a Deployment instead of a DaemonSet, so there was only ever a single pod. The new behavior is to collect all internal and external IPs, and then construct the address list of a single type - external if there are any, otherwise internal. https://github.com/k3s-io/k3s/issues/1652#issuecomment-774497788 Signed-off-by: Brad Davidson <brad.davidson@rancher.com> Co-authored-by: Brian Downs <brian.downs@gmail.com>	2021-02-09 16:26:57 -08:00
Brad Davidson	e06119729b	Improve handling of comounted cpu,cpuacct controllers (#2911 ) * Improve handling of comounted cpu,cpuacct controllers Signed-off-by: Brad Davidson <brad.davidson@rancher.com>	2021-02-09 16:12:58 -08:00
Brad Davidson	ad5e504cf0	Allow joining clusters when the server CA is trusted by the OS CA bundle (#2743 ) * Add tests to clientaccess/token * Fix issues in clientaccess/token identified by tests * Update tests to close coverage gaps * Remove redundant check turned up by code coverage reports * Add warnings if CA hash will not be validated Signed-off-by: Brad Davidson <brad.davidson@rancher.com>	2021-02-08 22:28:57 -08:00
Brad Davidson	6c472b5942	Use zstd instead of gzip for embedded tarball Signed-off-by: Brad Davidson <brad.davidson@rancher.com>	2021-02-08 21:08:35 -08:00
Brad Davidson	c5e2676d5c	Update local-path-provisioner and helper busybox (#2885 ) * Update local-path-provisioner and helper busybox Signed-off-by: Brad Davidson <brad.davidson@rancher.com>	2021-02-04 10:49:25 -08:00
Brad Davidson	65c78cc397	Replace options.KubeRouterConfig with config.Node and remove metrics/waitgroup stuff Signed-off-by: Brad Davidson <brad.davidson@rancher.com>	2021-02-03 10:41:51 -08:00
Brad Davidson	07256cf7ab	Add ServiceIPRange and ServiceNodePortRange to agent config Signed-off-by: Brad Davidson <brad.davidson@rancher.com>	2021-02-03 10:41:51 -08:00
Brad Davidson	95a1a86847	Spell check upstream code Signed-off-by: Brad Davidson <brad.davidson@rancher.com>	2021-02-03 10:41:51 -08:00
Brad Davidson	29483d0651	Initial update of netpol and utils from upstream Signed-off-by: Brad Davidson <brad.davidson@rancher.com>	2021-02-03 10:41:51 -08:00
Akihiro Suda	f3c41b7650	fix cgroup2 support Fix issue 900 cgroup2 support was introduced in PR 2584, but got broken in `f3de60ff31` It was failing with "F1210 19:13:37.305388 4955 server.go:181] cannot set feature gate SupportPodPidsLimit to false, feature is locked to true" Signed-off-by: Akihiro Suda <akihiro.suda.cz@hco.ntt.co.jp>	2021-01-25 22:45:07 -08:00
Akihiro Suda	728ebcc027	rootless: remove rootful /run/{netns,containerd} symlinks Since a recent commit, rootless mode was failing with the following errors: ``` E0122 22:59:47.615567 21 kuberuntime_manager.go:755] createPodSandbox for pod "helm-install-traefik-wf8lc_kube-system(9de0a1b2-e2a2-4ea5-8fb6-22c9272a182f)" failed: rpc error: code = Unknown desc = failed to create network namespace for sandbox "285ab835609387f82d304bac1fefa5fb2a6c49a542a9921995d0c35d33c683d5": failed to setup netns: open /var/run/netns/cni-c628a228-651e-e03e-d27d-bb5e87281846: permission denied ... E0122 23:31:34.027814 21 pod_workers.go:191] Error syncing pod 1a77d21f-ff3d-4475-9749-224229ddc31a ("coredns-854c77959c-w4d7g_kube-system(1a77d21f-ff3d-4475-9749-224229ddc31a)"), skipping: failed to "CreatePodSandbox" for "coredns-854c77959c-w4d7g_kube-system(1a77d21f-ff3d-4475-9749-224229ddc31a)" with CreatePodSandboxError: "CreatePodSandbox for pod \"coredns-854c77959c-w4d7g_kube-system(1a77d21f-ff3d-4475-9749-224229ddc31a)\" failed: rpc error: code = Unknown desc = failed to create containerd task: io.containerd.runc.v2: create new shim socket: listen unix /run/containerd/s/8f0e40e11a69738407f1ebaf31ced3f08c29bb62022058813314fb004f93c422: bind: permission denied\n: exit status 1: unknown" ``` Remove symlinks to /run/{netns,containerd} so that rootless mode can create their own /run/{netns,containerd}. Signed-off-by: Akihiro Suda <akihiro.suda.cz@hco.ntt.co.jp>	2021-01-22 19:51:43 -08:00
Brad Davidson	071de833ae	Fix typo in field tag Signed-off-by: Brad Davidson <brad.davidson@rancher.com>	2021-01-22 19:38:37 -08:00
Brad Davidson	8011697175	Only container-runtime-endpoint wants RuntimeSocket path as URI Signed-off-by: Brad Davidson <brad.davidson@rancher.com>	2021-01-22 18:56:30 -08:00
Yuriy	06fda7accf	Add functionality to bind custom IP address for Etcd metrics endpoint (#2750 ) * Add functionality to bind custom IP address for Etcd metrics endpoint Signed-off-by: yuriydzobak <yurii.dzobak@lotusflare.com>	2021-01-22 17:40:48 -08:00
Brad Davidson	f152f656a0	Replace k3s cloud provider wrangler controller with core node informer (#2843 ) * Replace k3s cloud provider wrangler controller with core node informer Upstream k8s has exposed an interface for cloud providers to access the cloud controller manager's node cache and shared informer since Kubernetes 1.9. This is used by all the other in-tree cloud providers; we should use it too instead of running a dedicated wrangler controller. Doing so also appears to fix an intermittent issue with the uninitialized taint not getting cleared on nodes in CI. Signed-off-by: Brad Davidson <brad.davidson@rancher.com>	2021-01-22 16:59:48 -08:00
Brian Downs	13229019f8	Add ability to perform an etcd on-demand snapshot via cli (#2819 ) * add ability to perform an etcd on-demand snapshot via cli	2021-01-21 14:09:15 -07:00
Waqar Ahmed	3ea696815b	Do not validate snapshotter argument if docker is enabled Problem: While using ZFS on debian and K3s with docker, I am unable to get k3s working as the snapshotter value is being validated and the validation fails. Solution: We should not validate snapshotter value if we are using docker as it's a no-op in that case. Signed-off-by: Waqar Ahmed <waqarahmedjoyia@live.com>	2021-01-20 12:25:28 -08:00
Erik Wilson	c71060f288	Merge pull request #2744 from erikwilson/rke2-node-password-bootstrap Bootstrap node password with local file	2021-01-11 09:51:30 -07:00
MonzElmasry	86f68d5d62	change etcd dir permission if it exists Signed-off-by: MonzElmasry <menna.elmasry@rancher.com>	2021-01-08 23:47:36 +02:00
Erik Wilson	4245fd7b67	Return http.StatusOK instead of 0 Signed-off-by: Erik Wilson <Erik.E.Wilson@gmail.com>	2020-12-23 16:55:47 -07:00
Erik Wilson	2fb411fc83	Fix spelling mistake Signed-off-by: Erik Wilson <Erik.E.Wilson@gmail.com>	2020-12-23 15:08:07 -07:00
Erik Wilson	09eb44ba53	Bootstrap node password with local file Signed-off-by: Erik Wilson <Erik.E.Wilson@gmail.com>	2020-12-23 15:08:06 -07:00
JenTing Hsiao	57041f0239	Add codespell CI test and fix codespell error (#2740 ) * Add codespell CI test * Fix codespell error	2020-12-22 12:35:58 -08:00
Brad Davidson	8936cf577f	Bump coredns to 1.8.0 Signed-off-by: Brad Davidson <brad.davidson@rancher.com>	2020-12-17 15:20:19 -08:00
Chris Kim	332fd73d46	Add support for both config-file and data-dir at a global level in the self-extracting wrapper for K3s (#2594 ) * Add support for both config-file and data-dir at a global level in the self-extracting wrapper for K3s Signed-off-by: Chris Kim <oats87g@gmail.com>	2020-12-16 09:27:57 -08:00
Erik Wilson	1230d7b7df	Fix HA server initialization Signed-off-by: Erik Wilson <Erik.E.Wilson@gmail.com>	2020-12-15 16:08:28 -08:00
Brad Davidson	8e4d3e645b	Restore legacy master role for etcd nodes Signed-off-by: Brad Davidson <brad.davidson@rancher.com>	2020-12-15 15:15:46 -08:00
Chris Kim	61ef2ce95e	use version.Program Signed-off-by: Chris Kim <oats87g@gmail.com>	2020-12-09 12:34:13 -08:00
Chris Kim	48925fcb88	Simplify checkCgroups function call Co-authored-by: Brian Downs <brian.downs@gmail.com>	2020-12-09 11:59:54 -08:00
Chris Kim	a3f87a81bd	Independently set kubelet-cgroups and runtime-cgroups, and detect if we are running under a systemd scope Signed-off-by: Chris Kim <oats87g@gmail.com>	2020-12-09 11:39:33 -08:00
Brad Davidson	c5aad1b5ed	Disable the ServiceAccountIssuerDiscovery feature-gate. We're not setting ``--service-account-issuer` to a https URL, which causes an error message at startup when the feature gate is enabled. From the docs on that flag: > If this option is not a valid URI per the OpenID Discovery 1.0 spec, the > ServiceAccountIssuerDiscovery feature will remain disabled, even if the > feature gate is set to true. It is highly recommended that this value > comply with the OpenID spec: > https://openid.net/specs/openid-connect-discovery-1_0.html. In practice, > this means that service-account-issuer must be an https URL. It is also > highly recommended that this URL be capable of serving OpenID discovery > documents at {service-account-issuer}/.well-known/openid-configuration. Signed-off-by: Brad Davidson <brad.davidson@rancher.com>	2020-12-08 22:51:34 -08:00
Brad Davidson	63f2211b31	deprecate the "node-role.kubernetes.io/master" label / taint Related to https://github.com/kubernetes/kubernetes/pull/95382 Signed-off-by: Brad Davidson <brad.davidson@rancher.com>	2020-12-08 22:51:34 -08:00
Brad Davidson	c6950d2cb0	Update Kubernetes to v1.20.0-k3s1 Signed-off-by: Brad Davidson <brad.davidson@rancher.com>	2020-12-08 22:51:34 -08:00
Brad Davidson	cd27c6fcbe	Bump coredns to 1.7.1 Signed-off-by: Brad Davidson <brad.davidson@rancher.com>	2020-12-08 15:58:17 -08:00
Erik Wilson	0ae7f2d5ae	Merge pull request #2407 from erikwilson/node-passwd-cleanup Use secrets for node-passwd entries	2020-12-08 16:25:13 -07:00
Chris Kim	3d1e40eaa3	Handle the case when systemd lives under `/init.scope` Signed-off-by: Chris Kim <oats87g@gmail.com>	2020-12-08 10:26:54 -08:00
Chris Kim	e71e11fed0	Merge pull request #2642 from Oats87/issues/k3s/2548-cgroup Set a cgroup if containerized	2020-12-08 10:05:21 -08:00
Chris Kim	f3de60ff31	When there is a defined cgroup for PID 1, assume we are containerized and set a root Signed-off-by: Chris Kim <oats87g@gmail.com>	2020-12-07 13:15:15 -08:00
Hussein Galal	fadc5a8057	Add tombstone file to etcd and catch errc etcd channel (#2592 ) * Add tombstone file to embedded etcd Signed-off-by: galal-hussein <hussein.galal.ahmed.11@gmail.com> * go mod update Signed-off-by: galal-hussein <hussein.galal.ahmed.11@gmail.com> * fixes Signed-off-by: galal-hussein <hussein.galal.ahmed.11@gmail.com> * more fixes Signed-off-by: galal-hussein <hussein.galal.ahmed.11@gmail.com> * more changes Signed-off-by: galal-hussein <hussein.galal.ahmed.11@gmail.com> * gofmt and goimports Signed-off-by: galal-hussein <hussein.galal.ahmed.11@gmail.com> * go mod update Signed-off-by: galal-hussein <hussein.galal.ahmed.11@gmail.com> * go lint Signed-off-by: galal-hussein <hussein.galal.ahmed.11@gmail.com> * go lint Signed-off-by: galal-hussein <hussein.galal.ahmed.11@gmail.com> * go mod tidy Signed-off-by: galal-hussein <hussein.galal.ahmed.11@gmail.com>	2020-12-07 22:30:44 +02:00
Chin-Ya Huang	3f0f2b342e	Show go version when executes with --version. Signed-off-by: Chin-Ya Huang <chin-ya.huang@suse.com>	2020-12-04 12:51:15 -08:00
transhapHigsn	87a43c69e1	Problem: CoreDNS getting preempted by other pods Solution: Set priorityClassName to system-node-critical of traefik, metrics-server, local storage and coredns deployment Signed-off-by: transhapHigsn <fet.prashantsingh@gmail.com>	2020-12-04 12:50:12 -08:00
Akihiro Suda	eb72d509ce	pkg/agent/config: validate containerd snapshotter value Signed-off-by: Akihiro Suda <akihiro.suda.cz@hco.ntt.co.jp>	2020-12-01 11:00:00 -08:00
Akihiro Suda	05f6255437	add fuse-overlayfs snapshotter (mainly for rootless mode) Ubuntu and Debian kernels support mounting real overlayfs inside userns, but the vanilla kernel still does not allow it. OTOH fuse-overlayfs can be mounted inside userns with the vanilla kernel (>= 4.18). Signed-off-by: Akihiro Suda <akihiro.suda.cz@hco.ntt.co.jp>	2020-12-01 11:00:00 -08:00
Akihiro Suda	43f7eaedf8	rootless: fix "stat /run/user/1000: no such file or directory" on `kubectl run` k3s was mounting a tmpfs on `/run` by itself, so it was hiding RootlessKit's `/run`. Signed-off-by: Akihiro Suda <akihiro.suda.cz@hco.ntt.co.jp>	2020-12-01 10:31:21 -08:00
Akihiro Suda	67410d2757	rootless: validate sysctl before starting up Fix #2420 Signed-off-by: Akihiro Suda <akihiro.suda.cz@hco.ntt.co.jp>	2020-12-01 09:21:39 -08:00
Jacob Blain Christen	3647654fe4	[migration k3s-io] update helm-controller dependency (#2569 ) rancher/helm-controller ➡️ k3s-io/helm-controller Part of https://github.com/rancher/k3s/issues/2189 Signed-off-by: Jacob Blain Christen <jacob@rancher.com>	2020-12-01 08:59:10 -07:00
Akihiro Suda	0b45e32486	Support cgroup v2 Signed-off-by: Akihiro Suda <akihiro.suda.cz@hco.ntt.co.jp>	2020-11-30 22:57:37 -08:00
Jacob Blain Christen	36230daa86	[migration k3s-io] update kine dependency (#2568 ) rancher/kine ➡️ k3s-io/kine Part of https://github.com/rancher/k3s/issues/2189 Signed-off-by: Jacob Blain Christen <jacob@rancher.com>	2020-11-30 16:45:22 -07:00
Brad Davidson	b873d3a03b	Explicitly set agent paths within --data-dir Removing the cfg.DataDir mutation in `3e4fd7b` did not break anything, but did change some paths in unwanted ways. Rather than mutating the user-supplied command-line flags, explicitly specify the agent subdirectory as needed. Signed-off-by: Brad Davidson <brad.davidson@rancher.com>	2020-11-11 09:26:41 -08:00
Brad Davidson	58b5b21f0d	Don't pass cloud-provider flag to controller-manager As per documentation, the cloud-provider flag should not be passed to controller-manager when using cloud-controller. However, the legacy cloud-related controllers still need to be explicitly disabled to prevent errors from being logged. Fixing this also prevents controller-manager from creating the cloud-controller-manager service account that needed extra RBAC. Signed-off-by: Brad Davidson <brad.davidson@rancher.com>	2020-11-09 13:55:09 -08:00
Brad Davidson	3e4fd7b41f	Respect --data-dir path for crictl.yaml Related to rancher/rke2#474 Note that anyone who customizes the data-dir path will have to set CRI_CONFIG_FILE to the correct path when using the wrapped binaries (crictl, etc). This is better than dropping files in the incorrect location. Signed-off-by: Brad Davidson <brad.davidson@rancher.com>	2020-11-05 15:51:10 -08:00
Brad Davidson	f50e3140f9	Disable configure-cloud-routes and external service/route programming support when using k3s stub cloud controller Resolves warning 3 from #2471 Signed-off-by: Brad Davidson <brad.davidson@rancher.com>	2020-11-05 15:51:10 -08:00
Brad Davidson	31575e407a	Add Cluster ID support to k3s stub cloud controller Resolves warning 2 from #2471. As per https://github.com/kubernetes/cloud-provider/issues/12 the ClusterID requirement was never really followed through on, so the flag is probably going to be removed in the future. One side-effect of this is that the core k8s cloud-controller-manager also wants to watch nodes, and needs RBAC to do so. Signed-off-by: Brad Davidson <brad.davidson@rancher.com>	2020-11-05 15:51:10 -08:00
Brad Davidson	5b318d093f	Fix containerd sock path warning Resolves warning 1 from #2471 Signed-off-by: Brad Davidson <brad.davidson@rancher.com>	2020-11-05 15:51:10 -08:00
Brad Davidson	d1424626ac	Disable containerd experimental snapshot labels Related to #2455 and containerd/containerd#4684 These were not meant to be enabled by default, break images with many layers, and will be disabled by default on the next containerd release. Signed-off-by: Brad Davidson <brad.davidson@rancher.com>	2020-11-05 15:51:10 -08:00
Erik Wilson	992ca52c31	Enable go test in ci	2020-11-05 09:48:53 -07:00
Erik Wilson	92d04355f4	Use secrets for node-passwd entries and cleanup	2020-11-05 09:48:53 -07:00
Brad Davidson	3b8ec74049	Update disables list when building with no_stage The --disable/--no-deploy flags actually turn off some built-in controllers, in addition to preventing manifests from getting loaded. Make it clear which controllers can still be disabled even when the packaged components are ommited by the no_stage build tag. Signed-off-by: Brad Davidson <brad.davidson@rancher.com>	2020-11-04 13:39:45 -08:00
Menna Elmasry	523ccaf3f2	Merge pull request #2448 from MonzElmasry/new_b Make etcd use node private ip	2020-10-29 00:23:56 +02:00
MonzElmasry	e8436cc76b	Make etcd use node private ip Signed-off-by: MonzElmasry <menna.elmasry@rancher.com>	2020-10-28 23:45:24 +02:00
Chris Kim	7b8a147a1b	Merge pull request #2408 from Oats87/rpm-install-selinux Add auto-install capability to install.sh for k3s-selinux	2020-10-28 14:24:09 -04:00
Hussein Galal	fcd18d1b6e	skip node delete from removed member (#2413 ) * skip node delete from removed member Signed-off-by: galal-hussein <hussein.galal.ahmed.11@gmail.com> * use grpc errors Signed-off-by: galal-hussein <hussein.galal.ahmed.11@gmail.com> * go imports Signed-off-by: galal-hussein <hussein.galal.ahmed.11@gmail.com> * exit if node is the etcd that being removed Signed-off-by: galal-hussein <hussein.galal.ahmed.11@gmail.com>	2020-10-28 18:32:51 +02:00
Chris Kim	96fc4c4b21	Add iptable_nat to modprobe list Signed-off-by: Chris Kim <oats87g@gmail.com>	2020-10-27 14:22:14 -04:00
Brad Davidson	de18528412	Make etcd voting members responsible for managing learners (#2399 ) * Set etcd timeouts using values from k8s instead of etcdctl Fix for one of the warnings from #2303 * Use etcd zap logger instead of deprecated capsnlog Fix for one of the warnings from #2303 * Remove member self-promotion code paths * Add learner promotion tracking code * Fix RaftAppliedIndex progress check * Remove ErrGRPCKeyNotFound check This is not used by v3 API - it just returns a response with 0 KVs. Signed-off-by: Brad Davidson <brad.davidson@rancher.com>	2020-10-27 11:06:26 -07:00
Erik Wilson	6b11d86037	Merge pull request #2377 from erikwilson/no-proxy-fix Use no_proxy env, add .svc and cluster domains	2020-10-12 13:46:22 -07:00
Erik Wilson	56e077eb29	Use no_proxy env, add .svc and cluster domains	2020-10-12 11:02:07 -07:00
Erik Wilson	114b5ccad1	Merge pull request #2363 from erikwilson/netpol-informers Add event handlers to network policy controller	2020-10-12 08:53:39 -07:00
Erik Wilson	e26e333b7e	Add network policy controller CacheSyncOrTimeout	2020-10-07 12:35:44 -07:00
Erik Wilson	045cd49ab5	Add event handlers to network policy controller	2020-10-07 12:10:27 -07:00
Erik Wilson	ce0da0a0f4	Add file verification for data directory	2020-10-06 10:29:27 -07:00
Erik Wilson	66d29148f7	Add Release function for flock	2020-10-06 10:29:27 -07:00
Erik Wilson	360d82d20e	Add flock from k8s.io/kubernetes/pkg/util/flock	2020-10-06 10:29:26 -07:00
Brad Davidson	c3c983198f	Add temporary fix for issue with interrupted etcd promote This is a minimal fix for https://github.com/rancher/rke2/issues/392 Signed-off-by: Brad Davidson <brad.davidson@rancher.com>	2020-09-30 11:45:58 -07:00
Hussein Galal	373449ec0a	Allow for multiple etcd snapshot restoration (#2307 ) * add reset tmp file Signed-off-by: galal-hussein <hussein.galal.ahmed.11@gmail.com> * go imports Signed-off-by: galal-hussein <hussein.galal.ahmed.11@gmail.com> * fix multiple lines string Signed-off-by: galal-hussein <hussein.galal.ahmed.11@gmail.com> * fix typo Signed-off-by: galal-hussein <hussein.galal.ahmed.11@gmail.com> * use resetFile function Signed-off-by: galal-hussein <hussein.galal.ahmed.11@gmail.com>	2020-09-30 02:53:31 +02:00
Brad Davidson	8262e23169	Revert removal of EndpointName hooks (#2319 ) * Revert "Remove dead EndpointName code" This reverts commit `8025da5a8d`. * Fix docstrings based on proper understanding of use	2020-09-28 18:13:55 -07:00
Brad Davidson	360b0f1ee5	Add timeout to clientaccess http client The default http client does not have an overall request timeout, so connections to misbehaving or unavailable servers can stall for an excessive amount of time. At the moment, just attempting to join an unavailable cluster takes 2 minutes and 40 seconds to timeout. Resolve that by setting a reasonable request timeout. Signed-off-by: Brad Davidson <brad.davidson@rancher.com>	2020-09-27 03:26:27 -07:00
Brad Davidson	cdfc6cfa1a	Split clientaccess token/kubeconfig code Signed-off-by: Brad Davidson <brad.davidson@rancher.com>	2020-09-27 03:26:27 -07:00
Brad Davidson	45dd4afe50	Simplify token parsing Improves readability, reduces round-trips to the join server to validate certs. Signed-off-by: Brad Davidson <brad.davidson@rancher.com>	2020-09-27 03:26:24 -07:00
Brad Davidson	9074da7405	Fix misc nits and missing/unused imports Signed-off-by: Brad Davidson <brad.davidson@rancher.com>	2020-09-27 03:10:00 -07:00
Brad Davidson	703ba5cde7	Add a bunch of doc comments Also change identical error messages to clarify where problems are occurring. Signed-off-by: Brad Davidson <brad.davidson@rancher.com>	2020-09-27 03:10:00 -07:00
Brad Davidson	ae916c2dec	Use const for kube-system namespace Signed-off-by: Brad Davidson <brad.davidson@rancher.com>	2020-09-27 03:10:00 -07:00
Brad Davidson	f59e8fc21b	Fix etcd directory permissions Silences warning on startup about insecure directory permissions Signed-off-by: Brad Davidson <brad.davidson@rancher.com>	2020-09-27 03:10:00 -07:00
Brad Davidson	ee99660a96	Rename etcd directory helpers to reduce confusion about which datadir we're talking about Signed-off-by: Brad Davidson <brad.davidson@rancher.com>	2020-09-27 03:10:00 -07:00
Brad Davidson	8025da5a8d	Remove dead EndpointName code According to @galal-hussein this is dead code that was probably brought over from Kine. I certainly couldn't figure out what it is supposed to be doing. Signed-off-by: Brad Davidson <brad.davidson@rancher.com>	2020-09-27 03:10:00 -07:00
Brad Davidson	97eb28a01a	Remove unnecessary listener arg from managed DB setup Signed-off-by: Brad Davidson <brad.davidson@rancher.com>	2020-09-27 03:09:45 -07:00
Brad Davidson	a3bbd58f37	Fix managed etcd cold startup deadlock issue #2249 We should ignore --token and --server if the managed database is initialized, just like we ignore --cluster-init. If the user wants to join a new cluster, or rejoin a cluster after --cluster-reset, they need to delete the database. This a cleaner way to prevent deadlocking on quorum loss, and removes the requirement that the target of the --server argument must be online before already joined nodes can start. Signed-off-by: Brad Davidson <brad.davidson@rancher.com>	2020-09-27 02:44:49 -07:00
Brad Davidson	42bba04651	Skip etcd snapshots if the local endpoint is still a learner (#2295 ) * Don't take snapshots if the local endpoint is still a learner * Configure timeouts for etcd client dialer	2020-09-21 20:23:18 -07:00
Brian Downs	ba70c41cce	Initial Logging Output Update (#2246 ) This attempts to update logging statements to make them consistent through out the code base. It also adds additional context to messages where possible, simplifies messages, and updates level where necessary.	2020-09-21 09:56:03 -07:00
Hussein Galal	46fe57d7e9	reset etcd name on cluster reset (#2284 ) * reset etcd name on cluster reset Signed-off-by: galal-hussein <hussein.galal.ahmed.11@gmail.com> * gofmt Signed-off-by: galal-hussein <hussein.galal.ahmed.11@gmail.com>	2020-09-19 03:09:36 +02:00
Brad Davidson	8c6d3567fe	Rename k3s-controller based on the build-time program name Since we're replacing the k3s rolebindings.yaml in rke2, we should allow renaming this so that we can use the white-labeled name downstream. Signed-off-by: Brad Davidson <brad.davidson@rancher.com>	2020-09-16 10:53:07 -07:00
Brad Davidson	ae5519c047	Use rancher-mirrored busybox for local-path-provisioner (#2257 ) Related to #1908 Will be fixed upstream by https://github.com/rancher/local-path-provisioner/pull/135/ but we're not going to update the LPP image right now since it's undergoing some changes that we don't want to pick up at the moment. Signed-off-by: Brad Davidson <brad.davidson@rancher.com>	2020-09-15 18:02:51 -07:00
Erik Wilson	a08e998bc5	Import containerd images with all platforms Signed-off-by: Brad Davidson <brad.davidson@rancher.com>	2020-09-14 20:44:58 -07:00
Brad Davidson	fcaeebaa18	Add support for disabling all staged content This reduces the binary footprint for downstream users that won't use these files anyway. Signed-off-by: Brad Davidson <brad.davidson@rancher.com>	2020-09-14 14:21:37 -07:00
Menna Elmasry	edb3e5b7a7	Add error logger to http server (#2242 ) * add error logger to http server Signed-off-by: MonzElmasry <menna.elmasry@rancher.com>	2020-09-14 23:14:30 +02:00
Brian Downs	15d7b61939	Merge remote-tracking branch 'upstream/master' into issue-112	2020-09-04 14:41:42 -07:00
Brian Downs	4c3ec907ab	remove k8s daemon config from setup hook in favor of specific fields from the config (#2206 ) Signed-off-by: Brian Downs <brian.downs@gmail.com>	2020-09-04 09:30:36 -07:00
Brian Downs	bb8e5374ea	conform to repo conventions Signed-off-by: Brian Downs <brian.downs@gmail.com>	2020-09-03 18:48:30 -07:00
Brian Downs	898cbeb9b6	Merge remote-tracking branch 'upstream/master' into issue-112	2020-09-03 17:26:48 -07:00
Darren Shepherd	289ba8df6a	All arguments should be of the form --k=v so that bool flags will work Previously a bool flag would be rendered as --flag false for `flag: false` which is invalid and results in the opposite of what you'd expect. Signed-off-by: Darren Shepherd <darren@rancher.com>	2020-09-03 16:25:35 -07:00
Darren Shepherd	64ae6affc5	Missing registering debug/config flags on server subcommand Signed-off-by: Darren Shepherd <darren@rancher.com>	2020-09-03 13:19:25 -07:00
Brian Downs	00831f9bc8	use version.Program Signed-off-by: Brian Downs <brian.downs@gmail.com>	2020-09-03 08:51:17 -07:00
Brian Downs	301fb73952	add node ip to the request header for cert gen Signed-off-by: Brian Downs <brian.downs@gmail.com>	2020-09-02 19:15:09 -07:00
Craig Jellick	53b3d0fc56	Merge pull request #2180 from ibuildthecloud/configfile Go back to urfave v1	2020-09-02 11:05:19 -07:00
Brad Davidson	a3e9d31e6c	Merge pull request #2097 from iwilltry42/registry-insecure-skip-verify Feature: add insecure_skip_verify field to registry config template	2020-09-01 15:58:26 -07:00
Darren Shepherd	551a1842ad	Update pkg/cli/cmds/config.go Co-authored-by: Jacob Blain Christen <dweomer5@gmail.com>	2020-09-01 10:43:28 -07:00
Darren Shepherd	7657ed2e13	Update pkg/cli/server/server.go Co-authored-by: Jacob Blain Christen <dweomer5@gmail.com>	2020-09-01 10:43:19 -07:00
Darren Shepherd	21d21ddd4d	Add config file support independent of CLI framework Signed-off-by: Darren Shepherd <darren@rancher.com>	2020-08-29 21:44:13 -07:00
Darren Shepherd	ae5c585050	Revert "Add config file support" This reverts commit `e1dc3451bc`. Signed-off-by: Darren Shepherd <darren@rancher.com>	2020-08-29 21:44:07 -07:00
Erik Wilson	447097a597	Merge pull request #2098 from erikwilson/k8s-1.19 Update to k8s 1.19	2020-08-28 18:22:15 -07:00
Erik Wilson	c5dc09159f	Move basic authentication to k3s	2020-08-28 17:18:34 -07:00
Erik Wilson	57fc0c9c87	Fix up authenticator	2020-08-28 17:18:34 -07:00
Erik Wilson	acc42874d8	Add k8s.io/apiserver/plugins/pkg/authenticator from release-1.18	2020-08-28 17:18:34 -07:00
Erik Wilson	837a943234	Update for k8s 1.19	2020-08-28 17:18:34 -07:00
Erik Wilson	daa4beb22c	Update go.mod for k8s 1.19	2020-08-28 17:18:31 -07:00
Erik Wilson	720197b9b1	Fix linting issues	2020-08-28 17:18:29 -07:00
Brian Downs	866dc94cea	Galal hussein etcd backup restore (#2154 ) * Add etcd snapshot and restore Signed-off-by: galal-hussein <hussein.galal.ahmed.11@gmail.com> * fix error logs Signed-off-by: galal-hussein <hussein.galal.ahmed.11@gmail.com> * goimports Signed-off-by: galal-hussein <hussein.galal.ahmed.11@gmail.com> * fix flag describtion Signed-off-by: galal-hussein <hussein.galal.ahmed.11@gmail.com> * Add disable snapshot and retention Signed-off-by: galal-hussein <hussein.galal.ahmed.11@gmail.com> * use creation time for snapshot retention Signed-off-by: galal-hussein <hussein.galal.ahmed.11@gmail.com> * unexport method, update var name Signed-off-by: Brian Downs <brian.downs@gmail.com> * adjust snapshot flags Signed-off-by: Brian Downs <brian.downs@gmail.com> * update var name, string concat Signed-off-by: Brian Downs <brian.downs@gmail.com> * revert previous change, create constants Signed-off-by: Brian Downs <brian.downs@gmail.com> * update Signed-off-by: Brian Downs <brian.downs@gmail.com> * updates Signed-off-by: Brian Downs <brian.downs@gmail.com> * type assertion error checking Signed-off-by: Brian Downs <brian.downs@gmail.com> * update Signed-off-by: Brian Downs <brian.downs@gmail.com> * update Signed-off-by: Brian Downs <brian.downs@gmail.com> * update Signed-off-by: Brian Downs <brian.downs@gmail.com> * pr remediation Signed-off-by: Brian Downs <brian.downs@gmail.com> * pr remediation Signed-off-by: Brian Downs <brian.downs@gmail.com> * pr remediation Signed-off-by: Brian Downs <brian.downs@gmail.com> * pr remediation Signed-off-by: Brian Downs <brian.downs@gmail.com> * pr remediation Signed-off-by: Brian Downs <brian.downs@gmail.com> * updates Signed-off-by: Brian Downs <brian.downs@gmail.com> * updates Signed-off-by: Brian Downs <brian.downs@gmail.com> * simplify logic, remove unneeded function Signed-off-by: Brian Downs <brian.downs@gmail.com> * update flags Signed-off-by: Brian Downs <brian.downs@gmail.com> * update flags Signed-off-by: Brian Downs <brian.downs@gmail.com> * add comment Signed-off-by: Brian Downs <brian.downs@gmail.com> * exit on restore completion, update flag names, move retention check Signed-off-by: Brian Downs <brian.downs@gmail.com> * exit on restore completion, update flag names, move retention check Signed-off-by: Brian Downs <brian.downs@gmail.com> * exit on restore completion, update flag names, move retention check Signed-off-by: Brian Downs <brian.downs@gmail.com> * update disable snapshots flag and field names Signed-off-by: Brian Downs <brian.downs@gmail.com> * move function Signed-off-by: Brian Downs <brian.downs@gmail.com> * update field names Signed-off-by: Brian Downs <brian.downs@gmail.com> * update var and field names Signed-off-by: Brian Downs <brian.downs@gmail.com> * update var and field names Signed-off-by: Brian Downs <brian.downs@gmail.com> * update defaultSnapshotIntervalMinutes to 12 like rke Signed-off-by: Brian Downs <brian.downs@gmail.com> * update directory perms Signed-off-by: Brian Downs <brian.downs@gmail.com> * update etc-snapshot-dir usage Signed-off-by: Brian Downs <brian.downs@gmail.com> * update interval to 12 hours Signed-off-by: Brian Downs <brian.downs@gmail.com> * fix usage typo Signed-off-by: Brian Downs <brian.downs@gmail.com> * add cron Signed-off-by: Brian Downs <brian.downs@gmail.com> * add cron Signed-off-by: Brian Downs <brian.downs@gmail.com> * add cron Signed-off-by: Brian Downs <brian.downs@gmail.com> * wire in cron Signed-off-by: Brian Downs <brian.downs@gmail.com> * wire in cron Signed-off-by: Brian Downs <brian.downs@gmail.com> * wire in cron Signed-off-by: Brian Downs <brian.downs@gmail.com> * wire in cron Signed-off-by: Brian Downs <brian.downs@gmail.com> * wire in cron Signed-off-by: Brian Downs <brian.downs@gmail.com> * wire in cron Signed-off-by: Brian Downs <brian.downs@gmail.com> * wire in cron Signed-off-by: Brian Downs <brian.downs@gmail.com> * update deps target to work, add build/data target for creation, and generate Signed-off-by: Brian Downs <brian.downs@gmail.com> * remove dead make targets Signed-off-by: Brian Downs <brian.downs@gmail.com> * error handling, cluster reset functionality Signed-off-by: Brian Downs <brian.downs@gmail.com> * error handling, cluster reset functionality Signed-off-by: Brian Downs <brian.downs@gmail.com> * update Signed-off-by: Brian Downs <brian.downs@gmail.com> * remove intermediate dapper file Signed-off-by: Brian Downs <brian.downs@gmail.com> Co-authored-by: galal-hussein <hussein.galal.ahmed.11@gmail.com>	2020-08-28 16:57:40 -07:00
Frederick F. Kautz IV	cdce2b7e9a	Add support for compressed images when pre-loading images (#2165 ) * Add support for compressed images when pre-loading images Signed-off-by: Frederick F. Kautz IV <fkautz@alumni.cmu.edu> * attempting to fix vendor source being dirty Signed-off-by: Frederick F. Kautz IV <fkautz@alumni.cmu.edu> * fixing file extension for .tar.lz4 Signed-off-by: Frederick F. Kautz IV <fkautz@alumni.cmu.edu>	2020-08-28 12:27:01 -07:00
Brad Davidson	c4ac620b8b	Merge pull request #2159 from brandond/config_file_rename Rename flags.conf to config.yaml	2020-08-25 21:43:48 -07:00
Brad Davidson	b4d81a9e33	Remove lingering references to dqlite Signed-off-by: Brad Davidson <brad.davidson@rancher.com>	2020-08-24 17:09:19 -07:00
Brad Davidson	43fcc5ddcb	Rename flags.conf => config.yaml Related to https://github.com/rancher/rke2/issues/150 Signed-off-by: Brad Davidson <brad.davidson@rancher.com>	2020-08-24 14:56:30 -07:00
Brad Davidson	c980fa68a0	Update helm-controller for HelmChartConfig CRD (#2114 ) * Update helm-controller for HelmChartConfig CRD Signed-off-by: Brad Davidson <brad.davidson@rancher.com>	2020-08-20 14:23:50 -07:00
Brian Downs	324bb55986	add ctx to hook, handle hook errors Signed-off-by: Brian Downs <brian.downs@gmail.com>	2020-08-19 16:54:58 -07:00
Brian Downs	fa2c1422b3	change name of variable Signed-off-by: Brian Downs <brian.downs@gmail.com>	2020-08-19 14:30:53 -07:00
Brian Downs	a4b2953017	add setup hook capabilities for rke2 Signed-off-by: Brian Downs <brian.downs@gmail.com>	2020-08-19 13:42:45 -07:00
Brad Davidson	79c499f0e0	Fix handling of TLS configuration args Also fixes an unrelated error formatting issue turned up while testing. Signed-off-by: Brad Davidson <brad.davidson@rancher.com>	2020-08-18 16:44:10 -07:00
Brad Davidson	b1d017f892	Update dynamiclistener Second round of fixes for #1621 Signed-off-by: Brad Davidson <brad.davidson@rancher.com>	2020-08-18 10:38:47 -07:00
Jacob Blain Christen	e2089bea18	cli: add --selinux flag to agent/server sub-cmds (#2111 ) * cli: add --selinux flag to agent/server sub-cmds Introduces --selinux flag to affirmatively enable SELinux in containerd. Deprecates --disable-selinux flag which now defaults to true which auto-detection of SELinux configuration for containerd is no longer supported. Specifying both --selinux and --disable-selinux will result in an error message encouraging you to pick a side. * Update pkg/agent/containerd/containerd.go update log warning message about enabled selinux host but disabled runtime Co-authored-by: Brad Davidson <brad@oatmail.org> Signed-off-by: Jacob Blain Christen <jacob@rancher.com>	2020-08-11 16:17:32 -07:00
Jacob Blain Christen	97ff5affab	Merge pull request #2065 from dweomer/containerd/v1.3.6-selinux updated containerd/cri selinux support	2020-08-07 11:09:28 -07:00
Thorsten Klein	cf8c101b70	registry template: add insecure_skip_verify field Signed-off-by: Thorsten Klein <iwilltry42@gmail.com>	2020-08-06 08:02:08 +02:00
Brad Davidson	3f2551ec05	Merge pull request #1848 from euank/insecure-on-lo Listen insecurely on localhost only	2020-08-05 10:55:09 -07:00
Euan Kemp	4808c4e7d5	Listen insecurely on localhost only Before this change, k3s configured the scheduler and controller's insecure ports to listen on 0.0.0.0. Those ports include pprof, which provides a DoS vector at the very least. These ports are only enabled for componentstatus checks in the first place, and componentstatus is hardcoded to only do the check on localhost anyway (see https://github.com/kubernetes/kubernetes/blob/v1.18.2/pkg/registry/core/rest/storage_core.go#L341-L344), so there shouldn't be any downside to switching them to listen only on localhost.	2020-08-05 10:28:11 -07:00
Akihiro Suda	a70cdac356	update rootlesskit to v0.10.0 Fix intermittent "Connection reset by peer" error during port forwarding https://github.com/rootless-containers/rootlesskit/issues/153 Signed-off-by: Akihiro Suda <akihiro.suda.cz@hco.ntt.co.jp>	2020-08-05 18:22:05 +09:00
Brad Davidson	3e8141dc65	Update dynamiclistener Signed-off-by: Brad Davidson <brad.davidson@rancher.com>	2020-08-04 13:05:37 -07:00
Hussein Galal	169ee63907	Add etcd members as learners (#2066 ) * Add etcd members as learners Signed-off-by: galal-hussein <hussein.galal.ahmed.11@gmail.com> * Ignore errors in promote member Signed-off-by: galal-hussein <hussein.galal.ahmed.11@gmail.com>	2020-07-29 22:52:49 +02:00
Brad Davidson	1eec7348a5	Call setproctitle to conceal node args in ps output This is related to #2014. Signed-off-by: Brad Davidson <brad.davidson@rancher.com>	2020-07-28 15:49:49 -07:00
Jacob Blain Christen	371bee82f9	containerd: bump to v1.3.6 Remove $NOTIFY_SOCKET, if present, from env when invoking containerd to prevent gratuitous notifications sent to systemd. Signed-off-by: Jacob Blain Christen <jacob@rancher.com>	2020-07-27 14:41:52 -07:00
Brad Davidson	dfd0f9d1a6	Correctly report and propagate kubeconfig write failures As seen in issues such as #15 #155 #518 #570 there are situations where k3s will fail to write the kubeconfig file, but reports that it wrote it anyway as the success message is printed unconditionally. Also, secondary actions like setting file mode and creating a symlink are also attempted even if the file was not created. This change skips attempting additional actions, and propagates the failure back upwards. Signed-off-by: Brad Davidson <brad.davidson@rancher.com>	2020-07-24 12:07:32 -07:00
Brad Davidson	9da8dc4f61	Update coredns version to 1.6.9 for master Needed for #1844 Signed-off-by: Brad Davidson <brad.davidson@rancher.com>	2020-07-21 11:06:44 -07:00
Brian Downs	5a81fdbdc5	update cis flag implementation to propogate the rest of the way through to kubelet Signed-off-by: Brian Downs <brian.downs@gmail.com>	2020-07-20 16:31:56 -07:00

... 5 6 7 8 9 ...

1098 Commits (0c302f43410d20a26816bad91732e1276a8ead72)