Commit Graph

2208 Commits (28eb7439054a2f0226b96a74950b84f0b6368cad)

Author SHA1 Message Date
Brad Davidson e1883d0537 Bump klipper-lb image for arm fix
Signed-off-by: Brad Davidson <brad.davidson@rancher.com>
2021-11-02 19:01:30 -07:00
Brad Davidson 5eb13b6ba6 Fix log/reap reexec
Signed-off-by: Brad Davidson <brad.davidson@rancher.com>
2021-11-01 14:58:46 -07:00
Brad Davidson 259ceb452c Fix other uses of NewForConfigOrDie in contexts where we could return err
Signed-off-by: Brad Davidson <brad.davidson@rancher.com>
(cherry picked from commit 3da1bb3af2)
2021-10-29 16:44:38 -07:00
Brad Davidson cc23fce0a7 Watch the local Node object instead of get/sleep looping
Signed-off-by: Brad Davidson <brad.davidson@rancher.com>
(cherry picked from commit 5acd0b9008)
2021-10-29 16:44:38 -07:00
Brad Davidson 6349aed8e8 Block scheduler startup on untainted node when using embedded CCM
Signed-off-by: Brad Davidson <brad.davidson@rancher.com>
(cherry picked from commit 3fe460d080)
2021-10-29 16:44:38 -07:00
Hussein Galal db8f54e6af
Update to v1.22.3 (#4348)
* Update to v1.22.3

Signed-off-by: galal-hussein <hussein.galal.ahmed.11@gmail.com>

* revert k8s client update

Signed-off-by: galal-hussein <hussein.galal.ahmed.11@gmail.com>

* Ignore go.mod and go.sum for codespell

Signed-off-by: galal-hussein <hussein.galal.ahmed.11@gmail.com>

* Update to v1.22.3

Signed-off-by: galal-hussein <hussein.galal.ahmed.11@gmail.com>
2021-10-28 22:37:59 +02:00
Brad Davidson 46eea2f10a Revert "Add ability to reconcile bootstrap data between datastore and disk (#3398)"
This reverts commits
9a4ca5978b
c9f6fa0be0
07f844cf95
48355dce10

Signed-off-by: Brad Davidson <brad.davidson@rancher.com>
2021-10-27 14:17:12 -07:00
Brian Downs 9a4ca5978b
reset buffer after use (#4279) (#4329) 2021-10-26 17:58:25 -07:00
Brian Downs c9f6fa0be0 remove integration test
Signed-off-by: Brian Downs <brian.downs@gmail.com>
2021-10-25 16:23:31 -07:00
Brian Downs 07f844cf95 Copy old bootstrap buffer data for use during migration (#4215)
Signed-off-by: Brian Downs <brian.downs@gmail.com>
2021-10-25 16:23:31 -07:00
Brian Downs 48355dce10 Add ability to reconcile bootstrap data between datastore and disk (#3398)
Signed-off-by: Brian Downs <brian.downs@gmail.com>
2021-10-25 16:23:31 -07:00
galal-hussein 84e9b829e0 Update peer address when running cluster-reset
Signed-off-by: galal-hussein <hussein.galal.ahmed.11@gmail.com>
2021-10-25 15:43:45 -07:00
Brad Davidson 06b8639068 Bump klipper-helm version
Signed-off-by: Brad Davidson <brad.davidson@rancher.com>
2021-10-22 14:43:27 -07:00
Derek Nola f98934980d
Added configuration input to etcd-snapshot (#4280) (#4281)
Signed-off-by: dereknola <derek.nola@suse.com>
2021-10-22 13:08:12 -07:00
Manuel Buil 61084d6120
Merge pull request #4265 from manuelbuil/update_flannel_1.22
[Release-1.22] Update to the newest flannel
2021-10-21 13:39:56 +02:00
Manuel Buil 7ede7d2e7c Update to the newest flannel
Fixes issues such as the internode connectivity

Signed-off-by: Manuel Buil <mbuil@suse.com>
2021-10-21 09:03:58 +02:00
Brad Davidson 971854c15b Refactor log and reaper exec to omit MAINPID
Using MAINPID breaks systemd's exit detection, as it stops watching the
original pid, but is unable to watch the new pid as it is not a child
of systemd itself. The best we can do is just notify when execing the child
process.

We also need to consolidate forking into a sigle place so that we don't
end up with multiple levels of child processes if both redirecting log
output and reaping child processes.

Signed-off-by: Brad Davidson <brad.davidson@rancher.com>
(cherry picked from commit dc18ef2e51)
2021-10-20 13:41:50 -07:00
Brad Davidson 3988edef25 Add containerd ready channel to delay etcd node join
Signed-off-by: Brad Davidson <brad.davidson@rancher.com>
(cherry picked from commit 88178ae65e)
2021-10-20 13:05:40 -07:00
Brad Davidson b65bcdf963 Bump klog fork version
Signed-off-by: Brad Davidson <brad.davidson@rancher.com>
2021-10-19 10:27:11 -07:00
Manuel Buil 0d6ccf4199
Merge pull request #4222 from manuelbuil/race_fix_1.22
[release-1.22] Race fix 1.22
2021-10-19 09:14:45 +02:00
Brian Downs 7c78e1c802
[Release-1.22] - Add etcd s3 timeout (#4207) (#4230) 2021-10-18 10:45:56 -07:00
Manuel Buil c10a0a2163 Fix race condition in cloud provider
Signed-off-by: Manuel Buil <mbuil@suse.com>
2021-10-15 19:31:08 +02:00
Hussein Galal 6193b1af97
Display cluster tls error only in debug mode (#4200)
Signed-off-by: galal-hussein <hussein.galal.ahmed.11@gmail.com>
2021-10-13 01:48:00 +02:00
Brian Downs 737f722315
set transport to skip verify if se skip flag passed (#4102) (#4103) 2021-10-07 12:47:42 -07:00
Derek Nola dd4b36e034
Add "etcd-" prefix to etcd-snapshot commands as aliases (#4161) (#4170)
* Add "etcd-" prefix to etcd-snapshot commands as alias

Signed-off-by: dereknola <derek.nola@suse.com>
2021-10-07 11:04:39 -07:00
Manuel Buil 25e925002e
Merge pull request #4160 from manuelbuil/dual-stack-serviceLB
[release-1.22] Dual-stack support LB controller
2021-10-07 18:38:26 +02:00
Manuel Buil cedcece9cc Dual-stack support LB controller
Signed-off-by: Manuel Buil <mbuil@suse.com>
2021-10-06 16:10:45 +02:00
Brad Davidson 3f5774b41e Bump containerd to v1.5.7+k3s1
Signed-off-by: Brad Davidson <brad.davidson@rancher.com>
2021-10-04 14:19:28 -07:00
Brad Davidson bdd597fb9e Don't evacuate the root cgroup when rootless
Signed-off-by: Brad Davidson <brad.davidson@rancher.com>
2021-10-01 16:18:33 -07:00
Brad Davidson 6d282b26c2 Skip tests that violate version skew policy
Signed-off-by: Brad Davidson <brad.davidson@rancher.com>
2021-09-30 12:26:00 -07:00
Brad Davidson a09bcba540 Properly handle operation as init process
Signed-off-by: Brad Davidson <brad.davidson@rancher.com>
2021-09-28 11:32:16 -07:00
Manuel Buil 4900e7b9bb
Merge pull request #4101 from manuelbuil/sysctl_ipv6_inheritance_1.22
[Backport 1.22] Enable the inheritance of settings for ipv6
2021-09-28 19:37:08 +02:00
Manuel Buil 44013ae899 Enable the inheritance of settings for ipv6
Signed-off-by: Manuel Buil <mbuil@suse.com>
2021-09-28 18:28:01 +02:00
Brad Davidson 10bca343e8 Update build images to python3 for compat with recent gsutil change
Signed-off-by: Brad Davidson <brad.davidson@rancher.com>
2021-09-23 13:34:17 -07:00
Brad Davidson 28be0de4e8 Revert "Use the newer klipper-lb image"
This reverts commit 1d21491094.
2021-09-20 13:19:38 -07:00
Brad Davidson 64b502e92c Disable automounting service account token in servicelb pods
Signed-off-by: Brad Davidson <brad.davidson@rancher.com>
2021-09-17 15:52:44 -07:00
Hussein Galal 7826407a2e
Make sure there are no duplicates in etcd member list (#4025)
* Make sure there are no duplicates in etcd member list

Signed-off-by: galal-hussein <hussein.galal.ahmed.11@gmail.com>

* fix node names with hyphens

Signed-off-by: galal-hussein <hussein.galal.ahmed.11@gmail.com>

* use full server name for etcd node name

Signed-off-by: galal-hussein <hussein.galal.ahmed.11@gmail.com>
2021-09-18 00:51:18 +02:00
Manuel Buil 1d21491094 Use the newer klipper-lb image
Signed-off-by: Manuel Buil <mbuil@suse.com>
2021-09-17 15:42:48 -07:00
Brad Davidson 753e11ee3c Enable JobTrackingWithFinalizers FeatureGate
Works around issue with Job controller not tracking job pods that
are in CrashloopBackoff during upgrade from 1.21 to 1.22.

Signed-off-by: Brad Davidson <brad.davidson@rancher.com>
2021-09-17 11:26:45 -07:00
Rowan Thorpe dccee4e87b Fix regression from commit 137e80cd86
Problem:

A false-negative in check-config.sh for cgroups v2 systems was fixed but the
commit introduced a regression based on a small assumption that content of
/sys/fs/cgroup/cgroup.controllers would have the same format as the content
of /proc/self/cgroup. It doesn't.

Solution:

This just tweaks the regex to count occurrences of either cgroup
subsystem-names on each line (as occurs in the sysfs pseudo-file), or those
names with colons either side (as occurs in the procfs pseudo-file).

Signed-off-by: Rowan Thorpe <rowan@rowanthorpe.com>
2021-09-17 11:21:17 -07:00
Brad Davidson 7a36c3f7f2 Bump golang version
Signed-off-by: Brad Davidson <brad.davidson@rancher.com>
2021-09-16 15:48:29 -07:00
Brad Davidson 77dfdda909 Update Kubernetes to v1.22.2-k3s1
Signed-off-by: Brad Davidson <brad.davidson@rancher.com>
2021-09-16 15:48:29 -07:00
Derek Nola eda65b19d9
Remove expiremental from cluster commands (#4024)
Signed-off-by: dereknola <derek.nola@suse.com>
2021-09-15 16:41:50 -07:00
Joe Kralicky debb508643
Nvidia container runtime discovery in containerd config template (#3890)
* Update the default containerd config template with support for adding extra container runtimes. Add logic to discover nvidia container runtimes installed via the the gpu operator or package manager.

Signed-off-by: Joe Kralicky <joe.kralicky@suse.com>
2021-09-15 14:31:11 -07:00
Brad Davidson 086ca8ba6a Fix premature etcd shutdown when joining an existing cluster
Signed-off-by: Brad Davidson <brad.davidson@rancher.com>
2021-09-15 10:35:07 -07:00
Manuel Buil 60cd86bc42
Merge pull request #3906 from manuelbuil/dual-stack
Add dual-stack support on flannel
2021-09-15 18:48:10 +02:00
Brad Davidson 85e11c47d1 Add StargzSupported stub for Windows
Signed-off-by: Brad Davidson <brad.davidson@rancher.com>
2021-09-15 09:45:57 -07:00
Nicolas Brechet e732517712 Retrieve "CONTAINERD_" environment variables
Signed-off-by: Nicolas Brechet <nicolas.brechet@swisscom.com>
2021-09-15 09:37:34 -07:00
Chris Kim acf9036b63
No-op when etcd member was already removed and use existing name for etcd controller (#4014)
Signed-off-by: Chris Kim <oats87g@gmail.com>
2021-09-15 08:41:30 -07:00
Manuel Buil 9fcd79baae Add tests to the dual-stack PR and enable dual-stack with flannel backend
Signed-off-by: Manuel Buil <mbuil@suse.com>
2021-09-15 14:11:54 +02:00