github/k3s - k3s - https://git.xinac.net

Commit Graph

Author	SHA1	Message	Date
Marcin Owsiany	36dc1c4515	Fix typo in function name. Also remove a superfluous comment.	2017-10-17 11:31:46 +02:00
Kubernetes Submit Queue	28df7a1cae	Merge pull request #47806 from dcbw/fix-pod-ip-race Automatic merge from submit-queue. If you want to cherry-pick this change to another branch, please follow the instructions <a href="https://github.com/kubernetes/community/blob/master/contributors/devel/cherry-picks.md">here</a>.. kubelet: fix inconsistent display of terminated pod IPs PLEG and kubelet race when reading and sending pod status to the apiserver. PLEG inserts status into a cache, and then signals kubelet. Kubelet then eventually reads the status out of that cache, but in the mean time the status could have been changed by PLEG. When a pod exits, pod status will no longer include the pod's IP address because the network plugin/runtime will report "" for terminated pod IPs. If this status gets inserted into the PLEG cache before kubelet gets the status out of the cache, kubelet will see a blank pod IP address. This happens in about 1/5 of cases when pods are short-lived, and somewhat less frequently for longer running pods. To ensure consistency for properties of dead pods, copy an old status update's IP address over to the new status update if (a) the new status update's IP is missing and (b) all sandboxes of the pod are dead/not-ready (eg, no possibility for a valid IP from the sandbox). Fixes: https://github.com/kubernetes/kubernetes/issues/47265 Fixes: https://bugzilla.redhat.com/show_bug.cgi?id=1449373 @eparis @freehan @kubernetes/rh-networking @kubernetes/sig-network-misc	2017-09-22 21:01:50 -07:00
Casey Davenport	be5cd7fed2	Recreate pod sandbox when the sandbox does not have an IP address.	2017-09-15 09:23:52 -07:00
Dan Williams	8c16260160	kubelet: fix inconsistent display of terminated pod IPs by using events instead PLEG and kubelet race when reading and sending pod status to the apiserver. PLEG inserts status into a cache, and then signals kubelet. Kubelet then eventually reads the status out of that cache, but in the mean time the status could have been changed by PLEG. When a pod exits, pod status will no longer include the pod's IP address because the network plugin/runtime will report "" for terminated pod IPs. If this status gets inserted into the PLEG cache before kubelet gets the status out of the cache, kubelet will see a blank pod IP address. This happens in about 1/5 of cases when pods are short-lived, and somewhat less frequently for longer running pods. To ensure consistency for properties of dead pods, copy an old status update's IP address over to the new status update if (a) the new status update's IP is missing and (b) all sandboxes of the pod are dead/not-ready (eg, no possibility for a valid IP from the sandbox). Fixes: https://github.com/kubernetes/kubernetes/issues/47265	2017-07-21 09:52:10 -05:00
Kubernetes Submit Queue	c1f8fcd9fe	Merge pull request #45496 from andyxning/fix_pleg_relist_time Automatic merge from submit-queue fix pleg relist time This PR fix pleg reslist time. According to current implementation, we have a `Healthy` method periodically check the relist time. If current timestamp subtracts latest relist time is longer than `relistThreshold`(default is 3 minutes), we should return an error to indicate the error of runtime. `relist` method is also called periodically. If runtime(docker) hung, the relist method should return immediately without updating the latest relist time. If we update latest relist time no matter runtime(docker) hung(default timeout is 2 minutes), the `Healthy` method will never return an error. ```release-note Kubelet PLEG updates the relist timestamp only after successfully relisting. ``` /cc @yujuhong @Random-Liu @dchen1107	2017-05-21 04:17:14 -07:00
Clayton Coleman	3e095d12b4	Refactor move of client-go/util/clock to apimachinery	2017-05-20 14:19:48 -04:00
Andy Xie	af6c040630	fix pleg relist time	2017-05-18 11:40:04 +08:00
deads2k	5a8f075197	move authoritative client-go utils out of pkg	2017-01-24 08:59:18 -05:00
deads2k	c47717134b	move utils used in restclient to client-go	2017-01-19 07:55:14 -05:00
Kubernetes Submit Queue	9a88687e24	Merge pull request #37865 from yujuhong/decouple_lifecycle Automatic merge from submit-queue kubelet: remove the pleg health check from healthz This prevents kubelet from being killed when docker hangs. Also, kubelet will report node not ready if PLEG hangs (`docker ps` + `docker inspect`).	2017-01-12 19:10:14 -08:00
deads2k	6a4d5cd7cc	start the apimachinery repo	2017-01-11 09:09:48 -05:00
Yu-Ju Hong	ec0e99c2ed	Check the health of PLEG when updating the node status	2017-01-10 16:34:00 -08:00
Kubernetes Submit Queue	b2d02bd1ab	Merge pull request #31395 from yujuhong/getpods Automatic merge from submit-queue Instruct PLEG to detect pod sandbox state changes This PR adds a Sandboxes list in `kubecontainer.Pod`, so that PLEG can check sandbox changes using `GetPods()` . The sandboxes are treated as regular containers (type `kubecontainer.Container`) for now to avoid additional changes in PLEG. /cc @feiskyer @yifan-gu @euank	2016-09-08 05:41:16 -07:00
Yu-Ju Hong	a49d28710a	Extend PLEG to handle pod sandboxes PLEG will treat them as if they are regular containers and detect changes the same manner. Note that this makes an assumption that container IDs will not collide with the podsandbox IDs.	2016-08-30 09:54:24 -07:00
Pengfei Ni	1c62d2c368	Kubelet: implement PodStatus for new runtime API	2016-08-25 09:36:00 +08:00
Andrey Kurilin	9f1c3a4c56	Fix various typos in kubelet	2016-08-03 01:14:44 +03:00
Michal Rostecki	59ca5986dd	Print/log pointers of structs with %#v instead of %+v There are many places in k8s where %+v is used to format a pointer to struct, which isn't working as expected. Fixes #26591	2016-08-01 22:27:56 +02:00
Harry Zhang	cb14b35bde	Refactor util clock into it's own pkg	2016-07-28 02:29:04 -04:00
Ron Lai	a58c774c08	Including ContainerRemoved in PLEG event reporting	2016-07-14 16:39:03 -07:00
David McMahon	ef0c9f0c5b	Remove "All rights reserved" from all the headers.	2016-06-29 17:47:36 -07:00
Tim Hockin	817abc3213	Kill our atomic pkg, now that 1.6 is req'd	2016-05-08 20:30:37 -07:00
Andy Goldstein	3a87bfb6f7	PLEG: reinspect pods that failed prior inspections Fix the following sequence of events: 1. relist call 1 successfully inspects a pod (just has infra container) 1. relist call 2 gets an error inspecting the same pod (has infra container and a transient container that failed to create) and doesn't update the old/new pod records 1. relist calls 3+ don't inspect the pod any more (just has infra container so it doesn't look like anything changed) This change adds a new list that keeps track of pods that failed inspection and retries them the next time relist is called. Without this change, a pod in this state would never be inspected again, its entry in the status cache would never be updated, and the pod worker would never call syncPod again because the most recent entry in the status cache has an error associated with it. Without this change, pods in this state would be stuck Terminating forever, unless the user issued a deletion with a grace period value of 0.	2016-05-03 11:06:35 -04:00
goltermann	34d4eaea08	Fixing several (but not all) go vet errors. Most are around string formatting, or unreachable code.	2016-03-22 17:26:50 -07:00
Yu-Ju Hong	4846c1e1b2	pleg: add an internal clock for testability Also add tests for the health check.	2016-03-01 17:53:03 -08:00
Yu-Ju Hong	94368df91a	kubelet: monitor the health of pleg PLEG is reponsible for listing the pods running on the node. If it's hung due to non-responsive container runtime or internal bugs, we should restart kubelet.	2016-03-01 17:24:27 -08:00
Random-Liu	96eeb812ff	kubelet: clear current pod records before relist	2016-02-28 13:19:47 -08:00
Yu-Ju Hong	388689238b	pleg: ensure the cache is updated whenever container are removed Even though we don't rely on the cache for garbage collection yet, we should keep it up-to-date.	2016-02-28 13:16:34 -08:00
Yu-Ju Hong	f9880d4a3a	kubelet: lower the verbosity level of some logging messages	2016-02-24 18:42:26 -08:00
laushinka	7ef585be22	Spelling fixes inspired by github.com/client9/misspell	2016-02-18 06:58:05 +07:00
Jan Chaloupka	4389b3f0d6	Rewritte util.* -> wait.* wherever reasonable	2016-02-07 12:02:20 +01:00
Yu-Ju Hong	b56ed1a8c2	Support populating the runtime cache in PLEG This changes does not turn on this feature (cache) for kubelet.	2016-01-13 10:19:47 -08:00
Yu-Ju Hong	73a4f8225c	PLEG should report events if a container is removed Currently, pleg would report a event if a container transitions from running to exited between relisting. However, if would not report any event if a container gets stopped and removed between relisting. This event will eventually be handled when the pod syncs periodically, but this is undesirable. This change ensures that we detect all such events.	2016-01-12 16:25:19 -08:00
Yu-Ju Hong	7d180b337b	Record pleg pod relist interval and latency Relisting latency/interval affects how quick kubelet discovers changes. Record the metrics in Prometheus to surface such information.	2016-01-04 10:56:38 -08:00
Random-Liu	3cbdf79f8c	Change original PodStatus to APIPodStatus, and start using kubelet internal PodStatus in dockertools	2015-12-04 17:37:39 -08:00
Yu-Ju Hong	bc6414a873	kubelet: add a generic pod lifecycle event generator This change introduces pod lifecycle event generator (PLEG), and adds a generic PLEG. The generic PLEG relies on relisting to discover container events, and is container-runtime-agnostic. Both docker and rkt are changed to use generic PLEG.	2015-11-13 09:55:36 -08:00

35 Commits (13b12e89408869c5b560a81e95bca33267bdb8e1)