github/k3s - k3s - https://git.xinac.net

Commit Graph

Author	SHA1	Message	Date
Kubernetes Submit Queue	755ab974e1	Merge pull request #58835 from ravisantoshgudimetla/critical-pod-with-priority Automatic merge from submit-queue. If you want to cherry-pick this change to another branch, please follow the instructions <a href="https://github.com/kubernetes/community/blob/master/contributors/devel/cherry-picks.md">here</a>. Critical pod priorityClass addition What this PR does / why we need it: @bsalamat - Apologies for the delay. This PR is to ensure that all pods with priorityClassName `system-node-critical` and `system-cluster-critical` will be critical pods while preserving backwards compatibility. Special notes for your reviewer: - Moved some constants and other data structures to scheduler/api/types.go where other constants are present. - An automatic assignment of critical priorities to pods based on critical pod annotation for backwards compatibility including some unit tests. xref: https://github.com/kubernetes/kubernetes/issues/57471 Release note: ```release-note Critical pods to use priorityClasses. ```	2018-02-23 11:22:31 -08:00
Shea Levy	48af739893	dockershim: Return Labels as Info in ImageStatus. `c6ddc749e8` added an Info field to ImageStatusResponse when Verbose is true. This makes the image's Labels available in that field, rather than unconditionally returning an empty map.	2018-02-23 07:47:55 -05:00
Kubernetes Submit Queue	d5aba0c6ca	Merge pull request #59088 from YuxiJin-tobeyjin/codeClean-merge-logfAndFailnow-to-fatalf Automatic merge from submit-queue (batch tested with PRs 60106, 59510, 60263, 60063, 59088). If you want to cherry-pick this change to another branch, please follow the instructions <a href="https://github.com/kubernetes/community/blob/master/contributors/devel/cherry-picks.md">here</a>. CodeClean, merge Logf And FailNow to Fatalf What this PR does / why we need it: Trivial changes to clean code, merge Logf And FailNow to Fatalf. Which issue(s) this PR fixes (optional, in `fixes #<issue number>(, fixes #<issue_number>, ...)` format, will close the issue(s) when PR gets merged): Fixes # Special notes for your reviewer: Release note: ```release-note "NONE" ```	2018-02-23 02:59:55 -08:00
Kubernetes Submit Queue	f59515ca99	Merge pull request #60063 from mtaufen/fix-configok-overlay Automatic merge from submit-queue (batch tested with PRs 60106, 59510, 60263, 60063, 59088). If you want to cherry-pick this change to another branch, please follow the instructions <a href="https://github.com/kubernetes/community/blob/master/contributors/devel/cherry-picks.md">here</a>. clean up KubeletConfigOk condition construction This PR cleans up the construction of the node condition and also fixes a small bug where the last transition time could be updated incorrectly when the sync failure overlay was present. ```release-note NONE ```	2018-02-23 02:59:51 -08:00
Kubernetes Submit Queue	6af0768768	Merge pull request #60106 from dashpole/cadvisor_godep Automatic merge from submit-queue (batch tested with PRs 60106, 59510, 60263, 60063, 59088). If you want to cherry-pick this change to another branch, please follow the instructions <a href="https://github.com/kubernetes/community/blob/master/contributors/devel/cherry-picks.md">here</a>. Update cadvisor godeps to v0.29.0 and ignore per-cpu metrics What this PR does / why we need it: Updates the cAdvisor dependency to the cAdvisor release associated with the kubernetes 1.10 release. Which issue(s) this PR fixes (optional, in `fixes #<issue number>(, fixes #<issue_number>, ...)` format, will close the issue(s) when PR gets merged): Fixes #60052 Special notes for your reviewer: This PR also adds per-cpu metrics to the ignoreMetrics list. This is a new metric that can be ignored in the most recent cAdvisor release. The reason for not collecting per-cpu metrics is that it can cause severe scalability issues. For example, if using a 128 core machine, and running 100 containers, we have 12800 different streams of metrics just for per-cpu metrics which cAdvisor needs to process and transmit. Additionally, per-cpu metrics are not used by any kubernetes components, and if a user needs these metrics, they can run cAdvisor as a daemonset. Release note: ```release-note Disable per-cpu metrics by default for scalability. Fix inaccurate disk usage monitoring of overlayFs. Retry docker connection on startup timeout to avoid permanent loss of metrics. ``` /assign @dchen1107	2018-02-23 02:59:38 -08:00
Pingan2017	9f37b5fe52	fix freespace for image GC	2018-02-23 17:25:54 +08:00
Cao Shufeng	530c459ff2	clean up sysctl code	2018-02-23 16:41:53 +08:00
Kubernetes Submit Queue	fe0e80e8da	Merge pull request #60181 from verb/pid-enable Automatic merge from submit-queue (batch tested with PRs 59463, 59719, 60181, 58283, 59966). If you want to cherry-pick this change to another branch, please follow the instructions <a href="https://github.com/kubernetes/community/blob/master/contributors/devel/cherry-picks.md">here</a>. Set shared PID namespace mode based on PodSpec What this PR does / why we need it: This PR enables pod process namespace sharing as an alpha feature, as described in [Shared PID Namespace Proposal](https://github.com/kubernetes/community/blob/master/contributors/design-proposals/node/pod-pid-namespace.md). Which issue(s) this PR fixes (optional, in `fixes #<issue number>(, fixes #<issue_number>, ...)` format, will close the issue(s) when PR gets merged): WIP #1615 Special notes for your reviewer: /assign @dchen1107 Release note: ```release-note When the `PodShareProcessNamespace` alpha feature is enabled, setting `pod.Spec.ShareProcessNamespace` to `true` will cause a single process namespace to be shared between all containers in a pod. ```	2018-02-23 00:34:26 -08:00
Kubernetes Submit Queue	ec77ddfe19	Merge pull request #59463 from dixudx/add_verify_spelling Automatic merge from submit-queue. If you want to cherry-pick this change to another branch, please follow the instructions <a href="https://github.com/kubernetes/community/blob/master/contributors/devel/cherry-picks.md">here</a>. add spelling checking script What this PR does / why we need it: Add spell checking script to avoid involving any typos. Currently many small PRs are fixing those annoying typos, which is time-consuming and low efficient. We should add such a preflight check before a PR gets merged. Which issue(s) this PR fixes (optional, in `fixes #<issue number>(, fixes #<issue_number>, ...)` format, will close the issue(s) when PR gets merged): Fixes # Special notes for your reviewer: /sig testing /area test-infra /sig release /cc @ixdy /assign @liggitt @smarterclayton Release note: ```release-note add spelling checking script ```	2018-02-22 23:46:15 -08:00
Pavithra Ramesh	098a4467fe	Remove conntrack entry on udp rule add. Moved conntrack util outside of proxy pkg Added warning message if conntrack binary is not found Addressed review comments. ran gofmt	2018-02-22 23:34:42 -08:00
Kubernetes Submit Queue	f05a065738	Merge pull request #59713 from hanxiaoshuai/fix0211 Automatic merge from submit-queue (batch tested with PRs 60208, 60084, 60183, 59713, 60096). If you want to cherry-pick this change to another branch, please follow the instructions <a href="https://github.com/kubernetes/community/blob/master/contributors/devel/cherry-picks.md">here</a>. Use SeekStart, SeekCurrent, and SeekEnd repalace of deprecated constant What this PR does / why we need it: Use SeekStart, SeekCurrent, and SeekEnd repalace of deprecated constant. ''' // Deprecated: Use io.SeekStart, io.SeekCurrent, and io.SeekEnd. const ( SEEK_SET int = 0 // seek relative to the origin of the file SEEK_CUR int = 1 // seek relative to the current offset SEEK_END int = 2 // seek relative to the end ) ''' Which issue(s) this PR fixes (optional, in `fixes #<issue number>(, fixes #<issue_number>, ...)` format, will close the issue(s) when PR gets merged): Fixes # Special notes for your reviewer: Release note: ```release-note NONE ```	2018-02-22 23:17:38 -08:00
Zihong Zheng	9e5e0c6a59	More unit test for configurable pod resolv.conf	2018-02-22 23:17:13 -08:00
Pengfei Ni	2d942dab68	Disable mount propagation for windows containers	2018-02-23 13:14:26 +08:00
Lantao Liu	313e8717f6	Generated code	2018-02-23 01:42:35 +00:00
Lantao Liu	d7b21a3358	Use container log manager in kubelet	2018-02-23 01:42:35 +00:00
Lantao Liu	ebb4865479	Add kubelet container log manager	2018-02-23 01:41:34 +00:00
Di Xu	271ae45901	fix new typos when rebasing	2018-02-23 09:33:14 +08:00
Michael Taufen	1d59190d3e	clean up KubeletConfigOk condition construction This PR cleans up the construction of the node condition and also fixes a small bug where the last transition time could be updated incorrectly when the sync failure overlay was present.	2018-02-22 14:43:19 -08:00
Michael Taufen	7290313dfd	backoff runtime errors in kubelet sync loop The runtime health check can race with PLEG's first relist, and this often results in an unnecessary 5 second wait during Kubelet bootstrap. This change aims to improve the performance.	2018-02-22 11:54:31 -08:00
David Ashpole	65394fe18c	update cadvisor godeps and ignore per-cpu metrics	2018-02-22 09:17:02 -08:00
Kubernetes Submit Queue	742c9b158d	Merge pull request #59906 from abhi/log_stats Automatic merge from submit-queue (batch tested with PRs 54191, 59374, 59824, 55032, 59906). If you want to cherry-pick this change to another branch, please follow the instructions <a href="https://github.com/kubernetes/community/blob/master/contributors/devel/cherry-picks.md">here</a>. Adding per container stats for CRI runtimes What this PR does / why we need it This commit aims to collect per container log stats. The change was proposed as a part of #55905. The change includes change the log path from /var/pod/<pod uid>/containername_attempt.log to /var/pod/<pod uid>/containername/containername_attempt.log. The logs are collected by reusing volume package to collect metrics from the log path. Fixes #55905 Special notes for your reviewer: cc @Random-Liu Release note: ``` Adding container log stats for CRI runtimes. ```	2018-02-21 19:40:42 -08:00
Pengfei Ni	d8703eede3	Get dirFsInfo from docker image filesystem	2018-02-22 11:09:22 +08:00
Pengfei Ni	b1361037ff	Set FsId and usedBytes for windows image file system	2018-02-22 11:09:22 +08:00
Pengfei Ni	cac0263c12	Add GetDiskFreeSpaceEx and export winstats.StatsClient	2018-02-22 11:09:22 +08:00
Lee Verberne	b9e8a8a6de	Set shared PID namespace mode based on PodSpec	2018-02-22 03:51:35 +01:00
Kubernetes Submit Queue	30a7bad884	Merge pull request #59125 from verb/pid-annotation Automatic merge from submit-queue (batch tested with PRs 60148, 60022, 59125, 60068, 60154). If you want to cherry-pick this change to another branch, please follow the instructions <a href="https://github.com/kubernetes/community/blob/master/contributors/devel/cherry-picks.md">here</a>. Adding support for per-pod process namespace sharing in kubelet What this PR does / why we need it: This enables process namespace sharing between containers in a pod as described in the [Shared PID Namespace](https://github.com/kubernetes/community/blob/master/contributors/design-proposals/node/pod-pid-namespace.md#container-runtime-interface-changes) proposal but leaves it disconnected pending merge of #58716. Which issue(s) this PR fixes (optional, in `fixes #<issue number>(, fixes #<issue_number>, ...)` format, will close the issue(s) when PR gets merged): WIP #1615 Special notes for your reviewer: Release note: ```release-note NONE ```	2018-02-21 18:09:43 -08:00
ravisantoshgudimetla	7da5a2e4dd	Build files generated	2018-02-21 20:53:25 -05:00
ravisantoshgudimetla	68c20ad770	Critical pods priorityClass addition	2018-02-21 20:53:21 -05:00
Kubernetes Submit Queue	2bbaf430d8	Merge pull request #59316 from smarterclayton/terminate_early Automatic merge from submit-queue (batch tested with PRs 58716, 59977, 59316, 59884, 60117). If you want to cherry-pick this change to another branch, please follow the instructions <a href="https://github.com/kubernetes/community/blob/master/contributors/devel/cherry-picks.md">here</a>. Cap how long the kubelet waits when it has no client cert If we go a certain amount of time without being able to create a client cert and we have no current client cert from the store, exit. This prevents a corrupted local copy of the cert from leaving the Kubelet in a zombie state forever. Exiting allows a config loop outside the Kubelet to clean up the file or the bootstrap client cert to get another client cert. Five minutes is a totally arbitary timeout, judged to give enough time for really slow static pods to boot. @mikedanese ```release-note Set an upper bound (5 minutes) on how long the Kubelet will wait before exiting when the client cert from disk is missing or invalid. This prevents the Kubelet from waiting forever without attempting to bootstrap a new client credentials. ```	2018-02-21 15:40:41 -08:00
Kubernetes Submit Queue	e8dd75f37d	Merge pull request #58282 from vikaschoudhary16/per-container-allocate Automatic merge from submit-queue. If you want to cherry-pick this change to another branch, please follow the instructions <a href="https://github.com/kubernetes/community/blob/master/contributors/devel/cherry-picks.md">here</a>. Invoke preStart RPC call before container start, if desired by plugin What this PR does / why we need it: 1. Adds a new RPC `preStart` to device plugin API 2. Update `Register` RPC handling to receive a flag from the Device plugins as an indicator if kubelet should invoke `preStart` RPC before starting container. 3. Changes in device manager to invoke `preStart` before container start 4. Test case updates Which issue(s) this PR fixes (optional, in `fixes #<issue number>(, fixes #<issue_number>, ...)` format, will close the issue(s) when PR gets merged): Fixes #56943 #56307 Special notes for your reviewer: Release note: ```release-note None ``` /sig node /area hw-accelerators /cc @jiayingz @RenaudWasTaken @vishh @ScorpioCPH @sjenning @derekwaynecarr @jeremyeder @lichuqiang @tengqm	2018-02-21 13:07:26 -08:00
abhi	ad6bf35c18	Test cases to verify container log stats The commit contains test case modifications to test and verify changes for container log stats feature. Signed-off-by: abhi <abhi@docker.com>	2018-02-21 13:01:49 -08:00
Kubernetes Submit Queue	4bfc29916b	Merge pull request #59901 from NickrenREN/rename-storageobjinuseprotection Automatic merge from submit-queue (batch tested with PRs 59901, 59302, 59928). If you want to cherry-pick this change to another branch, please follow the instructions <a href="https://github.com/kubernetes/community/blob/master/contributors/devel/cherry-picks.md">here</a>. Rename StorageProtection to StorageObjectInUseProtection Rename StorageProtection to StorageObjectInUseProtection Which issue(s) this PR fixes (optional, in `fixes #<issue number>(, fixes #<issue_number>, ...)` format, will close the issue(s) when PR gets merged): Fixes #59639 Special notes for your reviewer: Release note: ```release-note Rename StorageProtection to StorageObjectInUseProtection ```	2018-02-21 07:02:32 -08:00
Kubernetes Submit Queue	6e6c4ce1f2	Merge pull request #60091 from ravisantoshgudimetla/monitor-kubepods Automatic merge from submit-queue. If you want to cherry-pick this change to another branch, please follow the instructions <a href="https://github.com/kubernetes/community/blob/master/contributors/devel/cherry-picks.md">here</a>. Bump runc to latest and modify test cases for linux cgroup manager. What this PR does / why we need it: This PR has 2 commits - Bumps runc to latest and fixes trailing "/" problem in ExpandSlice of runc - Fixes the cgroup_manager_linux_tests.go test cases to have "/" as prefix. Which issue(s) this PR fixes (optional, in `fixes #<issue number>(, fixes #<issue_number>, ...)` format, will close the issue(s) when PR gets merged): Fixes #59993 Special notes for your reviewer: cc @sjenning @derekwaynecarr Release note: ```release-note NONE ```	2018-02-20 23:58:53 -08:00
vikaschoudhary16	e64517cd74	Migrate deviceplugin api from v1alpha to v1beta1	2018-02-21 01:26:20 -05:00
vikaschoudhary16	defcab81d5	Invoke PreStart RPC call before container start, if desired by plugin Signed-off-by: vikaschoudhary16 <vichoudh@redhat.com>	2018-02-21 01:25:24 -05:00
abhi	6649d38c96	Adding per container stats for CRI runtimes This commit aims to collect per container log stats. The change was proposed as a part of #55905. The change includes change of the log path from /var/pod/<pod uid>/containername_attempt.log to /var/pod/<pod uid>/containername/containername_attempt.log. The logs are collected by reusing volume package to collect metrics from the log path. Signed-off-by: abhi <abhi@docker.com>	2018-02-20 19:50:47 -08:00
NickrenREN	dad0fa07b7	rename StorageProtection to StorageObjectInUseProtection	2018-02-21 10:48:56 +08:00
David Ashpole	a55119820e	fix running with no eviction thresholds	2018-02-20 13:49:14 -08:00
ravisantoshgudimetla	a9a724d500	Test cases fix after path expansion	2018-02-20 14:23:09 -05:00
yue9944882	9ecc0b2bd2	fixes document grammar	2018-02-20 10:38:41 -05:00
Kubernetes Submit Queue	96ec318718	Merge pull request #59842 from ixdy/update-rules_go-02-2018 Automatic merge from submit-queue. If you want to cherry-pick this change to another branch, please follow the instructions <a href="https://github.com/kubernetes/community/blob/master/contributors/devel/cherry-picks.md">here</a>. Update bazelbuild/rules_go, kubernetes/repo-infra, and gazelle dependencies What this PR does / why we need it: updates our bazelbuild/rules_go dependency in order to bump everything to go1.9.4. I'm separating this effort into two separate PRs, since updating rules_go requires a large cleanup, removing an attribute from most build rules. Release note: ```release-note NONE ```	2018-02-19 22:23:05 -08:00
jiaxuanzhou	039b695e29	Disable image GC when high threshold is set to 100	2018-02-20 14:07:19 +08:00
jiaxuanzhou	17fff2d4ba	Disable ImageGC when high threshold is set to 0	2018-02-20 13:49:12 +08:00
ohmystack	ecc13c8d86	dockertools: disable MemorySwap on Linux According to docker docs, setting MemorySwap equals to Memory can prevent docker containers from using any swap, instead of setting MemorySwap to zero.	2018-02-18 20:38:44 +08:00
David Ashpole	960856f4e8	collect metrics on the /kubepods cgroup on-demand	2018-02-17 12:32:40 -08:00
Kubernetes Submit Queue	270ed995f4	Merge pull request #59841 from dashpole/metrics_after_reclaim Automatic merge from submit-queue (batch tested with PRs 59683, 59964, 59841, 59936, 59686). If you want to cherry-pick this change to another branch, please follow the instructions <a href="https://github.com/kubernetes/community/blob/master/contributors/devel/cherry-picks.md">here</a>. Reevaluate eviction thresholds after reclaim functions What this PR does / why we need it: When the node comes under `DiskPressure` due to inodes or disk space, the eviction manager runs garbage collection functions to clean up dead containers and unused images. Currently, we use the strategy of trying to measure the disk space and inodes freed by garbage collection. However, as #46789 and #56573 point out, there are gaps in the implementation that can cause extra evictions even when they are not required. Furthermore, for nodes which frequently cycle through images, it results in a large number of evictions, as running out of inodes always causes an eviction. This PR changes this strategy to call the garbage collection functions and ignore the results. Then, it triggers another collection of node-level metrics, and sees if the node is still under DiskPressure. This way, we can simply observe the decrease in disk or inode usage, rather than trying to measure how much is freed. Which issue(s) this PR fixes: Fixes #46789 Fixes #56573 Related PR #56575 Special notes for your reviewer: This will look cleaner after #57802 removes arguments from [makeSignalObservations](https://github.com/kubernetes/kubernetes/pull/57802/files#diff-9e5246d8c78d50ce4ba440f98663f3e9R719). Release note: ```release-note NONE ``` /sig node /kind bug /priority important-soon cc @kubernetes/sig-node-pr-reviews	2018-02-16 16:31:33 -08:00
Jeff Grafton	ef56a8d6bb	Autogenerated: hack/update-bazel.sh	2018-02-16 13:43:01 -08:00
Kubernetes Submit Queue	930f86574f	Merge pull request #57885 from cimomo/kubelet-fixes Automatic merge from submit-queue. If you want to cherry-pick this change to another branch, please follow the instructions <a href="https://github.com/kubernetes/community/blob/master/contributors/devel/cherry-picks.md">here</a>. Improve comments for kubelet What this PR does / why we need it: Improve comments and fix typos for kubelet. Which issue(s) this PR fixes (optional, in `fixes #<issue number>(, fixes #<issue_number>, ...)` format, will close the issue(s) when PR gets merged): Fixes # Special notes for your reviewer: Release note: ```release-note NONE ```	2018-02-16 13:38:49 -08:00
Kubernetes Submit Queue	244549f02a	Merge pull request #59769 from dashpole/capacity_ephemeral_storage Automatic merge from submit-queue. If you want to cherry-pick this change to another branch, please follow the instructions <a href="https://github.com/kubernetes/community/blob/master/contributors/devel/cherry-picks.md">here</a>. Collect ephemeral storage capacity on initialization What this PR does / why we need it: We have had some node e2e flakes where a pod can be rejected if it requests ephemeral storage. This is because we don't set capacity and allocatable for ephemeral storage on initialization. This PR causes cAdvisor to do one round of stats collection during initialization, which will allow it to get the disk capacity when it first sets the node status. It also sets the node to NotReady if capacities have not been initialized yet. Special notes for your reviewer: Release note: ```release-note NONE ``` /assign @jingxu97 @Random-Liu /sig node /kind bug /priority important-soon	2018-02-16 11:17:02 -08:00
Kubernetes Submit Queue	eac5bc0035	Merge pull request #57136 from k82cn/k8s_54313 Automatic merge from submit-queue (batch tested with PRs 57136, 59920). If you want to cherry-pick this change to another branch, please follow the instructions <a href="https://github.com/kubernetes/community/blob/master/contributors/devel/cherry-picks.md">here</a>. Updated PID pressure node condition. Which issue(s) this PR fixes (optional, in `fixes #<issue number>(, fixes #<issue_number>, ...)` format, will close the issue(s) when PR gets merged): part of #54313 Release note: ```release-note Updated PID pressure node condition ```	2018-02-16 10:35:33 -08:00
David Ashpole	e0830d0b71	reevaluate eviction thresholds after reclaim functions	2018-02-16 08:35:24 -08:00
Kubernetes Submit Queue	c105796e4b	Merge pull request #59953 from Random-Liu/fix-pod-scheduled Automatic merge from submit-queue (batch tested with PRs 59873, 59933, 59923, 59944, 59953). If you want to cherry-pick this change to another branch, please follow the instructions <a href="https://github.com/kubernetes/community/blob/master/contributors/devel/cherry-picks.md">here</a>. Fix pod scheduled. Fix `PodScheduled` condition. The test `[k8s.io] EquivalenceCache [Serial] validates pod affinity works properly when new replica pod is scheduled` for cri-containerd is flaky. The reason is that it assume all existing pods should have `PodScheduled` condition, but it is not the case: ``` Feb 15 15:31:01.359: INFO: with-label-390d246e-1265-11e8-beb8-0a580a3c7b55 bootstrap-e2e-minion-group-l6qw Running [{Initialized True 0001-01-01 00:00:00 +0000 UTC 2018-02-15 15:30:59 +0000 UTC } {Ready True 0001-01-01 00:00:00 +0000 UTC 2018-02-15 15:31:00 +0000 UTC } {PodScheduled True 0001-01-01 00:00:00 +0000 UTC 2018-02-15 15:30:59 +0000 UTC }] Feb 15 15:31:01.359: INFO: calico-node-7mzxc bootstrap-e2e-minion-group-hztx Running [{Initialized True 0001-01-01 00:00:00 +0000 UTC 2018-02-15 14:17:05 +0000 UTC } {Ready True 0001-01-01 00:00:00 +0000 UTC 2018-02-15 14:17:59 +0000 UTC }] Feb 15 15:31:01.359: INFO: calico-node-kvrsx bootstrap-e2e-minion-group-l6qw Running [{Initialized True 0001-01-01 00:00:00 +0000 UTC 2018-02-15 15:24:54 +0000 UTC } {Ready True 0001-01-01 00:00:00 +0000 UTC 2018-02-15 15:25:20 +0000 UTC }] Feb 15 15:31:01.359: INFO: calico-node-llwjh ``` I'm not sure why this doesn't happen to docker. One theory is that we don't prepull image in cri-containerd, and we do start pod a bit faster for cri-containerd, and that exposes the race condition. /cc @kubernetes/sig-node-bugs Signed-off-by: Lantao Liu <lantaol@google.com> What this PR does / why we need it: Which issue(s) this PR fixes (optional, in `fixes #<issue number>(, fixes #<issue_number>, ...)` format, will close the issue(s) when PR gets merged): Fixes # Special notes for your reviewer: Release note: ```release-note none ```	2018-02-15 20:16:44 -08:00
Kubernetes Submit Queue	99c87cf679	Merge pull request #59923 from jsafrane/volumemanager-logs Automatic merge from submit-queue (batch tested with PRs 59873, 59933, 59923, 59944, 59953). If you want to cherry-pick this change to another branch, please follow the instructions <a href="https://github.com/kubernetes/community/blob/master/contributors/devel/cherry-picks.md">here</a>. Rework volume manager log levels - all normal logs to go to level 4 - too frequent / duplicate logs go to level 5 (e.g. when something else logged similar message not too far away). I checked that there is no excessive spam in the log - reconciler runs every 100ms, but it does not log anything if there is nothing to do. What this PR does / why we need it: This will help us debug flakes. E2e tests do not log levels 10-12 used in volume manager Release note: ```release-note NONE ``` /sig storage /sig node cc: @jingxu97 @sjenning	2018-02-15 20:16:38 -08:00
Kubernetes Submit Queue	c7c5d89e32	Merge pull request #59873 from jsafrane/fix-downward-flake Automatic merge from submit-queue (batch tested with PRs 59873, 59933, 59923, 59944, 59953). If you want to cherry-pick this change to another branch, please follow the instructions <a href="https://github.com/kubernetes/community/blob/master/contributors/devel/cherry-picks.md">here</a>. Fix DownwardAPI refresh race. WaitForAttachAndMount should mark only pod in DesiredStateOfWorldPopulator (DSWP) and DSWP should mark the volume to be remounted only when the new pod has been processed. Otherwise DSWP and reconciler race who gets the new pod first. If it's reconciler, then DownwardAPI and Projected volumes of the pod are not refreshed with new content and they are updated after the next periodic sync (60-90 seconds). Fixes #59813 /assign @jingxu97 @saad-ali /sig storage /sig node ```release-note None ```	2018-02-15 20:16:32 -08:00
Kubernetes Submit Queue	bfdd94c6a0	Merge pull request #59170 from cofyc/fix_kubelet_volume_metrics Automatic merge from submit-queue. If you want to cherry-pick this change to another branch, please follow the instructions <a href="https://github.com/kubernetes/community/blob/master/contributors/devel/cherry-picks.md">here</a>. Fix kubelet PVC stale metrics What this PR does / why we need it: Volumes on each node changes, we should not only add PVC metrics into gauge vector. It's better use a collector to collector metrics from internal stats. Currently, if a PV (bound to a PVC `testpv`) is attached and used by node A, then migrated to node B or just deleted from node A later. `testpvc` metrics will not disappear from kubelet on node A. After a long running time, `kubelet` process will keep a lot of stale volume metrics in memory. For these dynamic metrics, it's better to use a collector to collect metrics from a data source (`StatsProvider` here), like [kube-state-metrics](https://github.com/kubernetes/kube-state-metrics) scraping metrics from kube-apiserver. Which issue(s) this PR fixes (optional, in `fixes #<issue number>(, fixes #<issue_number>, ...)` format, will close the issue(s) when PR gets merged): Fixes https://github.com/kubernetes/kubernetes/issues/57686 Special notes for your reviewer: Release note: ```release-note Fix kubelet PVC stale metrics ```	2018-02-15 18:44:08 -08:00
David Ashpole	b259543985	collect ephemeral storage capacity on initialization	2018-02-15 17:33:22 -08:00
Lantao Liu	f69b4e9262	Fix pod scheduled. Signed-off-by: Lantao Liu <lantaol@google.com>	2018-02-16 00:51:20 +00:00
JulienBalestra	493f335830	kubelet: revert the get pod status	2018-02-15 22:24:35 +01:00
Kubernetes Submit Queue	c03edcc58e	Merge pull request #53833 from mtaufen/kubeletconfig-to-beta Automatic merge from submit-queue (batch tested with PRs 59353, 59905, 53833). If you want to cherry-pick this change to another branch, please follow the instructions <a href="https://github.com/kubernetes/community/blob/master/contributors/devel/cherry-picks.md">here</a>. Graduate kubeletconfig API group to beta Regarding https://github.com/kubernetes/features/issues/281, this PR moves the kubeletconfig API group to beta. After #53088, the KubeletConfiguration type should not contain any deprecated or experimental fields, and we should not have to remove any more fields from the type before graduating it to beta. We need the community to double check for two things, however: 1. Are there any fields currently in the KubeletConfiguration type that you were going to mark deprecated this quarter, but haven't yet? 2. Are there any fields currently in the KubeletConfiguration type that are experimental or alpha, but were not explicitly denoted as such? Please comment on this PR if you can answer "yes" to either of those two questions. Please cc anyone with a stake in the kubeletconfig API, so we get as much coverage as possible. /cc @thockin @dchen1107 @Random-Liu @yujuhong @dashpole @tallclair @vishh @abw @freehan @dnardo @bowei @MrHohn @luxas @liggitt @ncdc @derekwaynecarr @mikedanese @kubernetes/sig-network-pr-reviews, @kubernetes/sig-node-pr-reviews ```release-note action required: The `kubeletconfig` API group has graduated from alpha to beta, and the name has changed to `kubelet.config.k8s.io`. Please use `kubelet.config.k8s.io/v1beta1`, as `kubeletconfig/v1alpha1` is no longer available. ``` TODO: - [x] Move experimental/non-gated-alpha/soon-to-be-deprecated fields to `KubeletFlags` - [x] #53088 - [x] #54154 - [x] #54160 - [x] #55562 - [x] #55983 - [x] #57851 - [x] Lift embedded structure out of strings - [x] #53025 - [x] #54643 - [x] #54823 - [x] #55254 - [x] Resolve relative paths against the location config files are loaded from - [x] #55648 - [x] Rename to `kubelet.config.k8s.io` - [x] Comments - [x] Make sure existing comments at least read sensibly. - [x] Note default values in comments on the versioned struct. - [x] Remove any reference to default values in comments on the internal struct. - [x] Most fields should be `+optional` and `omitempty`. Add where necessary. ~Where omitted, explicitly comment.~ Edit: We should not distinguish between nil and empty, see below items. - [x] Ensure defaults are specified via `pkg/kubelet/apis/kubelet.config.k8s.io/v1beta1/defaults.go`, not `cmd/kubelet/app/options/options.go`. - [x] #57770 - [x] Ensure kubeadm does not persist v1alpha1 KubeletConfiguration objects (or feature-gates this functionality) - [x] Don't make a distinction between empty and nil, because of #43203. - [x] #59515 - [x] #59681 - [x] Take the opportunity to fix insecure Kubelet defaults @tallclair - [x] #59666 - [x] Remove CAdvisorPort from KubeletConfiguration wrt #56523. - [x] #59580 - [x] Hide `ConfigTrialDuration` until we're more sure what to do with it. - [x] #59628 - [x] Fix `// default: x` comments after rebasing on recent changes.	2018-02-15 11:06:40 -08:00
Kubernetes Submit Queue	b099e91920	Merge pull request #59905 from mtaufen/dkcfg-config-ok-kubelet-config-ok Automatic merge from submit-queue (batch tested with PRs 59353, 59905, 53833). If you want to cherry-pick this change to another branch, please follow the instructions <a href="https://github.com/kubernetes/community/blob/master/contributors/devel/cherry-picks.md">here</a>. Rename ConfigOK to KubeletConfigOk This is a more accurate name for the condition, as it describes the status of the Kubelet's configuration. Also cleans up capitalization of internal names. ```release-note The ConfigOK node condition has been renamed to KubeletConfigOk. ```	2018-02-15 11:06:36 -08:00
Jan Safranek	e260096392	Rework volume manager log levels - all normal logs to go to level 4 - too frequent / duplicate logs go to level 5 (e.g. when something else logged similar message not too far away).	2018-02-15 16:33:17 +01:00
Kubernetes Submit Queue	7377c5911a	Merge pull request #59892 from JulienBalestra/revert-host-ip Automatic merge from submit-queue (batch tested with PRs 59877, 59886, 59892). If you want to cherry-pick this change to another branch, please follow the instructions <a href="https://github.com/kubernetes/community/blob/master/contributors/devel/cherry-picks.md">here</a>. kubelet: revert the status HostIP behavior What this PR does / why we need it: This PR partially revert #57106 to fix #59889. The PR #57106 changed the behavior of `generateAPIPodStatus` when a kubeClient is nil. Release note: ```release-note NONE ```	2018-02-15 00:01:35 -08:00
Kubernetes Submit Queue	4b147e0361	Merge pull request #59588 from jiayingz/v1beta1 Automatic merge from submit-queue. If you want to cherry-pick this change to another branch, please follow the instructions <a href="https://github.com/kubernetes/community/blob/master/contributors/devel/cherry-picks.md">here</a>. Create pkg/kubelet/apis/deviceplugin/v1beta1 directory. The proto stays the same as v1alpha. Only changes Version in constants.go to "v1beta1" and the BUILD file to pick up the new dir. ```release-note Adding pkg/kubelet/apis/deviceplugin/v1beta1 API. ```	2018-02-14 22:51:32 -08:00
Michael Taufen	d8cc440dd6	Rename ConfigOK to KubeletConfigOk This is a more accurate name for the condition, as it describes the status of the Kubelet's configuration. Also cleans up capitalization of internal names.	2018-02-14 19:36:52 -08:00
Michael Taufen	9ebaf5e7d2	Move the kubeletconfig v1alpha1 API to beta, rename to kubelet.config.k8s.io	2018-02-14 17:30:22 -08:00
yue9944882	a331f818ff	force node name in generated static pod name lowercase	2018-02-14 17:46:51 -06:00
JulienBalestra	2130f5bc55	kubelet: revert the status HostIP behavior	2018-02-14 23:38:09 +01:00
Kai Chen	9ca0d32fbb	Improve comments for kubelet	2018-02-14 12:03:46 -08:00
Kubernetes Submit Queue	63380d12db	Merge pull request #59666 from mtaufen/kc-secure-componentconfig-defaults Automatic merge from submit-queue. If you want to cherry-pick this change to another branch, please follow the instructions <a href="https://github.com/kubernetes/community/blob/master/contributors/devel/cherry-picks.md">here</a>. Secure Kubelet's componentconfig defaults while maintaining CLI compatibility This updates the Kubelet's componentconfig defaults, while applying the legacy defaults to values from options.NewKubeletConfiguration(). This keeps defaults the same for the command line and improves the security of defaults when you load config from a file. See: https://github.com/kubernetes/kubernetes/issues/53618 See: https://github.com/kubernetes/kubernetes/pull/53833#discussion_r166669931 Also moves EnableServer to KubeletFlags, per @tallclair's comments on #53833. We should find way of generating documentation for config file defaults, so that people can easily look up what's different from flags. ```release-note Action required: Default values differ between the Kubelet's componentconfig (config file) API and the Kubelet's command line. Be sure to review the default values when migrating to using a config file. ```	2018-02-14 10:09:13 -08:00
Kubernetes Submit Queue	79b1589657	Merge pull request #59788 from Lihua93/fix/typos Automatic merge from submit-queue. If you want to cherry-pick this change to another branch, please follow the instructions <a href="https://github.com/kubernetes/community/blob/master/contributors/devel/cherry-picks.md">here</a>. Fix typos What this PR does / why we need it: To fix some typos Which issue(s) this PR fixes (optional, in `fixes #<issue number>(, fixes #<issue_number>, ...)` format, will close the issue(s) when PR gets merged): Fixes # Special notes for your reviewer: Release note: ```release-note NONE ```	2018-02-14 08:51:31 -08:00
Jan Safranek	c232a0165a	Fix DownwardAPI refresh race. WaitForAttachAndMount should mark only pod in DesiredStateOfWorldPopulator (DSWP) and DSWP should mark the volume to be remounted only when the new pod has been processed. Otherwise DSWP and reconciler race who gets the new pod first. If it's reconciler, then DownwardAPI and Projected volumes of the pod are not refreshed with new content and they are updated after the next periodic sync (60-90 seconds).	2018-02-14 16:54:25 +01:00
Kubernetes Submit Queue	a129c0f984	Merge pull request #59832 from shyamjvs/fix-fake-docker-client-ip-collision Automatic merge from submit-queue. If you want to cherry-pick this change to another branch, please follow the instructions <a href="https://github.com/kubernetes/community/blob/master/contributors/devel/cherry-picks.md">here</a>. Fake docker-client assigns random IPs to containers Fixes https://github.com/kubernetes/kubernetes/issues/59823 /cc @wojtek-t @Random-Liu	2018-02-14 07:08:24 -08:00
Shyam Jeedigunta	517301df21	Fake docker-client assigns random IPs to containers	2018-02-14 14:28:52 +01:00
Kubernetes Submit Queue	58dcf3c533	Merge pull request #59489 from pohly/master-tmpdir Automatic merge from submit-queue (batch tested with PRs 59489, 59716). If you want to cherry-pick this change to another branch, please follow the instructions <a href="https://github.com/kubernetes/community/blob/master/contributors/devel/cherry-picks.md">here</a>. devicemanager testing: dynamically choose tmp dir This avoids the test issue #59488 that I was running into. I believe I have a reasonable explanation for the race condition in that issue (TLDR: it's probably part of the gRPC API and k8s can only avoid the issue until a proper solution gets worked out together with gRPC), therefore I suggest to merge this PR now both because it avoids the issue and because using fixed tmp directories is something that should be avoided anyway. /assign @jiayingz	2018-02-14 00:14:31 -08:00
Michael Taufen	c1e34bc725	Secure Kubelet's componentconfig defaults while maintaining CLI compatibility This updates the Kubelet's componentconfig defaults, while applying the legacy defaults to values from options.NewKubeletConfiguration(). This keeps defaults the same for the command line and improves the security of defaults when you load config from a file. See: https://github.com/kubernetes/kubernetes/issues/53618 See: https://github.com/kubernetes/kubernetes/pull/53833#discussion_r166669931	2018-02-13 18:10:15 -08:00
Kubernetes Submit Queue	7b678dc403	Merge pull request #57106 from JulienBalestra/kubelet-update-local-pods Automatic merge from submit-queue. If you want to cherry-pick this change to another branch, please follow the instructions <a href="https://github.com/kubernetes/community/blob/master/contributors/devel/cherry-picks.md">here</a>. Kubelet status manager sync the status of local Pods What this PR does / why we need it: In the kubelet, when using `--pod-manifest-path` the kubelet creates static pods but doesn't update the status accordingly in the `PodList`. This PR fixes the incorrect status of each Pod in the kubelet's `PodList`. This is the setup used to reproduce the issue: manifest: ```bash cat ~/kube/staticpod.yaml ``` ```yaml apiVersion: v1 kind: Pod metadata: labels: app: nginx name: nginx namespace: default spec: hostNetwork: true containers: - name: nginx image: nginx:latest imagePullPolicy: IfNotPresent volumeMounts: - name: os-release mountPath: /usr/share/nginx/html/index.html readOnly: true volumes: - name: os-release hostPath: path: /etc/os-release ``` kubelet: ```bash ~/go/src/k8s.io/kubernetes/_output/bin/kubelet --pod-manifest-path ~/kube/ --cloud-provider="" --register-node --kubeconfig kubeconfig.yaml ``` You can observe this by querying the kubelet API `/pods`: ```bash curl -s 127.0.0.1:10255/pods \| jq . ``` ```json { "kind": "PodList", "apiVersion": "v1", "metadata": {}, "items": [ { "metadata": { "name": "nginx-nodeName", "namespace": "default", "selfLink": "/api/v1/namespaces/default/pods/nginx-nodeName", "uid": "0fdfa64c73d9de39a9e5c05ef7967e72", "creationTimestamp": null, "labels": { "app": "nginx" }, "annotations": { "kubernetes.io/config.hash": "0fdfa64c73d9de39a9e5c05ef7967e72", "kubernetes.io/config.seen": "2017-12-12T18:42:46.088157195+01:00", "kubernetes.io/config.source": "file" } }, "spec": { "volumes": [ { "name": "os-release", "hostPath": { "path": "/etc/os-release", "type": "" } } ], "containers": [ { "name": "nginx", "image": "nginx:latest", "resources": {}, "volumeMounts": [ { "name": "os-release", "readOnly": true, "mountPath": "/usr/share/nginx/html/index.html" } ], "terminationMessagePath": "/dev/termination-log", "terminationMessagePolicy": "File", "imagePullPolicy": "IfNotPresent" } ], "restartPolicy": "Always", "terminationGracePeriodSeconds": 30, "dnsPolicy": "ClusterFirst", "nodeName": "nodeName", "hostNetwork": true, "securityContext": {}, "schedulerName": "default-scheduler", "tolerations": [ { "operator": "Exists", "effect": "NoExecute" } ] }, "status": { "phase": "Pending", "conditions": [ { "type": "PodScheduled", "status": "True", "lastProbeTime": null, "lastTransitionTime": "2017-12-12T17:42:51Z" } ] } } ] } ``` The status of the nginx `Pod` will remain in Pending state phase. ```bash curl -I 127.0.0.1 HTTP/1.1 200 OK ``` It's reported as expected on the apiserver side: ```bash kubectl get po --all-namespaces -w NAMESPACE NAME READY STATUS RESTARTS AGE default nginx-nodeName 0/1 Pending 0 0s default nginx-nodeName 1/1 Running 0 2s ``` It doesn't work either with a standalone kubelet: ```bash ~/go/src/k8s.io/kubernetes/_output/bin/kubelet --pod-manifest-path ~/kube/ --cloud-provider="" --register-node false ``` Special notes for your reviewer: Release note: ```release-note NONE ```	2018-02-13 16:46:33 -08:00
Kubernetes Submit Queue	9de5839944	Merge pull request #59681 from mtaufen/kc-empty-eviction-hard Automatic merge from submit-queue. If you want to cherry-pick this change to another branch, please follow the instructions <a href="https://github.com/kubernetes/community/blob/master/contributors/devel/cherry-picks.md">here</a>. Ignore 0% and 100% eviction thresholds Primarily, this gives a way to explicitly disable eviction, which is necessary to use omitempty on EvictionHard. See: https://github.com/kubernetes/kubernetes/pull/53833#discussion_r166672137 As justification for this approach, neither 0% nor 100% make sense as eviction thresholds; in the "less-than" case, you can't have less than 0% of a resource and 100% perpetually evicts; in the "greater-than" case (assuming we ever add a resource with this semantic), the reasoning is the reverse (not more than 100%, 0% perpetually evicts). ```release-note Eviction thresholds set to 0% or 100% are now ignored. ```	2018-02-13 09:48:11 -08:00
Lihua Tang	cad52f6576	Fix typos	2018-02-13 16:17:37 +08:00
Kubernetes Submit Queue	4afdc33c57	Merge pull request #56454 from mtanino/volumehandler-refactor Automatic merge from submit-queue (batch tested with PRs 59767, 56454, 59237, 59730, 55479). If you want to cherry-pick this change to another branch, please follow the instructions <a href="https://github.com/kubernetes/community/blob/master/contributors/devel/cherry-picks.md">here</a>. Block Volume: Refactor volumehandler in operationexecutor What this PR does / why we need it: Based on discussion with @saad-ali at #51494, we need refactor volumehandler in operationexecutor for Block Volume feature. We don't need to add volumehandler as separated object. ``` VolumeHandler does not need to be a separate object that is constructed inline like this. You can create a new operation, e.g. UnmountOperation to which you pass the spec, and it can return either a UnmountVolume or UnmapVolume. ``` Which issue(s) this PR fixes : no related issue. Special notes for your reviewer: @saad-ali @msau42 Release note: ```release-note NONE ```	2018-02-12 15:44:33 -08:00
Michael Taufen	21dbbe14f2	Ignore 0% and 100% eviction thresholds Primarily, this gives a way to explicitly disable eviction, which is necessary to use omitempty on EvictionHard. See: https://github.com/kubernetes/kubernetes/pull/53833#discussion_r166672137 As justification for this approach, neither 0% nor 100% make sense as eviction thresholds; in the "less-than" case, you can't have less than 0% of a resource and 100% perpetually evicts; in the "greater-than" case (assuming we ever add a resource with this semantic), the reasoning is the reverse (not more than 100%, 0% perpetually evicts).	2018-02-12 14:13:00 -08:00
Seth Jennings	9ab9ddeb19	kubelet: check for illegal phase transition	2018-02-12 15:28:10 -06:00
Yecheng Fu	fecff55c59	Fix kubelet PVC metrics using a volume stats collector. Volumes on each node changes, we should not only add PVC metrics into gauge vector. It's better use a collector to collector metrics from stats.	2018-02-11 23:48:06 +08:00
Kubernetes Submit Queue	317853c90c	Merge pull request #59464 from dixudx/fix_all_typos Automatic merge from submit-queue. If you want to cherry-pick this change to another branch, please follow the instructions <a href="https://github.com/kubernetes/community/blob/master/contributors/devel/cherry-picks.md">here</a>. fix all the typos across the project What this PR does / why we need it: There are lots of typos across the project. We should avoid small PRs on fixing those annoying typos, which is time-consuming and low efficient. This PR does fix all the typos across the project currently. And with #59463, typos could be avoided when a new PR gets merged. Which issue(s) this PR fixes (optional, in `fixes #<issue number>(, fixes #<issue_number>, ...)` format, will close the issue(s) when PR gets merged): Fixes # Special notes for your reviewer: /sig testing /area test-infra /sig release /cc @ixdy /assign @fejta Release note: ```release-note None ```	2018-02-10 22:12:45 -08:00
Di Xu	48388fec7e	fix all the typos across the project	2018-02-11 11:04:14 +08:00
hangaoshuai	7cfb94cbc5	Use SeekStart, SeekCurrent, and SeekEnd repalace of deprecated constant	2018-02-11 11:02:23 +08:00
yangfan	8221ab4641	some typo	2018-02-11 10:41:28 +08:00
Kubernetes Submit Queue	9d33926d5c	Merge pull request #59628 from mtaufen/kc-hide-trial-duration Automatic merge from submit-queue. If you want to cherry-pick this change to another branch, please follow the instructions <a href="https://github.com/kubernetes/community/blob/master/contributors/devel/cherry-picks.md">here</a>. Bury KubeletConfiguration.ConfigTrialDuration for now Based on discussion in https://github.com/kubernetes/kubernetes/pull/53833/files#r166669046, this PR chooses not to expose a knob for the trial duration yet. It is unclear exactly which shape this functionality should take in the API. ```release-note The alpha KubeletConfiguration.ConfigTrialDuration field is no longer available. ```	2018-02-10 14:03:08 -08:00
mtanino	973583e781	Refactor volumehandler in operationexecutor	2018-02-09 15:39:55 -05:00
Patrick Ohly	0d828e061b	devicemanager testing: time out sooner Each individual step should not take longer than a second. Suggest by Vikas Choudhary (https://github.com/kubernetes/kubernetes/pull/59489#discussion_r167205672).	2018-02-09 20:51:54 +01:00
Kubernetes Submit Queue	76e6da25fa	Merge pull request #59481 from rojkov/dm-unittests Automatic merge from submit-queue. If you want to cherry-pick this change to another branch, please follow the instructions <a href="https://github.com/kubernetes/community/blob/master/contributors/devel/cherry-picks.md">here</a>. devicemanager: increase code coverege of endpoint's unit test Particularly cover the code path when an unhealthy device becomes healthy.	2018-02-09 10:35:22 -08:00
Patrick Ohly	1325c2f8be	devicemanager testing: dynamically choose tmp dir Hard-coding the tests to use /tmp/device_plugin for sockets is problematic because it prevents running tests in parallel on the same machine (perhaps because there are multiple developers, perhaps because testing is done independently on different code checkouts). /tmp/device_plugin also was not removed after testing. This is probably not that relevant. But more importantly, this change also fixes https://github.com/kubernetes/kubernetes/issues/59488. "make test" failed in TestDevicePluginReRegistration because something removed /tmp/device_plugin/device-plugin.sock while something else tried to connect to it: 2018/02/07 14:34:39 Starting to serve on /tmp/device_plugin/device-plugin.sock [pid 29568] connect(14, {sa_family=AF_UNIX, sun_path="/tmp/device_plugin/server.sock"}, 33) = 0 [pid 29568] unlinkat(AT_FDCWD, "/tmp/device_plugin/server.sock", 0) = 0 [pid 29568] unlinkat(AT_FDCWD, "/tmp/device_plugin/device-plugin.sock", 0) = 0 [pid 29568] --- SIGPIPE {si_signo=SIGPIPE, si_code=SI_USER, si_pid=29568, si_uid=1000} --- [pid 29568] connect(6, {sa_family=AF_UNIX, sun_path="/tmp/device_plugin/device-plugin.sock"}, 40) = -1 ENOENT (No such file or directory) E0207 14:34:39.961321 29568 endpoint.go:117] listAndWatch ended unexpectedly for device plugin mock with error rpc error: code = Unavailable desc = transport is closing strace: Process 29623 attached [pid 29574] connect(3, {sa_family=AF_UNIX, sun_path="/tmp/device_plugin/device-plugin.sock"}, 40) = -1 ENOENT (No such file or directory) [pid 29623] connect(3, {sa_family=AF_UNIX, sun_path="/tmp/device_plugin/device-plugin.sock"}, 40) = -1 ENOENT (No such file or directory) [pid 29574] connect(3, {sa_family=AF_UNIX, sun_path="/tmp/device_plugin/device-plugin.sock"}, 40) = -1 ENOENT (No such file or directory) E0207 14:34:49.961324 29568 endpoint.go:60] Can't create new endpoint with path /tmp/device_plugin/device-plugin.sock err failed to dial device plugin: context deadline exceeded E0207 14:34:49.961390 29568 manager.go:340] Failed to dial device plugin with request &RegisterRequest{Version:v1alpha2,Endpoint:device-plugin.sock,ResourceName:fake-domain/resource,}: failed to dial device plugin: context deadline exceeded panic: test timed out after 2m0s It's not entirely certain which code was to blame for this unlinkat() calls (perhaps some cleanup code from a previous test running in a goroutine?) but this no longer happened after switching to per-test socket directories.	2018-02-09 14:01:13 +01:00
Michael Taufen	9c8eab96d0	Bury KubeletConfiguration.ConfigTrialDuration for now	2018-02-08 21:41:21 -08:00
Kubernetes Submit Queue	b24bc2cfdc	Merge pull request #59475 from Random-Liu/use-mountpoint Automatic merge from submit-queue. If you want to cherry-pick this change to another branch, please follow the instructions <a href="https://github.com/kubernetes/community/blob/master/contributors/devel/cherry-picks.md">here</a>. Add mountpoint as CRI image filesystem storage identifier. Fixes https://github.com/kubernetes/kubernetes/issues/57356. This PR changes CRI to use mountpoint as storage identifier. See https://github.com/kubernetes/kubernetes/issues/57356#issuecomment-363608733. Note that: 1) This doesn't work with devicemapper for now. Please feel free to propose change for device mapper, we can discuss more about this after this first version is merged. @mrunalp @runcom 2) `mountpoint` is added as new field in `StorageIdentifier` now. After https://github.com/kubernetes/kubernetes/pull/58973 is merged, we can remove the UUID field in `v1alpha2`. /cc @yujuhong @feiskyer @yguo0905 @dashpole @mikebrow @abhi @kubernetes/sig-node-api-reviews Release note: ```release-note CRI starts using moutpoint as image filesystem identifier instead of UUID. ```	2018-02-08 19:41:57 -08:00
Kubernetes Submit Queue	d6625f857a	Merge pull request #58177 from jingxu97/Jan/reconstruct Automatic merge from submit-queue. If you want to cherry-pick this change to another branch, please follow the instructions <a href="https://github.com/kubernetes/community/blob/master/contributors/devel/cherry-picks.md">here</a>. Redesign and implement volume reconstruction work This PR is the first part of redesign of volume reconstruction work. The detailed design information is https://github.com/kubernetes/community/pull/1601 The changes include 1. Remove dependency on volume spec stored in actual state for volume cleanup process (UnmountVolume and UnmountDevice) Modify AttachedVolume struct to add DeviceMountPath so that volume unmount operation can use this information instead of constructing from volume spec 2. Modify reconciler's volume reconstruction process (syncState). Currently workflow is when kubelet restarts, syncState() is only called once before reconciler starts its loop. a. If volume plugin supports reconstruction, it will use the reconstructed volume spec information to update actual state as before. b. If volume plugin cannot support reconstruction, it will use the scanned mount path information to clean up the mounts. In this PR, all the plugins still support reconstruction (except glusterfs), so reconstruction of some plugins will still have issues. The next PR will modify those plugins that cannot support reconstruction well. This PR addresses issue #52683	2018-02-08 18:21:34 -08:00
Kubernetes Submit Queue	98abac70ce	Merge pull request #59598 from Random-Liu/remove-unnecessary-summary-call Automatic merge from submit-queue (batch tested with PRs 59344, 59595, 59598). If you want to cherry-pick this change to another branch, please follow the instructions <a href="https://github.com/kubernetes/community/blob/master/contributors/devel/cherry-picks.md">here</a>. Remove unnecessary summary api call. Summary API call is not as cheap as we think. Especially for CRI container runtime, it means: 1) Extra cgroups parsing (because cpu/memory are considered to be on demand); 2) Extra grpc encoding/decoding `ListPodSandboxes`, `ListContainers`, `ListContainerStats`; I don't think it is necessary to call summary twice inside the same function. /cc @kubernetes/sig-node-pr-reviews @dashpole @jingxu97 Signed-off-by: Lantao Liu <lantaol@google.com> Release note: ```release-note none ```	2018-02-08 18:06:37 -08:00
Jiaying Zhang	90c2098103	Create pkg/kubelet/apis/deviceplugin/v1beta1 directory. The proto stays the same as v1alpha. Only changes Version in constants.go to "v1beta1" and the BUILD file to pick up the new dir.	2018-02-08 17:04:43 -08:00
Lantao Liu	91cbf93b90	Remove unnecessary summary api call. Signed-off-by: Lantao Liu <lantaol@google.com>	2018-02-08 22:57:30 +00:00
Michael Taufen	5ab9ccd4fb	remove CAdvisorPort from KubeletConfiguration See: #56523, cAdvisor is becoming an implementation detail of Kubernetes, and we should not canonize its knobs on the KubeletConfiguration.	2018-02-08 13:51:41 -08:00
Kubernetes Submit Queue	6cc3641730	Merge pull request #59515 from mtaufen/kc-enforcenodeallocatable-none-option Automatic merge from submit-queue (batch tested with PRs 59054, 59515, 59577). If you want to cherry-pick this change to another branch, please follow the instructions <a href="https://github.com/kubernetes/community/blob/master/contributors/devel/cherry-picks.md">here</a>. Add 'none' option to EnforceNodeAllocatable This lets us use omitempty on `EnforceNodeAllocatable`. We shouldn't treat `nil` as different from `[]T{}`, because this can play havoc with serializers (a-la #43203). See: https://github.com/kubernetes/kubernetes/pull/53833#discussion_r166672137 ```release-note 'none' can now be specified in KubeletConfiguration.EnforceNodeAllocatable (--enforce-node-allocatable) to explicitly disable enforcement. ```	2018-02-08 12:22:32 -08:00
Michael Taufen	3553390c97	Add 'none' option to EnforceNodeAllocatable This lets us use omitempty on EnforceNodeAllocatable. We shouldn't treat `nil` as different from `[]T{}`, because this can play havoc with serializers (a-la #43203). See: https://github.com/kubernetes/kubernetes/pull/53833#discussion_r166672137	2018-02-08 10:24:39 -08:00
Dan Williams	60a955d414	dockershim: don't check pod IP in StopPodSandbox We're about to tear the container down, there's no point. It also suppresses an annoying error message due to kubelet stupidity that causes multiple parallel calls to StopPodSandbox for the same sandbox. docker_sandbox.go:355] failed to read pod IP from plugin/docker: NetworkPlugin cni failed on the status hook for pod "docker-registry-1-deploy_default": Unexpected command output nsenter: cannot open /proc/22646/ns/net: No such file or directory 1) A first StopPodSandbox() request triggered by SyncLoop(PLEG) for a ContainerDied event calls into TearDownPod() and thus the network plugin. Until this completes, networkReady=true for the sandbox. 2) A second StopPodSandbox() request triggered by SyncLoop(REMOVE) calls PodSandboxStatus() and calls into the network plugin to read the IP address because networkReady=true 3) The first request exits the network plugin, sets networReady=false, and calls StopContainer() on the sandbox. This destroys the network namespace. 4) The second request finally gets around to running nsenter but the network namespace is already destroyed. It returns an error which is logged by getIP().	2018-02-08 12:22:44 -06:00
Lee Verberne	8835f54480	kubelet: add support for pod PID namespace sharing This adds the logic for sending a NamespaceMode_POD to the runtime, but leaves it disconnected pending https://issues.k8s.io/58716.	2018-02-08 16:58:07 +01:00
Kubernetes Submit Queue	fb340a4695	Merge pull request #57824 from thockin/gcr-vanity Automatic merge from submit-queue (batch tested with PRs 57824, 58806, 59410, 59280). If you want to cherry-pick this change to another branch, please follow the instructions <a href="https://github.com/kubernetes/community/blob/master/contributors/devel/cherry-picks.md">here</a>. 2nd try at using a vanity GCR name The 2nd commit here is the changes relative to the reverted PR. Please focus review attention on that. This is the 2nd attempt. The previous try (#57573) was reverted while we figured out the regional mirrors (oops). New plan: k8s.gcr.io is a read-only facade that auto-detects your source region (us, eu, or asia for now) and pulls from the closest. To publish an image, push k8s-staging.gcr.io and it will be synced to the regionals automatically (similar to today). For now the staging is an alias to gcr.io/google_containers (the legacy URL). When we move off of google-owned projects (working on it), then we just do a one-time sync, and change the google-internal config, and nobody outside should notice. We can, in parallel, change the auto-sync into a manual sync - send a PR to "promote" something from staging, and a bot activates it. Nice and visible, easy to keep track of. xref https://github.com/kubernetes/release/issues/281 TL;DR: * The new `staging-k8s.gcr.io` is where we push images. It is literally an alias to `gcr.io/google_containers` (the existing repo) and is hosted in the US. * The contents of `staging-k8s.gcr.io` are automatically synced to `{asia,eu,us)-k8s.gcr.io`. * The new `k8s.gcr.io` will be a read-only alias to whichever regional repo is closest to you. * In the future, images will be promoted from `staging` to regional "prod" more explicitly and auditably. ```release-note Use "k8s.gcr.io" for pulling container images rather than "gcr.io/google_containers". Images are already synced, so this should not impact anyone materially. Documentation and tools should all convert to the new name. Users should take note of this in case they see this new name in the system. ```	2018-02-08 03:29:32 -08:00
Tim Hockin	3586986416	Switch to k8s.gcr.io vanity domain This is the 2nd attempt. The previous was reverted while we figured out the regional mirrors (oops). New plan: k8s.gcr.io is a read-only facade that auto-detects your source region (us, eu, or asia for now) and pulls from the closest. To publish an image, push k8s-staging.gcr.io and it will be synced to the regionals automatically (similar to today). For now the staging is an alias to gcr.io/google_containers (the legacy URL). When we move off of google-owned projects (working on it), then we just do a one-time sync, and change the google-internal config, and nobody outside should notice. We can, in parallel, change the auto-sync into a manual sync - send a PR to "promote" something from staging, and a bot activates it. Nice and visible, easy to keep track of.	2018-02-07 21:14:19 -08:00
Kubernetes Submit Queue	eff9f75f70	Merge pull request #59297 from joelsmith/master Automatic merge from submit-queue (batch tested with PRs 59010, 59212, 59281, 59014, 59297). If you want to cherry-pick this change to another branch, please follow the instructions <a href="https://github.com/kubernetes/community/blob/master/contributors/devel/cherry-picks.md">here</a>. Improve error returned when fetching container logs during pod termination What this PR does / why we need it: This change better handles fetching of logs when a container is in a crash loop backoff state. In cases where it is unable to fetch the logs, it gives a helpful error message back to a user who has requested logs of a container from a terminated pod. Rather than attempting to get logs for a container using an empty container ID, it returns a useful error message. In cases where the container runtime gets an error, log the error but don't leak it back through the API to the user. Which issue(s) this PR fixes: Fixes #59296 Release note: ```release-note NONE ```	2018-02-07 15:27:49 -08:00
Lantao Liu	a77450ec2d	Add mountpoint as CRI image filesystem storage identifier.	2018-02-07 23:01:06 +00:00
Kubernetes Submit Queue	b40f865ae5	Merge pull request #59472 from hanxiaoshuai/fixtodo02072 Automatic merge from submit-queue (batch tested with PRs 59276, 51042, 58973, 59377, 59472). If you want to cherry-pick this change to another branch, please follow the instructions <a href="https://github.com/kubernetes/community/blob/master/contributors/devel/cherry-picks.md">here</a>. clean up unused function GetKubeletDockerContainers What this PR does / why we need it: fix todo: function GetKubeletDockerContainers is not unused,it has been migrated off in test/e2e_node/garbage_collector_test.go in [#57976](https://github.com/kubernetes/kubernetes/pull/57976/files) Which issue(s) this PR fixes (optional, in `fixes #<issue number>(, fixes #<issue_number>, ...)` format, will close the issue(s) when PR gets merged): Fixes # Special notes for your reviewer: Release note: ```release-note NONE ```	2018-02-07 12:00:53 -08:00
Kubernetes Submit Queue	cf7073a831	Merge pull request #58973 from verb/cri-enum Automatic merge from submit-queue (batch tested with PRs 59276, 51042, 58973, 59377, 59472). If you want to cherry-pick this change to another branch, please follow the instructions <a href="https://github.com/kubernetes/community/blob/master/contributors/devel/cherry-picks.md">here</a>. Update Container Runtime Interface to use enumerated namespace modes What this PR does / why we need it: This updates the CRI as described in the [Shared PID Namespace](https://github.com/kubernetes/community/blob/master/contributors/design-proposals/node/pod-pid-namespace.md#container-runtime-interface-changes) proposal. This change to the alpha API is not backwards compatible: implementations of the CRI will need to update to the new API version. Which issue(s) this PR fixes (optional, in `fixes #<issue number>(, fixes #<issue_number>, ...)` format, will close the issue(s) when PR gets merged): WIP #1615 Special notes for your reviewer: /assign @yujuhong Release note: ```release-note [action-required] The Container Runtime Interface (CRI) version has increased from v1alpha1 to v1alpha2. Runtimes implementing the CRI will need to update to the new version, which configures container namespaces using an enumeration rather than booleans. ```	2018-02-07 12:00:47 -08:00
Kubernetes Submit Queue	475457537b	Merge pull request #59276 from roboll/roboll/kubelet-fix Automatic merge from submit-queue (batch tested with PRs 59276, 51042, 58973, 59377, 59472). If you want to cherry-pick this change to another branch, please follow the instructions <a href="https://github.com/kubernetes/community/blob/master/contributors/devel/cherry-picks.md">here</a>. kubelet: only register api source when connecting What this PR does / why we need it: before this change, an api source was always registered, even when there was no kubeclient. this lead to some operations blocking waiting for podConfig.SeenAllSources to pass, which it never would. Which issue(s) this PR fixes (optional, in `fixes #<issue number>(, fixes #<issue_number>, ...)` format, will close the issue(s) when PR gets merged): Fixes #59275 Special notes for your reviewer: Release note: ```release-note NONE ```	2018-02-07 12:00:40 -08:00
Joel Smith	749980b726	Handle fetch of container logs of error containers during pod termination * improve error returned when failing to fetch container logs * handle cases where logs are requested for containers without the container ID	2018-02-07 12:23:56 -07:00
hangaoshuai	e416f3746e	clean up unused function GetKubeletDockerContainers	2018-02-07 19:21:05 +08:00
Dmitry Rozhkov	3175a687a0	devicemanager: increase code coverege of endpoint's unit test Particularly cover the code path when an unhealthy device becomes healthy.	2018-02-07 12:29:48 +02:00
Lee Verberne	e10042d22f	Increment CRI version from v1alpha1 to v1alpha2 This also incorporates the version string into the package name so that incompatibile versions will fail to connect. Arbitrary choices: - The proto3 package name is runtime.v1alpha2. The proto compiler normally translates this to a go package of "runtime_v1alpha2", but I renamed it to "v1alpha2" for consistency with existing packages. - kubelet/apis/cri is used as "internalapi". I left it alone and put the public "runtimeapi" in kubelet/apis/cri/runtime.	2018-02-07 09:06:26 +01:00
Lee Verberne	0f1de41790	Update kubelet for enumerated CRI namespaces This adds support to both the Generic Runtime Manager and the dockershim for the CRI's enumerated namespaces.	2018-02-07 09:06:26 +01:00
Lee Verberne	f4ab2b6331	Switch CRI NamespaceOption from bools to enums	2018-02-07 09:06:25 +01:00
Kubernetes Submit Queue	e5b6026db6	Merge pull request #59287 from cheftako/cloud-context-level Automatic merge from submit-queue (batch tested with PRs 59441, 58264, 59287, 59396, 59439). If you want to cherry-pick this change to another branch, please follow the instructions <a href="https://github.com/kubernetes/community/blob/master/contributors/devel/cherry-picks.md">here</a>. Add context to all relevant cloud APIs What this PR does / why we need it: This adds context to all the relevant cloud provider interface signatures. Callers of those APIs are currently satisfied using context.TODO(). There will be follow on PRs to push the context through the stack. Which issue(s) this PR fixes (optional, in `fixes #<issue number>(, fixes #<issue_number>, ...)` format, will close the issue(s) when PR gets merged): Fixes #815 Special notes for your reviewer: For an idea of the full scope of this change please look at PR #58532. Release note: ```release-note Implementers of the cloud provider interface will note the addition of a context to this interface. Trivial code modification will be necessary for a cloud provider to continue to compile. ```	2018-02-06 20:27:39 -08:00
Kubernetes Submit Queue	056e9ecc43	Merge pull request #58941 from vikaschoudhary16/test-allocate Automatic merge from submit-queue. If you want to cherry-pick this change to another branch, please follow the instructions <a href="https://github.com/kubernetes/community/blob/master/contributors/devel/cherry-picks.md">here</a>. Add unit test for endpoint allocate What this PR does / why we need it: Adds a unit test for covering `allocate` function at endpoint. Release note: ```release-note None ``` /kind testing /area hw-accelerators /cc @jiayingz @vishh @derekwaynecarr @RenaudWasTaken @resouer @ConnorDoyle	2018-02-06 17:19:41 -08:00
Kubernetes Submit Queue	8201e4ba00	Merge pull request #57124 from JiangtianLi/jiangtli-memfunc Automatic merge from submit-queue. If you want to cherry-pick this change to another branch, please follow the instructions <a href="https://github.com/kubernetes/community/blob/master/contributors/devel/cherry-picks.md">here</a>. Use GlobalMemoryStatusEx to get total physical memory on Windows node What this PR does / why we need it: This PR fixes issue #57110 due to failure in getting total physical memory on some Windows VM such as in VMWare Fusion or Virtualbox. This change uses GlobalMemoryStatusEx instead of GetPhysicallyInstalledSystemMemory to retrieve total physical memory on Windows node. The amount obtained this way is also closer in parity with reading MemTotal from /proc/meminfo on Linux node. (thanks to @martinivanov and @marono for the help) Which issue(s) this PR fixes (optional, in `fixes #<issue number>(, fixes #<issue_number>, ...)` format, will close the issue(s) when PR gets merged): Fixes #57110 Special notes for your reviewer: Release note: ```release-note ```	2018-02-06 13:53:09 -08:00
Walter Fender	e18e8ec3c0	Add context to all relevant cloud APIs This adds context to all the relevant cloud provider interface signatures. Callers of those APIs are currently satisfied using context.TODO(). There will be follow on PRs to push the context through the stack. For an idea of the full scope of this change please look at PR #58532.	2018-02-06 12:49:17 -08:00
Kubernetes Submit Queue	4bd22b5467	Merge pull request #58415 from gnufied/fix-volume-resize-messages Automatic merge from submit-queue (batch tested with PRs 52942, 58415). If you want to cherry-pick this change to another branch, please follow the instructions <a href="https://github.com/kubernetes/community/blob/master/contributors/devel/cherry-picks.md">here</a>. Improve messaging on volume expansion - we now provide clear message to user what to do when cloudprovider resizing is finished and file system resizing is needed. - add a event when resizing is successful - Use PATCH both in controller-manager and kubelet for updating PVC status - Remove code duplication between controller-manager and kubelet for updating PVC status - Only remove conditions that are managed by resize controller ```release-note Improve messages user gets during and after volume resizing is done. ```	2018-02-06 07:55:32 -08:00
Lantao Liu	5cbc8cc8e0	Fix the wrong comment in cri constants. Signed-off-by: Lantao Liu <lantaol@google.com>	2018-02-05 23:53:36 +00:00
Kubernetes Submit Queue	c02b784b76	Merge pull request #58172 from NVIDIA/annotations Automatic merge from submit-queue (batch tested with PRs 58184, 59307, 58172). If you want to cherry-pick this change to another branch, please follow the instructions <a href="https://github.com/kubernetes/community/blob/master/contributors/devel/cherry-picks.md">here</a>. Add annotations to the device plugin API What this PR does / why we need it: Which issue(s) this PR fixes : Related to #56649 but does not fix it This adds the ability for the device plugins to annotate containers. Product wise, this allows the NVIDIA device plugin to support CRI-O (which allows hooks through container annotations). Special notes for your reviewer: /area hw-accelerators /cc @vishh @jiayingz @vikaschoudhary16 I'm wondering if it would make sense to fire a blank call to `newContainerAnnotations` at the start of the deviceplugin to get Annotations that are forbidden. Current behavior is that any Annotations that conflicts with Kubelet will be overwritten by Kubelet. Release note: ```release-note NONE ```	2018-02-05 13:50:35 -08:00
Jing Xu	9588d2098a	Redesign and implement volume reconstruction work This PR is the first part of redesign of volume reconstruction work. The changes include 1. Remove dependency on volume spec stored in actual state for volume cleanup process (UnmountVolume and UnmountDevice) Modify AttachedVolume struct to add DeviceMountPath so that volume unmount operation can use this information instead of constructing from volume spec 2. Modify reconciler's volume reconstruction process (syncState). Currently workflow is when kubelet restarts, syncState() is only called once before reconciler starts its loop. a. If volume plugin supports reconstruction, it will use the reconstructed volume spec information to update actual state as before. b. If volume plugin cannot support reconstruction, it will use the scanned mount path information to clean up the mounts. In this PR, all the plugins still support reconstruction (except glusterfs), so reconstruction of some plugins will still have issues. The next PR will modify those plugins that cannot support reconstruction well. This PR addresses issue #52683, #54108 (This PR includes the changes to update devicePath after local attach finishes)	2018-02-05 13:14:09 -08:00
Derek Carr	4afc0c8052	kubelet ignores hugepages if hugetlb is not enabled	2018-02-05 13:07:59 -05:00
vikaschoudhary16	abfb99645b	Add unit test for endpoint allocate	2018-02-05 00:53:07 -05:00
Clayton Coleman	0346145615	Cap how long the kubelet waits when it has no client cert If we go a certain amount of time without being able to create a client cert and we have no current client cert from the store, exit. This prevents a corrupted local copy of the cert from leaving the Kubelet in a zombie state forever. Exiting allows a config loop outside the Kubelet to clean up the file or the bootstrap client cert to get another client cert.	2018-02-03 23:18:53 -05:00
Renaud Gaubert	db537e5954	Add Annotations from the deviceplugin to the runtime	2018-02-03 19:53:20 +01:00
Renaud Gaubert	eb5035b08d	Regenerate the deviceplugin protobuf file	2018-02-03 19:53:20 +01:00
Renaud Gaubert	ece4bf4f7f	Add annotations to the deviceplugin API	2018-02-03 19:53:20 +01:00
Kubernetes Submit Queue	f02e37b6ac	Merge pull request #57076 from feiskyer/win-resources Automatic merge from submit-queue (batch tested with PRs 59097, 57076, 59295). If you want to cherry-pick this change to another branch, please follow the instructions <a href="https://github.com/kubernetes/community/blob/master/contributors/devel/cherry-picks.md">here</a>. Add windows config to Kubelet CRI What this PR does / why we need it: Currently Container Runtime Interface (CRI) only supports LinuxContainerConfig and therefore LinuxContainerResources in ContainerConfig. Windows resource config is different from Linux's, although it shares some common properties. This PR adds windows config to CRI. Add newly added WindowsContainerResources is original from OCI spec (see https://github.com/opencontainers/runtime-spec/blob/master/specs-go/config.go#L437). Which issue(s) this PR fixes (optional, in `fixes #<issue number>(, fixes #<issue_number>, ...)` format, will close the issue(s) when PR gets merged): First part of #56734. A further PR is needed to fill the values after we have agreement on the spec. Special notes for your reviewer: Release note: ```release-note Add windows config to Kubelet CRI ``` /assign @yujuhong @brendandburns /cc @taylorb-microsoft @JiangtianLi @dchen1107	2018-02-02 19:37:38 -08:00
Kubernetes Submit Queue	8c6be65f4c	Merge pull request #58720 from joelsmith/ro-vol Automatic merge from submit-queue. If you want to cherry-pick this change to another branch, please follow the instructions <a href="https://github.com/kubernetes/community/blob/master/contributors/devel/cherry-picks.md">here</a>. Ensure that the runtime mounts RO volumes read-only What this PR does / why we need it: This change makes it so that containers cannot write to secret, configMap, downwardAPI and projected volumes since the runtime will now mount them read-only. This change makes things less confusing for a user since any attempt to update a secret volume will result in an error rather than a successful change followed by a revert by the kubelet when the volume next syncs. It also adds a feature gate `ReadOnlyAPIDataVolumes` to a provide a way to disable the new behavior in 1.10, but for 1.11, the new behavior will become non-optional. Also, E2E tests for downwardAPI and projected volumes are updated to mount the volumes somewhere other than /etc. Which issue(s) this PR fixes Fixes #58719 Release note: ```release-note Containers now mount secret, configMap, downwardAPI and projected volumes read-only. Previously, container modifications to files in these types of volumes were temporary and reverted by the kubelet during volume sync. Until version 1.11, setting the feature gate ReadOnlyAPIDataVolumes=false will preserve the old behavior. ```	2018-02-02 06:42:12 -08:00
Kubernetes Submit Queue	d3b783d5ec	Merge pull request #58743 from NickrenREN/pv-protection Automatic merge from submit-queue. If you want to cherry-pick this change to another branch, please follow the instructions <a href="https://github.com/kubernetes/community/blob/master/contributors/devel/cherry-picks.md">here</a>. Postpone PV deletion with finalizer when it is being used Postpone PV deletion if it is bound to a PVC xref: https://github.com/kubernetes/community/pull/1608 Which issue(s) this PR fixes (optional, in `fixes #<issue number>(, fixes #<issue_number>, ...)` format, will close the issue(s) when PR gets merged): Fixes #33355 Special notes for your reviewer: Release note: ```release-note Postpone PV deletion when it is being bound to a PVC ``` WIP, assign to myself first /assign @NickrenREN	2018-02-01 19:39:52 -08:00
rob boll	7da7b750fd	kubelet: only register api source when connecting before this change, an api source was always registered, even when there was no kubeclient. this lead to some operations blocking waiting for podConfig.SeenAllSources to pass, which it never would.	2018-02-01 15:28:02 -05:00
Kubernetes Submit Queue	06472a054a	Merge pull request #58930 from smarterclayton/background_rotate Automatic merge from submit-queue. If you want to cherry-pick this change to another branch, please follow the instructions <a href="https://github.com/kubernetes/community/blob/master/contributors/devel/cherry-picks.md">here</a>. Only rotate certificates in the background Change the Kubelet to not block until the first certs have rotated (we didn't act on it anyway) and fall back to the bootstrap cert if the most recent rotated cert is expired on startup. The certificate manager originally had a "block on startup" rotation behavior to ensure at least one rotation happened on startup. However, since rotation may not succeed within the first time window the code was changed to simply print the error rather than return it. This meant that the blocking rotation has no purpose - it cannot cause the kubelet to fail, and it does block the kubelet from starting static pods before the api server becomes available. The current block behavior causes a bootstrapped kubelet that is also set to run static pods to wait several minutes before actually launching the static pods, which means self-hosted masters using static pods have a pointless delay on startup. Since blocking rotation has no benefit and can't actually fail startup, this commit removes the blocking behavior and simplifies the code at the same time. The goroutine for rotation now completely owns the deadline, the shouldRotate() method is removed, and the method that sets rotationDeadline now returns it. We also explicitly guard against a negative sleep interval and omit the message. Should have no impact on bootstrapping except the removal of a long delay on startup before static pods start. The other change is that an expired certificate from the cert manager is not considered a valid cert, which triggers an immediate rotation. This causes the cert manager to fall back to the original bootstrap certificate until a new certificate is issued. This allows the bootstrap certificate on masters to be "higher powered" and allow the node to function prior to initial approval, which means someone configuring the masters with a pre-generated client cert can be guaranteed that the kubelet will be able to communicate to report self-hosted static pod status, even if the first client rotation hasn't happened. This makes master self-hosting more predictable for static configuration environments. ```release-note When using client or server certificate rotation, the Kubelet will no longer wait until the initial rotation succeeds or fails before starting static pods. This makes running self-hosted masters with rotation more predictable. ```	2018-02-01 12:05:15 -08:00
Joel Smith	66b061dad2	Ensure that the runtime mounts RO volumes read-only Add a feature gate ReadOnlyAPIDataVolumes to a provide a way to disable the new behavior in 1.10, but for 1.11, the new behavior will become non-optional. Also, update E2E tests for downwardAPI and projected volumes to mount the volumes somewhere other than /etc.	2018-02-01 10:02:29 -07:00
Kubernetes Submit Queue	0d900769d6	Merge pull request #59126 from filbranden/ipcs3 Automatic merge from submit-queue (batch tested with PRs 59106, 58985, 59068, 59120, 59126). If you want to cherry-pick this change to another branch, please follow the instructions <a href="https://github.com/kubernetes/community/blob/master/contributors/devel/cherry-picks.md">here</a>. Fix cross-build breakage after #58174 What this PR does / why we need it: Fix cross-build breakage after #58174 @cblecker Which issue(s) this PR fixes (optional, in `fixes #<issue number>(, fixes #<issue_number>, ...)` format, will close the issue(s) when PR gets merged): Fixes #59121 Special notes for your reviewer: Release note: ```release-note NONE ```	2018-02-01 05:53:45 -08:00
Kubernetes Submit Queue	f96ac05774	Merge pull request #59062 from mtaufen/fix-pod-pids-limit Automatic merge from submit-queue. If you want to cherry-pick this change to another branch, please follow the instructions <a href="https://github.com/kubernetes/community/blob/master/contributors/devel/cherry-picks.md">here</a>. Fix PodPidsLimit and ConfigTrialDuration on internal KubeletConfig type They should both follow the convention of not being a pointer on the internal type. This required adding a conversion function between `int64` and `*int64`. A side effect is this removes a warning in the generated code for the apps API group. @dims ```release-note NONE ```	2018-02-01 01:45:55 -08:00
Kubernetes Submit Queue	a644e611dd	Merge pull request #58751 from feiskyer/hyperv Automatic merge from submit-queue. If you want to cherry-pick this change to another branch, please follow the instructions <a href="https://github.com/kubernetes/community/blob/master/contributors/devel/cherry-picks.md">here</a>. Add support of hyperv isolation for windows containers What this PR does / why we need it: Add support of hyperv isolation for windows containers. Which issue(s) this PR fixes (optional, in `fixes #<issue number>(, fixes #<issue_number>, ...)` format, will close the issue(s) when PR gets merged): Fixes #58750 Special notes for your reviewer: Only one container per pod is supported yet. Release note: ```release-note Windows containers now support experimental Hyper-V isolation by setting annotation `experimental.windows.kubernetes.io/isolation-type=hyperv` and feature gates HyperVContainer. Only one container per pod is supported yet. ```	2018-01-31 21:10:17 -08:00
Kubernetes Submit Queue	465e925564	Merge pull request #58994 from RobertKrawitz/fake-runtime-start-race-condition-branch Automatic merge from submit-queue. If you want to cherry-pick this change to another branch, please follow the instructions <a href="https://github.com/kubernetes/community/blob/master/contributors/devel/cherry-picks.md">here</a>. Race condition between listener and client in remote_runtime_test Fix race condition in remote_runtime_test. Fixes #58993	2018-01-31 20:31:50 -08:00
Filipe Brandenburger	2f2d886734	Fix cross-build breakage after #58174	2018-01-31 09:46:36 -08:00
NickrenREN	2a2f88b939	Rename PVCProtection feature gate so that PV protection can share the feature gate with PVC protection	2018-01-31 20:02:01 +08:00
Kubernetes Submit Queue	c817765b0e	Merge pull request #58445 from hanxiaoshuai/typo Automatic merge from submit-queue. If you want to cherry-pick this change to another branch, please follow the instructions <a href="https://github.com/kubernetes/community/blob/master/contributors/devel/cherry-picks.md">here</a>. fix some typos in comments What this PR does / why we need it: Fixes # fix some typos in comments	2018-01-30 19:44:44 -08:00
YuxiJin-tobeyjin	af6b4e39c2	codeClean-merge-logfAndFailnow-to-fatalf	2018-01-31 11:39:31 +08:00
Kubernetes Submit Queue	84408378f9	Merge pull request #58174 from filbranden/ipcs1 Automatic merge from submit-queue. If you want to cherry-pick this change to another branch, please follow the instructions <a href="https://github.com/kubernetes/community/blob/master/contributors/devel/cherry-picks.md">here</a>. Fixes for HostIPC tests to work when Docker has SELinux support enabled. What this PR does / why we need it: Fixes for HostIPC tests to work when Docker has SELinux support enabled. Which issue(s) this PR fixes (optional, in `fixes #<issue number>(, fixes #<issue_number>, ...)` format, will close the issue(s) when PR gets merged): N/A Special notes for your reviewer: The core of the matter is to use `ipcs` from util-linux rather than the one from busybox. The typical SELinux policy has enough to allow Docker containers (running under svirt_lxc_net_t SELinux type) to access IPC information by reading the contents of the files under /proc/sysvipc/, but not by using the shmctl etc. syscalls. The `ipcs` implementation in busybox will use `shmctl(0, SHM_INFO, ...)` to detect whether it can read IPC info (see source code [here](https://git.busybox.net/busybox/tree/util-linux/ipcs.c?h=1_28_0#n138)), while the one in util-linux will prefer to read from the /proc files directly if they are available (see source code [here](https://github.com/karelzak/util-linux/blob/v2.27.1/sys-utils/ipcutils.c#L108)). It turns out the SELinux policy doesn't allow the shmctl syscalls in an unprivileged container, while access to it through the /proc interface is fine. (One could argue this is a bug in the SELinux policy, but getting it fixed on stable OSs is hard, and it's not that hard for us to test it with an util-linux `ipcs`, so I propose we do so.) This PR also contains a refactor of the code setting IpcMode, since setting it in the "common options" function is misleading, as on containers other than the sandbox, it ends up always getting overwritten, so let's only set it to "host" in the Sandbox. It also has a minor fix for the `ipcmk` call, since support for size suffix was only introduced in recent versions of it. Release note: ```release-note NONE ```	2018-01-30 17:18:52 -08:00
Kubernetes Submit Queue	a18f086220	Merge pull request #59020 from brendandburns/kubelet-hang Automatic merge from submit-queue. If you want to cherry-pick this change to another branch, please follow the instructions <a href="https://github.com/kubernetes/community/blob/master/contributors/devel/cherry-picks.md">here</a>. Remove setInitError. What this PR does / why we need it: Removes setInitError, it's not sure it was ever really used, and it causes the kubelet to hang and get wedged. Which issue(s) this PR fixes Fixes #46086 Special notes for your reviewer: If `initializeModules()` in `kubelet.go` encounters an error, it calls `runtimeState.setInitError(...)` `47d61ef472/pkg/kubelet/kubelet.go (L1339)` The trouble with this is that `initError` is never cleared, which means that `runtimeState.runtimeErrors()` always returns this `initError`, and thus pods never start sync-ing. In normal operation, this is expected and desired because eventually the runtime is expected to become healthy, but in this case, `initError` is never updated, and so the system just gets wedged. `47d61ef472/pkg/kubelet/kubelet.go (L1751)` We could add some retry to `initializeModules()` but that seems unnecessary, as eventually we'd want to just die anyway. Instead, just log fatal and die, a supervisor will restart us. Note, I'm happy to add some retry here too, if that makes reviewers happier. Release note: ```release-note Prevent kubelet from getting wedged if initialization of modules returns an error. ``` @feiskyer @dchen1107 @janetkuo @kubernetes/sig-node-bugs	2018-01-30 14:56:28 -08:00
Kubernetes Submit Queue	c244994af7	Merge pull request #58997 from Random-Liu/eviction-manager-use-cri Automatic merge from submit-queue. If you want to cherry-pick this change to another branch, please follow the instructions <a href="https://github.com/kubernetes/community/blob/master/contributors/devel/cherry-picks.md">here</a>. Make eviction manager work with CRI container runtime. Previously, eviction manager uses a function `HasDedicatedImageFs` in `pkg/kubelet/cadvisor` to detect whether image fs and root fs are on the same device. However, it doesn't work with CRI container runtime which provides container/image stats through CRI. Thus all eviction tests for containerd are failing now. https://k8s-testgrid.appspot.com/sig-node-containerd#node-e2e-flaky This PR makes it work with CRI container runtime. @kubernetes/sig-node-pr-reviews @yujuhong @yguo0905 @feiskyer @mrunalp @abhi @dashpole Signed-off-by: Lantao Liu <lantaol@google.com> What this PR does / why we need it: Which issue(s) this PR fixes (optional, in `fixes #<issue number>(, fixes #<issue_number>, ...)` format, will close the issue(s) when PR gets merged): Fixes # Special notes for your reviewer: Release note: ```release-note none ```	2018-01-30 12:43:30 -08:00
Michael Taufen	da41a6e793	Fix PodPidsLimit and ConfigTrialDuration on internal KubeletConfig type They should both follow the convention of not being a pointer on the internal type. This required adding a conversion function between `int64` and `*int64`. A side effect is this removes a warning in the generated code for the apps API group.	2018-01-30 11:43:41 -08:00
Lantao Liu	68dadcfd15	Make eviction manager work with CRI container runtime. Signed-off-by: Lantao Liu <lantaol@google.com>	2018-01-30 17:57:46 +00:00
Robert Krawitz	2d050b8549	Fix race condition in fake runtime test.	2018-01-30 08:09:01 -05:00
Peng Gao	ac86428d59	Add detailed err in ensure docker process error Signed-off-by: Peng Gao <peng.gao.dut@gmail.com>	2018-01-30 15:02:22 +08:00

1 2 3 4 5 ...

6106 Commits (6cca687bd8404832b5bb7a8f75528be760a3e10e)