github/k3s - k3s - https://git.xinac.net

Commit Graph

Author	SHA1	Message	Date
hzxuzhonghu	8cce8bdc85	make kube-apiserver ServerRunOptions setdefault and Validate before use	2018-04-04 11:19:55 +08:00
Kubernetes Submit Queue	043204b1e5	Merge pull request #61498 from mindprince/delete-in-tree-gpu Automatic merge from submit-queue (batch tested with PRs 61498, 62030). If you want to cherry-pick this change to another branch, please follow the instructions <a href="https://github.com/kubernetes/community/blob/master/contributors/devel/cherry-picks.md">here</a>. Delete in-tree support for NVIDIA GPUs. This removes the alpha Accelerators feature gate which was deprecated in 1.10 (#57384). The alternative feature DevicePlugins went beta in 1.10 (#60170). Fixes #54012 ```release-note Support for "alpha.kubernetes.io/nvidia-gpu" resource which was deprecated in 1.10 is removed. Please use the resource exposed by DevicePlugins instead ("nvidia.com/gpu"). ```	2018-04-03 02:02:04 -07:00
Rohit Agarwal	87dda3375b	Delete in-tree support for NVIDIA GPUs. This removes the alpha Accelerators feature gate which was deprecated in 1.10. The alternative feature DevicePlugins went beta in 1.10.	2018-04-02 20:17:01 -07:00
Christoph Blecker	710c8563b4	Fix go vet errors	2018-04-02 17:57:44 -07:00
Kubernetes Submit Queue	99fd98a893	Merge pull request #61740 from filbranden/nodetest1 Automatic merge from submit-queue (batch tested with PRs 61482, 61740). If you want to cherry-pick this change to another branch, please follow the instructions <a href="https://github.com/kubernetes/community/blob/master/contributors/devel/cherry-picks.md">here</a>. Make systemd service name for kubelet use a timestamp in e2e-node tests. What this PR does / why we need it: This makes it easier to figure out which execution was last when looking at the output of `systemd list-units kubelet-.service`. We try to find the name of the /tmp/node-e2e- directory and use the same timestamp if we can. Otherwise, we just call Now() again, which isn't as nice (as the unit name and directory name will not match) but will still produce unit names that will be ordered when launching multiple subsequent executions on the same host. Which issue(s) this PR fixes (optional, in `fixes #<issue number>(, fixes #<issue_number>, ...)` format, will close the issue(s) when PR gets merged): N/A Special notes for your reviewer: Tested using `make test-e2e-node REMOTE=true` and then checking `systemctl list-units kubelet-.service` on the target host. ``` $ systemctl list-units kubelet-.service kubelet-20180326T142016.service loaded active exited /tmp/node-e2e-20180326T142016/kubelet --kubeconfig /tmp/node-e2e-20180326T142016/kubeconfig --root-dir /var/lib/kubelet ... kubelet-20180326T143550.service loaded active exited /tmp/node-e2e-20180326T143550/kubelet --kubeconfig /tmp/node-e2e-20180326T143550/kubeconfig --root-dir /var/lib/kubelet ... ``` The units are sorted in the order they were launched. Release note: ```release-note NONE ```	2018-03-29 21:10:03 -07:00
Filipe Brandenburger	b8c39b7055	In summary_test, make Docker cpu/memory checks optional if unavailable. The numbers will only be available when docker.service has its own memory and cpu cgroups, which doesn't necessarily happen unless the unit has Delegate=yes configured. Let's work around that by checking the status of Delegate, in the case where we are: * running Docker * running Systemd * able to check the status through systemctl * the status is explicitly Delegate=no (the default) If all of those are true, let's make CPU and Memory expectations optional. Tested: make test-e2e-node REMOTE=true HOSTS=centos-e2e-node FOCUS="Summary API"	2018-03-29 18:12:30 -07:00
Filipe Brandenburger	351a70b60e	In summary_test, create a file outside the test volume too. This is necessary to show any RootFs usage on systems where the backing filesystem of overlay2 is xfs. The current test only created directories (for mount points) in the upper layer of the overlay. Outside of the mount namespace, only the directories are visible. When running `du` on those, usually filesystems will show some usage, but not xfs, which shows a disk usage of 0 for directories. Fix this by creating a file in the root directory, outside the volumes, in order to trigger some disk usage that can be measured by `du`. Tested: make test-e2e-node REMOTE=true HOSTS=centos-e2e-node FOCUS="Summary API"	2018-03-29 18:12:29 -07:00
Kubernetes Submit Queue	5ae7bba496	Merge pull request #60100 from mtaufen/node-authz-nodeconfigsource Automatic merge from submit-queue (batch tested with PRs 61829, 61908, 61307, 61872, 60100). If you want to cherry-pick this change to another branch, please follow the instructions <a href="https://github.com/kubernetes/community/blob/master/contributors/devel/cherry-picks.md">here</a>. node authorizer sets up access rules for dynamic config This PR makes the node authorizer automatically set up access rules for dynamic Kubelet config. I also added some validation to the node strategy, which I discovered we were missing while writing this. This PR is based on another WIP from @liggitt. ```release-note The node authorizer now automatically sets up rules for Node.Spec.ConfigSource when the DynamicKubeletConfig feature gate is enabled. ```	2018-03-29 17:37:18 -07:00
Filipe Brandenburger	76ef9c9074	Make systemd service name for kubelet use a timestamp in e2e-node tests. This makes it easier to figure out which execution was last when looking at the output of `systemd list-units kubelet-.service`. We try to find the name of the /tmp/node-e2e- directory and use the same timestamp if we can. Otherwise, we just call Now() again, which isn't as nice (as the unit name and directory name will not match) but will still produce unit names that will be ordered when launching multiple subsequent executions on the same host.	2018-03-29 11:17:42 -07:00
Filipe Brandenburger	451faff4ef	Use curl instead of wget to fetch the CNI tarball in e2e-node test Curl is more ubiquitous than wget. For instance, the GCE centos-7 and rhel-7 image families ship curl by default, but not wget. Looking at the shell scripts under cluster/, they tend to use curl more than wget. (The ones that use wget, such as get-kube.sh, try curl first and only fallback to wget if it's not available.) Tested: by running node-e2e-test on Ubuntu, COS and CentOS.	2018-03-27 09:41:09 -07:00
Michael Taufen	ab8dc12333	node authorizer sets up access rules for dynamic config This PR makes the node authorizer automatically set up access rules for dynamic Kubelet config. I also added some validation to the node strategy, which I discovered we were missing while writing this.	2018-03-27 08:49:45 -07:00
Kubernetes Submit Queue	915798d229	Merge pull request #60563 from hzxuzhonghu/replace-context Automatic merge from submit-queue. If you want to cherry-pick this change to another branch, please follow the instructions <a href="https://github.com/kubernetes/community/blob/master/contributors/devel/cherry-picks.md">here</a>. Replace package "golang.org/x/net/context" with "context" What this PR does / why we need it: Replace package "golang.org/x/net/context" with "context" Which issue(s) this PR fixes (optional, in `fixes #<issue number>(, fixes #<issue_number>, ...)` format, will close the issue(s) when PR gets merged): Fixes #60560 Special notes for your reviewer: As of Go 1.7 this package(golang.org/x/net/context) is available in the standard library under the name context. see (https://godoc.org/golang.org/x/net/context) It is almost machinery replace. Release note: ```release-note NONE ```	2018-03-23 16:34:23 -07:00
Kubernetes Submit Queue	1b6b2ee790	Merge pull request #61478 from shyamjvs/capture-pod-startup-phases-as-metrics Automatic merge from submit-queue (batch tested with PRs 61378, 60915, 61499, 61507, 61478). If you want to cherry-pick this change to another branch, please follow the instructions <a href="https://github.com/kubernetes/community/blob/master/contributors/devel/cherry-picks.md">here</a>. Capture pod startup phases as metrics Learning from https://github.com/kubernetes/kubernetes/issues/60589, we should also start collecting and graphing sub-parts of pod-startup latency. /sig scalability /kind feature /priority important-soon /cc @wojtek-t ```release-note NONE ```	2018-03-22 07:15:33 -07:00
hzxuzhonghu	70e45eccf2	Replace "golang.org/x/net/context" with "context"	2018-03-22 20:57:14 +08:00
Shyam Jeedigunta	0f0c754eb4	Get rid of duplicate VerifyPodStartupLatency util in node density tests	2018-03-21 16:58:31 +01:00
Shyam Jeedigunta	b0dd166fa3	Capture different parts of pod-startup latency as metrics	2018-03-21 16:58:25 +01:00
Lantao Liu	9fc2795d55	Change pods memory boundary. Signed-off-by: Lantao Liu <lantaol@google.com>	2018-03-20 23:24:16 +00:00
Kubernetes Submit Queue	c64f19dd1b	Merge pull request #59728 from wgliang/master.append Automatic merge from submit-queue (batch tested with PRs 59740, 59728, 60080, 60086, 58714). If you want to cherry-pick this change to another branch, please follow the instructions <a href="https://github.com/kubernetes/community/blob/master/contributors/devel/cherry-picks.md">here</a>. more concise to merge the slice What this PR does / why we need it: more concise to merge the slice Special notes for your reviewer:	2018-03-19 21:34:30 -07:00
Kubernetes Submit Queue	a3f40dd8df	Merge pull request #60856 from jiayingz/race-fix Automatic merge from submit-queue. If you want to cherry-pick this change to another branch, please follow the instructions <a href="https://github.com/kubernetes/community/blob/master/contributors/devel/cherry-picks.md">here</a>. Fixes the races around devicemanager Allocate() and endpoint deletion. There is a race in predicateAdmitHandler Admit() that getNodeAnyWayFunc() could get Node with non-zero deviceplugin resource allocatable for a non-existing endpoint. That race can happen when a device plugin fails, but is more likely when kubelet restarts as with the current registration model, there is a time gap between kubelet restart and device plugin re-registration. During this time window, even though devicemanager could have removed the resource initially during GetCapacity() call, Kubelet may overwrite the device plugin resource capacity/allocatable with the old value when node update from the API server comes in later. This could cause a pod to be started without proper device runtime config set. To solve this problem, introduce endpointStopGracePeriod. When a device plugin fails, don't immediately remove the endpoint but set stopTime in its endpoint. During kubelet restart, create endpoints with stopTime set for any checkpointed registered resource. The endpoint is considered to be in stopGracePeriod if its stoptime is set. This allows us to track what resources should be handled by devicemanager during the time gap. When an endpoint's stopGracePeriod expires, we remove the endpoint and its resource. This allows the resource to be exported through other channels (e.g., by directly updating node status through API server) if there is such use case. Currently endpointStopGracePeriod is set as 5 minutes. Given that an endpoint is no longer immediately removed upon disconnection, mark all its devices unhealthy so that we can signal the resource allocatable change to the scheduler to avoid scheduling more pods to the node. When a device plugin endpoint is in stopGracePeriod, pods requesting the corresponding resource will fail admission handler. Tested: Ran GPUDevicePlugin e2e_node test 100 times and all passed now. What this PR does / why we need it: Which issue(s) this PR fixes (optional, in `fixes #<issue number>(, fixes #<issue_number>, ...)` format, will close the issue(s) when PR gets merged): Fixes https://github.com/kubernetes/kubernetes/issues/60176 Special notes for your reviewer: Release note: ```release-note Fixes the races around devicemanager Allocate() and endpoint deletion. ```	2018-03-12 02:50:13 -07:00
Jiaying Zhang	5514a1f4dd	Fixes the races around devicemanager Allocate() and endpoint deletion. There is a race in predicateAdmitHandler Admit() that getNodeAnyWayFunc() could get Node with non-zero deviceplugin resource allocatable for a non-existing endpoint. That race can happen when a device plugin fails, but is more likely when kubelet restarts as with the current registration model, there is a time gap between kubelet restart and device plugin re-registration. During this time window, even though devicemanager could have removed the resource initially during GetCapacity() call, Kubelet may overwrite the device plugin resource capacity/allocatable with the old value when node update from the API server comes in later. This could cause a pod to be started without proper device runtime config set. To solve this problem, introduce endpointStopGracePeriod. When a device plugin fails, don't immediately remove the endpoint but set stopTime in its endpoint. During kubelet restart, create endpoints with stopTime set for any checkpointed registered resource. The endpoint is considered to be in stopGracePeriod if its stoptime is set. This allows us to track what resources should be handled by devicemanager during the time gap. When an endpoint's stopGracePeriod expires, we remove the endpoint and its resource. This allows the resource to be exported through other channels (e.g., by directly updating node status through API server) if there is such use case. Currently endpointStopGracePeriod is set as 5 minutes. Given that an endpoint is no longer immediately removed upon disconnection, mark all its devices unhealthy so that we can signal the resource allocatable change to the scheduler to avoid scheduling more pods to the node. When a device plugin endpoint is in stopGracePeriod, pods requesting the corresponding resource will fail admission handler.	2018-03-09 17:00:57 -08:00
Kubernetes Submit Queue	ae7be34c32	Merge pull request #60509 from verb/pid-e2e Automatic merge from submit-queue. If you want to cherry-pick this change to another branch, please follow the instructions <a href="https://github.com/kubernetes/community/blob/master/contributors/devel/cherry-picks.md">here</a>. Add node-e2e test for ShareProcessNamespace What this PR does / why we need it: Adds a node-e2e test for kubernetes/features#495 Which issue(s) this PR fixes (optional, in `fixes #<issue number>(, fixes #<issue_number>, ...)` format, will close the issue(s) when PR gets merged): Fixes #59554 Special notes for your reviewer: This requires a feature gate to be enabled in both the kubelet and API server. I'm not sure which jenkins configs need to be updated (or if these are even still used) so I just updated a pile of them. opened kubernetes/test-infra#7030 for https://github.com/kubernetes/test-infra/blob/master/jobs/config.json Release note: ```release-note NONE ```	2018-03-05 14:20:14 -08:00
David Ashpole	395bea9d83	increase amount of memory filled by memory allocatable eviction test	2018-03-02 10:00:03 -08:00
Jiaying Zhang	6d7e6599f1	I forgot the fact that the DevicePlugin test itself restarts Kubelet for testing purpose. Move that test back to Serial but constructs a smaller test without kubelet restart that we may run during presubmit.	2018-03-01 14:02:09 -08:00
Kubernetes Submit Queue	5d26ef96a8	Merge pull request #59345 from hanxiaoshuai/fixtodo02051 Automatic merge from submit-queue. If you want to cherry-pick this change to another branch, please follow the instructions <a href="https://github.com/kubernetes/community/blob/master/contributors/devel/cherry-picks.md">here</a>. fix todo:Move function readinessCheck to util What this PR does / why we need it: fix todo:Move function readinessCheck to util in test/e2e_node/services Which issue(s) this PR fixes (optional, in `fixes #<issue number>(, fixes #<issue_number>, ...)` format, will close the issue(s) when PR gets merged): Fixes # Special notes for your reviewer: Release note: ```release-note NONE ```	2018-02-28 08:06:34 -08:00
Kubernetes Submit Queue	5be121aca7	Merge pull request #60376 from mikedanese/fixup Automatic merge from submit-queue (batch tested with PRs 60376, 55584, 60358, 54631, 60291). If you want to cherry-pick this change to another branch, please follow the instructions <a href="https://github.com/kubernetes/community/blob/master/contributors/devel/cherry-picks.md">here</a>. remove gcloud docker -- since it's deprecated docker handles this now and it raises an error. try 3 ```release-note NONE ```	2018-02-28 03:37:21 -08:00
Kubernetes Submit Queue	2023c019eb	Merge pull request #60451 from jiayingz/e2e_node_enable Automatic merge from submit-queue (batch tested with PRs 60236, 60332, 57375, 60451, 57408). If you want to cherry-pick this change to another branch, please follow the instructions <a href="https://github.com/kubernetes/community/blob/master/contributors/devel/cherry-picks.md">here</a>. Update device plugin e2e_node test to not changing Kubelet config as DevicePlugins feature is enabled by default now. What this PR does / why we need it: Which issue(s) this PR fixes (optional, in `fixes #<issue number>(, fixes #<issue_number>, ...)` format, will close the issue(s) when PR gets merged): Fixes # Special notes for your reviewer: Release note: ```release-note ```	2018-02-28 01:12:32 -08:00
Mike Danese	c0b7364563	remove gcloud docker -- since it's deprecated	2018-02-28 00:24:27 -08:00
Lee Verberne	b02f1f2ce3	Add node-e2e test for ShareProcessNamespace	2018-02-28 09:15:56 +01:00
Ryan Hitchman	8aa3ca3cbb	Add a few "+build linux" tags where appropriate.	2018-02-27 13:53:32 -08:00
Ryan Hitchman	e04b91facf	Remove unused variables (only assigned to) from test code. This is revealed by the go/types package, which is stricter than the Go compiler about unused variables. See also: golang/go#8560	2018-02-27 13:45:31 -08:00
Jiaying Zhang	fee083feac	Update device plugin e2e_node test to not changing Kubelet config as DevicePlugins feature is enabled by default now.	2018-02-26 22:45:44 -08:00
Kubernetes Submit Queue	e31c8a2252	Merge pull request #60318 from jiayingz/api-change Automatic merge from submit-queue (batch tested with PRs 59159, 60318, 60079, 59371, 57415). If you want to cherry-pick this change to another branch, please follow the instructions <a href="https://github.com/kubernetes/community/blob/master/contributors/devel/cherry-picks.md">here</a>. Made a couple API changes to deviceplugin/v1beta1 to avoid future incompatible API changes: - Add GetDevicePluginOptions rpc call. This is needed when we switch from Registration service to probe-based plugin watcher. - Change AllocateRequest and AllocateResponse to allow device requests from multiple containers in a pod. Currently only made mechanical change on the devicemanager and test code to cope with the API but still issues an Allocate call per container. We can modify the devicemanager in 1.11 to issue a single Allocate call per pod. The change will also facilitate incremental API change to communicate pod level information through Allocate rpc if there is such future need. What this PR does / why we need it: Made a couple API changes to deviceplugin/v1beta1 to avoid future incompatible API changes. Which issue(s) this PR fixes (optional, in `fixes #<issue number>(, fixes #<issue_number>, ...)` format, will close the issue(s) when PR gets merged): Fixes https://github.com/kubernetes/kubernetes/issues/59370 Special notes for your reviewer: Release note: ```release-note ```	2018-02-24 21:19:33 -08:00
Kubernetes Submit Queue	720c29b3e8	Merge pull request #60314 from mtaufen/kubelet-manifest-is-oldspeak Automatic merge from submit-queue (batch tested with PRs 60324, 60269, 59771, 60314, 59941). If you want to cherry-pick this change to another branch, please follow the instructions <a href="https://github.com/kubernetes/community/blob/master/contributors/devel/cherry-picks.md">here</a>. expunge the word 'manifest' from Kubelet's config API The word 'manifest' technically refers to a container-group specification that predated the Pod abstraction. We should avoid using this legacy terminology where possible. Fortunately, the Kubelet's config API will be beta in 1.10 for the first time, so we still had the chance to make this change. I left the flags alone, since they're deprecated anyway. I changed a few var names in files I touched too, but this PR is the just the first shot, not the whole campaign (`git grep -i manifest \| wc -l -> 1248`). ```release-note Some field names in the Kubelet's now v1beta1 config API differ from the v1alpha1 API: PodManifestPath is renamed to PodPath, ManifestURL is renamed to PodURL, ManifestURLHeader is renamed to PodURLHeader. ```	2018-02-24 20:01:46 -08:00
Kubernetes Submit Queue	829ada8e30	Merge pull request #57965 from xiangpengzhao/cleanup-feature-gates Automatic merge from submit-queue. If you want to cherry-pick this change to another branch, please follow the instructions <a href="https://github.com/kubernetes/community/blob/master/contributors/devel/cherry-picks.md">here</a>. Update test framework featuregates type What this PR does / why we need it: A cleanup following #53025 and #57962. Which issue(s) this PR fixes (optional, in `fixes #<issue number>(, fixes #<issue_number>, ...)` format, will close the issue(s) when PR gets merged): ref: #53025 and #57962. Special notes for your reviewer: but yeah, not sure if it's worthy to do this :) Release note: ```release-note NONE ```	2018-02-24 07:34:19 -08:00
Jiaying Zhang	07beac6004	Made a couple API changes to deviceplugin/v1beta1 to avoid future incompatible changes: - Add GetDevicePluginOptions rpc call. This is needed when we switch from Registration service to probe-based plugin watcher. - Change AllocateRequest and AllocateResponse to allow device requests from multiple containers in a pod. Currently only made mechanical change on the devicemanager and test code to cope with the API but still issues an Allocate call per container. We can modify the devicemanager in 1.11 to issue a single Allocate call per pod. The change will also facilitate incremental API change to communicate pod level information through Allocate rpc if there is such future need.	2018-02-23 16:15:09 -08:00
Michael Taufen	b4bddcc998	expunge the word 'manifest' from Kubelet's config API The word 'manifest' technically refers to a container-group specification that predated the Pod abstraction. We should avoid using this legacy terminology where possible. Fortunately, the Kubelet's config API will be beta in 1.10 for the first time, so we still had the chance to make this change. I left the flags alone, since they're deprecated anyway. I changed a few var names in files I touched too, but this PR is the just the first shot, not the whole campaign (`git grep -i manifest \| wc -l -> 1248`).	2018-02-23 11:44:06 -08:00
Lantao Liu	faa581c5cb	Add node e2e test for log rotation.	2018-02-23 01:42:35 +00:00
Lantao Liu	313e8717f6	Generated code	2018-02-23 01:42:35 +00:00
Kubernetes Submit Queue	270148d7d9	Merge pull request #58684 from hzxuzhonghu/default-enabled-admission Automatic merge from submit-queue. If you want to cherry-pick this change to another branch, please follow the instructions <a href="https://github.com/kubernetes/community/blob/master/contributors/devel/cherry-picks.md">here</a>. set default enabled admission plugins by official document What this PR does / why we need it: https://kubernetes.io/docs/admin/admission-controllers/#is-there-a-recommended-set-of-admission-controllers-to-use recommend running the following set of admission controllers ``` If you previously had not set the `--admission-control` flag, your cluster behavior may change (to be more standard). See [https://kubernetes.io/docs/admin/admission-controllers/] for explanation of admission control. ``` Which issue(s) this PR fixes (optional, in `fixes #<issue number>(, fixes #<issue_number>, ...)` format, will close the issue(s) when PR gets merged): Fixes # Special notes for your reviewer: Release note: ```release-note Set default enabled admission plugins `NamespaceLifecycle,LimitRanger,ServiceAccount,PersistentVolumeLabel,DefaultStorageClass,DefaultTolerationSeconds,MutatingAdmissionWebhook,ValidatingAdmissionWebhook,ResourceQuota` ```	2018-02-22 05:24:44 -08:00
Kubernetes Submit Queue	714b19ee75	Merge pull request #57583 from MorrisLaw/bugfix/logf-newline Automatic merge from submit-queue (batch tested with PRs 60158, 60156, 58111, 57583, 60055). If you want to cherry-pick this change to another branch, please follow the instructions <a href="https://github.com/kubernetes/community/blob/master/contributors/devel/cherry-picks.md">here</a>. Bugfix/logf newline What this PR does / why we need it: Removes all redundant new lines being passed into the `Logf()` function. This involved going through code in both `test/e2e` and `test/e2e_node`, finding the newline redundancies in calls to `Logf()` and removing them. Which issue(s) this PR fixes: Fixes [#57102](https://github.com/kubernetes/kubernetes/issues/57102) Release note: ```release-note NONE ```	2018-02-21 22:10:34 -08:00
hzxuzhonghu	27f3fd2d79	set default enabled admission plugins by official document	2018-02-22 11:02:02 +08:00
Kubernetes Submit Queue	687c651dfd	Merge pull request #59884 from mikedanese/remove-deprecated-proxy Automatic merge from submit-queue (batch tested with PRs 58716, 59977, 59316, 59884, 60117). If you want to cherry-pick this change to another branch, please follow the instructions <a href="https://github.com/kubernetes/community/blob/master/contributors/devel/cherry-picks.md">here</a>. remove deprecated /proxy paths These were deprecated in v1.2. ref https://github.com/kubernetes/kubernetes/issues/59885 ```release-note kube-apiserver: the root /proxy paths have been removed (deprecated since v1.2). Use the /proxy subresources on objects that support HTTP proxying. ``` @kubernetes/sig-api-machinery-api-reviews	2018-02-21 15:40:45 -08:00
vikaschoudhary16	e64517cd74	Migrate deviceplugin api from v1alpha to v1beta1	2018-02-21 01:26:20 -05:00
vikaschoudhary16	defcab81d5	Invoke PreStart RPC call before container start, if desired by plugin Signed-off-by: vikaschoudhary16 <vichoudh@redhat.com>	2018-02-21 01:25:24 -05:00
Mike Danese	7b4722964d	remove deprecated /proxy paths These were depercated in v1.2.	2018-02-20 14:42:19 -08:00
Kubernetes Submit Queue	96ec318718	Merge pull request #59842 from ixdy/update-rules_go-02-2018 Automatic merge from submit-queue. If you want to cherry-pick this change to another branch, please follow the instructions <a href="https://github.com/kubernetes/community/blob/master/contributors/devel/cherry-picks.md">here</a>. Update bazelbuild/rules_go, kubernetes/repo-infra, and gazelle dependencies What this PR does / why we need it: updates our bazelbuild/rules_go dependency in order to bump everything to go1.9.4. I'm separating this effort into two separate PRs, since updating rules_go requires a large cleanup, removing an attribute from most build rules. Release note: ```release-note NONE ```	2018-02-19 22:23:05 -08:00
Jeremy L. Morris	e724886ad5	Removed newlines from e2e log statements.	2018-02-17 22:25:38 -05:00
David Ashpole	960856f4e8	collect metrics on the /kubepods cgroup on-demand	2018-02-17 12:32:40 -08:00
Kubernetes Submit Queue	1e5a58416b	Merge pull request #59989 from mtaufen/fix-e2e-node-tests Automatic merge from submit-queue (batch tested with PRs 59927, 59989, 59950). If you want to cherry-pick this change to another branch, please follow the instructions <a href="https://github.com/kubernetes/community/blob/master/contributors/devel/cherry-picks.md">here</a>. Fix e2e node setKubeletConfiguration helper The helper should have been using `apiequality.Semantic.DeepEqual`, instead of `reflect.DeepEqual`. Previously, nil vs empty containers were treated as not equal, but they should be considered equal for objects managed by Kubernetes API machinery, like KubeletConfiguration. This should fix the failing eviction tests. ```release-note NONE ```	2018-02-16 17:42:33 -08:00
Kubernetes Submit Queue	270ed995f4	Merge pull request #59841 from dashpole/metrics_after_reclaim Automatic merge from submit-queue (batch tested with PRs 59683, 59964, 59841, 59936, 59686). If you want to cherry-pick this change to another branch, please follow the instructions <a href="https://github.com/kubernetes/community/blob/master/contributors/devel/cherry-picks.md">here</a>. Reevaluate eviction thresholds after reclaim functions What this PR does / why we need it: When the node comes under `DiskPressure` due to inodes or disk space, the eviction manager runs garbage collection functions to clean up dead containers and unused images. Currently, we use the strategy of trying to measure the disk space and inodes freed by garbage collection. However, as #46789 and #56573 point out, there are gaps in the implementation that can cause extra evictions even when they are not required. Furthermore, for nodes which frequently cycle through images, it results in a large number of evictions, as running out of inodes always causes an eviction. This PR changes this strategy to call the garbage collection functions and ignore the results. Then, it triggers another collection of node-level metrics, and sees if the node is still under DiskPressure. This way, we can simply observe the decrease in disk or inode usage, rather than trying to measure how much is freed. Which issue(s) this PR fixes: Fixes #46789 Fixes #56573 Related PR #56575 Special notes for your reviewer: This will look cleaner after #57802 removes arguments from [makeSignalObservations](https://github.com/kubernetes/kubernetes/pull/57802/files#diff-9e5246d8c78d50ce4ba440f98663f3e9R719). Release note: ```release-note NONE ``` /sig node /kind bug /priority important-soon cc @kubernetes/sig-node-pr-reviews	2018-02-16 16:31:33 -08:00

1 2 3 4 5 ...

1280 Commits (07d1a8cb0c89d70572560611ad37e8986e9b6ead)