github/k3s - k3s - https://git.xinac.net

Commit Graph

Author	SHA1	Message	Date
k8s-ci-robot	8b36038b41	Merge pull request #68483 from pohly/e2e-refactor-pr e2e refactor	2018-10-19 12:32:01 -07:00
Mikhail Mazurskiy	55bc668f8d	Seed math/rand in TestMain before tests are executed	2018-10-11 22:07:45 +11:00
Mikhail Mazurskiy	3a243090a5	Simplify random seed initialization There is no need to set the time zone as the result does not depend on it	2018-10-11 21:01:15 +11:00
Patrick Ohly	8b17db7e0c	e2e: modular framework Not all users of the E2E framework want to run cloud-provider specific tests. By splitting out the code it becomes possible to decide in a E2E test suite which providers are supported. This is achieved in two ways: - the framework calls certain functions through a provider interface instead of calling specific cloud provider functions directly - tests that are cloud-provider specific directly import the new provider packages The ingress test utilities are only needed by a few tests. Splitting them out into a separate package makes the framework simpler for test suites not using those tests. Fixes: #66649	2018-10-11 11:16:11 +02:00
mooncake	4894f5583d	Remove the duplicated words in test files Signed-off-by: mooncake <xcoder@tenxcloud.com>	2018-10-05 22:55:16 +08:00
Jordan Liggitt	ad46728158	Switch e2e_node to etcd3	2018-10-04 11:41:16 -04:00
SataQiu	94a653f100	fix typo	2018-09-28 23:41:24 +08:00
Manjunath A Kumatagi	7b9833ce56	Update authenticated-image-pulling with fat manifest image	2018-09-27 17:43:15 +05:30
k8s-ci-robot	db322a4944	Merge pull request #67841 from jiayingz/fix-e2e-node Updates test/e2e_node/device_plugin.go to cope with recent device	2018-09-25 01:27:22 -07:00
Mayank Gaikwad	8f557da3c8	Port kubelet e2e_node tests to e2e	2018-09-17 11:33:30 +05:30
k8s-ci-robot	25cbd1c753	Merge pull request #67781 from dashpole/fix_priority_tests Fix priority tests	2018-09-10 12:48:05 -07:00
Kubernetes Submit Queue	c9de610897	Merge pull request #65250 from balajismaniam/node-perf-testing-framework Automatic merge from submit-queue (batch tested with PRs 65250, 68241). If you want to cherry-pick this change to another branch, please follow the instructions here: https://github.com/kubernetes/community/blob/master/contributors/devel/cherry-picks.md. Initial node performance testing framework. This PR adds a framework for node performance testing. Partially fixes: https://github.com/kubernetes/kubernetes/issues/65249. Use the following command to run this test: ```sh make test-e2e-node FOCUS="Node Performance Testing" SKIP="" PARALLELISM=1 ``` It has been tested in the following environment: - n1-standard-16 - Ubuntu 16.04 - docker 17.03.2 Note to reviewers: This PR won't pass node e2e since the docker images in https://github.com/kubernetes/kubernetes/pull/65251 are required for this to function. The node e2e will fail when trying to pull the required images for testing.	2018-09-08 16:09:30 -07:00
David Ashpole	90f58c1157	critical pod test should not rely on feature gate set in framework; non-critical pods are always preemptable	2018-09-07 17:43:42 -07:00
David Ashpole	4a2ef941b8	fix eviction test panics	2018-09-07 13:35:13 -07:00
Adelina Tuvenie	8aa33c6201	Replaced hardcoded busybox image in e2e tests.	2018-09-06 10:55:17 +03:00
Balaji Subramaniam	7c4411eb28	Initial node performance testing framework.	2018-09-05 11:24:44 -07:00
Kubernetes Submit Queue	d8365a9ca7	Merge pull request #68123 from mgdevstack/master-securitycontext-67032 Automatic merge from submit-queue (batch tested with PRs 67736, 68123, 68138). If you want to cherry-pick this change to another branch, please follow the instructions here: https://github.com/kubernetes/community/blob/master/contributors/devel/cherry-picks.md. Port security context NodeConformance e2e_node tests to e2e What this PR does / why we need it: Port all [NodeConformance] SecurityContext e2e_node tests to e2e/common. Which issue(s) this PR fixes (optional, in `fixes #<issue number>(, fixes #<issue_number>, ...)` format, will close the issue(s) when PR gets merged): Fixes #67032 Special notes for your reviewer: - This PR is a continuing effort to close #67032. - Removed ContainerRuntime constraint [as discussed](https://github.com/kubernetes/kubernetes/pull/67032#discussion_r214201870). - Porting all [NodeConformance] tests to e2e/common which do not have node dependencies. - Does it make sense to port [privileged test](https://github.com/kubernetes/kubernetes/blob/master/test/e2e_node/security_context_test.go#L558) to e2e/common and remove [NodeFeature:HostAccess] label from test name? Release note: ```release-note NONE ``` /area conformance @kubernetes/sig-node-pr-reviews	2018-09-04 12:51:35 -07:00
Lucas Käldström	8b6a7ee075	autogenerated go code, godeps, bazel and gofmt	2018-09-02 14:38:59 +03:00
Lucas Käldström	0707b1274f	Automated package reference rename	2018-09-02 14:15:38 +03:00
Kubernetes Submit Queue	8ba06eff79	Merge pull request #67571 from mgdevstack/master-commit-runtime Automatic merge from submit-queue. If you want to cherry-pick this change to another branch, please follow the instructions here: https://github.com/kubernetes/community/blob/master/contributors/devel/cherry-picks.md. Transitioning container-runtime e2e_node test to e2e What this PR does / why we need it: This is a continuation of an existing PR #67258 to transition [few runtime NodeConformance tests](https://github.com/kubernetes/kubernetes/issues/67103#issuecomment-411483640) from e2e_node to e2e (e2e/common). Which issue(s) this PR fixes (optional, in `fixes #<issue number>(, fixes #<issue_number>, ...)` format, will close the issue(s) when PR gets merged): Fixes #67103 Special notes for your reviewer: In order to make simple clear naming of test, they are updated to > "_Container Runtime blackbox test when starting a container that exits should run with the expected status [NodeConformance]_" >"~~_Container Runtime Conformance Test container runtime conformance blackbox test when starting a container that exits it should run with the expected status [NodeConformance]_~~" which requires updation of test names in test/test_owners.csv and test_owners.json file. Do we have any automated script to update these test_owners file or do we need to update them manually in both files? Please feel free to comment incase we don't want to change test name. Newly updated codebase includes following changes accomplishing all previously [mentioned](https://github.com/kubernetes/kubernetes/pull/67258#pullrequestreview-147294021) requested changes(reviews) - [Test name](https://github.com/kubernetes/kubernetes/pull/67258/files#diff-0dc16dc0a015699e53bda03495adc49eR36) change. - Container's [image name](https://github.com/kubernetes/kubernetes/pull/67258/files#diff-0dc16dc0a015699e53bda03495adc49eR144) - [By()](https://github.com/kubernetes/kubernetes/pull/67258/files#diff-0dc16dc0a015699e53bda03495adc49eR109) statement - [Removed test](https://github.com/kubernetes/kubernetes/pull/67258/files#diff-178a0a673bda44ea7a86bd94070df78cR137) from conformance golden list This would close existing PR #67258 Release note: ```release-note NONE ``` /area conformance @kubernetes/sig-node-pr-reviews	2018-08-31 20:37:27 -07:00
Mayank Gaikwad	c2683eafd2	Port security context NodeConformance e2e_node tests to e2e	2018-08-31 14:11:01 +05:30
Lucas Käldström	844487aea4	autogenerated	2018-08-29 20:21:17 +03:00
Lucas Käldström	994ac98586	Update api violations, golint failures and gofmt	2018-08-29 20:21:09 +03:00
Lucas Käldström	7a840cb4c8	automated: Rename all package references	2018-08-29 19:07:52 +03:00
Kubernetes Submit Queue	ad26ca1e69	Merge pull request #67805 from linyouchong/pr-0824 Automatic merge from submit-queue. If you want to cherry-pick this change to another branch, please follow the instructions <a href="https://github.com/kubernetes/community/blob/master/contributors/devel/cherry-picks.md">here</a>. Fix error link in comment What this PR does / why we need it: Fix error link in comment Which issue(s) this PR fixes (optional, in `fixes #<issue number>(, fixes #<issue_number>, ...)` format, will close the issue(s) when PR gets merged): Fixes # Special notes for your reviewer: NONE Release note: ```release-note NONE ``` /sig node	2018-08-26 14:07:28 -07:00
Jiaying Zhang	55f36946c8	Updates test/e2e_node/device_plugin.go to cope with recent device manager change in commit `7b1ae66`. Also changes the test to make sure node is indeed ready after Kubelet restart. The previous readiness check may use old API state but didn't run into the issue due to the delay of waiting for pod restart.	2018-08-24 10:29:10 -07:00
linyouchong	075f37f853	Fix error link in comment	2018-08-24 17:07:43 +08:00
Mayank Gaikwad	74bc8a3211	transitioning container-runtime from e2e_node to e2e/common	2018-08-24 07:54:19 +05:30
David Ashpole	a0c071e06e	remove feature gates from eviction tests that are enabled by default	2018-08-22 10:49:13 -07:00
Kubernetes Submit Queue	949199e6ae	Merge pull request #67426 from yanxuean/check-both-err Automatic merge from submit-queue (batch tested with PRs 67100, 67426). If you want to cherry-pick this change to another branch, please follow the instructions <a href="https://github.com/kubernetes/community/blob/master/contributors/devel/cherry-picks.md">here</a>. should check all error in ResourceCollector.Start() Signed-off-by: yanxuean <yan.xuean@zte.com.cn> What this PR does / why we need it: 1. We should check both errors. test/e2e_node/resource_collector.go ``` func (r ResourceCollector) Start() { // Get the cgroup container names for kubelet and runtime kubeletContainer, err := getContainerNameForProcess(kubeletProcessName, "") runtimeContainer, err := getContainerNameForProcess(framework.TestContext.ContainerRuntimeProcessName, framework.TestContext.ContainerRuntimePidFile) if err == nil { systemContainers = map[string]string{ stats.SystemContainerKubelet: kubeletContainer, stats.SystemContainerRuntime: runtimeContainer, } } ``` 2. redundant compare The Timestamp.Equal is unlikely to occur, because we have met Timestamp.Before. ``` if oldStats, ok := oldStatsMap[name]; ok && oldStats.Timestamp.Before(newStats.Timestamp) { if oldStats.Timestamp.Equal(newStats.Timestamp) { continue } r.buffers[name] = append(r.buffers[name], computeContainerResourceUsage(name, oldStats, newStats)) } ``` Which issue(s) this PR fixes* (optional, in `fixes #<issue number>(, fixes #<issue_number>, ...)` format, will close the issue(s) when PR gets merged): Fixes # Special notes for your reviewer: Release note: ```release-note NONE ``` /sig-node	2018-08-16 11:57:32 -07:00
Kubernetes Submit Queue	99fab84c7a	Merge pull request #67100 from mkurylec/promotion-lifecycle-hook-to-conformance Automatic merge from submit-queue (batch tested with PRs 67100, 67426). If you want to cherry-pick this change to another branch, please follow the instructions <a href="https://github.com/kubernetes/community/blob/master/contributors/devel/cherry-picks.md">here</a>. porting e2e_node lifecycle testcases into e2e folder under common a) Shifted (and renamed) file existing in e2e_node to e2e/common. b) Added these tests to the conformance suite: - "should execute poststart exec hook properly" - "should execute prestop exec hook properly" - "should execute poststart http hook properly" - "should execute prestop http hook properly" [reference issue](https://github.com/kubernetes/kubernetes/issues/67086) explaining the effort.	2018-08-16 11:57:28 -07:00
yanxuean	ea9376a18b	redundant equal compare Signed-off-by: yanxuean <yan.xuean@zte.com.cn>	2018-08-15 17:09:45 +08:00
yanxuean	2d6ee874a5	should check all error Signed-off-by: yanxuean <yan.xuean@zte.com.cn>	2018-08-15 17:00:12 +08:00
Kubernetes Submit Queue	b6f0aed056	Merge pull request #66906 from tnozicka/rename-until Automatic merge from submit-queue (batch tested with PRs 67071, 66906, 66722, 67276, 67039). If you want to cherry-pick this change to another branch, please follow the instructions <a href="https://github.com/kubernetes/community/blob/master/contributors/devel/cherry-picks.md">here</a>. #50102 Task 1: Move apimachinery/pkg/watch.Until into client-go/tools/watch.UntilWithoutRetry What this PR does / why we need it: This is a split off from https://github.com/kubernetes/kubernetes/pull/50102 to go in smaller pieces. Moves `apimachinery/pkg/watch.Until` into `client-go/tools/watch.UntilWithoutRetry` and adds context so it is cancelable. Release note: ```release-note NONE ``` Dev release note: ```dev-release-note `apimachinery/pkg/watch.Until` has been moved to `client-go/tools/watch.UntilWithoutRetry`. While switching please consider using the new `client-go/tools/watch.UntilWithSync` or `client-go/tools/watch.Until`. ``` /cc @smarterclayton @kubernetes/sig-api-machinery-pr-reviews /milestone v1.12 /priority important-soon /kind bug (bug after the main PR which is this split from)	2018-08-14 22:43:19 -07:00
Kubernetes Submit Queue	ad1483b58d	Merge pull request #66369 from wackxu/fixe2e Automatic merge from submit-queue (batch tested with PRs 61212, 66369, 66446, 66895, 66969). If you want to cherry-pick this change to another branch, please follow the instructions <a href="https://github.com/kubernetes/community/blob/master/contributors/devel/cherry-picks.md">here</a>. fix e2e tests which set PodPriority are failing fix https://github.com/kubernetes/kubernetes/issues/66357 ```release-note NONE ```	2018-08-14 21:18:08 -07:00
Tomas Nozicka	4d7747a5a3	Update Bazel	2018-08-10 09:55:41 +02:00
Tomas Nozicka	3d4a02abb5	Rename Until to UntilWithoutRetry and move to using context so it's cancelable	2018-08-10 09:55:41 +02:00
Davanum Srinivas	789800d298	Remove ARCH specific image consideration from e2e tests All e2e test images are now using multi-arch manifests so we should stop looking up and using images that are specific to runtime.GOARCH Change-Id: I5f3fd6e9a42b9fb88891c19e28a2dfcf7a14be82	2018-08-09 13:40:19 -04:00
Maria Alejandra Kurylec	21c0cae4e9	a) fixing dependencies.	2018-08-08 09:44:05 -03:00
Maria Alejandra Kurylec	f79d5a19d4	a) porting e2e_node lifecycle testcases into e2e folder, under common. b) placing them under conformance golden list.	2018-08-08 09:44:05 -03:00
Kubernetes Submit Queue	81c6b735fa	Merge pull request #67084 from spiffxp/rm-conformance-from-e2e_node Automatic merge from submit-queue (batch tested with PRs 63572, 67084). If you want to cherry-pick this change to another branch, please follow the instructions <a href="https://github.com/kubernetes/community/blob/master/contributors/devel/cherry-picks.md">here</a>. Remove [Conformance] from tests in test/e2e_node Conformance tests live inside of test/e2e, none of the tests currently tagged as `[Conformance]` in test/e2e_node actually get run when you run the conformance tests with e2e.test (either directly or indirectly with sonobuoy) If these tests make sense as both `[NodeConformance]` and `[Conformance]` tests, they should be ported to test/e2e/common /kind cleanup /area conformance /sig architecture /cc @bgrant0607 @smarterclayton @timothysc /sig node /cc @yujuhong ref: https://github.com/kubernetes/kubernetes/issues/66875 ```release-note NONE ```	2018-08-07 21:06:04 -07:00
Aaron Crickenberger	d724e979cd	Remove [Conformance] from tests in e2e_node None of these tests actually run as part of e2e testing, which is the only way conformance tests are kicked off. They should not be included as part of the conformance suite unless they live in test/e2e/common	2018-08-07 10:43:59 -07:00
Davanum Srinivas	6cd8bd62fe	e2e test harness - use busybox from dockerhub Use the same pattern everywhere in the e2e test harness, use busybox (from dockerhub) instead of using the one from k8s.gcr.io registry. Change-Id: I57c3b867408c1f9478a8909c26744ea0368ff003	2018-08-07 11:22:16 -04:00
Manjunath A Kumatagi	1f7f33aaa4	Update the nginx image from hub.docker.com	2018-08-04 05:19:53 +05:30
Srini Brahmaroutu	dcb7bc313f	Adding details to Conformance Tests using RFC 2119 standards.	2018-07-31 17:21:18 -07:00
Kubernetes Submit Queue	32e38b6659	Merge pull request #58755 from vikaschoudhary16/probing-mode Automatic merge from submit-queue (batch tested with PRs 58755, 66414). If you want to cherry-pick this change to another branch, please follow the instructions <a href="https://github.com/kubernetes/community/blob/master/contributors/devel/cherry-picks.md">here</a>. Use probe based plugin watcher mechanism in Device Manager What this PR does / why we need it: Uses this probe based utility in the device plugin manager. Which issue(s) this PR fixes (optional, in `fixes #<issue number>(, fixes #<issue_number>, ...)` format, will close the issue(s) when PR gets merged): Fixes #56944 Notes For Reviewers: Changes are backward compatible and existing device plugins will continue to work. At the same time, any new plugins that has required support for probing model (Identity service implementation), will also work. Release note ```release-note Add support kubelet plugin watcher in device manager. ``` /sig node /area hw-accelerators /cc /cc @jiayingz @RenaudWasTaken @vishh @ScorpioCPH @sjenning @derekwaynecarr @jeremyeder @lichuqiang @tengqm @saad-ali @chakri-nelluri @ConnorDoyle	2018-07-27 15:20:06 -07:00
wackxu	f3823cc2cf	fix e2e tests which set PodPriority are failing	2018-07-23 09:31:26 +08:00
vikaschoudhary16	a5842503eb	Use probe based plugin discovery mechanism in device manager	2018-07-17 04:02:31 -04:00
Yuanbin.Chen	f2eee3fe2a	Fix kubeadm checks import error kubeadm checks package import path exist "kubernetes/test", So change the import path. * move "k8s.io/kubernetes/test/e2e_node/system" directory file to "k8s.io/kubernetes/cmd/kubeadm/app/util/system" * change system package import path * remove "k8s.io/kubernetes/test/e2e_node/system" directory Issues report link: https://github.com/kubernetes/kubeadm/issues/976 Signed-off-by: Yuanbin.Chen <cybing4@gmail.com>	2018-07-13 14:27:46 +08:00
Kubernetes Submit Queue	097f300a4d	Merge pull request #65707 from dims/remove-deprecated-cadvisor-port Automatic merge from submit-queue. If you want to cherry-pick this change to another branch, please follow the instructions <a href="https://github.com/kubernetes/community/blob/master/contributors/devel/cherry-picks.md">here</a>. Remove --cadvisor-port - has been deprecated since v1.10 What this PR does / why we need it: Which issue(s) this PR fixes (optional, in `fixes #<issue number>(, fixes #<issue_number>, ...)` format, will close the issue(s) when PR gets merged): Fixes #56523 Special notes for your reviewer: - Deprecated in https://github.com/kubernetes/kubernetes/pull/59827 (v1.10) - Disabled in https://github.com/kubernetes/kubernetes/pull/63881 (v1.11) Release note: ```release-note [action required] The formerly publicly-available cAdvisor web UI that the kubelet started using `--cadvisor-port` is now entirely removed in 1.12. The recommended way to run cAdvisor if you still need it, is via a DaemonSet. ```	2018-07-07 05:28:13 -07:00
Kubernetes Submit Queue	5d87a70370	Merge pull request #65635 from neolit123/zfs-fix Automatic merge from submit-queue (batch tested with PRs 65348, 65599, 65635, 65688, 65691). If you want to cherry-pick this change to another branch, please follow the instructions <a href="https://github.com/kubernetes/community/blob/master/contributors/devel/cherry-picks.md">here</a>. test/e2e_node/system/types_unix: support ZFS What this PR does / why we need it: Docker validation tests in the case of ZFS used as the graph driver fail due to "zfs" not being present in the default Docker specification. Add "zfs" in the GraphDriver slice. kubeadm relies on the `DockerValidator` and pre-flight checks would fail if the user is using ZFS. Which issue(s) this PR fixes (optional, in `fixes #<issue number>(, fixes #<issue_number>, ...)` format, will close the issue(s) when PR gets merged): Updates kubernetes/kubeadm#930 Special notes for your reviewer: NONE /cc @kubernetes/sig-node-pr-reviews /cc @kubernetes/sig-cluster-lifecycle-pr-reviews /cc @kvaps (reported by) /area node-e2e /area kubeadm Release note: ```release-note Unix: support ZFS as a valid graph driver for Docker ```	2018-07-02 16:52:12 -07:00
Davanum Srinivas	5feab86329	Remove --cadvisor-port - has been deprecated since v1.10 Signed-off-by: Davanum Srinivas <davanum@gmail.com>	2018-07-02 08:54:14 -04:00
Lubomir I. Ivanov	945e3b3ee1	test/e2e_node/system/types_unix: support ZFS Docker validation tests in the case of ZFS used as the graph driver fail due to "zfs" not being present in the default Docker specification. Add "zfs" in the GraphDriver slice.	2018-06-29 16:53:15 +03:00
Jiaying Zhang	265f3a48d3	Increase certain waiting time window in gpu_device_plugin e2e_node test. Kubelet restart process seems to get a bit slower recently. From running the gpu_device_plugin e2e_node test on GCE, I saw it took ~37 seconds for kubelet to start CM DeviceManager after it restarts, and then took ~12 seconds for the gpu device plugin to re-register. As the result, this e2e_node test fails because the current 10 sec waiting time is too small. Restarting a container also seems to get slower that it sometimes exceeds the current 2 min waiting time in ensurePodContainerRestart(). This change increase both waiting time to 5 min to leave enough space on slower machines.	2018-06-27 11:00:36 -07:00
Kubernetes Submit Queue	76b4699c69	Merge pull request #49410 from jasonbrooks/patch-1 Automatic merge from submit-queue (batch tested with PRs 65449, 65373, 49410). If you want to cherry-pick this change to another branch, please follow the instructions <a href="https://github.com/kubernetes/community/blob/master/contributors/devel/cherry-picks.md">here</a>. add kernel config locations for fedora and atomic What this PR does / why we need it: * Fedora stores its kernel configs in /usr/lib/modules/$(uname -r)/config * Fedora/CentOS/RHEL atomic hosts use /usr/lib/ostree-boot/$(uname -r), though this location is deprecated * The lack of these locations in the validator is causing kubeadm to hang on "failed to parse kernel config" in its preflight checking on fedora and atomic host Special notes for your reviewer: Release note: ```release-note ```	2018-06-26 02:52:11 -07:00
Jeff Grafton	23ceebac22	Run hack/update-bazel.sh	2018-06-22 16:22:57 -07:00
Jeff Grafton	a725660640	Update to gazelle 0.12.0 and run hack/update-bazel.sh	2018-06-22 16:22:18 -07:00
Jan Chaloupka	0d4a5b4cbd	Have the /rootfs rw for containerized node e2e	2018-06-19 22:28:05 +02:00
Michael Taufen	0a6db6b194	Fix test tag on dynamic config tests The test accidentally got turned off when the NodeAlphaFeature tag was added in #64125. This PR updates the tag to turn it back on.	2018-06-04 11:03:30 -07:00
Jan Chaloupka	ee83021182	Mount the kubeletConfigPath rw when running containerized node e2e tests The kubelet needs to create dynamic-kubelet-config directory under the kubeletConfigPath when initialing dynamic config directory.	2018-05-31 14:45:20 +02:00
Kubernetes Submit Queue	15cd355281	Merge pull request #64213 from dashpole/eviction_event_annotation Automatic merge from submit-queue. If you want to cherry-pick this change to another branch, please follow the instructions <a href="https://github.com/kubernetes/community/blob/master/contributors/devel/cherry-picks.md">here</a>. Add metadata to kubelet eviction event annotations What this PR does / why we need it: Add annotations to kubelet eviction events. Annotations include "offending_containers" : comma-seperated list of containers. "offending_containers_usage": comma-seperated list of usage. "starved_resource": v1.ResourceName of the starved resource Special notes for your reviewer: Adding annotations to events required changing the `EventRecorder` interface to add a `AnnotatedEventf` function, which can add annotations to an event. Release note: ```release-note NONE ``` /assign @dchen1107 cc @mwielgus @schylek @kgrygiel	2018-05-29 23:37:47 -07:00
Kubernetes Submit Queue	57b8fda91b	Merge pull request #64472 from yujuhong/tag-pod-cgroup-test Automatic merge from submit-queue (batch tested with PRs 64456, 64457, 64472). If you want to cherry-pick this change to another branch, please follow the instructions <a href="https://github.com/kubernetes/community/blob/master/contributors/devel/cherry-picks.md">here</a>. e2e node: mark pod cgroup test as [NodeConformance] What this PR does / why we need it: Which issue(s) this PR fixes (optional, in `fixes #<issue number>(, fixes #<issue_number>, ...)` format, will close the issue(s) when PR gets merged): Fixes # Special notes for your reviewer: Release note: ```release-note NONE ```	2018-05-29 19:49:11 -07:00
Kubernetes Submit Queue	ea82e932b4	Merge pull request #64457 from dashpole/node_e2e_dynamic Automatic merge from submit-queue (batch tested with PRs 64456, 64457, 64472). If you want to cherry-pick this change to another branch, please follow the instructions <a href="https://github.com/kubernetes/community/blob/master/contributors/devel/cherry-picks.md">here</a>. Fix dynamic kubelet config tests What this PR does / why we need it: The default for dynamic kubelet config is now true, so if it is unset, assume it is enabled for testing. Release note: ```release-note NONE ``` /sig node /kind bug /priority critical-urgent /assign @mtaufen	2018-05-29 19:49:08 -07:00
Yu-Ju Hong	a48008f5ad	e2e node: mark pod cgroup test as [NodeConformance]	2018-05-29 12:56:37 -07:00
David Ashpole	668e127a1e	fix dynamic kubelet config tests	2018-05-29 09:34:40 -07:00
Yu-Ju Hong	2d97f8ea3a	node e2e: fix the missing square brackets Also tag the inode eviction tests with [NodeFeature:Eviction].	2018-05-29 09:24:45 -07:00
Kubernetes Submit Queue	8395b7c12e	Merge pull request #64175 from mtaufen/test-name-cleanup Automatic merge from submit-queue (batch tested with PRs 64175, 63893). If you want to cherry-pick this change to another branch, please follow the instructions <a href="https://github.com/kubernetes/community/blob/master/contributors/devel/cherry-picks.md">here</a>. add colon separators to improve readability of test names ```release-note NONE ```	2018-05-25 03:50:09 -07:00
Kubernetes Submit Queue	690e42b734	Merge pull request #64125 from yujuhong/add-node-e2e-tags Automatic merge from submit-queue (batch tested with PRs 61963, 64279, 64130, 64125, 64049). If you want to cherry-pick this change to another branch, please follow the instructions <a href="https://github.com/kubernetes/community/blob/master/contributors/devel/cherry-picks.md">here</a>. Add node-exclusive tags to tests in test/e2e_node Original issue: #59001 Depends on: #64128 Add node-exclusive tags based on the [proposal](https://docs.google.com/document/d/1BdNVUGtYO6NDx10x_fueRh_DLT-SVdlPC_SsXjYCHOE/edit#) Follow-up PRs will: - Tag the tests in `test/e2e/common` - Change the test job configurations to use the new tests - Remove the unused, non-node-exclusive tags in `test/e2e_node` Release note: ```release-note NONE ```	2018-05-25 01:09:24 -07:00
Kubernetes Submit Queue	731eaecfd1	Merge pull request #57527 from mtaufen/kc-metric Automatic merge from submit-queue (batch tested with PRs 64013, 63896, 64139, 57527, 62102). If you want to cherry-pick this change to another branch, please follow the instructions <a href="https://github.com/kubernetes/community/blob/master/contributors/devel/cherry-picks.md">here</a>. add dynamic config metrics This PR exports config-releated metrics from the Kubelet. The Guages for active, assigned, and last-known-good config can be used to identify config versions and produce aggregate counts across several nodes. The error-reporting Gauge can be used to determine whether a node is experiencing a config-related error, and to prodouce an aggregate count of nodes in an error state. https://github.com/kubernetes/features/issues/281 ```release-note The Kubelet now exports metrics that report the assigned (node_config_assigned), last-known-good (node_config_last_known_good), and active (node_config_active) config sources, and a metric indicating whether the node is experiencing a config-related error (node_config_error). The config source metrics always report the value 1, and carry the node_config_name, node_config_uid, node_config_resource_version, and node_config_kubelet_key labels, which identify the config version. The error metric reports 1 if there is an error, 0 otherwise. ```	2018-05-23 19:44:21 -07:00
Kubernetes Submit Queue	10377f6593	Merge pull request #63896 from mtaufen/refactor-test-metrics Automatic merge from submit-queue (batch tested with PRs 64013, 63896, 64139, 57527, 62102). If you want to cherry-pick this change to another branch, please follow the instructions <a href="https://github.com/kubernetes/community/blob/master/contributors/devel/cherry-picks.md">here</a>. Refactor test utils that deal with Kubelet metrics for clarity I found these functions hard to understand, because the names did not accurately reflect their behavior. For example, GetKubeletMetrics assumed that all of the metrics passed in were measuring latency. The caller of GetKubeletMetrics was implicitly making this assumption, but it was not obvious at the call site. ```release-note NONE ```	2018-05-23 19:44:15 -07:00
David Ashpole	fd1f19fc42	add metadata to kubelet eviction event annotations	2018-05-23 16:12:54 -07:00
Michael Taufen	0188e8c2b5	add colon separators to improve readability of test names	2018-05-22 17:33:18 -07:00
Michael Taufen	0868db5bf1	fix the e2e node helpers that let tests reconfigure Kubelet The dynamic config tests were updated with the validation change, but the tests that try to use dynamic config via this helper were not.	2018-05-22 17:20:51 -07:00
Michael Taufen	fd3432ef05	add dynamic config metrics This PR exports config-releated metrics from the Kubelet. The Guages for active, assigned, and last-known-good config can be used to identify config versions and produce aggregate counts across several nodes. The error-reporting Gauge can be used to determine whether a node is experiencing a config-related error, and to prodouce an aggregate count of nodes in an error state.	2018-05-22 14:08:55 -07:00
Yu-Ju Hong	90750c77c3	test/e2e_node: Add NodeFeature tags to non-conformance tests Serial tests are not considered for conformance tests.	2018-05-21 17:52:36 -07:00
Yu-Ju Hong	ff62f037b8	Re-tag benchmark tests	2018-05-21 17:52:36 -07:00
Yu-Ju Hong	5802f18283	test/e2e_node: mark more tests with [NodeConformance]	2018-05-21 17:52:36 -07:00
Yu-Ju Hong	7cbd897e3e	test/e2e_node: Add Node-exclusive feature tags to existing tests	2018-05-21 17:52:36 -07:00
Yu-Ju Hong	4ad9aedb04	test/e2e_node: Add [NodeConformance] to tests tagged [Conformance] This has no effect yet until test configurations are updated.	2018-05-21 17:51:49 -07:00
Kubernetes Submit Queue	2a989c60ff	Merge pull request #63221 from mtaufen/dkcfg-live-configmap Automatic merge from submit-queue (batch tested with PRs 63881, 64046, 63409, 63402, 63221). If you want to cherry-pick this change to another branch, please follow the instructions <a href="https://github.com/kubernetes/community/blob/master/contributors/devel/cherry-picks.md">here</a>. Kubelet responds to ConfigMap mutations for dynamic Kubelet config This PR makes dynamic Kubelet config easier to reason about by leaving less room for silent skew scenarios. The new behavior is as follows: - ConfigMap does not exist: Kubelet reports error status due to missing source - ConfigMap is created: Kubelet starts using it - ConfigMap is updated: Kubelet respects the update (but we discourage this pattern, in favor of incrementally migrating to a new ConfigMap) - ConfigMap is deleted: Kubelet keeps using the config (non-disruptive), but reports error status due to missing source - ConfigMap is recreated: Kubelet respects any updates (but, again, we discourage this pattern) This PR also makes a small change to the config checkpoint file tree structure, because ResourceVersion is now taken into account when saving checkpoints. The new structure is as follows: ``` - dir named by --dynamic-config-dir (root for managing dynamic config) \| - meta \| - assigned (encoded kubeletconfig/v1beta1.SerializedNodeConfigSource object, indicating the assigned config) \| - last-known-good (encoded kubeletconfig/v1beta1.SerializedNodeConfigSource object, indicating the last-known-good config) \| - checkpoints \| - uid1 (dir for versions of object identified by uid1) \| - resourceVersion1 (dir for unpacked files from resourceVersion1) \| - ... \| - ... ``` fixes: #61643 ```release-note The dynamic Kubelet config feature will now update config in the event of a ConfigMap mutation, which reduces the chance for silent config skew. Only name, namespace, and kubeletConfigKey may now be set in Node.Spec.ConfigSource.ConfigMap. The least disruptive pattern for config management is still to create a new ConfigMap and incrementally roll out a new Node.Spec.ConfigSource. ```	2018-05-21 17:05:42 -07:00
Michael Taufen	b5648c3f61	dynamic Kubelet config reconciles ConfigMap updates	2018-05-21 09:03:58 -07:00
Michael Taufen	83509a092f	Refactor test utils that deal with Kubelet metrics for clarity I found these functions hard to understand, because the names did not accurately reflect their behavior. For example, GetKubeletMetrics assumed that all of the metrics passed in were measuring latency. The caller of GetKubeletMetrics was implicitly making this assumption, but it was not obvious at the call site.	2018-05-18 11:32:29 -07:00
Kubernetes Submit Queue	2accf11f1a	Merge pull request #57849 from dashpole/eviction_test_event Automatic merge from submit-queue (batch tested with PRs 63865, 57849, 63932, 63930, 63936). If you want to cherry-pick this change to another branch, please follow the instructions <a href="https://github.com/kubernetes/community/blob/master/contributors/devel/cherry-picks.md">here</a>. Eviction Node e2e test checks for eviction reason What this PR does / why we need it: Currently, the eviction test simply ensures that pods are marked `Failed`. However, this could occur because of an OOM, rather than an eviction. To ensure that pods are actually being evicted, check for the Reason in the pod status to ensure it is evicted. Release note: ```release-note NONE ``` cc @kubernetes/sig-node-pr-reviews	2018-05-17 00:28:19 -07:00
Michael Taufen	fcc1f8e7b6	Move to a structured status for dynamic Kubelet config Updates dynamic Kubelet config to use a structured status, rather than a node condition. This makes the status machine-readable, and thus more useful for config orchestration. Fixes: #56896	2018-05-15 11:25:12 -07:00
Kubernetes Submit Queue	b2fe2a0a6d	Merge pull request #59847 from mtaufen/dkcfg-explicit-keys Automatic merge from submit-queue (batch tested with PRs 63624, 59847). If you want to cherry-pick this change to another branch, please follow the instructions <a href="https://github.com/kubernetes/community/blob/master/contributors/devel/cherry-picks.md">here</a>. explicit kubelet config key in Node.Spec.ConfigSource.ConfigMap This makes the Kubelet config key in the ConfigMap an explicit part of the API, so we can stop using magic key names. As part of this change, we are retiring ConfigMapRef for ConfigMap. ```release-note You must now specify Node.Spec.ConfigSource.ConfigMap.KubeletConfigKey when using dynamic Kubelet config to tell the Kubelet which key of the ConfigMap identifies its config file. ```	2018-05-09 17:55:13 -07:00
Filipe Brandenburger	48d052fae4	Fix cgroup names in node_container_manager_test. The names were made invalid for the CgroupName refactor in #62541, so update them here. Furthermore, as the new names are now compatible with what EnforceNodeAllocatable wants, reuse the constants there as well. Tested: $ make test-e2e-node REMOTE=true HOSTS=test-cos-beta-67-10575-27-0 FOCUS='Validate Node Allocatable' SKIP='' TEST_ARGS='--feature-gates=DynamicKubeletConfig=true' • [SLOW TEST:39.488 seconds] [k8s.io] Node Container Manager [Serial] Validate Node Allocatable set's up the node and runs the test Ran 1 of 261 Specs in 57.348 seconds SUCCESS! -- 1 Passed \| 0 Failed \| 0 Pending \| 260 Skipped	2018-05-08 16:15:26 -07:00
David Ashpole	a5df208866	eviction test ensures failed pods are evicted	2018-05-08 16:08:35 -07:00
Michael Taufen	c41cf55a2c	explicit kubelet config key in Node.Spec.ConfigSource.ConfigMap This makes the Kubelet config key in the ConfigMap an explicit part of the API, so we can stop using magic key names. As part of this change, we are retiring ConfigMapRef for ConfigMap.	2018-05-08 15:37:26 -07:00
Kubernetes Submit Queue	a244d8a48f	Merge pull request #63130 from vikaschoudhary16/dp_e2e_alloc Automatic merge from submit-queue (batch tested with PRs 61455, 63346, 63130, 63404). If you want to cherry-pick this change to another branch, please follow the instructions <a href="https://github.com/kubernetes/community/blob/master/contributors/devel/cherry-picks.md">here</a>. [Device-Plugin]: Extend e2e test to cover node allocatables What this PR does / why we need it: Extends device plugin e2e to cover node allocatable Which issue(s) this PR fixes (optional, in `fixes #<issue number>(, fixes #<issue_number>, ...)` format, will close the issue(s) when PR gets merged): Fixes # Special notes for your reviewer: Release note: ```release-note None ``` /sig node /area hw-accelerators /cc @jiayingz @vishh @RenaudWasTaken	2018-05-03 14:24:10 -07:00
vikaschoudhary16	b953f852f5	[Device-Plugin]: Extend e2e test to cover node allocatables	2018-05-03 14:19:29 -04:00
Kubernetes Submit Queue	592c39bccc	Merge pull request #62541 from filbranden/cgroupname1 Automatic merge from submit-queue. If you want to cherry-pick this change to another branch, please follow the instructions <a href="https://github.com/kubernetes/community/blob/master/contributors/devel/cherry-picks.md">here</a>. Use a []string for CgroupName, which is a more accurate internal representation What this PR does / why we need it: This is purely a refactoring and should bring no essential change in behavior. It does clarify the cgroup handling code quite a bit. It is preparation for further changes we might want to do in the cgroup hierarchy. (But it's useful on its own, so even if we don't do any, it should still be considered.) Special notes for your reviewer: The slice of strings more precisely captures the hierarchic nature of the cgroup paths we use to represent pods and their groupings. It also ensures we're reducing the chances of passing an incorrect path format to a cgroup driver that requires a different path naming, since now explicit conversions are always needed. The new constructor `NewCgroupName` starts from an existing `CgroupName`, which enforces a hierarchy where a root is always needed. It also performs checking on the component names to ensure invalid characters ("/" and "_") are not in use. A `RootCgroupName` for the top of the cgroup hierarchy tree is introduced. This refactor results in a net reduction of around 30 lines of code, mainly with the demise of ConvertCgroupNameToSystemd which had fairly complicated logic in it and was doing just too many things. There's a small TODO in a helper `updateSystemdCgroupInfo` that was introduced to make this commit possible. That logic really belongs in libcontainer, I'm planning to send a PR there to include it there. (The API already takes a field with that information, only that field is only processed in cgroupfs and not systemd driver, we should fix that.) Tested: By running the e2e-node tests on both Ubuntu 16.04 (with cgroupfs driver) and CentOS 7 (with systemd driver.) NOTE: I only tested this with dockershim, we should double-check that this works with the CRI endpoints too, both in cgroupfs and systemd modes. /assign @derekwaynecarr /assign @dashpole /assign @Random-Liu Release note: ```release-note NONE ```	2018-05-03 08:16:45 -07:00
Kubernetes Submit Queue	b5f61ac129	Merge pull request #62657 from matthyx/master Automatic merge from submit-queue. If you want to cherry-pick this change to another branch, please follow the instructions <a href="https://github.com/kubernetes/community/blob/master/contributors/devel/cherry-picks.md">here</a>. Update all script shebangs to use /usr/bin/env interpreter instead of /bin/interpreter This is required to support systems where bash doesn't reside in /bin (such as NixOS, or the *BSD family) and allow users to specify a different interpreter version through $PATH manipulation. https://www.cyberciti.biz/tips/finding-bash-perl-python-portably-using-env.html ```release-note Use /usr/bin/env in all script shebangs to increase portability. ```	2018-05-02 19:44:32 -07:00
Filipe Brandenburger	b230fb8ac4	Use a []string for CgroupName, which is a more accurate internal representation The slice of strings more precisely captures the hierarchic nature of the cgroup paths we use to represent pods and their groupings. It also ensures we're reducing the chances of passing an incorrect path format to a cgroup driver that requires a different path naming, since now explicit conversions are always needed. The new constructor NewCgroupName starts from an existing CgroupName, which enforces a hierarchy where a root is always needed. It also performs checking on the component names to ensure invalid characters ("/" and "_") are not in use. A RootCgroupName for the top of the cgroup hierarchy tree is introduced. This refactor results in a net reduction of around 30 lines of code, mainly with the demise of ConvertCgroupNameToSystemd which had fairly complicated logic in it and was doing just too many things. There's a small TODO in a helper updateSystemdCgroupInfo that was introduced to make this commit possible. That logic really belongs in libcontainer, I'm planning to send a PR there to include it there. (The API already takes a field with that information, only that field is only processed in cgroupfs and not systemd driver, we should fix that.) Tested by running the e2e-node tests on both Ubuntu 16.04 (with cgroupfs driver) and CentOS 7 (with systemd driver.)	2018-05-01 08:29:06 -07:00
Kubernetes Submit Queue	452b8c9e0d	Merge pull request #62101 from bart0sh/PR0010-e2e_node-kubelet-command-line-fix Automatic merge from submit-queue (batch tested with PRs 58474, 60034, 62101, 63198). If you want to cherry-pick this change to another branch, please follow the instructions <a href="https://github.com/kubernetes/community/blob/master/contributors/devel/cherry-picks.md">here</a>. Fix wrong usage of kubelet option What this PR does / why we need it: "--allow-privileged true" is incorrect usage of boolean option. It means setting '--allow-priviledged' to its default value plus non-existing subcommand 'true'. "--allow-privileged false" is even more confusing as it sets allow-priviledged flag to its default value 'true' This is true for any boolean command line option. Fixed this by using correct syntax --allow-priviledged=true Special notes for your reviewer: This is a show-stopper for PR #61833 Release note: ```release-note NONE ```	2018-04-30 13:24:12 -07:00
Kubernetes Submit Queue	e01858c595	Merge pull request #63252 from liztio/e2e_node_utils Automatic merge from submit-queue. If you want to cherry-pick this change to another branch, please follow the instructions <a href="https://github.com/kubernetes/community/blob/master/contributors/devel/cherry-picks.md">here</a>. E2e path utils What this PR does / why we need it: A bunch of useful methods for getting k8s paths and stuff are secreted away in `e2e_node`. This PR pulls them out so they can be used in other E2E method. Which issue(s) this PR fixes (optional, in `fixes #<issue number>(, fixes #<issue_number>, ...)` format, will close the issue(s) when PR gets merged): Special notes for your reviewer: This is motivated by the upcoming kubeadm-specific E2E tests. Those tests will be added in a follow-up to this PR. Release note: ```release-note NONE ```	2018-04-27 11:43:15 -07:00
liz	1ec02b1cd5	Move path management from e2e_node to common test/utils directory enables reuse of these methods for other e2e tests	2018-04-27 11:12:10 -04:00
liz	432b542218	Generated artefacts	2018-04-27 11:11:45 -04:00
Jordan Liggitt	1bddcdcf44	Bump QPS on namespace controller https://github.com/kubernetes/kubernetes/pull/62913 switched from using a client pool, where each groupVersionResource got its own rest client, to a single client. This increases the QPS to account for increased requests using a single rest client rate limiter.	2018-04-27 10:11:14 -04:00
David Eads	3632037e60	add easy to use dynamic client	2018-04-25 08:55:26 -04:00
Kubernetes Submit Queue	44b57338d5	Merge pull request #59692 from mtaufen/dkcfg-unpack-configmaps Automatic merge from submit-queue. If you want to cherry-pick this change to another branch, please follow the instructions <a href="https://github.com/kubernetes/community/blob/master/contributors/devel/cherry-picks.md">here</a>. unpack dynamic kubelet config payloads to files This PR unpacks the downloaded ConfigMap to a set of files on the node. This enables other config files to ride alongside the KubeletConfiguration, and the KubeletConfiguration to refer to these cohabitants with relative paths. This PR also stops storing dynamic config metadata (e.g. current, last-known-good config records) in the same directory as config checkpoints. Instead, it splits the storage into `meta` and `checkpoints` dirs. The current store dir structure is as follows: ``` - dir named by --dynamic-config-dir (root for managing dynamic config) \| - meta (dir for metadata, e.g. which config source is currently assigned, last-known-good) \| - current (a serialized v1 NodeConfigSource object, indicating the assigned config) \| - last-known-good (a serialized v1 NodeConfigSource object, indicating the last-known-good config) \| - checkpoints (dir for config checkpoints) \| - uid1 (dir for unpacked config, identified by uid1) \| - file1 \| - file2 \| - ... \| - uid2 \| - ... ``` There are some likely changes to the above structure before dynamic config goes beta, such as renaming "current" to "assigned" for clarity, and extending the checkpoint identifier to include a resource version, as part of resolving #61643. ```release-note NONE ``` /cc @luxas @smarterclayton	2018-04-24 12:01:37 -07:00
Michael Taufen	c9d398d01e	unpack dynamic kubelet config payloads to files This PR unpacks the downloaded ConfigMap to a set of files on the node. This enables other config files to ride alongside the KubeletConfiguration, and the KubeletConfiguration to refer to these cohabitants with relative paths. This PR also stops storing dynamic config metadata (e.g. current, last-known-good config records) in the same directory as config checkpoints. Instead, it splits the storage into `meta` and `checkpoints` dirs.	2018-04-19 09:18:53 -07:00
Matthias Bertschy	9b15af19b2	Update all script to use /usr/bin/env bash in shebang	2018-04-19 13:20:13 +02:00
Kubernetes Submit Queue	dd8f8819e4	Merge pull request #62768 from krzyzacy/clean-up-jenkins Automatic merge from submit-queue (batch tested with PRs 62445, 62768, 60633). If you want to cherry-pick this change to another branch, please follow the instructions <a href="https://github.com/kubernetes/community/blob/master/contributors/devel/cherry-picks.md">here</a>. clean up *.properties files ref https://github.com/kubernetes/kubernetes/issues/62754 to double check, is any of the node config yaml files are still being used outside of CI? I'll make a follow up one to clean them up as well. /assign @Random-Liu @mindprince @yujuhong	2018-04-18 12:25:08 -07:00
Kubernetes Submit Queue	1ddb0e05e5	Merge pull request #62761 from Random-Liu/lower-usage-nano-cores-in-summary Automatic merge from submit-queue (batch tested with PRs 62761, 62715). If you want to cherry-pick this change to another branch, please follow the instructions <a href="https://github.com/kubernetes/community/blob/master/contributors/devel/cherry-picks.md">here</a>. Lower UsageNanoCores boundary in summary api test. We recently switched to use `p2p` instead of `bridge` in containerd https://github.com/containerd/cri/pull/742. However, after that switch, the `UsageNanoCores` becomes lower, and constantly fails the test. An example failure: * https://k8s-gubernator.appspot.com/build/kubernetes-jenkins/pr-logs/pull/containerd_cri/740/pull-cri-containerd-node-e2e/690/ This is probably because: 1) The test container used in summary test does `ping`. https://github.com/kubernetes/kubernetes/blob/master/test/e2e_node/summary_test.go#L352 2) `p2p` is simpler than `bridge`, "Maybe cycles are saved from waiving Mac learning" - @jingax10. This PR lowers the boundary by 1 magnitude. Signed-off-by: Lantao Liu <lantaol@google.com> Release note: ```release-note none ```	2018-04-17 22:38:10 -07:00
Sen Lu	854132fdcc	clean up *.properties files	2018-04-17 21:44:32 -07:00
Lantao Liu	002483fe72	Lower UsageNanoCores boundary in summary api test. Signed-off-by: Lantao Liu <lantaol@google.com>	2018-04-17 18:37:51 -07:00
Lantao Liu	c86e85c420	Fix extra-log flag for node e2e. Signed-off-by: Lantao Liu <lantaol@google.com>	2018-04-17 21:48:26 +00:00
Lantao Liu	27105c90ec	Fix kubelet flags. Signed-off-by: Lantao Liu <lantaol@google.com>	2018-04-16 20:42:40 +00:00
Yu-Ju Hong	9a47bd0b67	Node E2E: Remove the simple mount test There are EmptyDir volume tests in test/e2e/common already. The test does not add any more coverage.	2018-04-12 17:05:28 -07:00
Ed Bartosh	7e3d28b30f	Fix wrong usage of kubelet options "--allow-privileged true" is incorrect usage of boolean option. It means setting '--allow-priviledged' to its default value plus non-existing subcommand 'true'. "--allow-privileged false" is even more confusing as it sets allow-priviledged flag to its default value 'true' This is true for any boolean command line option. Fixed this by using correct syntax --allow-priviledged=true Fixed generating of kubelet command line in addKubeletConfigFlags function.	2018-04-12 15:19:49 +03:00
Kubernetes Submit Queue	1dc6e87f57	Merge pull request #62206 from yujuhong/rm-rkt-refs Automatic merge from submit-queue (batch tested with PRs 62192, 61866, 62206, 62360). If you want to cherry-pick this change to another branch, please follow the instructions <a href="https://github.com/kubernetes/community/blob/master/contributors/devel/cherry-picks.md">here</a>. Remove rkt references in the codebase ```release-note None ```	2018-04-10 23:52:21 -07:00
Kubernetes Submit Queue	3bc1a0a1d0	Merge pull request #60900 from dashpole/eviction_test_no_pressure Automatic merge from submit-queue (batch tested with PRs 60900, 62215, 62196). If you want to cherry-pick this change to another branch, please follow the instructions <a href="https://github.com/kubernetes/community/blob/master/contributors/devel/cherry-picks.md">here</a>. [Flaky test fix] Use memory.force_empty before and after eviction tests What this PR does / why we need it: (copied from https://github.com/kubernetes/kubernetes/pull/60720): MemoryAllocatableEviction tests have been somewhat flaky: https://k8s-testgrid.appspot.com/sig-node-kubelet#kubelet-serial-gce-e2e&include-filter-by-regex=MemoryAllocatable The failure on the flakes is ["Pod ran to completion"](https://k8s-gubernator.appspot.com/build/kubernetes-jenkins/logs/ci-kubernetes-node-kubelet-serial/3785#k8sio-memoryallocatableeviction-slow-serial-disruptive-when-we-run-containers-that-should-cause-memorypressure-should-eventually-evict-all-of-the-correct-pods). Looking at [an example log](https://storage.googleapis.com/kubernetes-jenkins/logs/ci-kubernetes-node-kubelet-serial/3785/artifacts/tmp-node-e2e-6070a774-cos-stable-63-10032-71-0/kubelet.log) (and search for memory-hog-pod, we can see that this pod fails admission because the allocatable memory threshold has already been crossed. `eviction manager: thresholds - ignoring grace period: threshold [signal=allocatableMemory.available, quantity=250Mi] observed 242404Ki` https://github.com/kubernetes/kubernetes/pull/60720 wasn't effective. To clean-up after each eviction test, and prepare for the next, use memory.force_empty to make the kernel reclaim memory in the allocatable cgroup before and after eviction tests. Special notes for your reviewer: I tested to make sure this doesn't break Cgroup Manager tests. It should work on both cgroupfs and systemd based systems, although I have only tested in on cgroupfs. Release note: ```release-note NONE ``` /assign @yujuhong @Random-Liu /sig node /priority important-soon /kind bug its getting a little late in the release cycle, so we can probably wait until after code freeze is lifted for this.	2018-04-06 21:30:06 -07:00
Kubernetes Submit Queue	1e767ddf60	Merge pull request #62135 from jiayingz/kubelet-restart-fix Automatic merge from submit-queue. If you want to cherry-pick this change to another branch, please follow the instructions <a href="https://github.com/kubernetes/community/blob/master/contributors/devel/cherry-picks.md">here</a>. Fixes restartKubelet in test/e2e_node failure. Looks like there is some recent change on how we start kubelet service in test_e2e_node. Fixes restartKubelet() to get right kubelet service name to cope with the change. What this PR does / why we need it: Which issue(s) this PR fixes (optional, in `fixes #<issue number>(, fixes #<issue_number>, ...)` format, will close the issue(s) when PR gets merged): Fixes kubelet-serial-gce-e2e test failure: https://k8s-testgrid.appspot.com/wg-resource-management#kubelet-serial-gce-e2e Thanks a lot to @mindprince for noticing this! Special notes for your reviewer: Release note: ```release-note ```	2018-04-06 15:46:19 -07:00
David Ashpole	3254bdc1a4	use memory.force_empty before and after eviction tests	2018-04-06 14:01:11 -07:00
Yu-Ju Hong	59741bdfbd	Remove rkt references in the codebase	2018-04-06 12:02:11 -07:00
Manjunath A Kumatagi	1bb810e749	Use pause manifest image	2018-04-06 11:00:50 +05:30
Jiaying Zhang	0138007bdd	Fixes restartKubelet in test/e2e_node failure. Looks like there is some recent change on how we start kubelet service in test_e2e_node. Fixes restartKubelet() to get right kubelet service name to cope with the change.	2018-04-04 13:18:08 -07:00
hzxuzhonghu	8cce8bdc85	make kube-apiserver ServerRunOptions setdefault and Validate before use	2018-04-04 11:19:55 +08:00
Kubernetes Submit Queue	043204b1e5	Merge pull request #61498 from mindprince/delete-in-tree-gpu Automatic merge from submit-queue (batch tested with PRs 61498, 62030). If you want to cherry-pick this change to another branch, please follow the instructions <a href="https://github.com/kubernetes/community/blob/master/contributors/devel/cherry-picks.md">here</a>. Delete in-tree support for NVIDIA GPUs. This removes the alpha Accelerators feature gate which was deprecated in 1.10 (#57384). The alternative feature DevicePlugins went beta in 1.10 (#60170). Fixes #54012 ```release-note Support for "alpha.kubernetes.io/nvidia-gpu" resource which was deprecated in 1.10 is removed. Please use the resource exposed by DevicePlugins instead ("nvidia.com/gpu"). ```	2018-04-03 02:02:04 -07:00
Rohit Agarwal	87dda3375b	Delete in-tree support for NVIDIA GPUs. This removes the alpha Accelerators feature gate which was deprecated in 1.10. The alternative feature DevicePlugins went beta in 1.10.	2018-04-02 20:17:01 -07:00
Christoph Blecker	710c8563b4	Fix go vet errors	2018-04-02 17:57:44 -07:00
Kubernetes Submit Queue	99fd98a893	Merge pull request #61740 from filbranden/nodetest1 Automatic merge from submit-queue (batch tested with PRs 61482, 61740). If you want to cherry-pick this change to another branch, please follow the instructions <a href="https://github.com/kubernetes/community/blob/master/contributors/devel/cherry-picks.md">here</a>. Make systemd service name for kubelet use a timestamp in e2e-node tests. What this PR does / why we need it: This makes it easier to figure out which execution was last when looking at the output of `systemd list-units kubelet-.service`. We try to find the name of the /tmp/node-e2e- directory and use the same timestamp if we can. Otherwise, we just call Now() again, which isn't as nice (as the unit name and directory name will not match) but will still produce unit names that will be ordered when launching multiple subsequent executions on the same host. Which issue(s) this PR fixes (optional, in `fixes #<issue number>(, fixes #<issue_number>, ...)` format, will close the issue(s) when PR gets merged): N/A Special notes for your reviewer: Tested using `make test-e2e-node REMOTE=true` and then checking `systemctl list-units kubelet-.service` on the target host. ``` $ systemctl list-units kubelet-.service kubelet-20180326T142016.service loaded active exited /tmp/node-e2e-20180326T142016/kubelet --kubeconfig /tmp/node-e2e-20180326T142016/kubeconfig --root-dir /var/lib/kubelet ... kubelet-20180326T143550.service loaded active exited /tmp/node-e2e-20180326T143550/kubelet --kubeconfig /tmp/node-e2e-20180326T143550/kubeconfig --root-dir /var/lib/kubelet ... ``` The units are sorted in the order they were launched. Release note: ```release-note NONE ```	2018-03-29 21:10:03 -07:00
Filipe Brandenburger	b8c39b7055	In summary_test, make Docker cpu/memory checks optional if unavailable. The numbers will only be available when docker.service has its own memory and cpu cgroups, which doesn't necessarily happen unless the unit has Delegate=yes configured. Let's work around that by checking the status of Delegate, in the case where we are: * running Docker * running Systemd * able to check the status through systemctl * the status is explicitly Delegate=no (the default) If all of those are true, let's make CPU and Memory expectations optional. Tested: make test-e2e-node REMOTE=true HOSTS=centos-e2e-node FOCUS="Summary API"	2018-03-29 18:12:30 -07:00
Filipe Brandenburger	351a70b60e	In summary_test, create a file outside the test volume too. This is necessary to show any RootFs usage on systems where the backing filesystem of overlay2 is xfs. The current test only created directories (for mount points) in the upper layer of the overlay. Outside of the mount namespace, only the directories are visible. When running `du` on those, usually filesystems will show some usage, but not xfs, which shows a disk usage of 0 for directories. Fix this by creating a file in the root directory, outside the volumes, in order to trigger some disk usage that can be measured by `du`. Tested: make test-e2e-node REMOTE=true HOSTS=centos-e2e-node FOCUS="Summary API"	2018-03-29 18:12:29 -07:00
Kubernetes Submit Queue	5ae7bba496	Merge pull request #60100 from mtaufen/node-authz-nodeconfigsource Automatic merge from submit-queue (batch tested with PRs 61829, 61908, 61307, 61872, 60100). If you want to cherry-pick this change to another branch, please follow the instructions <a href="https://github.com/kubernetes/community/blob/master/contributors/devel/cherry-picks.md">here</a>. node authorizer sets up access rules for dynamic config This PR makes the node authorizer automatically set up access rules for dynamic Kubelet config. I also added some validation to the node strategy, which I discovered we were missing while writing this. This PR is based on another WIP from @liggitt. ```release-note The node authorizer now automatically sets up rules for Node.Spec.ConfigSource when the DynamicKubeletConfig feature gate is enabled. ```	2018-03-29 17:37:18 -07:00
Filipe Brandenburger	76ef9c9074	Make systemd service name for kubelet use a timestamp in e2e-node tests. This makes it easier to figure out which execution was last when looking at the output of `systemd list-units kubelet-.service`. We try to find the name of the /tmp/node-e2e- directory and use the same timestamp if we can. Otherwise, we just call Now() again, which isn't as nice (as the unit name and directory name will not match) but will still produce unit names that will be ordered when launching multiple subsequent executions on the same host.	2018-03-29 11:17:42 -07:00
Filipe Brandenburger	451faff4ef	Use curl instead of wget to fetch the CNI tarball in e2e-node test Curl is more ubiquitous than wget. For instance, the GCE centos-7 and rhel-7 image families ship curl by default, but not wget. Looking at the shell scripts under cluster/, they tend to use curl more than wget. (The ones that use wget, such as get-kube.sh, try curl first and only fallback to wget if it's not available.) Tested: by running node-e2e-test on Ubuntu, COS and CentOS.	2018-03-27 09:41:09 -07:00
Michael Taufen	ab8dc12333	node authorizer sets up access rules for dynamic config This PR makes the node authorizer automatically set up access rules for dynamic Kubelet config. I also added some validation to the node strategy, which I discovered we were missing while writing this.	2018-03-27 08:49:45 -07:00
Kubernetes Submit Queue	915798d229	Merge pull request #60563 from hzxuzhonghu/replace-context Automatic merge from submit-queue. If you want to cherry-pick this change to another branch, please follow the instructions <a href="https://github.com/kubernetes/community/blob/master/contributors/devel/cherry-picks.md">here</a>. Replace package "golang.org/x/net/context" with "context" What this PR does / why we need it: Replace package "golang.org/x/net/context" with "context" Which issue(s) this PR fixes (optional, in `fixes #<issue number>(, fixes #<issue_number>, ...)` format, will close the issue(s) when PR gets merged): Fixes #60560 Special notes for your reviewer: As of Go 1.7 this package(golang.org/x/net/context) is available in the standard library under the name context. see (https://godoc.org/golang.org/x/net/context) It is almost machinery replace. Release note: ```release-note NONE ```	2018-03-23 16:34:23 -07:00
Kubernetes Submit Queue	1b6b2ee790	Merge pull request #61478 from shyamjvs/capture-pod-startup-phases-as-metrics Automatic merge from submit-queue (batch tested with PRs 61378, 60915, 61499, 61507, 61478). If you want to cherry-pick this change to another branch, please follow the instructions <a href="https://github.com/kubernetes/community/blob/master/contributors/devel/cherry-picks.md">here</a>. Capture pod startup phases as metrics Learning from https://github.com/kubernetes/kubernetes/issues/60589, we should also start collecting and graphing sub-parts of pod-startup latency. /sig scalability /kind feature /priority important-soon /cc @wojtek-t ```release-note NONE ```	2018-03-22 07:15:33 -07:00
hzxuzhonghu	70e45eccf2	Replace "golang.org/x/net/context" with "context"	2018-03-22 20:57:14 +08:00
Shyam Jeedigunta	0f0c754eb4	Get rid of duplicate VerifyPodStartupLatency util in node density tests	2018-03-21 16:58:31 +01:00
Shyam Jeedigunta	b0dd166fa3	Capture different parts of pod-startup latency as metrics	2018-03-21 16:58:25 +01:00
Lantao Liu	9fc2795d55	Change pods memory boundary. Signed-off-by: Lantao Liu <lantaol@google.com>	2018-03-20 23:24:16 +00:00
Kubernetes Submit Queue	c64f19dd1b	Merge pull request #59728 from wgliang/master.append Automatic merge from submit-queue (batch tested with PRs 59740, 59728, 60080, 60086, 58714). If you want to cherry-pick this change to another branch, please follow the instructions <a href="https://github.com/kubernetes/community/blob/master/contributors/devel/cherry-picks.md">here</a>. more concise to merge the slice What this PR does / why we need it: more concise to merge the slice Special notes for your reviewer:	2018-03-19 21:34:30 -07:00
Kubernetes Submit Queue	a3f40dd8df	Merge pull request #60856 from jiayingz/race-fix Automatic merge from submit-queue. If you want to cherry-pick this change to another branch, please follow the instructions <a href="https://github.com/kubernetes/community/blob/master/contributors/devel/cherry-picks.md">here</a>. Fixes the races around devicemanager Allocate() and endpoint deletion. There is a race in predicateAdmitHandler Admit() that getNodeAnyWayFunc() could get Node with non-zero deviceplugin resource allocatable for a non-existing endpoint. That race can happen when a device plugin fails, but is more likely when kubelet restarts as with the current registration model, there is a time gap between kubelet restart and device plugin re-registration. During this time window, even though devicemanager could have removed the resource initially during GetCapacity() call, Kubelet may overwrite the device plugin resource capacity/allocatable with the old value when node update from the API server comes in later. This could cause a pod to be started without proper device runtime config set. To solve this problem, introduce endpointStopGracePeriod. When a device plugin fails, don't immediately remove the endpoint but set stopTime in its endpoint. During kubelet restart, create endpoints with stopTime set for any checkpointed registered resource. The endpoint is considered to be in stopGracePeriod if its stoptime is set. This allows us to track what resources should be handled by devicemanager during the time gap. When an endpoint's stopGracePeriod expires, we remove the endpoint and its resource. This allows the resource to be exported through other channels (e.g., by directly updating node status through API server) if there is such use case. Currently endpointStopGracePeriod is set as 5 minutes. Given that an endpoint is no longer immediately removed upon disconnection, mark all its devices unhealthy so that we can signal the resource allocatable change to the scheduler to avoid scheduling more pods to the node. When a device plugin endpoint is in stopGracePeriod, pods requesting the corresponding resource will fail admission handler. Tested: Ran GPUDevicePlugin e2e_node test 100 times and all passed now. What this PR does / why we need it: Which issue(s) this PR fixes (optional, in `fixes #<issue number>(, fixes #<issue_number>, ...)` format, will close the issue(s) when PR gets merged): Fixes https://github.com/kubernetes/kubernetes/issues/60176 Special notes for your reviewer: Release note: ```release-note Fixes the races around devicemanager Allocate() and endpoint deletion. ```	2018-03-12 02:50:13 -07:00
Jiaying Zhang	5514a1f4dd	Fixes the races around devicemanager Allocate() and endpoint deletion. There is a race in predicateAdmitHandler Admit() that getNodeAnyWayFunc() could get Node with non-zero deviceplugin resource allocatable for a non-existing endpoint. That race can happen when a device plugin fails, but is more likely when kubelet restarts as with the current registration model, there is a time gap between kubelet restart and device plugin re-registration. During this time window, even though devicemanager could have removed the resource initially during GetCapacity() call, Kubelet may overwrite the device plugin resource capacity/allocatable with the old value when node update from the API server comes in later. This could cause a pod to be started without proper device runtime config set. To solve this problem, introduce endpointStopGracePeriod. When a device plugin fails, don't immediately remove the endpoint but set stopTime in its endpoint. During kubelet restart, create endpoints with stopTime set for any checkpointed registered resource. The endpoint is considered to be in stopGracePeriod if its stoptime is set. This allows us to track what resources should be handled by devicemanager during the time gap. When an endpoint's stopGracePeriod expires, we remove the endpoint and its resource. This allows the resource to be exported through other channels (e.g., by directly updating node status through API server) if there is such use case. Currently endpointStopGracePeriod is set as 5 minutes. Given that an endpoint is no longer immediately removed upon disconnection, mark all its devices unhealthy so that we can signal the resource allocatable change to the scheduler to avoid scheduling more pods to the node. When a device plugin endpoint is in stopGracePeriod, pods requesting the corresponding resource will fail admission handler.	2018-03-09 17:00:57 -08:00
Kubernetes Submit Queue	ae7be34c32	Merge pull request #60509 from verb/pid-e2e Automatic merge from submit-queue. If you want to cherry-pick this change to another branch, please follow the instructions <a href="https://github.com/kubernetes/community/blob/master/contributors/devel/cherry-picks.md">here</a>. Add node-e2e test for ShareProcessNamespace What this PR does / why we need it: Adds a node-e2e test for kubernetes/features#495 Which issue(s) this PR fixes (optional, in `fixes #<issue number>(, fixes #<issue_number>, ...)` format, will close the issue(s) when PR gets merged): Fixes #59554 Special notes for your reviewer: This requires a feature gate to be enabled in both the kubelet and API server. I'm not sure which jenkins configs need to be updated (or if these are even still used) so I just updated a pile of them. opened kubernetes/test-infra#7030 for https://github.com/kubernetes/test-infra/blob/master/jobs/config.json Release note: ```release-note NONE ```	2018-03-05 14:20:14 -08:00
David Ashpole	395bea9d83	increase amount of memory filled by memory allocatable eviction test	2018-03-02 10:00:03 -08:00
Jiaying Zhang	6d7e6599f1	I forgot the fact that the DevicePlugin test itself restarts Kubelet for testing purpose. Move that test back to Serial but constructs a smaller test without kubelet restart that we may run during presubmit.	2018-03-01 14:02:09 -08:00
Kubernetes Submit Queue	5d26ef96a8	Merge pull request #59345 from hanxiaoshuai/fixtodo02051 Automatic merge from submit-queue. If you want to cherry-pick this change to another branch, please follow the instructions <a href="https://github.com/kubernetes/community/blob/master/contributors/devel/cherry-picks.md">here</a>. fix todo:Move function readinessCheck to util What this PR does / why we need it: fix todo:Move function readinessCheck to util in test/e2e_node/services Which issue(s) this PR fixes (optional, in `fixes #<issue number>(, fixes #<issue_number>, ...)` format, will close the issue(s) when PR gets merged): Fixes # Special notes for your reviewer: Release note: ```release-note NONE ```	2018-02-28 08:06:34 -08:00
Kubernetes Submit Queue	5be121aca7	Merge pull request #60376 from mikedanese/fixup Automatic merge from submit-queue (batch tested with PRs 60376, 55584, 60358, 54631, 60291). If you want to cherry-pick this change to another branch, please follow the instructions <a href="https://github.com/kubernetes/community/blob/master/contributors/devel/cherry-picks.md">here</a>. remove gcloud docker -- since it's deprecated docker handles this now and it raises an error. try 3 ```release-note NONE ```	2018-02-28 03:37:21 -08:00
Kubernetes Submit Queue	2023c019eb	Merge pull request #60451 from jiayingz/e2e_node_enable Automatic merge from submit-queue (batch tested with PRs 60236, 60332, 57375, 60451, 57408). If you want to cherry-pick this change to another branch, please follow the instructions <a href="https://github.com/kubernetes/community/blob/master/contributors/devel/cherry-picks.md">here</a>. Update device plugin e2e_node test to not changing Kubelet config as DevicePlugins feature is enabled by default now. What this PR does / why we need it: Which issue(s) this PR fixes (optional, in `fixes #<issue number>(, fixes #<issue_number>, ...)` format, will close the issue(s) when PR gets merged): Fixes # Special notes for your reviewer: Release note: ```release-note ```	2018-02-28 01:12:32 -08:00
Mike Danese	c0b7364563	remove gcloud docker -- since it's deprecated	2018-02-28 00:24:27 -08:00
Lee Verberne	b02f1f2ce3	Add node-e2e test for ShareProcessNamespace	2018-02-28 09:15:56 +01:00
Ryan Hitchman	8aa3ca3cbb	Add a few "+build linux" tags where appropriate.	2018-02-27 13:53:32 -08:00
Ryan Hitchman	e04b91facf	Remove unused variables (only assigned to) from test code. This is revealed by the go/types package, which is stricter than the Go compiler about unused variables. See also: golang/go#8560	2018-02-27 13:45:31 -08:00
Jiaying Zhang	fee083feac	Update device plugin e2e_node test to not changing Kubelet config as DevicePlugins feature is enabled by default now.	2018-02-26 22:45:44 -08:00
Kubernetes Submit Queue	e31c8a2252	Merge pull request #60318 from jiayingz/api-change Automatic merge from submit-queue (batch tested with PRs 59159, 60318, 60079, 59371, 57415). If you want to cherry-pick this change to another branch, please follow the instructions <a href="https://github.com/kubernetes/community/blob/master/contributors/devel/cherry-picks.md">here</a>. Made a couple API changes to deviceplugin/v1beta1 to avoid future incompatible API changes: - Add GetDevicePluginOptions rpc call. This is needed when we switch from Registration service to probe-based plugin watcher. - Change AllocateRequest and AllocateResponse to allow device requests from multiple containers in a pod. Currently only made mechanical change on the devicemanager and test code to cope with the API but still issues an Allocate call per container. We can modify the devicemanager in 1.11 to issue a single Allocate call per pod. The change will also facilitate incremental API change to communicate pod level information through Allocate rpc if there is such future need. What this PR does / why we need it: Made a couple API changes to deviceplugin/v1beta1 to avoid future incompatible API changes. Which issue(s) this PR fixes (optional, in `fixes #<issue number>(, fixes #<issue_number>, ...)` format, will close the issue(s) when PR gets merged): Fixes https://github.com/kubernetes/kubernetes/issues/59370 Special notes for your reviewer: Release note: ```release-note ```	2018-02-24 21:19:33 -08:00
Kubernetes Submit Queue	720c29b3e8	Merge pull request #60314 from mtaufen/kubelet-manifest-is-oldspeak Automatic merge from submit-queue (batch tested with PRs 60324, 60269, 59771, 60314, 59941). If you want to cherry-pick this change to another branch, please follow the instructions <a href="https://github.com/kubernetes/community/blob/master/contributors/devel/cherry-picks.md">here</a>. expunge the word 'manifest' from Kubelet's config API The word 'manifest' technically refers to a container-group specification that predated the Pod abstraction. We should avoid using this legacy terminology where possible. Fortunately, the Kubelet's config API will be beta in 1.10 for the first time, so we still had the chance to make this change. I left the flags alone, since they're deprecated anyway. I changed a few var names in files I touched too, but this PR is the just the first shot, not the whole campaign (`git grep -i manifest \| wc -l -> 1248`). ```release-note Some field names in the Kubelet's now v1beta1 config API differ from the v1alpha1 API: PodManifestPath is renamed to PodPath, ManifestURL is renamed to PodURL, ManifestURLHeader is renamed to PodURLHeader. ```	2018-02-24 20:01:46 -08:00

1 2 3 4 5 ...

1498 Commits (k3s-v1.15.3)