github/k3s - k3s - https://git.xinac.net

Commit Graph

Author	SHA1	Message	Date
Derek Carr	5c8b957779	Fix faulty assumptions in summary API testing	2017-03-20 14:56:11 -04:00
Kubernetes Submit Queue	5e0f0047dd	Merge pull request #43242 from timstclair/summary-test Automatic merge from submit-queue Relax 'misc' container memory constraints Fixes https://github.com/kubernetes/kubernetes/issues/40607 /cc @dchen1107	2017-03-16 15:25:23 -07:00
Tim St. Clair	827dd340d4	Relax 'misc' container memory constraints	2017-03-16 12:08:22 -07:00
Kubernetes Submit Queue	6656ffc300	Merge pull request #43165 from Random-Liu/update-npd Automatic merge from submit-queue Update npd to the official v0.3.0 release. Update npd to the official release v0.3.0. This also fixes a npd bug https://github.com/kubernetes/node-problem-detector/pull/98. @dchen1107 @kubernetes/node-problem-detector-reviewers	2017-03-16 11:23:43 -07:00
Random-Liu	c4b3fd4e63	Update npd to the official v0.3.0 release.	2017-03-15 14:26:12 -07:00
Tim St. Clair	9a1236ae20	Add process debug information to summary test	2017-03-14 17:45:12 -07:00
Vishnu Kannan	8ed9bff073	handle container restarts for GPUs Signed-off-by: Vishnu Kannan <vishnuk@google.com>	2017-03-13 10:58:26 -07:00
Random-Liu	f81460e35d	Change the junit file name format to `junit_image-name_id.xml`, and make the gci image name shorter.	2017-03-09 16:47:48 -08:00
Kubernetes Submit Queue	7c08e817a5	Merge pull request #42734 from dashpole/deletion_timeout Automatic merge from submit-queue (batch tested with PRs 42734, 42745, 42758, 42814, 42694) Create DefaultPodDeletionTimeout for e2e tests In our e2e and e2e_node tests, we had a number of different timeouts for deletion. Recent changes to the way deletion works (#41644, #41456) have resulted in some timeouts in e2e tests. #42661 was the most recent fix for this. Most of these tests are not meant to test pod deletion latency, but rather just to clean up pods after a test is finished. For this reason, we should change all these tests to use a standard, fairly high timeout for deletion. cc @vishh @Random-Liu	2017-03-09 15:06:53 -08:00
Kubernetes Submit Queue	eefa2ef1bb	Merge pull request #42425 from apprenda/kubeadm_189_docker_version Automatic merge from submit-queue (batch tested with PRs 42762, 42739, 42425, 42778) kubeadm: update docker version for CE and EE What this PR does / why we need it: Update regex for docker version to also capture new CE and EE versions. Which issue this PR fixes: fixes #https://github.com/kubernetes/kubeadm/issues/189 Special notes for your reviewer: /cc @jbeda @luxas Release note: ```release-note NONE ```	2017-03-09 02:51:40 -08:00
David Ashpole	3806d386df	use default timeout for deletion	2017-03-08 14:40:19 -08:00
Derek McQuay	35f07095d8	kubeadm: validators pass warnings and errors This change allows validators to pass warnings as well as errors. This was needed because of how support for docker 1.13+ and the new EE and CE versions is currently being handled.	2017-03-08 14:35:26 -08:00
Kubernetes Submit Queue	bf7f42d362	Merge pull request #42499 from dashpole/memcg_test_suite Automatic merge from submit-queue New e2e node test suite with memcg turned on The flag --experimental-kernal-memcg-notification was initially added to allow disabling an eviction feature which used memcg notifications to make memory evictions more reactive. As documented in #37853, memcg notifications increased the likelihood of encountering soft lockups, especially on CVM. This feature would valuable to turn on, at least for GCI, since soft lockup issues were less prevalent on GCI and appeared (at the time) to be unrelated to memcg notifications. In the interest of caution, I would like to monitor serial tests on GCI with --experimental-kernal-memcg-notification=true. cc @vishh @Random-Liu @dchen1107 @kubernetes/sig-node-pr-reviews	2017-03-07 18:47:40 -08:00
Kubernetes Submit Queue	0d60fc4013	Merge pull request #42687 from dashpole/flaky_to_serial Automatic merge from submit-queue (batch tested with PRs 42664, 42687) [Fix Flaky Tests] E2e Node Flaky test suite runs serially The [e2e Node Flaky Test Suite](https://k8s-testgrid.appspot.com/google-node#kubelet-flaky-gce-e2e&width=20) has been failing with strange errors. This is because the tests in that suite are meant to be run serially, but are running in parallel, since that was left out of the config. This PR fixes this by changing the Flaky test suite to serial cc @Random-Liu	2017-03-07 17:51:17 -08:00
David Ashpole	b0d138692e	make the flaky suite run serially. Should prevent all the dynamic config errors	2017-03-07 15:12:17 -08:00
David Ashpole	0e20caf3fb	new suite with memcg turned on	2017-03-07 14:14:08 -08:00
David Ashpole	6a0d5506c2	use default timeout	2017-03-07 11:45:59 -08:00
Derek McQuay	eeefd2ca87	kubeadm: fail on docker version 1.13+, CE, and EE	2017-03-07 10:20:32 -08:00
Derek Carr	48d822eafe	cgroup names created by kubelet should be lowercased	2017-03-06 11:19:21 -05:00
Kubernetes Submit Queue	cb0728c50f	Merge pull request #42457 from yujuhong/do_not_panic Automatic merge from submit-queue (batch tested with PRs 42456, 42457, 42414, 42480, 42370) node e2e: apparmor test should fail instead of panicking This doesn't fix #42420, but at least stop the test from panicking.	2017-03-04 00:17:42 -08:00
Kubernetes Submit Queue	f9ccee7714	Merge pull request #42435 from dashpole/timestamps_for_fsstats Automatic merge from submit-queue (batch tested with PRs 42369, 42375, 42397, 42435, 42455) [Bug Fix]: Avoid evicting more pods than necessary by adding Timestamps for fsstats and ignoring stale stats Continuation of #33121. Credit for most of this goes to @sjenning. I added volume fs timestamps. why is this a bug This PR attempts to fix part of https://github.com/kubernetes/kubernetes/issues/31362 which results in multiple pods getting evicted unnecessarily whenever the node runs into resource pressure. This PR reduces the chances of such disruptions by avoiding reacting to old/stale metrics. Without this PR, kubernetes nodes under resource pressure will cause unnecessary disruptions to user workloads. This PR will also help deflake a node e2e test suite. The eviction manager currently avoids evicting pods if metrics are old. However, timestamp data is not available for filesystem data, and this causes lots of extra evictions. See the [inode eviction test flakes](https://k8s-testgrid.appspot.com/google-node#kubelet-flaky-gce-e2e) for examples. This should probably be treated as a bugfix, as it should help mitigate extra evictions. cc: @kubernetes/sig-storage-pr-reviews @kubernetes/sig-node-pr-reviews @vishh @derekwaynecarr @sjenning	2017-03-03 23:21:48 -08:00
Kubernetes Submit Queue	2d319bd406	Merge pull request #42204 from dashpole/allocatable_eviction Automatic merge from submit-queue Eviction Manager Enforces Allocatable Thresholds This PR modifies the eviction manager to enforce node allocatable thresholds for memory as described in kubernetes/community#348. This PR should be merged after #41234. cc @kubernetes/sig-node-pr-reviews @kubernetes/sig-node-feature-requests @vishh Why is this a bug/regression Kubelet uses `oom_score_adj` to enforce QoS policies. But the `oom_score_adj` is based on overall memory requested, which means that a Burstable pod that requested a lot of memory can lead to OOM kills for Guaranteed pods, which violates QoS. Even worse, we have observed system daemons like kubelet or kube-proxy being killed by the OOM killer. Without this PR, v1.6 will have node stability issues and regressions in an existing GA feature `out of Resource` handling.	2017-03-03 20:20:12 -08:00
Kubernetes Submit Queue	67500b3947	Merge pull request #42443 from Random-Liu/fix-node-e2e-npd Automatic merge from submit-queue (batch tested with PRs 42443, 38924, 42367, 42391, 42310) Cast system uptime to time.Duration to fix cross build. Fixes https://github.com/kubernetes/kubernetes/issues/42441. Cast system uptime to `time.Duration` to avoid different behavior on different architectures. @sjenning @ixdy @ncdc	2017-03-03 18:08:38 -08:00
Kubernetes Submit Queue	98eae9b222	Merge pull request #42341 from dashpole/critial_pod_test Automatic merge from submit-queue Critial pod test uses allocatable instead of capacity This solves #42239. When this test was first introduced, pods could request up to the capacity of the node. With the addition of allocatable introduced in #41234, this is no longer the case, and pods can only use up to allocatable. This should be included in 1.6, as it is a bug related to a 1.6 feature. cc @vish @yujuhong	2017-03-03 14:34:37 -08:00
Yu-Ju Hong	1d907dbf4f	node e2e: apparmor test should fail instead of panicking	2017-03-02 16:36:52 -08:00
David Ashpole	a90c7951d4	add volume timestamps	2017-03-02 15:01:59 -08:00
Random-Liu	d41c2503e7	Cast system uptime to time.Duration to fix cross build.	2017-03-02 14:48:09 -08:00
Kubernetes Submit Queue	1d97472361	Merge pull request #41928 from Random-Liu/move-npd-test-to-node-e2e Automatic merge from submit-queue (batch tested with PRs 41984, 41682, 41924, 41928) Move node problem detector test into node e2e. Move current NPD e2e test into node e2e. In fact, current NPD e2e test is only a functionality test for NPD. It creates test NPD pod, sets test configuration, generates test logs and verifies test result. It doesn't actually test the NPD really deployed in the cluster. So it doesn't actually need to run in cluster e2e. Running it in node e2e will: 1) Make it easier to run the test. 2) Make it more light weight to introduce this as a pre/post submit test in NPD repo in the future. Except this, I'm working on a cluster e2e to run some basic functionality test and benchmark test against the real NPD deployed in the cluster. Will send the PR later. /cc @dchen1107 @kubernetes/node-problem-detector-reviewers	2017-03-02 10:51:18 -08:00
David Ashpole	ac612eab8e	eviction manager changes for allocatable	2017-03-02 07:36:24 -08:00
David Ashpole	5fa6515509	critial pod test uses allocatable instead of capacity	2017-03-01 09:57:17 -08:00
Kubernetes Submit Queue	ed479163fa	Merge pull request #42116 from vishh/gpu-experimental-support Automatic merge from submit-queue Extend experimental support to multiple Nvidia GPUs Extended from #28216 ```release-note `--experimental-nvidia-gpus` flag is replaced by `Accelerators` alpha feature gate along with support for multiple Nvidia GPUs. To use GPUs, pass `Accelerators=true` as part of `--feature-gates` flag. Works only with Docker runtime. ``` 1. Automated testing for this PR is not possible since creation of clusters with GPUs isn't supported yet in GCP. 1. To test this PR locally, use the node e2e. ```shell TEST_ARGS='--feature-gates=DynamicKubeletConfig=true' FOCUS=GPU SKIP="" make test-e2e-node ``` TODO: - [x] Run manual tests - [x] Add node e2e - [x] Add unit tests for GPU manager (< 100% coverage) - [ ] Add unit tests in kubelet package	2017-03-01 04:52:50 -08:00
Kubernetes Submit Queue	cda109d224	Merge pull request #36828 from mtaufen/eviction-test-thresholds Automatic merge from submit-queue (batch tested with PRs 42216, 42136, 42183, 42149, 36828) Set custom threshold for memory eviction test I am hoping this helps with memory eviction flakes, e.g. https://github.com/kubernetes/kubernetes/issues/32433 and https://github.com/kubernetes/kubernetes/issues/31676 /cc @derekwaynecarr @calebamiles @dchen1107	2017-02-28 21:17:05 -08:00
Vishnu kannan	318f4e102a	adding an e2e for GPUs Signed-off-by: Vishnu kannan <vishnuk@google.com>	2017-02-28 13:42:08 -08:00
Kubernetes Submit Queue	81d01a84e0	Merge pull request #41944 from jingxu97/Feb/mounter Automatic merge from submit-queue (batch tested with PRs 35094, 42095, 42059, 42143, 41944) Use chroot for containerized mounts This PR is to modify the containerized mounter script to use chroot instead of rkt fly. This will avoid the problem of possible large number of mounts caused by rkt containers if they are not cleaned up.	2017-02-28 09:20:21 -08:00
Vishnu kannan	9a65640789	fix go vet issues Signed-off-by: Vishnu kannan <vishnuk@google.com>	2017-02-27 21:24:45 -08:00
Vishnu Kannan	cc5f5474d5	add support for node allocatable phase 2 to kubelet Signed-off-by: Vishnu Kannan <vishnuk@google.com>	2017-02-27 21:24:44 -08:00
timchenxiaoyu	fb213582e6	fix typo retries	2017-02-27 11:52:01 +08:00
Kubernetes Submit Queue	16f87fe7d8	Merge pull request #40952 from dashpole/premption Automatic merge from submit-queue (batch tested with PRs 41994, 41969, 41997, 40952, 40576) Guaranteed admission for Critical Pods This is the first step in implementing node-level preemption for critical pods. It defines the AdmissionFailureHandler interface, which allows callers, like the kubelet, to define how failed predicates are handled, and take steps to correct failures if necessary. In the kubelet's implementation, it triggers preemption if the pod being admitted is critical, and if the only failed predicates are InsufficientResourceErrors, then it prempts (not yet implemented) other other pods to allow admission of the critical pod. cc: @vishh	2017-02-26 12:57:59 -08:00
Jing Xu	ac22416835	Use chroot for containerized mounts This PR is to modify the containerized mounter script to use chroot instead of rkt fly. This will avoid the problem of possible large number of mounts caused by rkt containers if they are not cleaned up.	2017-02-24 13:46:26 -08:00
David Ashpole	b798df8c44	check that innocent pod survives after evictions	2017-02-23 11:52:25 -08:00
David Ashpole	c58970e47c	critical pods can preempt other pods to be admitted	2017-02-23 10:31:20 -08:00
Kubernetes Submit Queue	bfdeaf302c	Merge pull request #41652 from ncdc/shared-informers-13-namespace Automatic merge from submit-queue (batch tested with PRs 39855, 41433, 41567, 41887, 41652) Switch namespace controller to shared informer @smarterclayton @derekwaynecarr @gmarek @wojtek-t @deads2k @sttts @liggitt @kubernetes/sig-scalability-pr-reviews	2017-02-23 09:36:38 -08:00
Kubernetes Submit Queue	59f4c5911a	Merge pull request #41819 from dchen1107/master Automatic merge from submit-queue (batch tested with PRs 38957, 41819, 41851, 40667, 41373) Bump GCI to gci-stable-56-9000-84-2 Changelogs since gci-beta-56-9000-80-0: - Fixed google-accounts-daemon breaks on GCI when network is unavailable. - Fixed iptables-restore performance regression. cc/ @adityakali @Random-Liu @fabioy	2017-02-22 19:59:33 -08:00
Random-Liu	1c8e127973	Move node problem detector test into node e2e.	2017-02-22 14:35:46 -08:00
Derek Carr	43ae6f49ad	Enable per pod cgroups, fix defaulting of cgroup-root when not specified	2017-02-21 16:34:22 -05:00
Dawn Chen	57fe26111e	Update node-e2e to gci-stable-56-9000-84-2	2017-02-21 10:05:44 -08:00
Andy Goldstein	99313cc394	Switch namespace controller to shared informer	2017-02-17 12:34:27 -05:00
Yu-Ju Hong	0189da49ce	Add non-cri configurations for node e2e tests	2017-02-15 11:02:53 -08:00
Kubernetes Submit Queue	4ac7fd9d19	Merge pull request #40934 from dashpole/density_test_cadvisor Automatic merge from submit-queue delete cadvisor pod after test tracing looks at events for pod deletion and volume teardown. SInce the cadvisor pod has more than 1 volume, this can make results harder to analyze. This PR moves the deletion of the cadvisor pod to after the logPodCreateThroughput call, since that marks the "end" of the test. cc: @dchen1107 @Random-Liu	2017-02-14 17:40:32 -08:00
Random-Liu	1226c5794a	Print running containers in infra container oom score test.	2017-02-10 17:45:21 -08:00

1 2 3 4 5 ...

794 Commits (da74b86b9959d3083143fd11002589fb50f863c9)