Commit Graph

3379 Commits (39e3c986cc448b42892b6d66dca0cdae9909f455)

Author SHA1 Message Date
Wojciech Tyczynski 67e4fa5b6e Reduce maximum number of net-proxy pods to make test work in large clusters 2016-09-15 10:45:03 +02:00
Kubernetes Submit Queue 4387dcbdd5 Merge pull request #32659 from wojtek-t/fix_gke_cluster
Automatic merge from submit-queue

Make networking tests work reasonably fast in large clusters

@bprashanth - FYI
2016-09-14 06:22:52 -07:00
Kubernetes Submit Queue b77e2728d8 Merge pull request #32560 from dshulyak/enable_signer_for_vagrant
Automatic merge from submit-queue

Allow to use GetSigner with vagrant provider

In order to run tests that require ssh access to a node on vagrant
we need to provide path to private ssh key.

Now it will be possible to do using VAGRANT_SSH_KEY environment variable
2016-09-14 05:35:55 -07:00
Wojciech Tyczynski b86ba6eab0 Make networking tests work reasonably fast in large clusters 2016-09-14 12:27:20 +02:00
Kubernetes Submit Queue e03d35cfd2 Merge pull request #32173 from bprashanth/nginx_slow
Automatic merge from submit-queue

Move nginx ingress e2e to slow

Normal GCE L7 e2e takes ~15m and runs in a feature private suite. This e2e ensure that the api isn't broken, by creating an nginx controller. I plan to write a really slimmed down version for presubmit, but I need to shave off a minute to get it below 5m. 

Fixes https://github.com/kubernetes/kubernetes/issues/23416
2016-09-14 01:13:40 -07:00
Kubernetes Submit Queue 1443fffc5d Merge pull request #30701 from ardnaxelarak/28695_suppress_noisy_output
Automatic merge from submit-queue

Add test for --quiet flag for kubectl run

This adds a test for the changes introduced in #30247 and #28801.

Ref #28695
2016-09-13 23:28:47 -07:00
Kubernetes Submit Queue 61ba82c637 Merge pull request #32632 from jlowdermilk/petset-e2e
Automatic merge from submit-queue

Only skip petset test if resource is missing

**What this PR does / why we need it**:
Unblock testing petset on other providers.

cc @pwittrock. Would like to cherrypick onto 1.4 but this is test code only, so it can wait til after release cut.
2016-09-13 22:09:40 -07:00
Jeff Lowdermilk 1631c6dd24 update comment, only skip on missing resource if provider is not gce 2016-09-13 19:30:00 -07:00
Jeff Lowdermilk 915e735d21 Only skip petset test if resource is missing 2016-09-13 18:50:36 -07:00
Janet Kuo 79373b698e Skip kubectl tests (create quota and exit code) on not supported versions 2016-09-13 17:25:19 -07:00
Kubernetes Submit Queue a26da7a1c4 Merge pull request #32160 from mml/evict.test.extra
Automatic merge from submit-queue

Refactor disruption e2e test and add some eviction test cases.
2016-09-13 17:23:02 -07:00
Kubernetes Submit Queue 51c4170c69 Merge pull request #32588 from maisem/upgrade_images
Automatic merge from submit-queue

Adding support for upgrading testing across image types.

Adds support for upgrade testing across image types.

@spxtr @vishh @ixdy @pwittrock

This change only affects upgrade testing. This does not touch production code and hence should be safe for cherrypicks into the 1.4 release branch.
2016-09-13 15:59:08 -07:00
Matt Liggett 138647e329 Refactor disruption e2e test and add some eviction test cases. 2016-09-13 15:31:49 -07:00
Kubernetes Submit Queue 4d0d9fa8f3 Merge pull request #32545 from nikhiljindal/fixFedSecTest
Automatic merge from submit-queue

Fixing federation secret and replicaset e2e tests

Verified that both of these pass once https://github.com/kubernetes/kubernetes/pull/32438 is merged.

cc @kubernetes/sig-cluster-federation 

Ref https://github.com/kubernetes/kubernetes/pull/31655#discussion_r78432043
2016-09-13 15:17:15 -07:00
Maisem Ali 19ca81cd36 Adding support for upgrading testing across image types. 2016-09-13 13:46:20 -07:00
nikhiljindal d19cfc0dcc Fixing federation secret and replicaset e2e tests 2016-09-13 12:20:46 -07:00
Kubernetes Submit Queue 3206be19bd Merge pull request #32539 from quinton-hoole/2016-09-12-enable-remaining-federated-ingress-e2e-tests
Automatic merge from submit-queue

Re-enable Federated Ingress e2e test to check connectivity to global load balancer

...Now that it works properly.

Should not merge before #31600, as it will fail until then.
2016-09-13 11:07:56 -07:00
Piotr Szczesniak 66c5d72ae0 Enabled Rescheduler e2e for GKE 2016-09-13 13:34:28 +02:00
Dmitry Shulyak 615d1d6336 Allow to use GetSigner with vagrant provider
In order to run tests that require ssh access to a node on vagrant
we need to provide path to private ssh key.

Now it will be possible to do using VAGRANT_SSH_KEY environment variable

Change-Id: Ic5fe0037edd46d0db3b8036ad7fc03cf1ea07574
2016-09-13 11:23:55 +03:00
Kubernetes Submit Queue 8f2f836e4e Merge pull request #32106 from jellonek/fd_leaks
Automatic merge from submit-queue

Ensure that we are closing files.

**What this PR does / why we need it**: In several places we are leaking file descriptors. This could be problematic on systems with low ulimits for them.

**Release note**:
```release-note
```
2016-09-12 23:43:38 -07:00
Kubernetes Submit Queue a44d2f8009 Merge pull request #32243 from caesarxuchao/generate-1-5-clientset
Automatic merge from submit-queue

Generate 1 5 clientset

Generate the 1.5 clientset. Stop updating 1.4 clientset. Remove 1.2 clientset.

@nikhiljindal @lavalamp 

I will rebase #31994 atop of this one.
2016-09-12 23:04:50 -07:00
Kubernetes Submit Queue c39f5681b7 Merge pull request #32316 from deads2k/client-05-remaining-clients
Automatic merge from submit-queue

remove the rest of the non-generated clients from the kubectl code

Die `Client` Die!

It's always bigger than you think.  Last bit @kargakis after this, it's gone.
2016-09-12 21:45:16 -07:00
Kubernetes Submit Queue 523fa24659 Merge pull request #31102 from sttts/sttts-podclient-doc
Automatic merge from submit-queue

Add some docs about the missing node e2e scheduling
2016-09-12 18:26:49 -07:00
Quinton Hoole a4696bb8c8 Re-enable e2e test to check connectivity to global load balancer, now that it works properly. 2016-09-12 17:34:55 -07:00
Kubernetes Submit Queue 6a739aae29 Merge pull request #32515 from nikhiljindal/fixFedRSTest
Automatic merge from submit-queue

federation e2e: Initialising cluster var before using it

Fixing https://github.com/kubernetes/kubernetes/pull/31904#discussion_r78433657

cc @jianhuiz @kubernetes/sig-cluster-federation
2016-09-12 14:04:46 -07:00
Kubernetes Submit Queue 74ad07de86 Merge pull request #32500 from kubernetes/revert-32196-rescheduler-e2e-gke
Automatic merge from submit-queue

Revert "Enabled Rescheduler e2e for gke"

Reverts kubernetes/kubernetes#32196

This broke gke-serial.

fix #31710
2016-09-12 12:49:53 -07:00
nikhiljindal 4c22c3fce5 Initialising cluster var before using it 2016-09-12 12:04:06 -07:00
Saad Ali 9ac08144d1 Revert "Remove long sleep in provisioning e2e tests." 2016-09-12 11:16:34 -07:00
Piotr Szczesniak 70755448b0 Revert "Enabled Rescheduler e2e for gke" 2016-09-12 17:25:13 +02:00
deads2k 519c13745f convert rest of kubectl to generated clients 2016-09-12 08:22:34 -04:00
Kubernetes Submit Queue 148628cbba Merge pull request #32279 from jsafrane/remove-deletion-sleep
Automatic merge from submit-queue

Remove long sleep in provisioning e2e tests.

PV controller sync is now 15 seconds, i.e. the controller re-tries to delete a PV four times in a minute until it succeeds. There is no need to wait for three minutes.

@kubernetes/sig-storage
2016-09-12 04:39:48 -07:00
Kubernetes Submit Queue fc30bf7e8b Merge pull request #32374 from soltysh/e2e_check_resource
Automatic merge from submit-queue

Provide an e2e skip helper checking for available resource

@janetkuo @dims this is the promised util function, but unfortunately I just learned that dynamic client suffers from the problem I've fixed in the manually written one (https://github.com/kubernetes/kubernetes/pull/29187) I need to look into the dynamic client in that case :/
2016-09-12 03:59:30 -07:00
gmarek a6e8789194 Relax the CPU constraint for L7 lb 2016-09-12 10:42:44 +02:00
Kubernetes Submit Queue af325ee7bf Merge pull request #31797 from aveshagarwal/master-dapi-volume-tests-image-update
Automatic merge from submit-queue

Update container image version for downward api volume tests

Some tests were using 0.7, and some were using 0.6, so updating all to 0.7.
@kubernetes/rh-cluster-infra
2016-09-12 01:22:27 -07:00
Piotr Skamruk dc3674c8d0 Ensure that we are closing files. 2016-09-12 09:33:24 +02:00
Kubernetes Submit Queue 64e914b8da Merge pull request #32365 from lojies/addporttokubectl
Automatic merge from submit-queue

add targetport to framework.Failf()

add target port value to framework.Failf()
2016-09-12 00:08:19 -07:00
Kubernetes Submit Queue 4b7c44b7c8 Merge pull request #32135 from rootfs/fix-29225
Automatic merge from submit-queue

fix #29225: wait slow during image pull

fix #29225
@kubernetes/sig-testing @krousey
2016-09-12 00:08:14 -07:00
Kubernetes Submit Queue acb63f07e0 Merge pull request #32145 from jfrazelle/31855-fix-flake
Automatic merge from submit-queue

test/e2e: up the timeout on AllNodesReady

<!--  Thanks for sending a pull request!  Here are some tips for you:
1. If this is your first time, read our contributor guidelines https://github.com/kubernetes/kubernetes/blob/master/CONTRIBUTING.md and developer guide https://github.com/kubernetes/kubernetes/blob/master/docs/devel/development.md
2. If you want *faster* PR reviews, read how: https://github.com/kubernetes/kubernetes/blob/master/docs/devel/faster_reviews.md
3. Follow the instructions for writing a release note: https://github.com/kubernetes/kubernetes/blob/master/docs/devel/pull-requests.md#release-notes
-->

**What this PR does / why we need it**: help with flake issue #31855 

**Which issue this PR fixes** *(optional, in `fixes #<issue number>(, #<issue_number>, ...)` format, will close that issue when PR gets merged)*: fixes #

**Special notes for your reviewer**:

**Release note**:
<!--  Steps to write your release note:
1. Use the release-note-* labels to set the release note state (if you have access) 
2. Enter your extended release note in the below block; leaving it blank means using the PR title as the release note. If no release note is required, just write `NONE`. 
-->
```release-note
NONE
```

This is not the most glamorous fix, but...
2016-09-11 23:28:03 -07:00
Kubernetes Submit Queue 2f45b4f8c2 Merge pull request #32254 from kubernetes/revert-32233-scheduledjob-e2e
Automatic merge from submit-queue

Revert "tag scheduledjob e2e as [Feature:ScheduledJob]"

Reverts kubernetes/kubernetes#32233

The way the e2e jobs are configured, `[Feature:...]` tests can't easily be run in jenkins-pr or any of  submit-queue blocking jobs.
2016-09-11 22:11:22 -07:00
Kubernetes Submit Queue ce1c113442 Merge pull request #32046 from guangxuli/close_readBytesFromFile
Automatic merge from submit-queue

close the open file

the open file should be close.
2016-09-11 10:21:13 -07:00
Kubernetes Submit Queue 3f5132aa79 Merge pull request #32170 from bprashanth/ingress_leak
Automatic merge from submit-queue

Replace gcloud shelling out with cloudprovider calls.

gcloud flakes a lot leading to resource leak. Also fixes https://github.com/kubernetes/kubernetes/issues/16636 by verifying instance-groups, ssl-certs and firewall-rules and cleaned up.
2016-09-11 07:39:38 -07:00
Kubernetes Submit Queue f2951a54f9 Merge pull request #30674 from ivan4th/add-e2e-tests-for-wrapped-volume-race
Automatic merge from submit-queue

Add e2e tests that check for wrapped volume race

This PR adds two new e2e tests that reproduce the race condition fixed in #29641 (see e.g. #29297)

In order to observe the race, you need to revert the PR that fixes it, via e.g.
```
git revert -n df1e925143
```
or
```
curl -sL https://github.com/kubernetes/kubernetes/pull/29641.patch | patch -p1 -R
```

The tests are `[Slow]` because they need to run several passes that involve creating pods with many volumes. They also are `[Serial]` because the load on the cluster may affect reproducibility of the race. They take about ~450s each when they fail on standard GCE cluster created by `go run hack/e2e.go -v --up`. `git_repo` test takes about 66s to run when it succeeds (fix PR not reverted) and `configmap` test takes about 546s in this case because configmap mounting is slower and still requires 3 passes x 5 pods x 50 configmap volumes to fail constantly with fix PR reverted. Probably these times can be reduced but frankly I've already spent quite a bit of time on tuning the numbers to find a balance between reproducibility and speed.

Managed to reproduce the problem in more or less reliable way for `configMap` and `gitRepo` volumes. Tried to reproduce it for `secret` volumes too but without success so far because they use tmpfs-based `emptyDir` variety. For `downwardAPI` volumes I expect the same problems with race reproducibility as with `secret` volumes, although I think some e2e races were caused by the bug, e.g. #29633.

The tests operate by creating several pods (via an RC) with many volumes and waiting for them to become Running. It sets node affinity for pods so that they all get created on a single node (the first one in the node list). The race condition leads to volume mount failures with slow retries, thus causing the test to time out.

The test failures look like this:

configmap:
```
• Failure [435.547 seconds]
[k8s.io] Wrapped EmptyDir volumes
/go/src/k8s.io/kubernetes/_output/dockerized/go/src/k8s.io/kubernetes/test/e2e/framework/framework.go:709
  should not cause race condition when used for configmaps [Serial] [Slow] [It]
  /go/src/k8s.io/kubernetes/_output/dockerized/go/src/k8s.io/kubernetes/test/e2e/wrapped_empty_dir.go:170

  Failed waiting for pod wrapped-volume-race-8c097734-6376-11e6-9ffa-5254003793ad-acbtt to enter running state
  Expected error:
      <*errors.errorString | 0xc8201758d0>: {
          s: "timed out waiting for the condition",
      }
      timed out waiting for the condition
  not to have occurred

  /go/src/k8s.io/kubernetes/_output/dockerized/go/src/k8s.io/kubernetes/test/e2e/wrapped_empty_dir.go:395
```
You'll see errors like this in kubelet log on the first node in the cluster:
```
E0816 00:27:23.319431    3510 configmap.go:174] Error creating atomic writer: stat /var/lib/kubelet/pods/e5986355-6347-11e6-a5d7-42010af00002/volumes/kubernetes.io~configmap/racey-configmap-14: no such file or directory
E0816 00:27:23.319478    3510 nestedpendingoperations.go:232] Operation for "\"kubernetes.io/configmap/e5986355-6347-11e6-a5d7-42010af00002-racey-configmap-14\" (\"e5986355-6347-11e6-a5d7-42010af00002\")" failed. No retries permitted until 2016-08-16 00:28:27.319450118 +0000 UTC (durationBeforeRetry 1m4s). Error: MountVolume.SetUp failed for volume "kubernetes.io/configmap/e5986355-6347-11e6-a5d7-42010af00002-racey-configmap-14" (spec.Name: "racey-configmap-14") pod "e5986355-6347-11e6-a5d7-42010af00002" (UID: "e5986355-6347-11e6-a5d7-42010af00002") with: stat /var/lib/kubelet/pods/e5986355-6347-11e6-a5d7-42010af00002/volumes/kubernetes.io~configmap/racey-configmap-14: no such file or directory
```

git_repo:
```
• Failure [455.035 seconds]                                                                                                                                                                                                                           [0/1882]
[k8s.io] Wrapped EmptyDir volumes
/go/src/k8s.io/kubernetes/_output/dockerized/go/src/k8s.io/kubernetes/test/e2e/framework/framework.go:709
  should not cause race condition when used for git_repo [Serial] [Slow] [It]
  /go/src/k8s.io/kubernetes/_output/dockerized/go/src/k8s.io/kubernetes/test/e2e/wrapped_empty_dir.go:179

  Failed waiting for pod wrapped-volume-race-71b12b3d-6375-11e6-9ffa-5254003793ad-b0slz to enter running state
  Expected error:
      <*errors.errorString | 0xc8201758d0>: {
          s: "timed out waiting for the condition",
      }
      timed out waiting for the condition
  not to have occurred

  /go/src/k8s.io/kubernetes/_output/dockerized/go/src/k8s.io/kubernetes/test/e2e/wrapped_empty_dir.go:395
```
Errors in kubelet log:
```
E0815 23:41:08.670203    3510 nestedpendingoperations.go:232] Operation for "\"kubernetes.io/git-repo/97636bd8-6341-11e6-a5d7-42010af00002-racey-git-repo-8\" (\"97636bd8-6341-11e6-a5d7-42010af00002\")" failed. No retries permitted until 2016-08-15 23:42:12.670181604 +0000 UTC (durationBeforeRetry 1m4s). Error: MountVolume.SetUp failed for volume "kubernetes.io/git-repo/97636bd8-6341-11e6-a5d7-42010af00002-racey-git-repo-8" (spec.Name: "racey-git-repo-8") pod "97636bd8-6341-11e6-a5d7-42010af00002" (UID: "97636bd8-6341-11e6-a5d7-42010af00002") with: failed to exec 'git clone http://10.0.68.35:2345 test': : chdir /var/lib/kubelet/pods/97636bd8-6341-11e6-a5d7-42010af00002/volumes/kubernetes.io~git-repo/racey-git-repo-8: no such file or directory
```

Generally, the races cause unexpected "no such directory" errors in kubelet logs with subsequent volume mount failures.

I've added race tests to e2e test `empty_dir_wrapper.go` ("EmptyDir wrapper volumes"). This test was added in #18445, the same PR that introduced the race bug. The original purpose of the test was making sure that no conflicts occur between different wrapped emptyDir volumes, so I've replaced "should becomes" with "should not conflict" in the first `It(...)`.
2016-09-11 03:39:21 -07:00
Kubernetes Submit Queue dd9d5aa27d Merge pull request #32378 from kevin-wangzefeng/update-taints-e2e
Automatic merge from submit-queue

update taints e2e, restrict taints operation with key, effect

Since taints are now unique by key, effect on a node, this PR is to restrict existing taints adding/removing/updating operations in taints e2e.
Also fixes https://github.com/kubernetes/kubernetes/issues/31066#issuecomment-242870101
Related prior Issue/PR #29362 and #30590
2016-09-10 13:20:51 -07:00
Kubernetes Submit Queue 09efe0457d Merge pull request #32163 from mtaufen/more-eviction-logging
Automatic merge from submit-queue

Log pressure condition, memory usage, events in memory eviction test

I want to log this to help us debug some of the latest memory eviction test flakes, where we are seeing burstable "fail" before the besteffort. I saw (in the logs) attempts by the eviction manager to evict besteffort a while before burstable phase changed to "Failed", but the besteffort's phase appeared to remain "Running". I want to see the pressure condition interleaved with the pod phases to get a sense of the eviction manager's knowledge vs. pod phase.
2016-09-09 18:37:55 -07:00
Michael Taufen b9f0bd959e Log the following items in memory eviction test:
- memory working set
- pressure condition
- events for the default and test namespaces, after the test completes
2016-09-09 13:42:26 -07:00
Kubernetes Submit Queue e317af87cc Merge pull request #31819 from mtaufen/plumb-feature-gates
Automatic merge from submit-queue

Plumb --feature-gates from TEST_ARGS to components in node e2e tests

This means you can set `TEST_ARGS` on the command line, in a `.properties` config for a Jenkins job, etc, to toggle gated features. For example:

`TEST_ARGS='--feature-gates=DynamicKubeletConfig=true'`

/cc @vishh @jlowdermilk
2016-09-09 12:31:00 -07:00
Kubernetes Submit Queue cd30526b44 Merge pull request #32379 from wojtek-t/allow_for_notready_nodes_in_scalability
Automatic merge from submit-queue

Allow for not-ready nodes in WaitForAllNodesSchedulable

Ref #31215
2016-09-09 06:47:36 -07:00
Wojciech Tyczynski dc3dcc27d2 Allow for not-ready nodes in WaitForAllNodesSchedulable 2016-09-09 14:56:43 +02:00
deads2k 786879aa9b skip dynamic provisision test on gke until gke enables it 2016-09-09 07:45:40 -04:00
Maciej Szulik 7ee8f867d9 Provide an e2e skip helper checking for available resource. 2016-09-09 13:38:32 +02:00