Commit Graph

3492 Commits (c8ca49ec886cc86a96cd3ddf1e2a1eecaeade138)

Author SHA1 Message Date
k8s-merge-robot 3d332a047b Merge pull request #26961 from sttts/sttts-flake-26210-decouple-port-80
Automatic merge from submit-queue

Flake 26210: decouple explicit access from port 80

Flake #26210 only happens for port 80. To decouple the possible causes, all
tests with explicit port 80 are moved to port 1080 (these were 80% of the flakes).
The urls without a specified port (which map to port 80 though) are left untouched.

If port 1080 does not show up as flake now, there is really a connection to the
actual port number.
2016-06-08 11:16:13 -07:00
k8s-merge-robot 7695ae6334 Merge pull request #27038 from wojtek-t/reduce_number_of_logs
Automatic merge from submit-queue

Reduce huge amount of logs in large cluster tests

When running tests in 2000-node clusters, I got more than 100.000 lines like this:
```
Jun  8 01:03:11.850: INFO: Condition NetworkUnavailable of node gke-gke-large-cluster-default-pool-1-03ee5a12-knrw is true instead of false. Reason: NoRouteCreated, message: Node created w       ithout a route
```
that doesn't give much value.

This is PR is reducing the number of logs.
2016-06-08 10:29:24 -07:00
k8s-merge-robot 46f167188a Merge pull request #27006 from caesarxuchao/shorten-gc-stress-test
Automatic merge from submit-queue

shorten gc stress test

But the stress test is still run twice, because the `KUBE_TEST_API_VERSIONS` env var in test-integration.sh doesn't have any effect on the testapi because of the [defaulting](https://github.com/kubernetes/kubernetes/blob/master/pkg/api/testapi/testapi.go#L164).

cc @lavalamp
2016-06-08 08:50:17 -07:00
Wojciech Tyczynski 51f433d9a6 Reduce huge amount of logs in large cluster tests 2016-06-08 13:36:46 +02:00
Jerzy Szczepkowski 95a59734de Added extra logging to cluster autoscaler e2e tests.
Added extra logging to cluster autoscaler e2e tests to track flakiness.
2016-06-08 09:55:49 +02:00
Wojciech Tyczynski 878c8d41f1 Merge pull request #27008 from zmerlynn/e2e-routes
Networking e2es: Wait for all nodes to be schedulable in kubeproxy and networking tests
2016-06-08 09:02:30 +02:00
Zach Loafman 936297c64c Networking e2es: Wait for all nodes to be schedulable in kubeproxy and networking tests
Now that GCE routes take an extremely long time to come up and there's
a variance in "Ready" and "Schedulable", start cherry-picking tests
where we really want to have all nodes routable/schedulable for
testing. Adding logging. This will increase test times on large
clusters but should have 0 impact on normal testing.
2016-06-07 16:38:03 -07:00
k8s-merge-robot 457d42e70e Merge pull request #26959 from wojtek-t/fix_performance_flakes
Automatic merge from submit-queue

Fix scalability flakes in small clusters

Fix #26185

@xiang90 @hongchaodeng @gmarek
2016-06-07 16:13:10 -07:00
Chao Xu 965aadac39 shorten gc stress test 2016-06-07 15:58:30 -07:00
Dawn Chen 0d3be6a316 Merge pull request #26735 from timstclair/local-e2e
Fixes for running node e2es
2016-06-07 14:51:18 -07:00
Vishnu kannan 2494c77972 Disable CoreOS image from node e2e testing.
Signed-off-by: Vishnu kannan <vishnuk@google.com>
2016-06-07 10:45:37 -07:00
Dr. Stefan Schimanski 17e6797d72 Decouple explicit access from port 80
Flake #26210 only happens for port 80. To decouple the possible causes, all
tests with explicit port 80 are moved to port 1080 (these were 80% of the flakes).
The urls without a specified port (which map to port 80 though) are left untouched.

If port 1080 does not show up as flake now, there is really a connection to the
actual port number.
2016-06-07 15:50:12 +02:00
Wojciech Tyczynski 940882cb44 Fix scalability flakes in small clusters 2016-06-07 14:44:31 +02:00
Tim St. Clair 66d74f969d More reliable teardown of node e2e services 2016-06-06 15:30:23 -07:00
k8s-merge-robot 015bc3d60d Merge pull request #26811 from vishh/26809
Automatic merge from submit-queue

Mark runtime conformance tests as flaky and not run them in jenkins CI.

For #26809

@pwittrock As discussed offline, marking runtime tests as flaky for now. I'm not sure if those tests are required. Testing docker in every Kubernetes PR is un-necessary. 
These tests can be run periodically in a separate CI. AFAIK, these tests don't seem to exercise any kube features.
2016-06-06 11:14:44 -07:00
Marcin Wielgus 2b30eda8e2 Cluster autoscaling e2e tests cleanup 2016-06-06 16:53:09 +02:00
k8s-merge-robot 6ecfb4c8fa Merge pull request #26881 from jszczepkowski/kac-tests-cleanup
Automatic merge from submit-queue

Fixed cleaun-up after cluster autoscaling e2e tests.

[![Analytics](https://kubernetes-site.appspot.com/UA-36037335-10/GitHub/.github/PULL_REQUEST_TEMPLATE.md?pixel)]()

Fixed cleaun-up after cluster autoscaling e2e tests.
2016-06-06 05:59:24 -07:00
Jerzy Szczepkowski 3506280df6 Fixed cleaun-up after cluster autoscaling e2e tests.
Fixed cleaun-up after cluster autoscaling e2e tests.
2016-06-06 11:34:04 +02:00
Marcin Wielgus 95eaaeb883 Increase cluster size by 2 in scale down test to bypass Heapster nanny issue 2016-06-06 10:37:11 +02:00
k8s-merge-robot 6c5f5c5e36 Merge pull request #26571 from pwittrock/node-e2e-godeps
Automatic merge from submit-queue

Node e2e use vendored testing packages.


[![Analytics](https://kubernetes-site.appspot.com/UA-36037335-10/GitHub/.github/PULL_REQUEST_TEMPLATE.md?pixel)]()
2016-06-04 14:54:25 -07:00
k8s-merge-robot a2d79d0ef6 Merge pull request #26825 from timstclair/node-e2e-pull
Automatic merge from submit-queue

Fix node e2e image pulling retry logic
2016-06-04 11:36:41 -07:00
k8s-merge-robot 8ba9bd3dcb Merge pull request #26714 from jszczepkowski/kac-event-test
Automatic merge from submit-queue

Improved check in shouldn't increase cluster size e2e test.

[![Analytics](https://kubernetes-site.appspot.com/UA-36037335-10/GitHub/.github/PULL_REQUEST_TEMPLATE.md?pixel)]()

Improved check in shouldn't increase cluster size e2e test.
2016-06-04 05:58:32 -07:00
k8s-merge-robot 5c8dd576e2 Merge pull request #26592 from vishh/26289
Automatic merge from submit-queue

Update Node e2e Core OS image to run systemd with CPU & Memory accounting enabled by default

cc @derekwaynecarr 

For #26289
2016-06-04 03:32:10 -07:00
k8s-merge-robot 22d405477b Merge pull request #26568 from derekwaynecarr/kubectl_qos
Automatic merge from submit-queue

Update kubectl describe pod to use pod level qos

Fixes https://github.com/kubernetes/kubernetes/issues/26374

/cc @vishh
2016-06-04 02:40:47 -07:00
k8s-merge-robot 707cc2bbb8 Merge pull request #26493 from caesarxuchao/fix-gc-flake
Automatic merge from submit-queue

Fixes 25890 flake. Let GC convert ListOptions to v1 before passing it to the dynamic client

GC's ListWatcher directly passed the api.ListOptions to the dynamic client, but the parameter codec of dynamic client converts the options to queries based on the tags in the struct, which are not present in api.ListOptions, so the queries are not sent to the server. As a result, the Watch request was sent without a resourceVersion, causing missed events. Flake #25890 is caused by the missed deletion events.

This PR converts the api.ListOptions to v1.ListOptions before the GC passes it to the dynamic codec. The flaky test has successfully passed 79 times ([log](https://00e9e64bacd064560a027fbee9c5a373a1614f3a56e652ae40-apidata.googleusercontent.com/download/storage/v1_internal/b/kubernetes-jenkins/o/pr-logs%2Fpull%2F25923%2Fkubernetes-pull-test-unit-integration%2F28364%2Fbuild-log.txt?qk=AD5uMEv72OjSUqDyk5i-ZLurcmM4i7gket1c7WaqR7yuIYz7WhPYT7ewVBafijV0ymnPTYqxRYt1kp6S9YQv7chPwC-3UtrKetKfhYnvAFrPGXAIBxHytTmpFohRAYgsARN1B6j1f9vyK5lM-8jyzRGhCK3sCRsAPnbDBWIWFlbH4b1n3vUET3P71QamHrF5itYyaqRU5pMZV3Cwwr81X8q7h5hCzm3Ip78RpMzfjEqTG0RcM2TLGccUrlkWVBLh4hn0NFpUIkzVFugFA5ooJffo-0AdJnO3mGWEOnXNVFWftJbK8cKnTns0DISrYFOyH_PlOe_YHCxgIXIT-dW8G-nbqoUjn5SBqunr36rcpaYCIwe2va4W_AcLCT43xiEAezRER_U9AuIqi_22KMd6SuHTyljhmWFPvPk8-gpjthLWXhcE7LPO5dV41hnZHnbI4n_9eI1nSVm7q9XdSvX1sWKV1GCwn8oj017AnxVvl9bScultko_0dTC747UqJ6UTFakLuFcHFe-F5Tz7ItDWlBVPoXeC7gTpyuicFKLsdqGlW9F5X6kIwNrBRj9uRsS-QuzSER-fVkQCn4dUTcokttRH_0bYvyfr9oqiDXmywMgOp-L0sKayk8JOVynh2q0Tju9sdkvFr0PxoAjhofomfIC1SZ_JkOzwAT1TUW8dLjPHluMct34xW_-qna1AmkoxM4bZQLhllap96NTC-0IdtzeKDrTul8p7u3WXSJjjEMSijibTNMlnkB0AluT1_RNO94OnzuFv4YlcV24FPhJzchhbyKREkOb_wzgcnSbRwGHjIcfRgkX-IzoXHVBcMYFUrPmsXrnRcfad4XwjkUOgvivkURW2_EwnzgrLDh-IKek51_0FpT1MnFCSG0gQbVSs_iMVPr6UXNAw62LGbKVtl3ZMXyapEpcO8azNbn6Wvd550R704JXxYlU)).

@lavalamp @krousey @smarterclayton
2016-06-04 01:52:31 -07:00
k8s-merge-robot 0418a2c0ad Merge pull request #26153 from jsafrane/intergation-binder-stress
Automatic merge from submit-queue

volume controller: add configurable integration test to stress the binder

The test tries to bind configured nr. of PVs to the same nr. of PVCs. '100' is used by default, which should take ~1-3 seconds (depends on log level). Periodic sync is needed in rare cases, which may add another 10 seconds. - cache from #25881 will help here and sync should not be needed at all.

The test is configurable and may be reused to measure binder performance. Set KUBE_INTEGRATION_PERSISTENTVOLUME_* env. variables as described in persistent_volume_test.go and run the tests:

```
# compile
$ cd test/integration
$ godep go test -tags 'integration no-docker' -c

# run the tests
$ KUBE_INTEGRATION_PERSISTENTVOLUME_SYNC_PERIOD=10s KUBE_INTEGRATION_PERSISTENTVOLUME_OBJECTS=1000 time ./integration.test -test.run TestPersistentVolumeMultiPVsPVCs -v 2
```

Log level '2' is useful to get timestamps of various events like 'TestPersistentVolumeMultiPVsPVCs: start' and 'TestPersistentVolumeMultiPVsPVCs: claims are bound'.
2016-06-04 00:13:29 -07:00
k8s-merge-robot 7d852febc9 Merge pull request #26794 from yujuhong/wait_before_test
Automatic merge from submit-queue

kubelet e2e: enforce that image prepulling must finish before the test

The image prepulling pod calls docker directly to pull images. If the pod
hasn't finished before running the resource usage tracking test, there'd be a
cpu spike in docker. We'd rather wait and fail if this is the case, before
running the test.
2016-06-03 19:42:42 -07:00
k8s-merge-robot 3f1960b73e Merge pull request #26314 from vishh/all-logs
Automatic merge from submit-queue

Link kernel log files to have them be copied over as part of node e2e.

Fixes #25993

cc @pwittrock
2016-06-03 18:49:25 -07:00
pwittrock c74997508e Node e2e use vendored testing packages. 2016-06-03 17:01:50 -07:00
Tim St. Clair 7153981bfe Fix node e2e image pulling retry logic 2016-06-03 16:37:58 -07:00
Vishnu kannan 82718c186c update coreos node e2e images to include updated systemd config
Signed-off-by: Vishnu kannan <vishnuk@google.com>
2016-06-03 16:14:26 -07:00
Vishnu kannan 7665120818 update attributes on node e2e copy image script
Signed-off-by: Vishnu kannan <vishnuk@google.com>
2016-06-03 16:14:26 -07:00
Vish Kannan 4e1596e61e Remove debug log line 2016-06-03 14:41:05 -07:00
Vishnu kannan 4e3bce8cb7 Mark runtime conformance tests as flaky and not run them in jenkins CI.
Signed-off-by: Vishnu kannan <vishnuk@google.com>
2016-06-03 13:57:44 -07:00
k8s-merge-robot 69e45dc30d Merge pull request #26671 from vishh/node-e2e-images
Automatic merge from submit-queue

Support images and instances to exist in different GCP projects for node

Forked off from #26314
2016-06-03 13:38:02 -07:00
derekwaynecarr 89ff3b2530 Update kubectl describe pod to use pod level qos 2016-06-03 15:32:32 -04:00
Yu-Ju Hong 02d7af28e0 kubelet e2e: enforce that image prepulling must finish before the test
The image prepulling pod calls docker directly to pull images. If the pod
hasn't finished before running the resource usage tracking test, there'd be a
cpu spike in docker. We'd rather wait and fail if this is the case, before
running the test.
2016-06-03 12:25:25 -07:00
Chao Xu 06f49f7ca7 Let the dynamic client take a customized parameter codec for List, Watch, and DeleteCollection.
Let the gc's ListWatcher use api.ParameterCodec. Fixes 25890.
2016-06-03 11:22:51 -07:00
Vishnu kannan 34a828d07e Link kernel log files to have them be copied over as part of node e2e.
Signed-off-by: Vishnu kannan <vishnuk@google.com>
2016-06-03 11:22:25 -07:00
Jerzy Szczepkowski a9dc78c6f6 Implemented e2e test: cluster autoscaler with node selector.
Implemented e2e test: cluster autoscaler with node selector.
2016-06-03 16:38:45 +02:00
Jerzy Szczepkowski 07d9e2dcb0 Implemented support for multiple MIGs in e2e cluster autoscaling tests.
Improved check in shouldn't increase cluster size e2e test.
2016-06-03 16:31:56 +02:00
k8s-merge-robot 81c5cfc139 Merge pull request #26773 from mwielgus/gke-autoscaling
Automatic merge from submit-queue

Setup for cluster autoscaling tests in GKE

cc: @piosz @jszczepkowski @fgrzadkowski
2016-06-03 07:31:14 -07:00
Marcin Wielgus ab56306bec Setup for cluster autoscaling tests in GKE 2016-06-03 14:45:08 +02:00
Jan Safranek 1cfc984b70 volume controller: add configurable integration test to stress the binder
The test tries to bind configured nr. of PVs to the same nr. of PVCs.
'100' is used by default, which should take ~1-3 seconds (depends on log level).
Periodic sync is needed in rare cases, which may add another 10 seconds. - cache
from #25881 will help here and sync should not be needed at all.

The test is configurable and may be reused to measure binder performance.
Set KUBE_INTEGRATION_PV_* env. variables as described in
persistent_volume_test.go and run the tests:

# compile
$ cd test/integration
$ godep go test -tags 'integration no-docker' -c

# run the tests
$ KUBE_INTEGRATION_PV_SYNC_PERIOD=10s KUBE_INTEGRATION_PV_OBJECTS=1000 time ./integration.test -test.run TestPersistentVolumeMultiPVsPVCs -v 2

Log level '2' is useful to get timestamps of various events like
'TestPersistentVolumeMultiPVsPVCs: start' and 'TestPersistentVolumeMultiPVsPVCs:
claims are bound'.
2016-06-03 12:52:32 +02:00
k8s-merge-robot 45aa24b6b7 Merge pull request #26629 from jsafrane/stabilize
Automatic merge from submit-queue

Stabilize persistent volume integration tests

- add more logs
- wait both for volume and claim to get bound

When binding volumes to claims the controller saves PV first and PVC right
after that. In theory, this saved PV could cause waitForPersistentVolumePhase
to finish and PVC could be checked in the test before the controller saves it.
So, wait for both PVC and PV to get bound and check the results only after
that. This is only a theory, there are no usable logs in integration tests.

Fixes #26499 (at least I hope so...)
2016-06-03 03:10:00 -07:00
k8s-merge-robot 4adc00eceb Merge pull request #26454 from thockin/licenses_again
Automatic merge from submit-queue

Don't allow deps with no discernible license

This updates the few deps we had with no LICENSE file to current versions that do have that file.  It also disallows new deps without obvious licenses.
2016-06-03 01:18:14 -07:00
k8s-merge-robot 693dae70a8 Merge pull request #26226 from ArtfulCoder/reversedns
Automatic merge from submit-queue

Added DNS Reverse Record logic for service IPs
2016-06-02 18:43:39 -07:00
Tim Hockin 3c6ce53b7f Fix call-sites after dep updates 2016-06-02 16:57:28 -07:00
k8s-merge-robot c114f32998 Merge pull request #26289 from derekwaynecarr/enable_accounting
Automatic merge from submit-queue

Enable node e2e accounting on systemd

Updated the e2e setup.sh script to enable cpu and memory accounting.

Related to https://github.com/kubernetes/kubernetes/issues/26198

/cc @pwittrock
2016-06-02 13:42:58 -07:00
k8s-merge-robot b6a7be6f66 Merge pull request #26533 from gmarek/retries
Automatic merge from submit-queue

Add retries for make in start-kubemark

Fix. #25872
2016-06-02 10:49:28 -07:00