Commit Graph

1279 Commits (01189549f3e27d8a42d32d0b2b36bda65eeb02ac)

Author SHA1 Message Date
Jeff Grafton 6b65d4a390 Don't fail teardown if we can't delete the network 2016-10-13 12:24:22 -07:00
Kubernetes Submit Queue 721f4be5b2 Merge pull request #34577 from ixdy/cleanup-network
Automatic merge from submit-queue

Delete all firewall rules (and optionally network) on GCE/GKE cluster teardown

Not entirely ready for review yet; I want to see what Jenkins thinks of this.
2016-10-13 10:48:34 -07:00
Jeff Grafton ee6e92473d gcloud compute instances list --zone -> --zones 2016-10-12 15:46:13 -07:00
Jeff Grafton 661d6bde6f Delete all firewall rules and network on GCE/GKE test cluster teardown 2016-10-12 15:46:06 -07:00
Mike Danese 51ec7c2845 retry salt-call in configure-vm.sh 2016-10-12 10:25:32 -07:00
Kubernetes Submit Queue 12b133577e Merge pull request #33278 from Crassirostris/gcl-e2e-test
Automatic merge from submit-queue

Add gcl cluster logging test

This PR changes default logging destination for tests to gcp and adds test for cluster logging using google cloud logging

Fix #20760
2016-10-12 06:54:25 -07:00
Michael Taufen 263c54c424 Fix upgrade.sh image setup
The original fix (#33147) sourced the correct `node-helper.sh` but set
`node_os_distribution` instead of `NODE_OS_DISTRIBUTION`. The
`set-node-image` function is imported indirectly via `source
"${KUBE_ROOT}/cluster/kube-util.sh"`, which in turn (in the GCE case)
sources `cluster/gce/util.sh`. Since the `set-node-image` function
relies on the `NODE_OS_DISTRIBUTION` variable, the original fix
did not have the entire intended effect.

I have confirmed that cherry-picking #33147 into the `release-1.4`
branch and layering this commit on top of it make for a successful
upgrade from a GCI based K8s 1.3 cluster to a GCI based K8s 1.4 cluster.
2016-10-10 10:54:58 -07:00
Kubernetes Submit Queue 8bcb85685e Merge pull request #34156 from adityakali/gci
Automatic merge from submit-queue

Update GCI_VERSION to gci-dev-55-8866-0-0

Update GCI base image:

Change log:
* Built-in kubernetes updated to v1.4.0
* Enabled VXLAN and IP_SET config options in kernel to support some networking tools
* OpenSSL CVE fixes

```release-note
Update GCI base image:
* Enabled VXLAN and IP_SET config options in kernel to support some networking tools (ebtools)
* OpenSSL CVE fixes
```

cc/ @kubernetes/goog-image cc/ @dchen1107
2016-10-07 16:35:20 -07:00
Kubernetes Submit Queue 86c322e014 Merge pull request #33520 from vishh/33444
Automatic merge from submit-queue

Nodefs becomes imagefs on GCI

Kubelet cannot identify rootfs correctly
For #33444

```release-note
Enforce Disk based pod eviction with GCI base image in Kubelet
```

Signed-off-by: Vishnu kannan <vishnuk@google.com>
2016-10-06 17:37:33 -07:00
Aditya Kali 11397e0f6d Update GCI_VERSION to gci-dev-55-8866-0-0
Changelog:
* Built-in kubernetes updated to v1.4.0
* Enabled VXLAN and IP_SET config options in kernel to support some networking tools
* OpenSSL CVE fixes
2016-10-06 15:43:29 -07:00
Kubernetes Submit Queue cccd3b3234 Merge pull request #33147 from vishh/fix-upgrade-sh
Automatic merge from submit-queue

Fix base image pinning during upgrades via cluster/gce/upgrade.sh

Fixes #33056

cc @timstclair @maisem
2016-10-05 01:19:56 -07:00
Lucas Käldström 950e23740b Remove the old flannel network mode in kube-up 2016-10-04 12:02:25 +03:00
Jeff Grafton ca42d7f016 Fix condition for deleting master etcd firewall rule 2016-10-03 12:40:47 -07:00
Kubernetes Submit Queue 8d9642e2a2 Merge pull request #33094 from jszczepkowski/ha-firewall2
Automatic merge from submit-queue

HA master: add firewall rule for etcd cluster.
2016-10-03 03:17:20 -07:00
Michael Taufen edcf97db1d Change --bind to -B for consistency 2016-09-30 10:59:47 -07:00
Michael Taufen 73b64e6da5 Correct env var name in configure-helper
The variable should be KUBEPROXY_TEST_ARGS, not KUBE_PROXY_TEST_ARGS.
2016-09-30 10:56:08 -07:00
Jerzy Szczepkowski b8cbac6753 HA master: add firewall rule for etcd cluster.
HA master: add firewall rule for accessing & securing etcd cluster.
2016-09-30 14:37:08 +02:00
Kubernetes Submit Queue b71def750f Merge pull request #33774 from MrHohn/kubedns-graceful
Automatic merge from submit-queue

Bump up addon kube-dns to v20 for graceful termination

Below images are built and pushed:
- gcr.io/google_containers/kubedns-amd64:1.8
- gcr.io/google_containers/kubedns-arm:1.8
- gcr.io/google_containers/kubedns-arm64:1.8
- gcr.io/google_containers/kubedns-ppc64le:1.8

Both kubedns and dnsmasq are bumped up in the manifest files.

@thockin @bprashanth
2016-09-30 00:15:13 -07:00
Kubernetes Submit Queue f595eede2d Merge pull request #33802 from euank/the-number-of-the-spaces-shall-be-2
Automatic merge from submit-queue

cluster/gci: Minor spacing tweak

Two shall be the number thou shalt indent, and the level of the indent
shall be two. Three shalt thou not indent, neither indent thou once,
excepting that thou then proceed to two. Five is right out.

/cc @andyzheng0831 @jlowdermilk
2016-09-29 23:37:23 -07:00
MrHohn 3e55d4d6bd Bump up addon kube-dns to v20 for graceful termination 2016-09-29 18:09:20 -07:00
Euan Kemp 527932e6ad cluster/gci: Minor spacing tweak
Two shall be the number thou shalt indent, and the level of the indent
shall be two. Three shalt thou not indent, neither indent thou once,
excepting that thou then proceed to two. Five is right out.
2016-09-29 16:27:35 -07:00
Euan Kemp 22b6fd893a gce/coreos: Fix dnsmasq image name
This bug was inadvertently introduced in #32406.

The longer term plan (shouldn't be too much longer) is to remove this
file entirely and rely on the `gci-trusty` version of it, but to stop
some bleeding and allow our jenkins using kube-up + coreos to work, we
should merge this fix until we have the more complete solution.
2016-09-29 14:44:10 -07:00
Mik Vyatskov 8bc60a44e1 Switched default logging destination in tests to gcp 2016-09-29 11:15:30 +02:00
Kubernetes Submit Queue 9e4ba1866b Merge pull request #33146 from MrHohn/kubedns-readiness
Automatic merge from submit-queue

Tune down initialDelaySeconds for readinessProbe.

Fixed #33053.

Tuned down the `initialDelaySeconds`(original 30s) for readiness probe to 3 seconds and `periodSeconds`(default 10s) to 5 seconds to shorten the initial time before a dns server pod being exposed. This configuration passed DNS e2e tests and did not even hit any readiness failure(for kube-dns) with a GCE cluster with 4 nodes during the experiments.

For scaling out kube-dns servers, it took less than 10s for servers being exposed after they appeared as running, which is much faster than 30+s(the original cost).

`failureThreshold` is left as default(3) and it would not lead to restart because the status of readiness probe would only affect whether endpoints being exposed in service or not(in the dns service point of view). According to the implementation of [prober](https://github.com/kubernetes/kubernetes/blob/master/pkg/kubelet/prober/worker.go), the number of retries for readiness probe is unbounded. Hence there is no obvious effect if the readiness probe fail several times in the beginning. 

The state machine of prober could be illustrated with below figure:

![drawing](https://cloud.githubusercontent.com/assets/8681801/18693503/fb4466dc-7f56-11e6-8671-0a14c4835d24.jpeg)

I want to see the e2e result of this PR for further evaluation.

@thockin @bprashanth
2016-09-27 05:02:39 -07:00
Kubernetes Submit Queue 869af8f5a1 Merge pull request #33437 from justinsb/typo_incremeting
Automatic merge from submit-queue

Fix typo: incremeting -> incrementing
2016-09-26 22:30:22 -07:00
Vishnu kannan e7633d8328 nodefs becomes imagefs on GCI since kubelet cannot identify rootfs correctly
Signed-off-by: Vishnu kannan <vishnuk@google.com>
2016-09-26 15:27:49 -07:00
gmarek f7d0615e2b Wait until master IP is visible 2016-09-26 15:56:31 +02:00
Kubernetes Submit Queue 5fe2495588 Merge pull request #33122 from ixdy/upgrade-debugging
Automatic merge from submit-queue

Print a more helpful error message when failing to start rolling-updates

Hopefully this will help us track down where the 1.3 -> 1.4 upgrades are breaking down. We'll need to cherry-pick this into release-1.4 to have any effect, though.
2016-09-26 00:35:05 -07:00
MrHohn 55db76241c Tune down initialDelaySeconds for readinessProbe 2016-09-25 12:48:19 -07:00
Kubernetes Submit Queue b79c99da1b Merge pull request #32406 from MrHohn/kubedns-healthz
Automatic merge from submit-queue

Split dns healthcheck into two different urls

Attempt to fix #30633.

<s>This new kube-dns pod template creates two exechealthz processes listen on two different ports for kubedns and dnsmasq correspondingly. 

@thockin @girishkalele
2016-09-25 12:21:34 -07:00
Justin Santa Barbara a6dfaffe3f Fix typo: incremeting -> incrementing 2016-09-24 16:10:51 -04:00
Kubernetes Submit Queue 55830471ba Merge pull request #33353 from vishh/gci-default
Automatic merge from submit-queue

Switch k8s on GCE to use GCI by default
2016-09-23 17:25:35 -07:00
Kubernetes Submit Queue 33b5d9650a Merge pull request #33197 from adityakali/core
Automatic merge from submit-queue

Reset core_patern on GCI

The default core_pattern pipes the core dumps to /sbin/crash_reporter
which is more restrictive in saving crash dumps. So for
now, set a generic core_pattern that users can work with.

@dchen1107 @aulanov can you please review?

cc/ @kubernetes/goog-image
2016-09-23 03:50:15 -07:00
Vishnu kannan 504cf5ca3c mount kubelet root directory as executable in GCI
Signed-off-by: Vishnu kannan <vishnuk@google.com>
2016-09-22 22:01:59 -07:00
Vishnu kannan ef49584603 Switch k8s on GCE to use GCI by default
Signed-off-by: Vishnu kannan <vishnuk@google.com>
2016-09-22 21:11:13 -07:00
MrHohn d17cd1a514 Split dns healthcheck into two different urls 2016-09-22 18:02:30 -07:00
Kubernetes Submit Queue 03c698ce44 Merge pull request #33194 from dchen1107/master
Automatic merge from submit-queue

Update the containervm image to the latest one (container-v1-3-v20160…

Node e2e is running with old containervm image which only has docker 1.9.1. This pr fixed such issue.
2016-09-21 20:40:02 -07:00
Fabio Yeon 177fee1358 (GCI) Configure logrotate to rotate all .log files in /var/log. 2016-09-21 15:29:14 -07:00
Dawn Chen f1f16fe03a Update the containervm image to the latest one (container-v1-3-v20160604). 2016-09-21 10:24:22 -07:00
Aditya Kali d54db34172 Reset core_patern on GCI
The default core_pattern pipes the core dumps to /sbin/crash_reporter
which is more restrictive in saving crash dumps. So for
now, set a generic core_pattern that users can work with.
2016-09-21 10:08:23 -07:00
Kubernetes Submit Queue 01dd125b60 Merge pull request #33039 from colhom/fix-bad-var-name-gce
Automatic merge from submit-queue

gce/util: $replica-pd --> $replica_pd

\cc @quinton-hoole @madhusudancs 

fixes #32997
2016-09-20 22:22:16 -07:00
Vishnu kannan 8f7b274cf7 fix base image pinning during upgrades via cluster/gce/upgrade.sh
Signed-off-by: Vishnu kannan <vishnuk@google.com>
2016-09-20 18:31:18 -07:00
Kubernetes Submit Queue 6fd94968e1 Merge pull request #32738 from Amey-D/gci-version-v1.4
Automatic merge from submit-queue

Bump up GCI version.

```release-note
   Upgrading Container-VM base image for k8s on GCE. Brief changelog as follows:
    - Fixed performance regression in veth device driver
    - Docker and related binaries are statically linked
    - Fixed the issue of systemd being oom-killable
```

Fixes #32596

This needs a cherrypick into v1.4 release branch because it is fixing v1.4 release blocking issues. This patch is easy and safe to rollback in case of emergencies.

@vishh can you please review?

Fixes #32596 and many other issues.
cc/ @kubernetes/goog-image  FYI
2016-09-20 16:30:01 -07:00
Jeff Grafton 47e4573943 Print a more helpful error message when rolling-updates fail. 2016-09-20 15:31:57 -07:00
Amey Deshpande 5da8486758 Bump up GCI version.
Brief changelog compared to gci-dev-54-8743-3-0:
- Fixed performance regression in veth device driver
- Docker and related binaries are statically linked
- Fixed the issue of systemd being oom-killable
- Updated built-in kubelet version to 1.3.7
- add ethtool and ebtables binaries expected by kubelet

Fixes #32596
2016-09-20 13:59:31 -07:00
Colin Hom acd7f5045d gce/util: $replica-pd --> $replica_pd
fixes #32997
2016-09-19 12:00:08 -07:00
Kubernetes Submit Queue 87c2650038 Merge pull request #32873 from jszczepkowski/ha-delete-nodes2
Automatic merge from submit-queue

Implemented KUBE_DELETE_NODES flag in kube-down.

Implemented KUBE_DELETE_NODES flag in kube-down script.
It prevents removal of nodes when shutting down a HA master replica.
2016-09-19 01:08:18 -07:00
Kubernetes Submit Queue a5e35eb887 Merge pull request #32886 from freehan/bump-master-cidr
Automatic merge from submit-queue

bump master cidr range from /30 to /29

Fixes P1 item in the 1.4 milestone

ref: https://github.com/kubernetes/kubernetes/issues/32844
2016-09-17 11:27:46 -07:00
Michael Taufen 2a536bf6f5 Revert "Merge pull request #31023 from vishh/gci-default"
This reverts PR #31023, which had made GCI the default node image for
open source. This revert makes container-vm the default for open source again.
2016-09-16 15:16:53 -07:00
Minhan Xia 879a2dcdbd bump master cidr range from /30 to /29 2016-09-16 13:41:58 -07:00