Commit Graph

3938 Commits (fae7285b00a5f320b2e0fb993eee07be09c1c582)

Author SHA1 Message Date
k8s-merge-robot fae7285b00 Merge pull request #27707 from yifan-gu/stage1_name
Automatic merge from submit-queue

rkt: Map kubelet's `--stage1-image` flag to rkt's `--stage1-name` flag.

This enables rkt to use cached stage1 image instead of unpacking the stage1 image every time for every pod.

After this change, users need to preload the stage1 images in order to enable rkt to find the stage1 image with the name specified by this flag.

Also, the cloud config is modified to pre-load the stage1 images.

cc @kubernetes/sig-rktnetes @kubernetes/sig-node
2016-06-20 16:40:09 -07:00
k8s-merge-robot 5c841556b8 Merge pull request #27724 from wonderfly/fix_unbound_var
Automatic merge from submit-queue

Make GCI_DOCKER_VERSION optional

[![Analytics](https://kubernetes-site.appspot.com/UA-36037335-10/GitHub/.github/PULL_REQUEST_TEMPLATE.md?pixel)]()

fixes #26813

This env var is only set in `config-test.sh` and trying to evaluate it with
`nounset` set will end up with an "unbound variable" error.

@girishkalele @dchen1107 This is a XS fix. Can you review?
2016-06-20 15:26:13 -07:00
k8s-merge-robot a3f09700e5 Merge pull request #27409 from adityakali/logrotate.1
Automatic merge from submit-queue

add logrotate service and configuration for GCI

This change mirrors the configuration in cluster/saltbase/salt/logrotate for GCI.

On GCI we use systemd timers (https://www.freedesktop.org/software/systemd/man/systemd.timer.html) and install an hourly timer - kube-logrotate.timer. This will invoke kube-logrotate.service (which calls /usr/sbin/logrotate) once every hour to perform log rotation as per the rotation rules installed under /etc/logrotate.d/.

@kubernetes/goog-image @zmerlynn @dchen1107 @andyzheng0831
2016-06-20 13:40:57 -07:00
Daniel Wang 0acb2dc28a Make GCI_DOCKER_VERSION optional
fixes #26813

This env var is only set in `config-test.sh` and trying to evaluate it with
`nounset` set will end up with an "unbound variable" error.
2016-06-20 11:22:14 -07:00
Yifan Gu fc31c8d6f0 cluster/gce: Preload rkt stage1 images and set the default `--rkt-stage1-image`. 2016-06-20 10:39:46 -07:00
Yifan Gu 4e9298cd99 rkt: Map kubelet's `--stage1-image` flag to rkt's `--stage1-name` flag.
This enables rkt to use cached stage1 image instead of unpacking the
stage1 image every time for every pod.

After this change, users need to preload the stage1 images in order to
enable rkt to find the stage1 image with the name specified by this flag.
2016-06-20 10:39:46 -07:00
k8s-merge-robot 3b57f868d3 Merge pull request #27542 from piosz/heapster-version
Automatic merge from submit-queue

Bumped Heapster to v1.1.0

```release-note
Bumped Heapster to v1.1.0.
More details about the release https://github.com/kubernetes/heapster/releases/tag/v1.1.0
```

Version 1.1.0 doesn't differ from the previous one v1.1.0-beta2 significantly - just fixed few small issues during stabilization period.

cc @a-robinson @jszczepkowski @mwielgus 

[![Analytics](https://kubernetes-site.appspot.com/UA-36037335-10/GitHub/.github/PULL_REQUEST_TEMPLATE.md?pixel)]()
2016-06-20 04:40:27 -07:00
k8s-merge-robot b442ba72e5 Merge pull request #27522 from adityakali/gci53
Automatic merge from submit-queue

make GCI image detection robust

This change makes sure that in case we roll back a released GCI image, the image detection logic picks a correct active image. 

@kubernetes/goog-image @Amey-D @wonderfly @dchen1107
2016-06-20 04:05:45 -07:00
k8s-merge-robot 5debaf7b88 Merge pull request #27253 from luxas/latest_hyperkube_addons
Automatic merge from submit-queue

Update to dnsmasq:1.3 and make hyperkube always use the latest addons

This bumps dnsmasq to a version that works on all architectures: https://github.com/kubernetes/contrib/pull/1192 (which have to be pushed first indeed)

Also I removed the manifests in hyperkube addons in favor for machine-generated ones, which will avoid mistakes.

This one is required for `v1.3`, so it has to be cherrypicked I think...

It makes docker and docker-multinode addons work again...
(Yes, we'll probably get rid of docker in favor for minikube, but we'll have to have it in this release at least)

@girishkalele @thockin @ArtfulCoder @david-mcmahon @bgrant0607 @mikedanese
2016-06-19 22:19:09 -07:00
k8s-merge-robot 530001c1fd Merge pull request #26778 from gmarek/validate-cluster
Automatic merge from submit-queue

Validate-cluster finishes shortly after at most ALLOWED_NOTREADY_NODE…

Fixes #26691

cc @wojtek-t @lavalamp
2016-06-19 21:40:54 -07:00
k8s-merge-robot 2274359ec1 Merge pull request #27518 from justinsb/kubedown_warn_if_no_vpc
Automatic merge from submit-queue

AWS kube-down: Issue warning if VPC not found

To address issue #17219
2016-06-19 19:53:20 -07:00
k8s-merge-robot a025e20f93 Merge pull request #27302 from zmerlynn/faster-large-down
Automatic merge from submit-queue

gce/kube-down: Parallelize IGM deletion, batch more

Batch up to 200 instance / route deletions, handle IGM deletion in parallel.

[![Analytics](https://kubernetes-site.appspot.com/UA-36037335-10/GitHub/.github/PULL_REQUEST_TEMPLATE.md?pixel)]()
2016-06-19 18:04:13 -07:00
k8s-merge-robot 368704dc71 Merge pull request #27185 from gmarek/heapster-cpu
Automatic merge from submit-queue

Enable dynamic allocation of heapster/eventer cpu request/limit

cc @mwielgus @piosz @zmerlynn
2016-06-19 16:48:49 -07:00
Lucas Käldström 6c53c6a997 Make hyperkube work on all platforms, bump dnsmasq to 1.3 and addon-manager to v3, and always use the latest manifests 2016-06-19 17:15:17 +03:00
Justin Santa Barbara d62fe2a315 AWS kube-up: set net.ipv4.neigh.default.gc_thresh1=0
This works around a linux kernel bug with overly aggressive caching of
ARP entries, which was causing problems when we reused IP addresses in
VPCs, for example with an ASG in a relatively small subnet.

See #23395 for more explanation.

Fixes #23395
2016-06-19 00:20:16 -04:00
k8s-merge-robot 6bdb9b0b8d Merge pull request #27070 from girishkalele/exechealthz-quiet
Automatic merge from submit-queue

DNS healthz container - add quiet option to stop flooding logs
2016-06-18 17:54:57 -07:00
k8s-merge-robot 13cd6475c9 Merge pull request #26917 from fabioy/fix_multi_migs
Automatic merge from submit-queue

Handle multiple MIGs (single-zone) properly in GKE scripts.

Change the scripts for GKE to handle multiple node pools (MIGs) in the same zone. 

Fixes https://github.com/kubernetes/test-infra/issues/100.

[![Analytics](https://kubernetes-site.appspot.com/UA-36037335-10/GitHub/.github/PULL_REQUEST_TEMPLATE.md?pixel)]()
2016-06-18 11:36:08 -07:00
k8s-merge-robot 48b255d424 Merge pull request #26813 from wonderfly/choose_docker_version
Automatic merge from submit-queue

Prep for continuous Docker validation test

```release-note
Add a test config variable to specify desired Docker version to run on GCI.
```
We want to continuously validate Docker releases (#25215), on GCI. This change
adds a new test config variable, `KUBE_GCI_DOCKER_VERSION`, through which we can
specify which version of Docker we want to run on the master and nodes. This
change also patches the Jenkins e2e-runner with the ability to fetch the latest
Docker (pre)release, and sets the aforementioned variable accordingly.

Tested on my local Jenkins instance that was able to start a cluster with the latest Docker version (different from installed version) running on both master and nodes.

@dchen1107 Can you review?

cc/ @andyzheng0831 for changes in `cluster/gce/gci/helper.sh`, and @ixdy @spxtr for changes to the Jenkins e2e-runner

cc/ @kubernetes/goog-image
2016-06-18 11:00:55 -07:00
k8s-merge-robot db904257f9 Merge pull request #27599 from dchen1107/gci
Automatic merge from submit-queue

Fix docker log level on GCI node.

Fix #27584

cc/ @a-robinson
2016-06-17 20:21:05 -07:00
k8s-merge-robot 808f3ecbe6 Merge pull request #27220 from yifan-gu/kube-up-master
Automatic merge from submit-queue

cluster/gce/coreos: Make kube-up works for both rkt and docker on coreos on gce

With this PR, kube-up will be able to spin up a pure rkt cluster given the choice `KUBE_CONTAINER_RUNTIME=rkt`

e.g. 
```
export KUBE_GCE_ZONE=us-east1-b
export KUBE_OS_DISTRIBUTION=coreos

export KUBE_GCE_MASTER_PROJECT=coreos-cloud
export KUBE_GCE_MASTER_IMAGE=coreos-alpha-1032-0-0-v20160428

export KUBE_GCE_NODE_PROJECT=coreos-cloud
export KUBE_GCE_NODE_IMAGE=coreos-alpha-1032-0-0-v20160428


export KUBE_ENABLE_NODE_LOGGING=false
export KUBE_ENABLE_CLUSTER_MONITORING=none

export KUBE_CONTAINER_RUNTIME=rkt
export KUBE_RKT_VERSION=v1.8.0
```

```
$ cluster/kube-up.sh
...
$ kubectl cluster-info
Kubernetes master is running at https://104.196.41.124
GLBCDefaultBackend is running at https://104.196.41.124/api/v1/proxy/namespaces/kube-system/services/default-http-backend
Elasticsearch is running at https://104.196.41.124/api/v1/proxy/namespaces/kube-system/services/elasticsearch-logging
Kibana is running at https://104.196.41.124/api/v1/proxy/namespaces/kube-system/services/kibana-logging
KubeDNS is running at https://104.196.41.124/api/v1/proxy/namespaces/kube-system/services/kube-dns
kubernetes-dashboard is running at https://104.196.41.124/api/v1/proxy/namespaces/kube-system/services/kubernetes-dashboard
```

```
$ kubectl get pods --all-namespaces
NAMESPACE     NAME                                            READY     STATUS    RESTARTS   AGE
kube-system   elasticsearch-logging-v1-5zfrd                  1/1       Running   0          2m
kube-system   elasticsearch-logging-v1-83u6w                  1/1       Running   0          2m
kube-system   etcd-server-events-yifan-test-rkt-master        1/1       Running   0          2m
kube-system   etcd-server-yifan-test-rkt-master               1/1       Running   0          2m
kube-system   kibana-logging-v1-0g7yu                         1/1       Running   2          2m
kube-system   kube-addon-manager-yifan-test-rkt-master        1/1       Running   0          3m
kube-system   kube-apiserver-yifan-test-rkt-master            1/1       Running   0          2m
kube-system   kube-controller-manager-yifan-test-rkt-master   1/1       Running   0          3m
kube-system   kube-dns-v14-1mqco                              3/3       Running   0          2m
kube-system   kube-scheduler-yifan-test-rkt-master            1/1       Running   0          2m
kube-system   kubernetes-dashboard-v1.1.0-beta2-uwutn         1/1       Running   0          2m
kube-system   l7-lb-controller-v0.6.0-8pgbo                   2/2       Running   0          2m
kube-system   node-problem-detector-v0.1-7iwb2                1/1       Running   0          2m
kube-system   node-problem-detector-v0.1-k4m8o                1/1       Running   0          2m
kube-system   node-problem-detector-v0.1-rxtp8                1/1       Running   0          2m
kube-system   node-problem-detector-v0.1-wsoqd                1/1       Running   0          2m

```


Fix #24103 

cc @kubernetes/sig-node @kubernetes/rktnetes-maintainers
2016-06-17 19:39:22 -07:00
k8s-merge-robot 7ab303efbe Merge pull request #27466 from madhusudancs/fed-dns-scope
Automatic merge from submit-queue

Move GCE NODE_SCOPES to config-common.sh and enable DNS scope in GKE.

Follow up to #27357

cc @colhom @kubernetes/sig-cluster-federation 

[![Analytics](https://kubernetes-site.appspot.com/UA-36037335-10/GitHub/.github/PULL_REQUEST_TEMPLATE.md?pixel)]()
2016-06-17 17:02:11 -07:00
k8s-merge-robot 2266c4eb1d Merge pull request #27371 from andyzheng0831/network-plugin
Automatic merge from submit-queue

Revert "Revert "GCI: add support for network plugin""

PR #27027 added the network plugin support in GCI config, but later a bug in the network plugin broke e2e tests (see issue #27118). The bug was fixed by #27141 and we have been repeatedly run the serial e2e tests more than 10 times to verify the fix. Now it should be safe to put the GCI network plugin support back.

We will first merge in the master branch and monitor the Jenkins serial tests for a while and then cherry-pick it into release-1.3 branch.
2016-06-17 14:03:18 -07:00
k8s-merge-robot d4d6184931 Merge pull request #27627 from justinsb/fix_23390
Automatic merge from submit-queue

kube-up multizone: don't print scary warning

The node-count check gets confused when there are more nodes that we
launched, which is normal with KUBE_USE_EXISTING_MASTER.

This fix just suppresses the error message in that case.

Fix #23390
2016-06-17 11:24:29 -07:00
Justin Santa Barbara bab7a255b7 kube-up multizone: don't print scary warning
The node-count check gets confused when there are more nodes that we
launched, which is normal with KUBE_USE_EXISTING_MASTER.

This fix just suppresses the error message in that case.

Fix #23390
2016-06-17 13:00:50 -04:00
Aditya Kali 4a46c948f2 make GCI image detection robust 2016-06-17 09:56:53 -07:00
Marcin Wielgus 9031e4780c Bump cluster autoscaler to 0.2.0 2016-06-17 16:31:25 +02:00
Dawn Chen 58684a56be Fix docker log level on GCI node. 2016-06-16 20:54:42 -07:00
Yifan Gu d96c7edc34 cluster/saltbase: Enable rkt to load master components images. 2016-06-16 15:18:52 -07:00
Yifan Gu 16d69f9f37 cluster/gce/coreos: Update addon manifests, add node-problem-detector.
Also bumped CNI version.
2016-06-16 15:18:52 -07:00
Yifan Gu f3c5d4e62b cluster/gce/coreos: Update the cloud config and scripts to enable rkt runtime. 2016-06-16 15:18:52 -07:00
Piotr Szczesniak 8fff5319db Bumped Heapster to v1.1.0 2016-06-16 20:41:28 +02:00
Girish Kalele ee7ca66dba Add nethealth prepull container output to e2e run logs 2016-06-16 11:03:09 -07:00
Jerzy Szczepkowski 390a30b130 Updated version of resource consumer in image puller.
Updated version of resource consumer in image puller.
2016-06-16 10:57:21 +02:00
Justin Santa Barbara e58a16e95a AWS kube-down: Issue warning if VPC not found
Issue #17219
2016-06-15 23:53:57 -04:00
k8s-merge-robot 38cd0d078e Merge pull request #27311 from vmware/photon-controller-fixes
Automatic merge from submit-queue

Fixes and improvements to Photon Controller backend for kube-up

- Improve reliability of network address detection by using MAC
  address. VMware has a MAC OUI that reliably distinguishes the VM's
  NICs from the other NICs (like the CBR). This doesn't rely on the
  unreliable reporting of the portgroup.
- Persist route changes. We configure routes on the master and nodes,
  but previously we didn't persist them so they didn't last across
  reboots. This persists them in /etc/network/interfaces
- Fix regression that didn't configure auth for kube-apiserver with
  Photon Controller.
- Reliably run apt-get update: Not doing this can cause apt to fail.
- Remove unused nginx config in salt
2016-06-15 14:46:54 -07:00
Madhusudan.C.S 3f5fbee19d Move GCE NODE_SCOPES to config-common.sh and enable DNS scope in GKE. 2016-06-15 11:36:07 -07:00
gmarek 8617f70add Adjust heapster cpu request/limit 2016-06-15 13:34:58 +02:00
k8s-merge-robot 3c822c0b3c Merge pull request #27357 from madhusudancs/fed-dns-scope
Automatic merge from submit-queue

Add Google Cloud DNS auth scope to GCE VMs when they are started in a federation.

cc @colhom @kubernetes/sig-cluster-federation 

[![Analytics](https://kubernetes-site.appspot.com/UA-36037335-10/GitHub/.github/PULL_REQUEST_TEMPLATE.md?pixel)]()
2016-06-15 00:00:54 -07:00
Aditya Kali a423b83c06 add logrotate service and configuration for GCI
This change mirrors the configuration in cluster/saltbase/salt/logrotate
for GCI.
2016-06-14 19:01:58 -07:00
Alain Roy 5740ceb7f6 Fixes and improvements to Photon Controller backend for kube-up
- Improve reliability of network address detection by using MAC
  address. VMware has a MAC OUI that reliably distinguishes the VM's
  NICs from the other NICs (like the CBR). This doesn't rely on the
  unreliable reporting of the portgroup.
- Persist route changes. We configure routes on the master and nodes,
  but previously we didn't persist them so they didn't last across
  reboots. This persists them in /etc/network/interfaces
- Fix regression that didn't configure auth for kube-apiserver with
  Photon Controller.
- Reliably run apt-get update: Not doing this can cause apt to fail.
- Remove unused nginx config in salt
2016-06-14 15:40:33 -07:00
Zach Loafman 1fde6f867c GKE provider: Add a HEAPSTER_MACHINE_TYPE option
Allows heterogeneous cluster configurations for large cluster
synthetic testing.
2016-06-14 14:39:08 -07:00
Andy Zheng e6b744c85a Revert "Revert "GCI: add support for network plugin""
This reverts commit 8207eddd99.
2016-06-14 09:52:34 -07:00
Madhusudan.C.S fac6f1c379 Add Google Cloud DNS auth scope to GCE VMs when they are started in a federation. 2016-06-14 05:26:56 -07:00
k8s-merge-robot 1aece22176 Merge pull request #27187 from gmarek/heapster-nanny
Automatic merge from submit-queue

Adjust memory limit for heapster's pod nanny

cc @piosz @wojtek-t @zmerlynn
2016-06-14 04:56:46 -07:00
Wojciech Tyczynski ff1258a29e Merge pull request #27268 from wojtek-t/bump_nanny_version
Adjust Heapster requirements and bump nanny version
2016-06-14 10:50:25 +02:00
Andy Zheng 1c490c24e4 gci: fix multizone setting in issue #27150 2016-06-13 16:33:12 -07:00
Zach Loafman 2d77dc639b gce/kube-down: Parallelize IGM deletion, batch more
Batch up to 200 instance / route deletions, handle
IGM deletion in parallel.
2016-06-13 14:05:43 -07:00
k8s-merge-robot 94f8c9fbc5 Merge pull request #27292 from mikedanese/dep-notice
Automatic merge from submit-queue

add maintenance notice to cluster/README

cc @bgrant0607 @davidopp @justinsb @smarterclayton @derekwaynecarr
2016-06-13 13:54:50 -07:00
k8s-merge-robot 45bb6824ff Merge pull request #27210 from adityakali/gci53
Automatic merge from submit-queue

version bump for gci to milestone 53

Fixes #26455

GCI release 53 includes kubernetes v1.3.0-alpha.5 with docker-1.11.2.

@dchen1107 @kubernetes/goog-image @andyzheng0831
2016-06-13 13:54:46 -07:00
Mike Danese bf72d335cb add maintenance notice to cluster/README 2016-06-13 11:47:30 -07:00