Commit Graph

6246 Commits (7e5d2df2995ef402a2e4ee39524c761904bf7689)

Author SHA1 Message Date
Mik Vyatskov 46f53daef0 Refactored the fluentd-es addon files, moved the fluentd configuration to ConfigMap 2017-08-04 20:05:15 +02:00
Kubernetes Submit Queue 4a8d041c4c Merge pull request #50088 from wojtek-t/ha_etcd_migration
Automatic merge from submit-queue

Fix etcd migration for HA clusters

@jpbetz 

Fix #43600
2017-08-04 09:44:14 -07:00
Matthew Wong 55a417e84f Add basic install and mount flexvolumes e2e tests 2017-08-04 10:39:24 -04:00
Kubernetes Submit Queue 68a722155b Merge pull request #50096 from nicksardo/bump-glbc-0.9.6
Automatic merge from submit-queue (batch tested with PRs 46685, 49863, 50098, 50070, 50096)

GCE: Bump GLBC version to 0.9.6

Closes #50095

**Release note**:
```release-note
GCE: Bump GLBC version to 0.9.6
```
2017-08-04 00:21:00 -07:00
Kubernetes Submit Queue d15baf69e1 Merge pull request #48969 from ixdy/update-kazel
Automatic merge from submit-queue (batch tested with PRs 50103, 49677, 49449, 43586, 48969)

Run kazel on the entire tree

**What this PR does / why we need it**: part of #47558: auto-generate `BUILD` files on the entire tree, since this is what `gazelle` does, and it'll make subsequent reviews easier if less is changing.

**Release note**:

```release-note
NONE
```
/assign
/release-note-none
2017-08-03 16:43:41 -07:00
Bowei Du 2f8dbae9be Update OWNERS files for networking components
This will reduce the approval load for the top level tree owners
2017-08-03 11:08:54 -07:00
Nick Sardo 97114d7343 Bump GLBC version to 0.9.6 2017-08-03 08:39:12 -07:00
Wojciech Tyczynski b559a95eb2 Remove v2 data when upgrading to 3.1.* version 2017-08-03 14:49:00 +02:00
Wojciech Tyczynski e050e7a867 Fix etcd migration for HA clusters 2017-08-03 14:43:52 +02:00
Daniel Kłobuszewski b48b1b80bc Enable overriding fluentd resources in GCP 2017-08-03 11:58:43 +02:00
Kubernetes Submit Queue ae0ca36695 Merge pull request #48722 from aknuds1/upgrade-fluentd-elasticsearch
Automatic merge from submit-queue (batch tested with PRs 48365, 49902, 49808, 48722, 47045)

Upgrade fluentd-elasticsearch addon to Elasticsearch/Kibana 5.5

This is a patch to upgrade the fluentd-elasticsearch addon to Elasticsearch/Kibana 5.5. Please provide feedback!

```release-notes
* Upgrade Elasticsearch/Kibana to 5.5.1 in fluentd-elasticsearch addon
* Switch to basing our image of Elasticsearch in fluentd-elasticsearch addon off the official one
* Switch to the official image of Kibana in fluentd-elasticsearch addon
* Use StatefulSet for Elasticsearch instead of ReplicationController, with persistent volume claims
* Require authenticating towards Elasticsearch, as Elasticsearch 5.5 by default requires basic authentication
```
2017-08-02 22:32:59 -07:00
Kubernetes Submit Queue efe3951ac0 Merge pull request #48365 from ixdy/hyperkube-base-image
Automatic merge from submit-queue (batch tested with PRs 48365, 49902, 49808, 48722, 47045)

Rebase hyperkube image on debian-hyperkube-base, based on debian-base.

**What this PR does / why we need it**: saves all of the hyperkube image dependencies in a cacheable base image, rather than downloading them for every build (which is slow and flaky).

This way, at build time, we only need to pull down the hyperkube base image and add the hyperkube binary.

I've additionally based the base image on `debian-base` instead of `debian`, though we amusing end up reinstalling a bunch of the things we removed in `debian-base`.

**Which issue this PR fixes** *(optional, in `fixes #<issue number>(, fixes #<issue_number>, ...)` format, will close that issue when PR gets merged)*: fixes #35058, at least partially

**Special notes for your reviewer**: I'm increasingly convinced that the hyperkube image is a bad pattern, as this image carries the superset of dependencies anyone might need, rather than the limited set of dependencies one needs. hyperkube really needs a proper owner.

**Release note**:

```release-note
```

/assign @timstclair @luxas @philips @nikhiljindal 
cc @kubernetes/sig-release-pr-reviews
2017-08-02 22:32:49 -07:00
Jeff Grafton 3afeb7cd3c Fix BUILD files 2017-08-02 18:33:32 -07:00
Jeff Grafton 3579017b86 Run hack/update-bazel.sh to generate BUILD files 2017-08-02 18:33:25 -07:00
Kubernetes Submit Queue 968ebc6603 Merge pull request #49649 from shiliangxue/master
Automatic merge from submit-queue (batch tested with PRs 49989, 49806, 49649, 49412, 49512)

Use existing k8s binaries and images on disk when they are preloaded to gce cos image.

**What this PR does / why we need it**:
This change is to accelerate K8S startup time on gce when k8s tarballs and images are already preloaded in VM image, by skipping the downloading, extracting and file transfer steps.

**Which issue this PR fixes** *(optional, in `fixes #<issue number>(, fixes #<issue_number>, ...)` format, will close that issue when PR gets merged)*: fixes #

**Special notes for your reviewer**:

**Release note**:

```release-note
NONE
```
2017-08-02 17:06:00 -07:00
Kubernetes Submit Queue 4c9e880627 Merge pull request #49866 from mmaquevice/patch-1
Automatic merge from submit-queue (batch tested with PRs 50029, 48517, 49739, 49866, 49782)

fix spelling

**What this PR does / why we need it**:

**Which issue this PR fixes** *(optional, in `fixes #<issue number>(, fixes #<issue_number>, ...)` format, will close that issue when PR gets merged)*: fixes #

**Special notes for your reviewer**:

**Release note**:

```release-note
```
2017-08-02 12:46:55 -07:00
Arve Knudsen 0ed0f02c76 Upgrade Elasticsearch/Kibana to 5.5.1 and use official Kibana image 2017-08-02 19:40:19 +02:00
Kubernetes Submit Queue a2f2a220c5 Merge pull request #49953 from gmarek/logrotate
Automatic merge from submit-queue

Allow configuration of logrorate in GCE

Fix #49941
2017-08-02 07:53:15 -07:00
Kubernetes Submit Queue e9617b694e Merge pull request #48429 from shyamjvs/parallel-upgrade
Automatic merge from submit-queue

Add parallelism to GCE cluster upgrade

Fixes https://github.com/kubernetes/kubernetes/issues/48373
Should allow upgrading 500-node cluster (1.6->1.7) in < 1 hr. It currently takes ~1.5 day.
Though it is the duty of the upgrader to choose the right parallelism in order to avoid disrupting too many pods.

/cc @kubernetes/sig-cluster-lifecycle-pr-reviews @kubernetes/sig-scalability-misc @mikedanese @gmarek
2017-08-02 01:26:57 -07:00
Kubernetes Submit Queue 0a5ac4e787 Merge pull request #49940 from shyamjvs/logdump-rocks
Automatic merge from submit-queue

Don't stop log-dumping if logexporter fails

Just noticed.

Ref https://github.com/kubernetes/kubernetes/issues/48513
2017-08-02 00:32:28 -07:00
Kubernetes Submit Queue a422196142 Merge pull request #49888 from MrHohn/addon-manager-remove-old-codes
Automatic merge from submit-queue

[addon-manager] Remove unneeded annotation codes

**What this PR does / why we need it**:
Clean up addon-manager codes to make it less confusing. The annotation logics is only needed for 1.4->1.5 upgrade.

**Which issue this PR fixes** *(optional, in `fixes #<issue number>(, fixes #<issue_number>, ...)` format, will close that issue when PR gets merged)*: fixes #

**Special notes for your reviewer**:

**Release note**:

```release-note
NONE
```
2017-08-01 20:48:45 -07:00
Kubernetes Submit Queue 49955b1594 Merge pull request #49897 from mikedanese/atomic-append
Automatic merge from submit-queue (batch tested with PRs 49898, 49897, 49919, 48860, 49491)

gce: make append_or_replace.. atomic

Before this change,
* the final echo is not atomically written to the target file
* two concurrent callers will use the same tempfile

Helps with https://github.com/kubernetes/kubernetes/issues/49895

cc @miekg
2017-08-01 17:57:14 -07:00
Kubernetes Submit Queue d79c8fe4ef Merge pull request #49898 from mikedanese/clober-tokens
Automatic merge from submit-queue (batch tested with PRs 49898, 49897, 49919, 48860, 49491)

gce: extend CLOBBER_CONFIG to support known_tokens.csv

Helps with #49895
2017-08-01 17:57:12 -07:00
Shyam Jeedigunta 73b419447f Don't stop log-dumping if logexporter fails 2017-08-01 17:39:50 +02:00
gmarek 54f43f5f56 Allow configuration of logrorate in GCE 2017-08-01 17:07:44 +02:00
shiliang f561a299ac refactor function is-preloaded in configure.sh 2017-07-31 15:57:02 -07:00
Mike Danese a120906756 gce: extend CLOBBER_CONFIG to support known_tokens.csv 2017-07-31 13:39:18 -07:00
Kubernetes Submit Queue aba3339f85 Merge pull request #49823 from shyamjvs/optimize-logdump
Automatic merge from submit-queue

Reduce kubectl calls from O(#nodes) to O(1) in cluster logdump

Ref https://github.com/kubernetes/kubernetes/issues/48513

Each node's logexporter is made to write a file to a GCS directory on success (https://github.com/kubernetes/test-infra/pull/3782).
We now use that directory as a registry of successful nodes and get it through a single "gsutil ls" call. This:
- reduces the current waiting time for logexporter in 5k-node cluster from >1hr to <10s.
- eliminates dependency on `kubectl logs` calls which seem to be unreliable sometimes (e.g when kubelet (or apiserver) is down)

/cc @kubernetes/sig-scalability-misc @wojtek-t @gmarek @fejta
2017-07-31 13:30:37 -07:00
Mike Danese 1c2feb02fa gce: make append_or_replace.. atomic
Before this change,
* the final echo is not atomically written to the target file
* two concurrent callers will use the same tempfile
2017-07-31 13:14:51 -07:00
Zihong Zheng 1ef7854c14 [addon-manager] Remove unneeded annotation codes 2017-07-31 11:19:23 -07:00
Shyam Jeedigunta 5b5d3b9410 Add parallelism to GCE cluster upgrade 2017-07-31 13:39:01 +02:00
Shyam Jeedigunta 80084f0621 Reduce kubectl calls from O(#nodes) to O(1) in cluster logdump 2017-07-31 13:20:53 +02:00
Matthieu Maquevice 93acc4c276 fix spelling 2017-07-31 10:59:29 +02:00
zouyee a57e3eab1f update dashboard image version 2017-07-31 11:08:08 +08:00
Kubernetes Submit Queue ba8b26d47a Merge pull request #49708 from mborsz/regional
Automatic merge from submit-queue (batch tested with PRs 49538, 49708, 47665, 49750, 49528)

Add a support for GKE regional clusters in e2e tests.

**What this PR does / why we need it**:
Add a support for GKE regional clusters in e2e tests.
**Which issue this PR fixes** *(optional, in `fixes #<issue number>(, fixes #<issue_number>, ...)` format, will close that issue when PR gets merged)*: fixes #

**Special notes for your reviewer**:

**Release note**:
```release-note
NONE
```
2017-07-29 05:28:51 -07:00
Shyam Jeedigunta 6c8ae3d5d7 Fix bug and add log statements to log-dump script 2017-07-28 22:33:30 +02:00
Kubernetes Submit Queue a9a9241db2 Merge pull request #49717 from wwwtyro/rye/block-on-failed-services
Automatic merge from submit-queue (batch tested with PRs 49712, 49694, 49714, 49670, 49717)

set juju master charm state to blocked if the services appear to be failing

**What this PR does / why we need it**: set the juju master charm state to blocked if the services appear to be failing

**Release note**:
```release-note
set the juju master charm state to blocked if the services appear to be failing
```
2017-07-28 08:15:09 -07:00
Kubernetes Submit Queue a2a6fe8217 Merge pull request #49714 from juju-solutions/bug/add-maintainers
Automatic merge from submit-queue (batch tested with PRs 49712, 49694, 49714, 49670, 49717)

Adding old Juju charm maintainers

**What this PR does / why we need it**: Update email addresses of past Juju charm maintainers 

**Which issue this PR fixes** *(optional, in `fixes #<issue number>(, fixes #<issue_number>, ...)` format, will close that issue when PR gets merged)*: 

**Special notes for your reviewer**:

**Release note**:

```NONE
```
2017-07-28 08:15:04 -07:00
Kubernetes Submit Queue a2d2ddf719 Merge pull request #48533 from superbrothers/fix-dead-link
Automatic merge from submit-queue (batch tested with PRs 47738, 49196, 48907, 48533, 48822)

Fix a dead link in cluster/update-storage-objects.sh

**What this PR does / why we need it**: This PR fixes a dead link in cluster/update-storage-objects.sh.

**Which issue this PR fixes** *(optional, in `fixes #<issue number>(, fixes #<issue_number>, ...)` format, will close that issue when PR gets merged)*: fixes #

**Special notes for your reviewer**:

**Release note**:

```release-note
NONE
```
2017-07-28 03:10:35 -07:00
Maciej Borsz 7822b5d9fd Add a support for GKE regional clusters in e2e tests. 2017-07-28 11:42:54 +02:00
Bobby (Babak) Salamat de8689c99b Add priority admission controller 2017-07-27 18:44:40 -07:00
Kubernetes Submit Queue e4551d50e5 Merge pull request #49665 from shyamjvs/logdump-fix
Automatic merge from submit-queue

Fix log-dump script wrt logexporter
2017-07-27 18:02:18 -07:00
Rye Terrell 809a0abb31 set k8s master charm state to blocked if the services appear to be failing 2017-07-27 08:13:15 -05:00
Konstantinos Tsakalozos 86a036feb1 Adding old juju maintainers 2017-07-27 15:42:36 +03:00
Shyam Jeedigunta 3ffc9ca34b Fix log-dump script wrt logexporter 2017-07-27 00:09:59 +02:00
Casey Davenport a0bf3f7f90 Add OWNERS file for Calico add-on 2017-07-26 14:05:56 -07:00
Kubernetes Submit Queue 2c624e590f Merge pull request #49558 from juju-solutions/bug/maintainers
Automatic merge from submit-queue (batch tested with PRs 48360, 48469, 49576, 49516, 49558)

Update maintainers for Juju charm layers

**What this PR does / why we need it**: Update maintainers of harm layers to reflect ... reality

**Which issue this PR fixes** *(optional, in `fixes #<issue number>(, fixes #<issue_number>, ...)` format, will close that issue when PR gets merged)*: 

**Special notes for your reviewer**:

**Release note**:

```NONE
```
2017-07-26 13:02:46 -07:00
Kubernetes Submit Queue 4a65675840 Merge pull request #48469 from fasaxc/rev-typha
Automatic merge from submit-queue (batch tested with PRs 48360, 48469, 49576, 49516, 49558)

Rev Calico's Typha daemon to v0.2.3 in add-on deployment.

**What this PR does / why we need it**:

This PR revs the version of Calico's Typha daemon used in the calico-policy-controller add-on to the latest bug-fix release, which incorporates a [critical bug fix](https://github.com/projectcalico/typha/issues/28).

**Which issue this PR fixes**

fixes #49473

**Release note**:

```release-note
Rev version of Calico's Typha daemon used in add-on to v0.2.3 to pull in bug-fixes.
```
2017-07-26 13:02:37 -07:00
Kubernetes Submit Queue 762ef32acb Merge pull request #48360 from gunjan5/remove-noSnat
Automatic merge from submit-queue

Set snat to false

**What this PR does / why we need it**:
- the [version](e8bea554c5) of the portmap plugin included with calico CNI version `v1.9.1` doesn't have `noSnat` config option, it has `snat` which is not specified (which is the case without this PR), [will be set to true by default](https://github.com/containernetworking/plugins/tree/master/plugins/meta/portmap#usage) , so we need to explicitly set it to `false` 

CC @caseydavenport
2017-07-26 12:53:06 -07:00
shiliang 994275aa55 skip downloading and extracting tarballs and docker images when they are preloaded. 2017-07-25 15:40:03 -07:00
Jeff Grafton 66b9ae7cd8 Rebase hyperkube image on debian-hyperkube-base, based on debian-base. 2017-07-25 15:03:38 -07:00
Kubernetes Submit Queue 0d6d025c45 Merge pull request #45040 from aaronlevy/cephcommon
Automatic merge from submit-queue (batch tested with PRs 45040, 48960)

Add ceph-common to hyperkube image

**What this PR does / why we need it**:

Adds the ceph-common package to the hyperkube image
2017-07-25 14:40:41 -07:00
Kubernetes Submit Queue d197ff0595 Merge pull request #49426 from shyamjvs/logdump-fix
Automatic merge from submit-queue (batch tested with PRs 48976, 49474, 40050, 49426, 49430)

Fix bug in cluster/log-dump

We're breaking in case KUBECTL is set as "./cluster/kubectl.sh --match-server-version". Moreover we always are using cluster/kubectl.sh as the default and don't want to do match-server-version for the purpose of logexporter.

Also adding owners file so I'm not blocked for approves while making fixes in log-dump. Besides I'll be able to review fixes sent by others.

/cc @wojtek-t
2017-07-25 12:14:46 -07:00
Kubernetes Submit Queue 2189314895 Merge pull request #40050 from mtaufen/standalone-mode
Automatic merge from submit-queue (batch tested with PRs 48976, 49474, 40050, 49426, 49430)

Use presence of kubeconfig file to toggle standalone mode

Fixes #40049 

```release-note
The deprecated --api-servers flag has been removed. Use --kubeconfig to provide API server connection information instead. The --require-kubeconfig flag is now deprecated. The default kubeconfig path is also deprecated. Both --require-kubeconfig and the default kubeconfig path will be removed in Kubernetes v1.10.0.
```

/cc @kubernetes/sig-cluster-lifecycle-misc @kubernetes/sig-node-misc
2017-07-25 12:14:43 -07:00
Shyam Jeedigunta bf33e2ac7e Fix bug in cluster/log-dump and add OWNERS file 2017-07-25 16:11:24 +02:00
Konstantinos Tsakalozos 68b42c3b88 Update maintainers for Juju charm layers 2017-07-25 13:09:09 +03:00
Kubernetes Submit Queue 7f1d9382ec Merge pull request #48846 from dashpole/remove_ood
Automatic merge from submit-queue

Remove flags low-diskspace-threshold-mb and outofdisk-transition-frequency

issue: #48843

This removes two flags replaced by the eviction manager.  These have been depreciated for two releases, which I believe correctly follows the kubernetes depreciation guidelines.

```release-note
Remove depreciated flags: --low-diskspace-threshold-mb and --outofdisk-transition-frequency, which are replaced by --eviction-hard
```

cc @mtaufen since I am changing kubelet flags
cc @vishh @derekwaynecarr 
/sig node
2017-07-24 23:05:50 -07:00
Michael Taufen 38aee0464d Providing kubeconfig file is now the switch for standalone mode
Replaces use of --api-servers with --kubeconfig in Kubelet args across
the turnup scripts. In many cases this involves generating a kubeconfig
file for the Kubelet and placing it in the correct location on the node.
2017-07-24 11:03:00 -07:00
Wojciech Tyczynski 5740c7afbf Fix ha_master tests 2017-07-24 12:10:22 +02:00
Kubernetes Submit Queue 9712a80505 Merge pull request #49394 from wwwtyro/rye/rbd-stats-improvement
Automatic merge from submit-queue (batch tested with PRs 49326, 49394, 49346, 49379, 49399)

more robust stat handling from ceph df output in the kubernetes-master charm create-rbd-pv action

**What this PR does / why we need it**: more robust stat handling from ceph df output in the kubernetes-master charm create-rbd-pv action

**Release note**:
```release-note
more robust stat handling from ceph df output in the kubernetes-master charm create-rbd-pv action
```
2017-07-21 23:23:32 -07:00
Kubernetes Submit Queue b9e549e7cf Merge pull request #49371 from juju-solutions/bug/pv-metric
Automatic merge from submit-queue (batch tested with PRs 49420, 49296, 49299, 49371, 46514)

Fix: PV metric is not namespaced

**What this PR does / why we need it**: The PV metric of juju deployments is not namespaced. This PR fixes this bug. 

**Which issue this PR fixes** *(optional, in `fixes #<issue number>(, fixes #<issue_number>, ...)` format, will close that issue when PR gets merged)*: fixes https://github.com/juju-solutions/bundle-canonical-kubernetes/issues/348

**Special notes for your reviewer**:

**Release note**:

```NONE
```
2017-07-21 22:23:21 -07:00
Kubernetes Submit Queue 0af4306695 Merge pull request #49296 from juju-solutions/feature/report-failing-services
Automatic merge from submit-queue (batch tested with PRs 49420, 49296, 49299, 49371, 46514)

Update status to show failing services.

**What this PR does / why we need it**: Report on charm status any services that are not running.

**Which issue this PR fixes** *(optional, in `fixes #<issue number>(, fixes #<issue_number>, ...)` format, will close that issue when PR gets merged)*: fixes https://github.com/juju-solutions/bundle-canonical-kubernetes/issues/341

**Special notes for your reviewer**:

**Release note**:

```Report failing services in Juju deployed clusters.
```
2017-07-21 22:23:17 -07:00
Rye Terrell 1c21e8ab31 more robust stat handling from ceph df output 2017-07-21 11:46:32 -05:00
Kubernetes Submit Queue ab40f526fd Merge pull request #49344 from shyamjvs/master-disk-size
Automatic merge from submit-queue

Auto-calculate master disk and root disk sizes in GCE

@gmarek PR https://github.com/kubernetes/kubernetes/pull/49282 didn't fix the issue because MASTER_DISK_SIZE was defaulting to 20GB in config-test.sh before being calculated inside get-master-disk-size() where you use pre-existing value if any.

It should be fixed by this now.
2017-07-21 06:24:39 -07:00
Konstantinos Tsakalozos 0faf5bf4b0 Fix: PV metric is not namespaced 2017-07-21 13:55:17 +03:00
Konstantinos Tsakalozos 53f00722e2 Do not try to restart services 2017-07-21 12:39:42 +03:00
Kubernetes Submit Queue ef98b80bd0 Merge pull request #49172 from ianchakeres/gce-local-ssd-fix
Automatic merge from submit-queue (batch tested with PRs 48565, 49172)

On GCE check whether NODE_LOCAL_SSDS=0 and handle this case appropriately

**What this PR does / why we need it**: Presently if you are using a mac and GCE and specify NODE_LOCAL_SSDS=0, or use the default, you end up with 2 local SSDs.

**Which issue this PR fixes** : fixes https://github.com/kubernetes/kubernetes/issues/49171

**Special notes for your reviewer**:

I've discovered that this issue is due to b353792f9c/cluster/gce/util.sh (L579)

If NODE_LOCAL_SSDS=0, this evaluates to $(seq 0)

```
$ for i in $(seq 0); do echo $i; done
1
0
```

From man seq on mac osx

```
The seq utility prints a sequence of numbers, one per line (default), from first (default 1), 
to near last as possible, in increments of incr (default 1).When first is larger than last the 
default incr is -1.
```

This was run on mac with the seq manpage indicating it comes from BSD Feb 19 2010.

**Release note**:

```release-note
NONE
```
2017-07-21 01:13:19 -07:00
Kubernetes Submit Queue 80582826aa Merge pull request #49337 from jingxu97/July/mounterfix
Automatic merge from submit-queue (batch tested with PRs 49222, 49333, 48708, 49337)

Fix issue in installing containerized mounter

Fix PR #49335

PR #49157 causes failure when installing containerized mounter. This
PR is a fix for it
2017-07-20 23:01:23 -07:00
Kubernetes Submit Queue d2c5dd64ee Merge pull request #49222 from yujuhong/fix-l7-backend
Automatic merge from submit-queue (batch tested with PRs 49222, 49333, 48708, 49337)

glbc: change the label of the l7-lb-controller pod

This ensures that the default http backend service doesn't include this
pod as its endpoint. This fixes #49159
2017-07-20 23:01:16 -07:00
Kubernetes Submit Queue 0e1744bb92 Merge pull request #49278 from juju-solutions/bug/master-msg
Automatic merge from submit-queue (batch tested with PRs 49330, 49252, 49262, 49278, 49334)

Simplify master-worker relation missing message

**What this PR does / why we need it**: Simplify messaging of missing relation in Juju deployments

**Which issue this PR fixes** *(optional, in `fixes #<issue number>(, fixes #<issue_number>, ...)` format, will close that issue when PR gets merged)*: fixes https://github.com/juju-solutions/bundle-canonical-kubernetes/issues/309

**Special notes for your reviewer**:

**Release note**:

```NONE
```
2017-07-20 21:59:34 -07:00
Shyam Jeedigunta 94b3cc21f9 Auto-calculate master disk and root disk sizes in GCE 2017-07-21 03:35:37 +02:00
Jing Xu 04e5243db4 Fix issue in installing containerized mounter
PR #49157 causes failure when installing containerized mounter. THis
PR is a fix for it
2017-07-20 16:30:55 -07:00
Kubernetes Submit Queue 32bd2abd44 Merge pull request #49316 from ajitak/npd-port
Automatic merge from submit-queue

Use custom port for node-problem-detector

It fixes https://github.com/kubernetes/kubernetes/issues/49263

```release-note
Use port 20256 for node-problem-detector in standalone mode.
```
2017-07-20 16:28:57 -07:00
David Ashpole 7a23f8b018 remove deprecated flags LowDiskSpaceThresholdMB and OutOfDiskTransitionFrequency 2017-07-20 13:23:13 -07:00
Kubernetes Submit Queue d9bf88ceeb Merge pull request #49229 from shyamjvs/remove-old-logdump
Automatic merge from submit-queue

Remove old log-dump.sh script

Ref https://github.com/kubernetes/kubernetes/issues/48513

Since we moved to new kubetest image which no longer needs the old one - https://github.com/kubernetes/test-infra/pull/3585

/cc @wojtek-t @fejta
2017-07-20 12:55:44 -07:00
Ajit Kumar f45cc00440 Use custom port for node-problem-detector 2017-07-20 11:25:44 -07:00
Kubernetes Submit Queue 088141ca3a Merge pull request #49298 from shyamjvs/fix-log-dump
Automatic merge from submit-queue

Fix bug with sed in log-dump script

/cc @wojtek-t
2017-07-20 09:54:20 -07:00
Kubernetes Submit Queue c2dd291ecb Merge pull request #49265 from jianglingxia/jlx72014
Automatic merge from submit-queue

fix the typo of Kubernetes Worker

**What this PR does / why we need it**:
fix the typo of Kubernetes Worker that Kubernetes spell error
**Which issue this PR fixes** *(optional, in `fixes #<issue number>(, fixes #<issue_number>, ...)` format, will close that issue when PR gets merged)*: fixes #

**Special notes for your reviewer**:

**Release note**:

```release-note
```NONE
2017-07-20 09:08:01 -07:00
Kubernetes Submit Queue c35bf2aa7b Merge pull request #49282 from gmarek/disk_size
Automatic merge from submit-queue

Fix master disk size variable usage
2017-07-20 06:04:01 -07:00
Shyam Jeedigunta ba7167963d Fix bug with sed in log-dump script 2017-07-20 14:34:18 +02:00
gmarek 83d3018085 Fix master disk size variable usage 2017-07-20 14:15:06 +02:00
Konstantinos Tsakalozos 685dff99ab Update status to show failing services. 2017-07-20 14:57:02 +03:00
Kubernetes Submit Queue 35d6cf086c Merge pull request #49212 from kawych/master
Automatic merge from submit-queue

Bump rescheduler version to v0.3.1

**What this PR does / why we need it**:
Bump Rescheduler version to v0.3.1 to log to STDERR.

**Which issue this PR fixes**
Fixes https://github.com/kubernetes/contrib/issues/2518

**Release note**:
```release-note
NONE
```
2017-07-20 02:15:23 -07:00
Konstantinos Tsakalozos d5a71eb2d1 Simplify master-worker relation missing message 2017-07-20 11:47:50 +03:00
jianglingxia 5de3464755 fix the typo of Kubernetes Worker 2017-07-20 14:08:12 +08:00
Kubernetes Submit Queue 2eb6b95a5d Merge pull request #49148 from mikedanese/deprecate-cluster
Automatic merge from submit-queue (batch tested with PRs 48377, 48940, 49144, 49062, 49148)

add some more deprecation warnings to cluster

Part of https://github.com/kubernetes/kubernetes/issues/49213
@kubernetes/sig-cluster-lifecycle-misc
2017-07-19 19:10:18 -07:00
Ian Chakeres a18a18367f Check whether NODE_LOCAL_SSDS=0 and handle this case appropriately.
Some versions of seq will count down from 1 if "seq 0" is specified
2017-07-19 17:06:21 -07:00
Kubernetes Submit Queue c5b47482ae Merge pull request #49121 from mikedanese/remove-kubelet-bearer-token
Automatic merge from submit-queue (batch tested with PRs 47509, 46821, 45319, 49121, 49125)

gce: don't add kubelet bearer token to known tokens
2017-07-19 16:16:31 -07:00
Kubernetes Submit Queue 042913d898 Merge pull request #47509 from billy2180/bump-mounttest-image-version-to-0.8
Automatic merge from submit-queue

Bump e2e mounttest image version to 0.8

Reduce the number of image files required for e2e test run
```release-note
NONE
```
2017-07-19 15:53:31 -07:00
Yu-Ju Hong 780af313d6 glbc: change the label of the l7-lb-controller pod
This ensures that the default http backend service doesn't include this
pod as its endpoint.
2017-07-19 15:15:54 -07:00
Shyam Jeedigunta d2bf533938 Remove deprecated cluster/log-dump.sh 2017-07-19 21:50:09 +02:00
Kubernetes Submit Queue 7dc0322b0c Merge pull request #49182 from juju-solutions/feature/increase-cidr
Automatic merge from submit-queue (batch tested with PRs 49058, 49072, 49137, 49182, 49045)

Set default CIDR to /16 for Juju deployments

**What this PR does / why we need it**: Increase the number of IPs on a deployment

**Which issue this PR fixes** *(optional, in `fixes #<issue number>(, fixes #<issue_number>, ...)` format, will close that issue when PR gets merged)*: fixes https://github.com/juju-solutions/bundle-canonical-kubernetes/issues/272

**Special notes for your reviewer**:

**Release note**:

```Set default CIDR to /16 for Juju deployments
```
2017-07-19 10:27:27 -07:00
Kubernetes Submit Queue 6af05149aa Merge pull request #49058 from shyamjvs/logexporter-support
Automatic merge from submit-queue

Pass logexporter config through e2e framework

Ref https://github.com/kubernetes/kubernetes/issues/48513

/cc @wojtek-t @fejta
2017-07-19 09:57:47 -07:00
Mike Danese 89bf771e51 add some more deprecation warnings to cluster 2017-07-19 09:43:05 -07:00
Karol Wychowaniec 3a5f4c2a78 Bump rescheduler version to v0.3.1 2017-07-19 17:51:20 +02:00
Shaun Crampton d2882a8b89 Rev Calico's Typha daemon to v0.2.3 in add-on deployment. 2017-07-19 10:22:51 +01:00
Kubernetes Submit Queue c3f4e7e59a Merge pull request #49165 from mikedanese/cleanup2
Automatic merge from submit-queue (batch tested with PRs 49120, 46755, 49157, 49165, 48950)

gce: don't print every file in mounter to stdout

This is printing ~3000 lines.
2017-07-19 00:06:27 -07:00
Kubernetes Submit Queue 5e50097c57 Merge pull request #49157 from mikedanese/cleanup1
Automatic merge from submit-queue (batch tested with PRs 49120, 46755, 49157, 49165, 48950)

gce: make some global variables local

/approve no-issue
2017-07-19 00:06:25 -07:00
Konstantinos Tsakalozos 280ea7f485 Set default CIDR to /16 2017-07-19 09:34:23 +03:00
Kubernetes Submit Queue 384c1d285d Merge pull request #48914 from MrHohn/gke-kube-down-log
Automatic merge from submit-queue (batch tested with PRs 48914, 48535, 49099, 48935, 48871)

Log error when fail to execute command in with-retry()

**What this PR does / why we need it**: Enhance gke/util.sh logging.

**Which issue this PR fixes** *(optional, in `fixes #<issue number>(, fixes #<issue_number>, ...)` format, will close that issue when PR gets merged)*: fixes #48913

**Special notes for your reviewer**:
/cc @krzyzacy 

**Release note**:

```release-note
NONE
```
2017-07-18 21:04:20 -07:00
Mike Danese 7a6917f552 gce: don't print every file in mounter to stdout 2017-07-18 17:04:03 -07:00
Mike Danese 0d662e8845 gce: make some global variables local 2017-07-18 15:55:37 -07:00
Mike Danese 9ee240dfcf aggr: don't write empty CA files
write now if the metadata is not supplied, we write an empty file for
these certs. we should just fail
2017-07-18 14:40:24 -07:00
Shyam Jeedigunta 5cdedd22cf Pass logexporter config through e2e framework 2017-07-18 23:28:06 +02:00
Mike Danese 65cc002877 gce: don't add kubelet bearer token to known tokens
since it is no longer used after kubelet TLS bootstrap.
2017-07-18 08:52:08 -07:00
Kubernetes Submit Queue 013d37d881 Merge pull request #48874 from juju-solutions/feature/sync-channels
Automatic merge from submit-queue (batch tested with PRs 49019, 48919, 49040, 49018, 48874)

Set default snap channel on charms to 1.7 stable

**What this PR does / why we need it**: This PR sets the default snap channel on charms to 1.7/stable. 
This addresses problems where the the user might want to deploy the charm and get the same kubernetes version found on the bundles.

**Which issue this PR fixes** *(optional, in `fixes #<issue number>(, fixes #<issue_number>, ...)` format, will close that issue when PR gets merged)*: fixes https://github.com/juju-solutions/bundle-canonical-kubernetes/issues/305

**Special notes for your reviewer**:

**Release note**:

```Set default snap channel on charms to 1.7/stable
```
2017-07-17 21:25:13 -07:00
Kubernetes Submit Queue 3548031fa9 Merge pull request #49040 from shyamjvs/fix-logdump
Automatic merge from submit-queue (batch tested with PRs 49019, 48919, 49040, 49018, 48874)

Fix if condition in cluster/log-dump/log-dump.sh

Fixes https://github.com/kubernetes/kubernetes/issues/49039

/cc @yujuhong
2017-07-17 21:25:09 -07:00
Kubernetes Submit Queue 50ec4384ff Merge pull request #49033 from wwwtyro/rye/handle-prev-port-none
Automatic merge from submit-queue (batch tested with PRs 48231, 47377, 48797, 49020, 49033)

prevent unsetting of nonexistent previous port in kubeapi-load-balancer charm

**What this PR does / why we need it**: prevent unsetting of nonexistent previous port in kubeapi-load-balancer charm

**Release note**:
```release-note
prevent unsetting of nonexistent previous port in kubeapi-load-balancer charm
```
2017-07-17 19:37:48 -07:00
Shyam Jeedigunta de09f8e569 Fix if condition in cluster/log-dump/log-dump.sh 2017-07-17 20:21:43 +02:00
Rye Terrell 7c06da5b0e prevent unsetting of nonexistent previous port in kubeapi-load-balancer charm 2017-07-17 12:22:54 -05:00
gmarek af6480210e Create 64-core masters for huge clusters 2017-07-17 15:03:39 +02:00
icereval 60872db873
log node-problem-detector 2017-07-16 16:46:25 -04:00
Kubernetes Submit Queue 60c3dd1bf7 Merge pull request #48958 from wwwtyro/rye/charm-load-balancer-port
Automatic merge from submit-queue (batch tested with PRs 48578, 48895, 48958)

use port configuration

**What this PR does / why we need it**: Uses the `port` config option in the kubeapi-load-balancer charm.

**Release note**:
```release-note
Uses the port config option in the kubeapi-load-balancer charm.
```
2017-07-14 13:45:55 -07:00
Kubernetes Submit Queue 8532cdfd69 Merge pull request #48886 from mikedanese/cleanup
Automatic merge from submit-queue

remove some people from OWNERS so they don't get reviews anymore

These are googlers who don't work on the project anymore but are still
getting reviews assigned to them:
- @bprashanth
- @rjnagal
- @vmarmol
2017-07-14 11:46:10 -07:00
Rye Terrell ece0461cce use port configuration 2017-07-14 13:31:47 -05:00
Kubernetes Submit Queue 455e44b616 Merge pull request #48812 from crassirostris/change-fluentd-monitoring
Automatic merge from submit-queue (batch tested with PRs 48812, 48276)

Change fluentd-gcp monitoring to use metrics exposed by SD plugin

Following https://github.com/GoogleCloudPlatform/fluent-plugin-google-cloud/pull/135, make fluentd-gcp expose metrics in Prometheus registry and use them instead of counting records in the pipeline.

/cc @piosz @igorpeshansky

```release-note
Fluentd-gcp DaemonSet exposes different set of metrics.
```
2017-07-14 04:43:42 -07:00
Kubernetes Submit Queue ca0a868823 Merge pull request #48651 from shyamjvs/logexporter-supporter
Automatic merge from submit-queue (batch tested with PRs 48864, 48651, 47703)

Enable logexporter mechanism to dump logs from k8s nodes to GCS directly

Ref https://github.com/kubernetes/kubernetes/issues/48513

This adds support for logexporter from k8s side. Next I'll send a PR adding support from test-infra side.

/cc @kubernetes/sig-scalability-misc @kubernetes/test-infra-maintainers @fejta @wojtek-t @gmarek
2017-07-14 03:10:45 -07:00
Kubernetes Submit Queue ca1b408bf5 Merge pull request #48864 from jszczepkowski/ha-e2e-fix
Automatic merge from submit-queue

Fixed cluster validation for multizonal clusters.

Fixed cluster validation for multizonal clusters.
This should fix HA master e2e tests.

```release-note
```
2017-07-14 02:48:06 -07:00
Kubernetes Submit Queue d623bd4c44 Merge pull request #46748 from gyliu513/critical
Automatic merge from submit-queue (batch tested with PRs 46748, 48826)

Added `CriticalAddonsOnly` toleration for npd.

**What this PR does / why we need it**:
We should add `CriticalAddonsOnly` toleration to make sure the daemonset can be scheduled on the node even if already planned to run critical pod.

**Which issue this PR fixes** *(optional, in `fixes #<issue number>(, fixes #<issue_number>, ...)` format, will close that issue when PR gets merged)*: fixes #47015

**Special notes for your reviewer**:

**Release note**:

```release-note
none
```
2017-07-14 02:03:41 -07:00
Kubernetes Submit Queue cc35759318 Merge pull request #48475 from gmile/patch-1
Automatic merge from submit-queue

Properly nest code blocks

**What this PR does / why we need it**:

Markdown code blocks are adjusted to better display on GitHub. See [rendered](c3fbec7663/cluster/addons/cluster-loadbalancing/glbc/README.md) version.

**Release note**:

```release-note
Adjust markdown code block in README for Google Load Balancer addon.
```
2017-07-13 18:50:46 -07:00
Kubernetes Submit Queue b507254bb0 Merge pull request #46825 from zjj2wry/fix_doc_spell
Automatic merge from submit-queue

Update docs for user-guide

**What this PR does / why we need it**:

**Which issue this PR fixes** *(optional, in `fixes #<issue number>(, fixes #<issue_number>, ...)` format, will close that issue when PR gets merged)*: fixes #

**Special notes for your reviewer**:

**Release note**:

```release-note
NONE
```
2017-07-13 18:50:36 -07:00
Zihong Zheng 722c3e348c Log error when fail to execute command in with-retry() 2017-07-13 14:56:16 -07:00
Mike Danese c201553f27 remove some people from OWNERS so they don't get reviews anymore
These are googlers who don't work on the project anymore but are still
getting reviews assigned to them:
- bprashanth
- rjnagal
- vmarmol
2017-07-13 10:02:21 -07:00
gmarek afe1a2c71b Revert "Merge pull request #48560 from nicksardo/gce-network-project"
This reverts commit d4881dd491, reversing
changes made to b5c4346130.
2017-07-13 18:34:24 +02:00
Guangya Liu d874548eed Added `CriticalAddonsOnly` toleration for npd. 2017-07-13 23:44:15 +08:00
Konstantinos Tsakalozos ca0f042854 Set default snap channel on charms to 1.7 stable 2017-07-13 16:24:48 +03:00
Jerzy Szczepkowski 1cef818cd0 Fixed cluster validation for multizonal clusters.
Fixed cluster validation for multizonal clusters.
This should fix HA master e2e tests.
2017-07-13 11:03:57 +02:00
Kubernetes Submit Queue 3c080e83c7 Merge pull request #48642 from freehan/gce-api-endpint
Automatic merge from submit-queue

Support GCE alpha/beta api endpoint override

fixes: https://github.com/kubernetes/kubernetes/issues/48568
2017-07-12 18:23:37 -07:00
Kubernetes Submit Queue 6e3bcd9b02 Merge pull request #48830 from csbell/patch-conversion-typo
Automatic merge from submit-queue (batch tested with PRs 48781, 48817, 48830, 48829, 48053)

Fix yaml-quote typo

Caught this looking through CI logs.

/assign wojtek-t
2017-07-12 15:11:13 -07:00
Minhan Xia 3e8b4a27c4 use overrided api endpoint in gce cloud provider 2017-07-12 15:10:13 -07:00
Kubernetes Submit Queue d7d4346464 Merge pull request #48566 from vasartori/fluentd-prometheus
Automatic merge from submit-queue (batch tested with PRs 48279, 48566, 48319, 48794, 47952)

Add prometheus plugin on fluentd image.

**What this PR does / why we need it**:
This PR adds the prometheus plugin on Fluentd.

**Special notes for your reviewer**:
The plugin used was: https://github.com/kazegusuri/fluent-plugin-prometheus, on the latest stable version.
All configs used are default.

**Release note**:

```release-note
Fluentd-es addon now exposes a /metrics endpoint for monitoring on port 24231. 
```
2017-07-12 11:52:17 -07:00
Kubernetes Submit Queue 6396b9ddc3 Merge pull request #48279 from abgworrall/cos-nodes-by-default
Automatic merge from submit-queue

Use Container-optimzed OS images for nodes by default

Part of the deprecation of the debian-based ContainerVM images.

```release-note
kube-up and kubemark will default to using cos (GCI) images for nodes.
The previous default was container-vm (CVM, "debian"), which is deprecated.
If you need to explicitly use container-vm for some reason, you should set
KUBE_NODE_OS_DISTRIBUTION=debian
```
2017-07-12 11:46:19 -07:00
Christian Bell ad365b3479 Fix typo 2017-07-12 11:38:14 -07:00
Kubernetes Submit Queue 7001b9d9d1 Merge pull request #48425 from kawych/master
Automatic merge from submit-queue

Pass cluster name to Heapster with Stackdriver sink.

**What this PR does / why we need it**:
Passes cluster name as argument to Heapster when it's used with Stackdriver sink to allow setting resource label 'cluster_name' in exported metrics.

**Release note**:
```release-note
NONE
```
2017-07-12 10:51:09 -07:00
Mik Vyatskov 125f759907 Change fluentd-gcp monitoring to use metrics exposed by SD plugin 2017-07-12 17:07:15 +02:00
Kubernetes Submit Queue 93344ea18c Merge pull request #48571 from verult/DefaultCOSFlexDir
Automatic merge from submit-queue (batch tested with PRs 48405, 48742, 48748, 48571, 48482)

Setting default FlexVolume driver directory on COS images.

**What this PR does / why we need it**: The original default FlexVolume driver directory is not writable on COS. A new location is necessary to make FlexVolume work.

This directory doesn't exist by default. FlexVolume users need to create this directory, bind mount it, and remount with the executable permission. The other candidate is /home/kubernetes/bin, but the directory is already getting cluttered. I will submit a different PR for a script that automates this step.

**Which issue this PR fixes** *(optional, in `fixes #<issue number>(, fixes #<issue_number>, ...)` format, will close that issue when PR gets merged)*: fixes #48570
2017-07-12 08:05:23 -07:00
Shyam Jeedigunta 5f8cb3d9ff Enable logexporter mechanism to dump logs from k8s nodes to GCS directly 2017-07-12 14:39:49 +02:00
Kubernetes Submit Queue 787f5e0fe5 Merge pull request #48735 from nicksardo/gce-empty-network-proj
Automatic merge from submit-queue (batch tested with PRs 48698, 48712, 48516, 48734, 48735)

GCE: Allow empty NETWORK_PROJECT_ID env var

Changes:
1. Adds `GCE_API_ENDPOINT` logic to container-linux as it was added to GCI in #47881.
1. Apply `NETWORK_PROJECT_ID` value to gce.conf only if the env var is set.

/sig network
/area platform/gce

**Release note**:
```release-note
NONE
```
2017-07-12 04:56:33 -07:00
Adam Worrall 2196be7c7f Leave testing on CVM by default
Avoid breaking tests that assume they're testing CVM.
2017-07-11 14:53:49 -07:00
Adam Worrall 72f58e09c1 Leave the test jobs running on CVM after all. 2017-07-11 14:47:35 -07:00
Adam Worrall ddeab2a69a Update NODE_OS_DISTRIBUTION from debian to gci
This in turn drives how NODE_IMAGE is set
2017-07-11 14:47:35 -07:00
Adam Worrall ba2dd9ea4c Use Container-optimzed OS images for node 2017-07-11 14:47:35 -07:00
Nick Sardo ebce7d2497 Allow missing NETWORK_PROJECT_ID env var 2017-07-10 14:26:47 -07:00
Cheng Xing 01fd6219cf Setting default FlexVolume driver directory on COS images. 2017-07-10 11:17:23 -07:00
Mik Vyatskov b11084e76c Bump event-exporter version 2017-07-10 17:32:40 +02:00
Victor Sartori de50947f30 Bump image version on makefile and DS. 2017-07-07 14:55:54 -03:00
Nick Sardo 06e328627c Use network project id for firewall/route mgmt and zone listing 2017-07-06 16:58:27 -07:00
Victor Sartori 0e52082673 Add prometheus plugin on fluentd image. 2017-07-06 18:24:04 -03:00
Kubernetes Submit Queue 20e629b1c6 Merge pull request #44394 from rthallisey/pre-existing-provider
Automatic merge from submit-queue

Launch kubemark with an existing Kubemark master

In order to expand the use of kubemark, allow developers to use kubemark with a pre-existing Kubernetes cluster.

Ref issue  #44393
2017-07-06 04:41:53 -07:00
Kazuki Suda 63f6198b4e Fix a dead link in cluster/update-storage-objects.sh 2017-07-06 18:37:52 +09:00
Kubernetes Submit Queue 40a21312d1 Merge pull request #48144 from juju-solutions/bug/worker-termination
Automatic merge from submit-queue (batch tested with PRs 48399, 48450, 48144)

Skip errors when unregistering juju kubernetes-workers

**What this PR does / why we need it**: When removing a kubernetes node from using Juju and for some reason kubernetes master fails we should not error the node, instead we should proceed with the removal of the node and the master will recognise that node as unavailable because it will fail heartbeats.

**Which issue this PR fixes** *(optional, in `fixes #<issue number>(, fixes #<issue_number>, ...)` format, will close that issue when PR gets merged)*: fixes https://github.com/juju-solutions/bundle-canonical-kubernetes/issues/300

**Special notes for your reviewer**:

**Release note**:

```
Clean decommission of Juju kubernetes worker units 
```
2017-07-05 08:58:39 -07:00
Kubernetes Submit Queue 7b13208d61 Merge pull request #48450 from wwwtyro/rye/lxc-disable-conntrack-max
Automatic merge from submit-queue (batch tested with PRs 48399, 48450, 48144)

configure kube-proxy to run with unset conntrack param when in lxc

**What this PR does / why we need it**: Configures the Juju Charm code to run kube-proxy with `conntrack-max-per-core` set to `0` when in an lxc as a workaround for issues when mounting `/sys/module/nf_conntrack/parameters/hashsize`

**Release note**:

```release-note
Configures the Juju Charm code to run kube-proxy with conntrack-max-per-core set to 0 when in an lxc as a workaround for issues when mounting /sys/module/nf_conntrack/parameters/hashsize
```
2017-07-05 08:58:37 -07:00
Ryan Hallisey 82e1d208f6 Launch kubemark with an existing Kubemark Master
In order to expand the use of kubemark, allow developers to
use kubemark with a pre-existing Kubemark master.
2017-07-05 09:14:53 -04:00
Konstantinos Tsakalozos 90a57931af Skip errors when unregistering juju kubernetes-workers 2017-07-05 16:00:37 +03:00
Kubernetes Submit Queue 5d21390561 Merge pull request #41790 from wojtek-t/allow_for_enabling_conversion_mismatch_detecto
Automatic merge from submit-queue

Add ability to enable patch conversion detector

Will rebase and fix once #41326 is merged.
2017-07-04 13:18:22 -07:00
Eugene Pirogov c3fbec7663 Properly nest code blocks
Markdown code blocks are adjusted to better display on GitHub
2017-07-04 19:49:48 +03:00
Kubernetes Submit Queue 3823270b9e Merge pull request #48446 from Cynerva/gkk/stop-snaps
Automatic merge from submit-queue (batch tested with PRs 47043, 48448, 47515, 48446)

Fix charms leaving services running after remove-unit

**What this PR does / why we need it**:

This fixes a case where removed charm units can sometimes leave behind running services that interfere with the rest of the cluster.

**Which issue this PR fixes** *(optional, in `fixes #<issue number>(, fixes #<issue_number>, ...)` format, will close that issue when PR gets merged)*: fixes #

**Special notes for your reviewer**:

**Release note**:

```release-note
Fix charms leaving services running after remove-unit
```
2017-07-04 09:12:57 -07:00
Wojciech Tyczynski 37b5a214bc Add ability to enable patch conversion detector 2017-07-04 14:33:24 +02:00
Rye Terrell 05fbc7a7f8 configure kube-proxy to run with unset conntrack param when in lxc 2017-07-03 21:58:54 -05:00
Kubernetes Submit Queue e1d9ab205a Merge pull request #48440 from Cynerva/gkk/snap-upgrades-restart-services
Automatic merge from submit-queue (batch tested with PRs 48439, 48440, 48394)

Fix kubernetes charms not restarting services after snap upgrades

**What this PR does / why we need it**:

This fixes a problem where the Kubernetes charms don't restart services after upgrading snaps. This can cause certain fixes not to be picked up (for example https://github.com/juju-solutions/release/pull/10)

**Which issue this PR fixes** *(optional, in `fixes #<issue number>(, fixes #<issue_number>, ...)` format, will close that issue when PR gets merged)*: fixes #

**Special notes for your reviewer**:

**Release note**:

```release-note
Fixed kubernetes charms not restarting services after snap upgrades
```
2017-07-03 13:05:28 -07:00
Kubernetes Submit Queue 937369bc21 Merge pull request #48439 from juju-solutions/bug/namespaces-path
Automatic merge from submit-queue (batch tested with PRs 48439, 48440, 48394)

Fix: namespace-create have kubectl in path

**What this PR does / why we need it**: In juju deployed clusters namespace-create action is failing

**Which issue this PR fixes** *(optional, in `fixes #<issue number>(, fixes #<issue_number>, ...)` format, will close that issue when PR gets merged)*: fixes https://github.com/juju-solutions/bundle-canonical-kubernetes/issues/326

**Special notes for your reviewer**:

**Release note**:

```Fix: namespace-create action on Juju deployed clusters
```
2017-07-03 13:05:27 -07:00
George Kraft f0e08818d8 Fix charms leaving services running after unit removal 2017-07-03 14:55:07 -05:00
Konstantinos Tsakalozos cea934bcbc Fix: namespace-create have kubectl in path 2017-07-03 18:22:20 +03:00
George Kraft c21b305fe9 Fix kubernetes charms not restarting services after snap upgrades 2017-07-03 09:47:08 -05:00
Konstantinos Tsakalozos e2571a853a Non leaders should overwrite any local copies of keys they have with what the leader has. 2017-07-03 17:45:43 +03:00
Karol Wychowaniec ca5581edcc Pass cluster name to Heapster with Stackdriver sink. 2017-07-03 11:55:57 +02:00
Kubernetes Submit Queue 9848cdb3ac Merge pull request #48281 from hogepodge/configure-swift-store
Automatic merge from submit-queue

Add configuration for swift container name

**What this PR does / why we need it:**
This review updates the OpenStack Heat provider to allow for configuring the name of the Swift object store.

**Which issue this PR fixes:**
fixes #47966

**Special notes for your reviewer**:
Note that the terminology for OpenStack Swift conflicts with K8S terminology. In this instance, container is referring to the organization structure of Swift storage objects.

**Release note**:
```release-note
Adds configuration option for Swift object store container name to OpenStack Heat provider.
```
2017-07-02 08:02:42 -07:00
Kubernetes Submit Queue dc597291c1 Merge pull request #48351 from juju-solutions/bug/get-pass
Automatic merge from submit-queue (batch tested with PRs 48317, 48313, 48351, 48357, 48115)

Ensure get_password is accessing a file that exists.

**What this PR does / why we need it**: get_password will throw an exception instead of returning None in case the basic_auth.csv file is missing but /root/cdk/ is there in a juju deployment.

**Which issue this PR fixes** *(optional, in `fixes #<issue number>(, fixes #<issue_number>, ...)` format, will close that issue when PR gets merged)*: fixes https://github.com/juju-solutions/bundle-canonical-kubernetes/issues/324

**Special notes for your reviewer**:

**Release note**:

```
Fix race condition where /root/cdk is not yet initialised in kubernetes-master setup by Juju  
```
2017-06-30 19:54:27 -07:00
Kubernetes Submit Queue c0337c92cc Merge pull request #47881 from cadmuxe/endpoint
Automatic merge from submit-queue (batch tested with PRs 47918, 47964, 48151, 47881, 48299)

Add ApiEndpoint support to GCE config.

**What this PR does / why we need it**:
Add the ability to change ApiEndpoint  for GCE.

**Which issue this PR fixes** *(optional, in `fixes #<issue number>(, fixes #<issue_number>, ...)` format, will close that issue when PR gets merged)*: fixes #

**Special notes for your reviewer**:

**Release note**:
```release-note
None
```
2017-06-30 18:42:40 -07:00
Kubernetes Submit Queue 87c6fb5de2 Merge pull request #42376 from jingxu97/Feb/mounter
Automatic merge from submit-queue (batch tested with PRs 43558, 48261, 42376, 46803, 47058)

Add bind mount /etc/resolv.conf from host to containerized mounter

Currently, in containerized mounter rootfs, there is no DNS setup. If client
try to set up volume with host name instead of IP address, it will fail to resolve
the host name. 
By bind mount the host's /etc/resolv.conf to mounter rootfs, VM hosts name
could be resolved when using host name during mount. 

```release-note
Fixes issue where you could not mount NFS or glusterFS volumes using hostnames on GCI/GKE with COS images.
```
2017-06-30 16:28:46 -07:00
gunjan5 cc162a7f15 set snat to false 2017-06-30 12:53:55 -07:00
Konstantinos Tsakalozos cd34d8f80d Ensure get_password is accessing a file that exists. 2017-06-30 20:24:35 +03:00
Kubernetes Submit Queue d19773d855 Merge pull request #47835 from juju-solutions/feature/security
Automatic merge from submit-queue (batch tested with PRs 47850, 47835, 46197, 47250, 48284)

Securing the cluster created by Juju

**What this PR does / why we need it**: This PR secures the deployments done with Juju master. Works around certain security issues inherent to kubernetes (see for example dashboard access)

**Which issue this PR fixes** *(optional, in `fixes #<issue number>(, fixes #<issue_number>, ...)` format, will close that issue when PR gets merged)*: fixes #

**Special notes for your reviewer**:

**Release note**:

```
Securing Juju kubernetes dashboard
```
2017-06-29 15:16:39 -07:00
Chris Hoge c0621061c8 Add configuration for swift container name
Fixes Issue #47966
2017-06-29 10:15:55 -07:00
Koonwah Chen c4e84e263c Change KUBE_GCE_API_ENDPOINT to GCE_API_ENDPOINT 2017-06-29 10:04:53 -07:00
Kubernetes Submit Queue d3aa0d5a8a Merge pull request #46850 from x13n/nanny-version
Automatic merge from submit-queue (batch tested with PRs 46850, 47984)

Update addon-resizer version

Update addon-resizer version and remove the flags that have been deprecated in the new version.

**What this PR does / why we need it**:
ref kubernetes/contrib#2623

**Special notes for your reviewer**:
Need to wait for merging kubernetes/contrib#2623 first.

**Release note**:
```release-note
addon-resizer flapping behavior was removed.
```
2017-06-29 07:18:32 -07:00
Kubernetes Submit Queue 7018479968 Merge pull request #48204 from shyamjvs/logdump-only-n-nodes
Automatic merge from submit-queue

Allow log-dumping only N randomly-chosen nodes in the cluster

This should let us save "lots" (~3-4 hours) of time in our 5000-node cluster scale tests as we copy logs from all the nodes to jenkins worker and then upload all of them to gcs (while we don't need too many).
This will also prevent the jenkins container facing "No space left on device" error while dumping logs, that we saw in runs 12-13 of gce-enormous-cluster.

The longterm fix will be to enable [logexporter](https://github.com/kubernetes/test-infra/tree/master/logexporter) for our tests.

cc @kubernetes/sig-scalability-misc @kubernetes/test-infra-maintainers @gmarek @fejta
2017-06-29 04:23:58 -07:00
Daniel Kłobuszewski 63ccedcfa7 Update addon-resizer version
Also, remove the flags that have been deprecated in the new version.
2017-06-29 11:03:43 +02:00
Koonwah Chen b3956a689e Add KUBE_GCE_API_ENDPOINT for GCE API endpoint config. 2017-06-28 16:03:18 -07:00
Shyam Jeedigunta b960a0da12 Allow log-dumping only N randomly-chosen nodes in the cluster 2017-06-28 23:01:08 +02:00
Shyam Jeedigunta cc8bb857f9 Allow creating special node for heapster in GCE 2017-06-28 21:27:36 +02:00
Kubernetes Submit Queue a17f15a8a9 Merge pull request #48205 from piosz/heapster-1.4
Automatic merge from submit-queue (batch tested with PRs 48004, 48205, 48130, 48207)

Bumped Heapster to v1.4.0

``` release-note
Bumped Heapster to v1.4.0.
More details about the release https://github.com/kubernetes/heapster/releases/tag/v1.4.0
```

follow up #47961
The release candidate `v1.4.0-beta.0` turned out to be stable.
2017-06-28 10:35:12 -07:00
Kubernetes Submit Queue 63d4af44ac Merge pull request #48004 from dnardo/gke
Automatic merge from submit-queue (batch tested with PRs 48004, 48205, 48130, 48207)

Do not set CNI in cases where there is a private master and network policy provider is set.

**What this PR does / why we need it**:

**Which issue this PR fixes** *(optional, in `fixes #<issue number>(, fixes #<issue_number>, ...)` format, will close that issue when PR gets merged)*: fixes #

**Special notes for your reviewer**:

**Release note**:

```release-note
In GCE and in a "private master" setup, do not set the network-plugin provider to CNI by default if a network policy provider is given.
```
2017-06-28 10:35:10 -07:00
Kubernetes Submit Queue ec729ad66d Merge pull request #48182 from gmarek/fluentd
Automatic merge from submit-queue (batch tested with PRs 48192, 48182)

Add generic NoSchedule toleration to fluentd in gcp config as a quick…

…-fix for #44445
2017-06-28 09:33:08 -07:00
Piotr Szczesniak 43280e274d Bumped Heapster to v1.4.0 2017-06-28 16:40:35 +02:00
gmarek 10ce8e2c0d Fix bug cluster-subnet logic 2017-06-28 14:27:52 +02:00
gmarek 3f57d8dba3 Add generic NoSchedule toleration to fluentd in gcp config as a quick-fix for #44445 2017-06-28 10:35:58 +02:00
Konstantinos Tsakalozos 0525b84a45 Disable anonymous-auth 2017-06-28 10:47:45 +03:00
Zach Loafman 903bc643b1 Bump GCE ContainerVM to container-vm-v20170627
Remove the built-in kubelet (finally), pick up security fixes.
2017-06-27 16:14:55 -07:00
Kubernetes Submit Queue 89579c45a4 Merge pull request #48054 from juju-solutions/bug/terminate-etcd
Automatic merge from submit-queue (batch tested with PRs 48139, 48042, 47645, 48054, 48003)

Add a failsafe for etcd not returning a connection string

**What this PR does / why we need it**: Removing a kubernetes-master will fail as described on this issue: https://github.com/juju-solutions/bundle-canonical-kubernetes/issues/311

**Which issue this PR fixes** *(optional, in `fixes #<issue number>(, fixes #<issue_number>, ...)` format, will close that issue when PR gets merged)*: fixes https://github.com/juju-solutions/bundle-canonical-kubernetes/issues/311

**Special notes for your reviewer**: This is a two liner defensive code. I am not totally sold on this patch. I might not be the right place to address the above issue. However, solving the problem on the etcd side and updating the interface scope to be unit (as suggested) seems much more involving.

**Release note**:

```
Fix error when removing juju kubernetes-master unit
```
2017-06-27 14:08:19 -07:00
Kubernetes Submit Queue f1b58f4e5f Merge pull request #48139 from crassirostris/fix-fluentd-config
Automatic merge from submit-queue (batch tested with PRs 48139, 48042, 47645, 48054, 48003)

Fix fluentd-gcp configuration to facilitate JSON parsing

There's a bug in https://github.com/kubernetes/kubernetes/pull/45734, because of which each records gets additional field and google-cloud plugin thinks it's not JSON (https://github.com/GoogleCloudPlatform/fluent-plugin-google-cloud/blob/master/lib/fluent/plugin/out_google_cloud.rb#L569)

Fixes https://github.com/kubernetes/kubernetes/issues/48108

/cc @piosz @fgrzadkowski
2017-06-27 14:08:07 -07:00
Kubernetes Submit Queue ede78d9ee7 Merge pull request #47513 from gmarek/subnet
Automatic merge from submit-queue

Make big clusters work again after introduction of subnets

This PR does two things: 
  - make IP aliases automatically pick Node IP Range based on number of Nodes,
  - fix logic for starting clusters >4095 Nodes that was broken by introduction of subnets,

cc @wojtek-t @shyamjvs 

```release-note
Setting env var ENABLE_BIG_CLUSTER_SUBNETS=true will allow kube-up.sh to start clusters bigger that 4095 Nodes on GCE.
```

Ref https://github.com/kubernetes/kubernetes/issues/47344
2017-06-27 08:52:50 -07:00
Kubernetes Submit Queue d65b87a00d Merge pull request #47847 from chuckbutler/cluster-juju-approvers
Automatic merge from submit-queue

Insert Cynerva and Kjackal to approvers list

**What this PR does / why we need it**:
Per the membership reviews, we're looking to promote Konstantinos and
George to approvers to help distribute the review/bug load for the `cluster/juju` code
tree.

**Which issue this PR fixes** *(optional, in `fixes #<issue number>(, fixes #<issue_number>, ...)` format, will close that issue when PR gets merged)*: 

**Special notes for your reviewer**:
cc @marcoceppi and @tvansteenburgh 

**Release note**:

```release-note
NONE
```
2017-06-27 08:36:30 -07:00
Mik Vyatskov b6a0e442ce Fix fluentd-gcp configuration to facilitate JSON parsing 2017-06-27 16:16:00 +02:00
Maciej Pytel 04f7a96340 Fix typo in cluster-autoscaler config 2017-06-27 13:49:19 +02:00
Maciej Pytel b11175f73f Set cluster-autoscaler node balancing flag 2017-06-27 12:10:33 +02:00
Konstantinos Tsakalozos 0b01cd743b Improve security of Juju deployed clusters 2017-06-27 12:19:21 +03:00
zhengjiajin 2ab0c65506 Update docs for user-guide 2017-06-27 12:21:49 +08:00
Kubernetes Submit Queue 0dad2d0803 Merge pull request #47983 from yguo0905/memcg
Automatic merge from submit-queue (batch tested with PRs 48092, 47894, 47983)

Enables memcg notification in cluster/node e2e tests

Ref: https://github.com/kubernetes/kubernetes/issues/42676

This PR sets Kubelet flag `--experimental-kernel-memcg-notification=true` when running cluster/node e2e tests on COS and Ubuntu images.

Tested:
```
e2e-node-cos:
I0623 00:09:06.641776    1080 server.go:147] Starting server "kubelet" with command "/usr/bin/systemd-run --unit=kubelet-777178888.service --slice=runtime.slice --remain-after-exit /tmp/node-e2e-20170622T170739/kubelet --kubelet-cgroups=/kubelet.slice --cgroup-root=/ --api-servers http://localhost:8080 --address 0.0.0.0 --port 10250 --read-only-port 10255 --volume-stats-agg-period 10s --allow-privileged true --serialize-image-pulls false --pod-manifest-path /tmp/node-e2e-20170622T170739/pod-manifest571288056 --file-check-frequency 10s --pod-cidr 10.100.0.0/24 --eviction-pressure-transition-period 30s --feature-gates  --eviction-hard memory.available<250Mi,nodefs.available<10%%,nodefs.inodesFree<5%% --eviction-minimum-reclaim nodefs.available=5%%,nodefs.inodesFree=5%% --v 4 --logtostderr --network-plugin=kubenet --cni-bin-dir /tmp/node-e2e-20170622T170739/cni/bin --cni-conf-dir /tmp/node-e2e-20170622T170739/cni/net.d --hostname-override tmp-node-e2e-bfe5799d-cos-stable-59-9460-64-0 --experimental-mounter-path=/tmp/node-e2e-20170622T170739/cluster/gce/gci/mounter/mounter --experimental-kernel-memcg-notification=true"

e2e-node-ubuntu:
I0623 00:03:28.526984    2279 server.go:147] Starting server "kubelet" with command "/usr/bin/systemd-run --unit=kubelet-1407651753.service --slice=runtime.slice --remain-after-exit /tmp/node-e2e-20170622T170203/kubelet --kubelet-cgroups=/kubelet.slice --cgroup-root=/ --api-servers http://localhost:8080 --address 0.0.0.0 --port 10250 --read-only-port 10255 --volume-stats-agg-period 10s --allow-privileged true --serialize-image-pulls false --pod-manifest-path /tmp/node-e2e-20170622T170203/pod-manifest083943734 --file-check-frequency 10s --pod-cidr 10.100.0.0/24 --eviction-pressure-transition-period 30s --feature-gates  --eviction-hard memory.available<250Mi,nodefs.available<10%%,nodefs.inodesFree<5%% --eviction-minimum-reclaim nodefs.available=5%%,nodefs.inodesFree=5%% --v 4 --logtostderr --network-plugin=kubenet --cni-bin-dir /tmp/node-e2e-20170622T170203/cni/bin --cni-conf-dir /tmp/node-e2e-20170622T170203/cni/net.d --hostname-override tmp-node-e2e-e48cdd73-ubuntu-gke-1604-xenial-v20170420-1 --experimental-kernel-memcg-notification=true"

e2e-node-containervm:
I0623 00:14:35.392383    2774 server.go:147] Starting server "kubelet" with command "/tmp/node-e2e-20170622T171318/kubelet --runtime-cgroups=/docker-daemon --kubelet-cgroups=/kubelet --cgroup-root=/ --system-cgroups=/system --api-servers http://localhost:8080 --address 0.0.0.0 --port 10250 --read-only-port 10255 --volume-stats-agg-period 10s --allow-privileged true --serialize-image-pulls false --pod-manifest-path /tmp/node-e2e-20170622T171318/pod-manifest507536807 --file-check-frequency 10s --pod-cidr 10.100.0.0/24 --eviction-pressure-transition-period 30s --feature-gates  --eviction-hard memory.available<250Mi,nodefs.available<10%,nodefs.inodesFree<5% --eviction-minimum-reclaim nodefs.available=5%,nodefs.inodesFree=5% --v 4 --logtostderr --network-plugin=kubenet --cni-bin-dir /tmp/node-e2e-20170622T171318/cni/bin --cni-conf-dir /tmp/node-e2e-20170622T171318/cni/net.d --hostname-override tmp-node-e2e-9e3fdd7c-e2e-node-containervm-v20161208-image"

e2e-cos:
Jun 23 17:54:38 e2e-test-ygg-minion-group-t5r0 kubelet[2005]: I0623 17:54:38.646374    2005 flags.go:52] FLAG: --experimental-kernel-memcg-notification="true"

e2e-ubuntu:
Jun 23 18:25:27 e2e-test-ygg-minion-group-19qp kubelet[1547]: I0623 18:25:27.722253    1547 flags.go:52] FLAG: --experimental-kernel-memcg-notification="true"

e2e-containervm:
I0623 18:55:51.886632    3385 flags.go:52] FLAG: --experimental-kernel-memcg-notification="false"
```

**Release note**:
```
None
```

/sig node
/area node-e2e
/assign @dchen1107 @dashpole
2017-06-26 21:08:10 -07:00
Kubernetes Submit Queue e8701e6746 Merge pull request #47894 from xuxinkun/fixclustercentos20170622
Automatic merge from submit-queue (batch tested with PRs 48092, 47894, 47983)

fix systemd service file for custom args.

`KUBE_SCHEDULER_ARGS` and `KUBELET_ARGS` are used to custom args for scheduler or kubelet by users. 
But if there are more than one params in `KUBELET_ARGS`, for example, if I set  KUBELET_ARGS="--cgroups-per-qos=false --enforce-node-allocatable=", the kubelet will judge the `false --enforce-node-allocatable=` as the value of `cgroups-per-qos`.  Because `${KUBELET_ARGS}` in kubelet.service will expands the variable into one word. And if I take `$KUBELET_ARGS` instead, kubelet will worker perfectly.
For more info, please click [EnvironmentFiles and support for /etc/sysconfig files](http://fedoraproject.org/wiki/Packaging:Systemd#EnvironmentFiles_and_support_for_.2Fetc.2Fsysconfig_files). This bug is reported by @huanxingyouyoutoo. And I make this PR for her to fix it.

**Release note**:

```
NONE
```
2017-06-26 21:08:08 -07:00