Commit Graph

8212 Commits (af2659527f0bd2f7ad8500ffcd0e5640bfd53cc3)

Author SHA1 Message Date
André Bauer 9e2d9cfbb0 changed es image repo
Signed-off-by: André Bauer <monotek23@gmail.com>
2019-02-26 09:09:21 +01:00
Kubernetes Prow Robot 3fb6e77770
Merge pull request #74549 from yujuhong/pause-image
GCE: switch to using e2eteam/pause:3.1 for pause containers
2019-02-25 23:12:00 -08:00
Kubernetes Prow Robot 0ff7e463ee
Merge pull request #73746 from mrbobbytables/kubemark-shellcheck
Fix shellcheck lint errors in Kubemark scripts
2019-02-25 17:25:13 -08:00
Michael Taufen 98edbf41bc upload Windows startup scripts to GCS for CI 2019-02-25 16:44:57 -08:00
Kubernetes Prow Robot 3814176d42
Merge pull request #74455 from SataQiu/fix-shell-2019022302
Fix shellcheck lint errors in cluster and hack scripts
2019-02-25 15:15:19 -08:00
Yu-Ju Hong b863655faa GCE: switch to using e2eteam/pause:3.1 for pause containers
Stop building pause images on node startup.
2019-02-25 14:36:49 -08:00
Kubernetes Prow Robot 2aacb77374
Merge pull request #74444 from pjh/gce-windows-no-defender
Disable Windows Defender on Windows nodes.
2019-02-25 13:54:42 -08:00
Bob Killen 9a4f4878f5
Fix shellcheck lint errors in cluster/kubemark/util.sh 2019-02-25 15:21:29 -05:00
Bob Killen 9a58913e8f
Fix shellcheck lint errors in cluster/kubemark/iks/config-default.sh 2019-02-25 15:21:25 -05:00
Bob Killen ce4c85e3fd
Fix shellcheck lint errors in cluster/kubemark/gce/config-default.sh 2019-02-25 14:55:01 -05:00
Kubernetes Prow Robot 35a258d640
Merge pull request #73272 from danielqsj/juju
fix shellcheck in cluster/juju
2019-02-25 11:33:21 -08:00
Kubernetes Prow Robot f288678cfa
Merge pull request #73261 from danielqsj/local
fix shellcheck in cluster/local
2019-02-25 11:33:11 -08:00
Florent Delannoy e627474e8f Fix fluentd-gcp addon liveness probe
Fix three issues with the fluentd-gcp liveness probe:

h1. STUCK_THRESHOLD_SECONDS was overridden by LIVENESS_THRESHOLD_SECONDS
if defined

Probably a copy/paste issue introduced in edf1ffc074

h1. `[[` is [a bashism](https://stackoverflow.com/a/47576482), and will always failed when called with `/bin/sh`

Introduced by a844523c20

Given that we call the liveness probe with `/bin/sh`, we cannot use the
double-bracketed `[[` syntax for test, as it is not POSIX-compliant and
will throw an error.

Annoyingly, even through it prints an error, `sh` returns with exit code 0
in this case:

```bash
root@fluentd-7mprs:/# sh liveness.sh
liveness.sh: 8: liveness.sh: [[: not found
liveness.sh: 15: liveness.sh: [[: not found
root@fluentd-7mprs:/# echo $?
0
```

Which means the liveness probe is considered successful by Kubernetes,
despite failing to test things as it was intended. This is also
probably the reason why this bug wasn't reported sooner :)

Thankfully, the test in this case can just as easily be written as
POSIX-compliant as it doesn't use any bash-specific features within the
`[[` block.

h1. Buffers are transient and cannot be relied upon for monitoring

Finally, after fixing the above issue, we started seeing the fluentd
containers being restarted very often, and found an issue with the
underlying logic of the liveness probe.

The probe checks that the pod is still alive by running the following
command:

`find /var/log/fluentd-buffers -type f -newer /tmp/marker-stuck -print -quit`

This checks if any _regular_ file exists under `/var/log/fluentd-buffers`
that is more recent than a predetermined time, and will return an empty
string otherwise.

The issue is that these buffers are temporary and volatile, they get created and
deleted constantly. Here is an example of running that check every second on a
running fluentd:

```
root@fluentd-eks-playground-jdc8m:/# LIVENESS_THRESHOLD_SECONDS=${LIVENESS_THRESHOLD_SECONDS:-300};
root@fluentd-eks-playground-jdc8m:/# STUCK_THRESHOLD_SECONDS=${LIVENESS_THRESHOLD_SECONDS:-900};
root@fluentd-eks-playground-jdc8m:/# touch -d "${STUCK_THRESHOLD_SECONDS} seconds ago" /tmp/marker-stuck;
root@fluentd-eks-playground-jdc8m:/# touch -d "${LIVENESS_THRESHOLD_SECONDS} seconds ago" /tmp/marker-liveness;
root@fluentd-eks-playground-jdc8m:/# while true; do date ; find /var/log/fluentd-buffers -type f -newer /tmp/marker-stuck -print -quit ; sleep 1 ; done
Fri Feb 22 10:52:57 UTC 2019
Fri Feb 22 10:52:58 UTC 2019
/var/log/fluentd-buffers/kubernetes.system.buffer/buffer.b5827964ccf4c7004103c3fa7c8533f85.log
Fri Feb 22 10:52:59 UTC 2019
/var/log/fluentd-buffers/kubernetes.system.buffer/buffer.b5827964ccf4c7004103c3fa7c8533f85.log
Fri Feb 22 10:53:00 UTC 2019
Fri Feb 22 10:53:01 UTC 2019
/var/log/fluentd-buffers/kubernetes.system.buffer/buffer.b5827964fb8b2eedcccd2763ea7775cc2.log
Fri Feb 22 10:53:02 UTC 2019
/var/log/fluentd-buffers/kubernetes.system.buffer/buffer.b5827964fb8b2eedcccd2763ea7775cc2.log
Fri Feb 22 10:53:03 UTC 2019
Fri Feb 22 10:53:04 UTC 2019
Fri Feb 22 10:53:05 UTC 2019
Fri Feb 22 10:53:06 UTC 2019
/var/log/fluentd-buffers/kubernetes.system.buffer/buffer.b5827965564883997b673d703af54848b.log
Fri Feb 22 10:53:07 UTC 2019
/var/log/fluentd-buffers/kubernetes.system.buffer/buffer.b5827965564883997b673d703af54848b.log
Fri Feb 22 10:53:08 UTC 2019
/var/log/fluentd-buffers/kubernetes.system.buffer/buffer.b5827965564883997b673d703af54848b.log
Fri Feb 22 10:53:09 UTC 2019
Fri Feb 22 10:53:10 UTC 2019
Fri Feb 22 10:53:11 UTC 2019
Fri Feb 22 10:53:12 UTC 2019
Fri Feb 22 10:53:13 UTC 2019
Fri Feb 22 10:53:14 UTC 2019
Fri Feb 22 10:53:15 UTC 2019
Fri Feb 22 10:53:16 UTC 2019
```

We can see buffers being created, then disappearing. The LivenessProbe running
under these conditions has a ~50% chance of failing, despite fluentd being
perfectly happy.

I believe that check is probably ok for fluentd installs using large
amounts of buffers, in which case the liveness probe will be correct more
often than not, but fluentd installs that use buffering less intensively
will be negatively impacted by this.

My solution to fix this is to check the last updated time of buffering
_folders_ within `/var/log/fluentd_buffers`. These _do_ get updated when
buffers are created, and do not get deleted as buffers are emptied,
making them the perfect candidate for our use.

Here's an example with the `-d` flag for directories:
```
root@fluentd-eks-playground-jdc8m:/# while true; do date ; find /var/log/fluentd-buffers -type d -newer /tmp/marker-stuck -print -quit ; sleep 1 ; done
Fri Feb 22 10:57:51 UTC 2019
/var/log/fluentd-buffers/kubernetes.system.buffer
Fri Feb 22 10:57:52 UTC 2019
/var/log/fluentd-buffers/kubernetes.system.buffer
Fri Feb 22 10:57:53 UTC 2019
/var/log/fluentd-buffers/kubernetes.system.buffer
Fri Feb 22 10:57:54 UTC 2019
/var/log/fluentd-buffers/kubernetes.system.buffer
Fri Feb 22 10:57:55 UTC 2019
/var/log/fluentd-buffers/kubernetes.system.buffer
Fri Feb 22 10:57:56 UTC 2019
/var/log/fluentd-buffers/kubernetes.system.buffer
Fri Feb 22 10:57:57 UTC 2019
/var/log/fluentd-buffers/kubernetes.system.buffer
Fri Feb 22 10:57:58 UTC 2019
/var/log/fluentd-buffers/kubernetes.system.buffer
Fri Feb 22 10:57:59 UTC 2019
/var/log/fluentd-buffers/kubernetes.system.buffer
Fri Feb 22 10:58:00 UTC 2019
/var/log/fluentd-buffers/kubernetes.system.buffer
Fri Feb 22 10:58:01 UTC 2019
/var/log/fluentd-buffers/kubernetes.system.buffer
Fri Feb 22 10:58:02 UTC 2019
/var/log/fluentd-buffers/kubernetes.system.buffer
Fri Feb 22 10:58:03 UTC 2019
/var/log/fluentd-buffers/kubernetes.system.buffer
```

And example of the directory being updated as new buffers come in:
```
root@fluentd-eks-playground-jdc8m:/# ls -lah /var/log/fluentd-buffers/kubernetes.system.buffer
total 0
drwxr-xr-x 2 root root  6 Feb 22 11:17 .
drwxr-xr-x 3 root root 38 Feb 22 11:14 ..
root@fluentd-eks-playground-jdc8m:/# ls -lah /var/log/fluentd-buffers/kubernetes.system.buffer
total 16K
drwxr-xr-x 2 root root  224 Feb 22 11:18 .
drwxr-xr-x 3 root root   38 Feb 22 11:14 ..
-rw-r--r-- 1 root root 1.8K Feb 22 11:18 buffer.b58279be6e21e8b29fc333a7d50096ed0.log
-rw-r--r-- 1 root root  215 Feb 22 11:18 buffer.b58279be6e21e8b29fc333a7d50096ed0.log.meta
-rw-r--r-- 1 root root  429 Feb 22 11:18 buffer.b58279be6f09bdfe047a96486a525ece2.log
-rw-r--r-- 1 root root  195 Feb 22 11:18 buffer.b58279be6f09bdfe047a96486a525ece2.log.meta
root@fluentd-eks-playground-jdc8m:/# ls -lah /var/log/fluentd-buffers/kubernetes.system.buffer
total 0
drwxr-xr-x 2 root root  6 Feb 22 11:18 .
drwxr-xr-x 3 root root 38 Feb 22 11:14 ..
```
2019-02-25 11:48:31 +00:00
André Bauer 2bd6d3dc12 use image version 6.6.1
Signed-off-by: André Bauer <monotek23@gmail.com>
2019-02-25 11:05:52 +01:00
André Bauer 2d15ffc9cc updated to 6.5.2
Signed-off-by: André Bauer <monotek23@gmail.com>
2019-02-25 10:56:50 +01:00
André Bauer 0c29ea1a2e Update es-statefulset.yaml 2019-02-25 10:55:23 +01:00
André Bauer 53a936c359 Update Makefile 2019-02-25 10:55:23 +01:00
André Bauer 0e44fa6359 updated elasticsearch to 6.5.0 2019-02-25 10:55:23 +01:00
André Bauer fc850b5ecd fixed wording
Signed-off-by: André Bauer <monotek23@gmail.com>
2019-02-25 10:49:43 +01:00
André Bauer 421fcd8262 added prodution note to readme
Signed-off-by: André Bauer <monotek23@gmail.com>
2019-02-25 10:47:26 +01:00
danielqsj 7d051e1a75 update juju shell 2019-02-24 20:46:20 +08:00
danielqsj 7e655e8666 fix shellcheck in cluster/juju 2019-02-24 20:40:59 +08:00
danielqsj f02a986081 add comments to shell function 2019-02-24 20:35:46 +08:00
SataQiu 9cda80e836 fix shellcheck lint errors in cluster and hack scripts 2019-02-24 11:15:35 +08:00
Kubernetes Prow Robot 1cf8001e53
Merge pull request #74449 from xichengliudui/fix190223
make more of the shell pass lints
2019-02-23 12:52:34 -08:00
Kubernetes Prow Robot 8993fbc543
Merge pull request #74328 from daixiang0/delete-blank
delete all duplicate empty blanks
2019-02-23 01:43:58 -08:00
Peter Hornyack 621df2cddb Disable Windows Defender on Windows test nodes. 2019-02-22 18:35:38 -08:00
Xiang Dai 36065c6dd7 delete all duplicate empty blanks
Signed-off-by: Xiang Dai <764524258@qq.com>
2019-02-23 10:28:04 +08:00
Jeff Grafton 7a938eb541 Create work-around wrappers for pure attribute on go_binary and go_test
This enables cgo when cross-compiling certain tests and binaries to
Linux, while disabling cgo for Windows and Darwin.
2019-02-22 18:08:34 -08:00
Kubernetes Prow Robot 46d7e9c784
Merge pull request #74381 from yujuhong/add-key
GCE: add EventLog registry key for docker
2019-02-22 17:58:44 -08:00
Kubernetes Prow Robot fd7acc3275
Merge pull request #74065 from ixdy/break-up-test-tarball
break up the test tarball
2019-02-22 17:58:23 -08:00
Kubernetes Prow Robot 743f864310
Merge pull request #73819 from coffeepac/move-fluentd-es-images
Move fluentd es images
2019-02-22 17:58:12 -08:00
Peter Hornyack 3efd4ca1dc Enhance/repair detect-node-names() and related env vars for Windows nodes. 2019-02-22 14:56:55 -08:00
Patrick Christopher 1bd45ba6eb review updates 2019-02-22 10:00:10 -08:00
Aaron Crickenberger 2d868025be Promote spiffxp to approver for cluster/
Also add Katharine as reviewer
2019-02-22 09:25:31 -08:00
Kubernetes Prow Robot c7ac532816
Merge pull request #74360 from pjh/gce-windows-cluster-ssh
Enable OpenSSH on Windows nodes in test clusters.
2019-02-21 20:51:23 -08:00
Kubernetes Prow Robot 125dc6c8ea
Merge pull request #74187 from xichengliudui/fixgolint0218
Fix shellcheck lint errors in cluster/addons/fluentd-elasticsearch/fl……uentd-es-image/run.sh
2019-02-21 20:51:13 -08:00
Yu-Ju Hong 40d0ae311c GCE: add EventLog registry key for docker 2019-02-21 17:47:32 -08:00
Kubernetes Prow Robot 042f9ed3af
Merge pull request #74093 from blakebarnett/lower-neg-cache-ttl
Lowers the default nodelocaldns denial cache TTL
2019-02-21 17:47:16 -08:00
Blake 46c299c1b1 Match default cache size of 10000
https://github.com/coredns/coredns/blob/master/plugin/cache/cache.go#L236
This gets rounded down to the nearest multiple of 256: 9984
2019-02-21 15:03:30 -08:00
Peter Hornyack 57ca6f007e Enable OpenSSH on Windows nodes in test clusters.
Also switches to the most recent 64-bit version of OpenSSH for Windows.

Tested:
PROJECT=${CLOUDSDK_CORE_PROJECT} KUBERNETES_SKIP_CONFIRM=y NUM_NODES=2 \
NUM_WINDOWS_NODES=2 KUBE_GCE_ENABLE_IP_ALIASES=true TEST_CLUSTER=true \
./cluster/kube-up.sh
2019-02-21 14:03:43 -08:00
Jeff Grafton b360f95eb3 cleanup: we always need to download client and server tarballs 2019-02-21 13:17:58 -08:00
Jeff Grafton 56949c7834 Support split test tarballs in get-kube-binaries.sh 2019-02-21 13:17:58 -08:00
Peter Hornyack 6d78f2b666 Default to Windows Server version 1809 for Windows nodes.
Removes all references to 1803, including moving "win1803" directory to
just "windows". A single Windows directory suffices for now, if
necessary in the future we can shard it into directories for each
Windows version.

We've been running tests with Windows 1809 nodes for a couple days in
our fork without major problems:
https://testgrid.k8s.io/google-windows#windows-prototype&width=20.
Testing on Azure is already using 1809:
https://testgrid.k8s.io/sig-windows#Conformance%20acs-engine%20on%20Azure&width=20.
2019-02-21 09:44:44 -08:00
Kubernetes Prow Robot f1de0b557c
Merge pull request #74324 from mtaufen/fix-windows
Fix hash if statement
2019-02-20 23:57:18 -08:00
xichengliudui 053332ad46 Fix shellcheck lint errors in cluster/addons/fluentd-elasticsearch/fluentd-es-image/run.sh
update pull request

update pull request

update pull request

update pull request

update pull request
2019-02-21 02:00:48 -05:00
Kubernetes Prow Robot 6c1f2077e5
Merge pull request #74192 from xichengliudui/fixshellcheck190218
make more of the shell pass lints
2019-02-20 21:41:25 -08:00
Kubernetes Prow Robot 054a676141
Merge pull request #74142 from javier-b-perez/master
GCE config.sh script should use headers for curl GCS apis
2019-02-20 21:41:12 -08:00
Michael Taufen cf3ad9c655 Fix hash if statement 2019-02-20 16:56:00 -08:00
Kubernetes Prow Robot f04ce297d6
Merge pull request #74100 from mtaufen/file-download-improvements
Retry downloads, respect URL list, validate tar hash
2019-02-20 11:34:06 -08:00
Michael Taufen 7ffe810f1d Retry downloads, respect URL list, validate tar hash 2019-02-20 08:52:46 -08:00
Kubernetes Prow Robot f5989303b7
Merge pull request #74060 from SataQiu/fix-shellcheck-20190214
Fix shellcheck failures on kube-down.sh kubeadm.sh get-build.sh
2019-02-19 21:41:17 -08:00
Kubernetes Prow Robot db7d930aab
Merge pull request #74109 from pjh/gce-windows-cluster-smoke-test
Detect ready pods correctly and untaint Windows nodes in smoke-test.
2019-02-19 19:57:40 -08:00
Kubernetes Prow Robot 7b203c6809
Merge pull request #74137 from rajansandeep/readinessprobe
Add readinessProbe to CoreDNS
2019-02-19 16:24:04 -08:00
Kubernetes Prow Robot cbf45eea13
Merge pull request #74138 from rramkumar1/ingress-docs-fix
Update docs for Ingress-GCE related cluster addon
2019-02-19 15:05:50 -08:00
Peter Hornyack 8a7fe26888 Detect ready pods correctly and untaint Windows nodes in smoke-test. 2019-02-19 11:48:54 -08:00
Sandeep Rajan 37c3d68a91 Add readinessProbe 2019-02-19 10:14:12 -05:00
Kubernetes Prow Robot 16cbb6b965
Merge pull request #73848 from krzysied/logexporter_custom_fix
Handling for use_custom_instance_list in dump_nodes_with_logexporter
2019-02-18 15:23:48 -08:00
xichengliudui b83b83f8e3 aaa 2019-02-18 17:50:55 -05:00
Rohit Ramkumar a50752ceb7 Update docs for Ingress-GCE related cluster addon 2019-02-18 13:17:01 -08:00
André Bauer d82d5fda35 updated kibana to 6.6.0
Signed-off-by: André Bauer <monotek23@gmail.com>
2019-02-18 11:00:02 +01:00
André Bauer fa859e4644 Merge branch 'master' into kibana 2019-02-18 10:58:49 +01:00
Kubernetes Prow Robot 92e0c231fa
Merge pull request #74107 from truongnh1992/adding-shebang
Adding '/usr/bin/env bash'
2019-02-18 00:23:35 -08:00
Kubernetes Prow Robot cdbc9d8567
Merge pull request #73686 from ShivangGoswami/master
Cleanup: Audit log and error capitalization
2019-02-17 23:08:39 -08:00
xichengliudui fd044c5770 make more of the shell pass lints 2019-02-18 00:42:46 -05:00
Kubernetes Prow Robot 5aaf6cd61b
Merge pull request #72386 from logicalhan/readiness
fail faster for readiness checks
2019-02-15 18:02:30 -08:00
Kubernetes Prow Robot a22763b24e
Merge pull request #74063 from huynq0911/fix_wrong_format_yaml_influxdb
Fix incorrect influxdb yaml file
2019-02-15 16:46:18 -08:00
Javier Pérez Hernández 708d7df561 k8s: gci: configure: add token to curl calls
Modify script to use curl to get metadata and Service Account token.
When the SA doesn't have 'Storage Read' scope, it can only read
public files.
2019-02-15 11:00:35 -08:00
Ben Moss 34ac4d9ee9 Update deprecated links 2019-02-15 09:13:07 -05:00
Kubernetes Prow Robot 808f2cf0ef
Merge pull request #72525 from justinsb/owners_should_not_be_executable
Remove executable file permission from OWNERS files
2019-02-14 23:55:45 -08:00
Nguyen Hai Truong 848da13359 Adding '/usr/bin/env bash'
This commit aims to add '/usr/bin/env bash' as a shebang line
to indicates scripts use bash shell for interpreting.

Signed-off-by: Nguyen Hai Truong <truongnh@vn.fujitsu.com>
2019-02-14 19:18:49 -08:00
Nguyen Quang Huy ac8466444c Fix incorrect influxdb yamle file
Remove redundant attribute in container declaration
2019-02-14 14:26:05 +07:00
SataQiu 78e272a927 fix shellcheck failures on kube-down.sh kubeadm.sh get-build.sh 2019-02-14 15:12:08 +08:00
Kubernetes Prow Robot 6f6f8bdfcc
Merge pull request #73822 from justinsb/renamed_set_node_image
e2e upgrade test: rename functions to match f0f7829934
2019-02-13 19:06:20 -08:00
Kubernetes Prow Robot b9072c3fe7
Merge pull request #73984 from yujuhong/find-node-tar
Add path in find-tar for locating node binary tarball
2019-02-13 14:32:26 -08:00
Blake e51c9025ac Lowers the default nodelocaldns denial cache TTL
Similar to `--no-negcache` on dnsmasq, this prevents issues which poll DNS for orchestration such as operators with StatefulSets. It can also be very confusing for users when negative caching results in a change they just made seeming to be broken until the cache expires. This assumes that 5 seconds is reasonable and will still catch repeated AAAA negative responses. We could also set the denial cache size to zero which should effectively fully disable it like dnsmasq in kube-dns but testing shows this approach seems to work well in our (albeit small) test clusters.
2019-02-13 13:23:53 -08:00
Kubernetes Prow Robot 812016fa23
Merge pull request #73818 from gnufied/add-e2e-for-no-expand
add e2e test when expansion is disabled
2019-02-13 12:16:09 -08:00
Kubernetes Prow Robot aefd281143
Merge pull request #73983 from ixdy/update-bazel-deps
Update repo-infra, bazel-skylib, and rules_docker dependencies
2019-02-13 11:00:08 -08:00
Hemant Kumar 7d823e5df0 also remove slow tag from resizing tests 2019-02-13 10:48:49 -05:00
Kubernetes Prow Robot 40c91a0951
Merge pull request #74001 from mm4tt/logexporter_version_bump
Start using new version of logexporter.
2019-02-13 04:18:38 -08:00
Matt Matejczyk d7d46013cb Start using new version of logexporter. 2019-02-13 08:52:04 +01:00
Yu-Ju Hong 0a68740d08 Add path in find-tar for locating node binary tarball 2019-02-12 18:15:17 -08:00
Jeff Grafton e216995ef1 Update repo-infra, bazel-skylib, rules_docker, and rules_go dependencies
Also require bazel 0.18.0+
2019-02-12 17:55:10 -08:00
Peter Hornyack 063725230d Delete RDP firewall rule during kube-down.
This rule is only created if the cluster has one or more Windows nodes,
but delete-firewall-rules() ignores firewall rule arguments that do not
exist so it's safe to always attempt to delete this rule.
2019-02-12 15:19:16 -08:00
Kubernetes Prow Robot aa00afe231
Merge pull request #73649 from ojmhetar/coredns-priorityclass
Add priority class to CoreDNS pods
2019-02-11 22:55:45 -08:00
Jiaying Zhang 52e92ab4b9 Update nvidia-gpu-device-plugin addon.
This includes changes from GoogleCloudPlatform/container-engine-accelerators#102
2019-02-11 15:52:33 -08:00
Yu-Ju Hong 1bba009689 Fix setting NODE_ARCH and NODE_PLATFORM 2019-02-11 13:55:44 -08:00
Kubernetes Prow Robot b50c643be0
Merge pull request #73540 from rlenferink/patch-5
Updated OWNERS files to include link to docs
2019-02-08 09:05:56 -08:00
Krzysztof Siedlecki bc42602024 adding handling for use_custom_instance_list in dump_nodes_with_logexporter 2019-02-08 14:02:06 +01:00
patc 0e219f4caa boilerplate fix 2019-02-07 21:12:46 -08:00
Kubernetes Prow Robot a7966022d7
Merge pull request #73714 from tallclair/debian-base
Bump debian-* base image versions to latest
2019-02-07 19:44:44 -08:00
Justin SB 28c34d8d71
e2e upgrade test: rename functions to match f0f7829934
These functions were renamed in
f0f7829934, but cluster/gce/upgrade.sh
was missed.
2019-02-07 13:27:34 -05:00
Kubernetes Prow Robot a0aa22226e
Merge pull request #73813 from wojtek-t/fix_kubemark_nodes
Fix kubemark windows num nodes
2019-02-07 07:12:41 -08:00
wojtekt 482e307406 Fix kubemark windows num nodes 2019-02-07 14:14:41 +01:00
Benjamin Elder bccf95b10c add bentheelder to reviewers 2019-02-06 16:26:51 -08:00
Matt Matejczyk 5e6171790b Propagate dump_systemd_journal to logexporter job.
Log exporter changes have been made in
https://github.com/kubernetes/test-infra/pull/11121 and new version has
been pushed in https://github.com/kubernetes/test-infra/pull/11149
2019-02-06 15:49:29 +01:00
Jorge O. Castro 522a34f1d7 Remove inactive maintainers
Signed-off-by: Jorge O. Castro <jorgec@vmware.com>
2019-02-05 09:45:39 -05:00
Matt Matejczyk 4dcc017d7a Propagate API_SERVER_TEST_LOG_LEVEL in cluster/gce/util.sh 2019-02-05 10:30:17 +01:00
Roy Lenferink b43c04452f Updated OWNERS files to include link to docs 2019-02-04 22:33:12 +01:00
Tim Allclair 1b9d0c1094 Bump debian-* base image versions to latest 2019-02-04 11:58:51 -08:00
ShivangGoswami 2866b1a3b8 Cleanup: Audit log and error capitalization 2019-02-04 10:59:34 +05:30
Matt Matejczyk 35543f8989 Allow dumping full systemd journal in log-dump.sh.
The feature is gated behind a newly introduced 'dump-systemd-journal' flag.
We want to dump the full systemd journal in our scalability performance tests.
2019-02-03 21:28:37 +01:00
Ojas M bb9c865d47 Add priority class to CoreDNS pods 2019-02-01 12:56:48 -08:00
Kubernetes Prow Robot 556c0b8593
Merge pull request #73627 from mborsz/heapster
Fix typo in create-heapster-node
2019-02-01 07:44:48 -08:00
Kubernetes Prow Robot b1eae78fc7
Merge pull request #73610 from rajansandeep/coredns1.3.1
Bump CoreDNS version to 1.3.1
2019-02-01 06:32:11 -08:00
Maciej Borsz afeabe99b7 Fix typo in create-heapster-node 2019-02-01 11:04:23 +01:00
Kubernetes Prow Robot 244795a06a
Merge pull request #73442 from pjh/gce-windows-cluster
Update cluster/gce/ scripts to support Windows nodes.
2019-01-31 18:20:37 -08:00
Sandeep Rajan aa11f6c298 Bump CoreDNS version to 1.3.1 2019-01-31 16:55:02 -05:00
Kubernetes Prow Robot 46fc4dd96a
Merge pull request #73577 from wojtek-t/remove_vmodule
Remove vmodule from apiserver to reduce lock contention
2019-01-31 04:59:42 -08:00
wojtekt 3032d92736 Remove vmodule from apiserver to reduce lock contention 2019-01-31 10:53:44 +01:00
Kubernetes Prow Robot 38d12df696
Merge pull request #73450 from liggitt/update-cluster-proportional-autoscaler
Update cluster-proportional-autoscaler to 1.4.0 to use apps/v1
2019-01-30 20:25:52 -08:00
Peter Hornyack f0f7829934 Update cluster/gce scripts to support Windows nodes. 2019-01-30 19:49:07 -08:00
Kubernetes Prow Robot 16000e3af7
Merge pull request #73220 from yujuhong/gce-os-labels
GCE: update addon DaemonSets to select node OS
2019-01-29 18:22:21 -08:00
Kubernetes Prow Robot c501d464af
Merge pull request #73267 from rajansandeep/proxytoforward
Use forward plugin instead of proxy plugin in the default configuration of CoreDNS
2019-01-29 11:32:28 -08:00
Matt Matejczyk 12904ac911 Tune suggested master disk sizes for big clusters.
Looks like the get-master-root-disk-size() and get-master-disk-size() functions didn't anticipate
clusters bigger than 2K nodes.

In https://github.com/kubernetes/kubernetes/issues/72976 we found out
that 100GB may be not enough for large clusters (5K nodes) when it comes
to master root disk size.

Updating both get-master-root-disk-size() and get-master-disk-size() to
make them consistent and match cluster sizes with get-master-size()
function.
2019-01-29 10:48:08 +01:00
patc 64918cc2b4 update makefiles to use cloud build and update README 2019-01-28 22:44:44 -08:00
Jordan Liggitt 01434acc66 Update cluster-proportional-autoscaler to 1.4.0 to use apps/v1 2019-01-29 00:45:21 -05:00
Kubernetes Prow Robot d54716338a
Merge pull request #73302 from rajansandeep/prometheusport
Add metrics port to CoreDNS service
2019-01-25 12:04:37 -08:00
Sandeep Rajan e9220dd408 add prometheus port 2019-01-25 11:29:03 +05:30
Kubernetes Prow Robot 7d91e1223d
Merge pull request #73293 from prameshj/nodelocal-update
Modify nodelocaldns yaml to include updateStrategy and xtables.lock
2019-01-24 20:07:57 -08:00
Kubernetes Prow Robot 0210c0d869
Merge pull request #73253 from phenixblue/improve-syntax-for-shell-script
Updates to eliminate shellcheck failures - kubernetes/cluster/centos
2019-01-24 17:46:43 -08:00
Kubernetes Prow Robot d306d68a36
Merge pull request #72978 from liggitt/addon-changelog
Update addon-manager changelog/image
2019-01-24 16:30:00 -08:00
Pavithra Ramesh 9611b112eb Add xtables.lock file to mounts
nodelocaldns needs to use the same iptables lock file to avoid
contention with other entities modifying iptables rules.
2019-01-24 13:21:06 -08:00
Joe Searcy e72a683f82 Updates to eliminate shellcheck failures in #72956
Extend quotes to full filepath strings

Fixes for reviewers comments
2019-01-24 15:57:26 -05:00
Kubernetes Prow Robot c21f60f862
Merge pull request #73268 from wojtek-t/better_scale_inflights
Better scale max-inflight request limits to correlate them with master VM size
2019-01-24 10:37:25 -08:00
wojtekt d911ebbc3f Correlate max-inflight values in GCE with master VM sizes 2019-01-24 15:13:36 +01:00
Sandeep Rajan c346ae1e49 Use forward plugin instead of proxy in the default configuration 2019-01-24 17:28:55 +05:30
Kubernetes Prow Robot 9ce03717a9
Merge pull request #73229 from yujuhong/append-owner
GCE: add yujuhong to OWNERS
2019-01-24 02:56:29 -08:00
danielqsj cd5e658518 fix shellcheck in cluster/local 2019-01-24 18:06:55 +08:00
Kubernetes Prow Robot b9631a881f
Merge pull request #73231 from liggitt/add-on-arch-build
Enable multi-arch build for add-on manager
2019-01-23 16:19:35 -08:00
Jordan Liggitt af66376960 Enable multi-arch build for add-on manager 2019-01-23 18:08:09 -05:00
Yu-Ju Hong 2edd3e3f52 GCE: add yujuhong to OWNERS 2019-01-23 14:46:29 -08:00
Kubernetes Prow Robot baaaa15fdf
Merge pull request #69940 from MarcPow/master
Bind coredns containers to  linux nodes to avoid Windows scheduling
2019-01-23 12:19:53 -08:00
Yu-Ju Hong 9c892243f6 GCE: update addon DaemonSets to select node OS
These DaemonSets supports only Linux today, so this change updates the
specs to reflect this limitation. The labels have recently been promoted
to GA. Using the beta labels for now until node-master version skew
problem no longer exists.
2019-01-23 09:01:40 -08:00
patc 7f873f23b9 multi stage dckerfile builds discovery app and ES image 2019-01-23 00:01:58 -08:00
Humble Devassy Chirammal 2651c72166 This patch consists of various corrections. (#73060)
* This patch consists of various corrections.

*) Remove unwanted `else` code block
*) Adjust variable intialization.
*) Correct error strings.
*) Use recommended variable increment method.

Signed-off-by: hchiramm <hchiramm@redhat.com>

* Remove cluster/images/cluster-version-monitor from hack/.golint_failures.

Signed-off-by: Humble Chirammal <hchiramm@redhat.com>
2019-01-22 12:28:59 -08:00
Kubernetes Prow Robot b6fb1572d8
Merge pull request #72861 from BenTheElder/slightly-less-bash-fire
fix bugs in get-kube scripts
2019-01-18 18:00:28 -08:00
Pavithra Ramesh 960705ac98 Add updateStrategy section for nodelocaldns 2019-01-18 16:40:01 -08:00
Kubernetes Prow Robot 0713f29c28
Merge pull request #72062 from jpbetz/gce-logrotate-check-interval
Perform GCE master log rotation check every 5 minutes
2019-01-16 14:45:22 -08:00
Benjamin Elder 09f48a78e1 fix get-kube.sh 2019-01-16 14:29:17 -08:00
Jordan Liggitt df97217f7f Update addon-manager changelog/image 2019-01-16 13:26:39 -05:00
Kubernetes Prow Robot d95b9f16d1
Merge pull request #70144 from wenjiaswe/etcdKasTls
Add mTLS encription between etcd and kube-apiserver in GCE
2019-01-16 02:51:06 -08:00
Jordan Liggitt 747f1f6b94 Bump kubectl in addon-manager 2019-01-15 16:36:04 -05:00
Benjamin Elder a901dd3ed3 fix get-kube-local.sh 2019-01-12 10:29:29 -08:00
Benjamin Elder b7ab09f572 fix get-kube-binaries.sh shellcheck lints 2019-01-12 09:48:32 -08:00
Kubernetes Prow Robot 7319aced08
Merge pull request #71180 from monotek/fluentd-elasticsearch
fluentd-elasticsearch - update fluentd to 1.3.3 & added filter_parser for json fields in es
2019-01-12 00:24:11 -08:00
Justin SB dd19b923b7
Remove executable file permission from OWNERS files 2019-01-11 16:42:59 -08:00
Tim Allclair 485b21e8cb Fix kube-proxy PodSecurityPolicy RoleBinding namespace 2019-01-09 17:57:15 -08:00
André Bauer 273fa998f7 updated fluentd to 1.3.3 & gem updates 2019-01-08 20:53:26 +01:00
Wenjia Zhang c17233c41c add mTLS encription between etcd and kube-apiserver in GCE 2019-01-07 13:52:20 -08:00