Commit Graph

8425 Commits (644632ddcfd5ceede125758d51c9aa3736763753)

Author SHA1 Message Date
Kubernetes Prow Robot 41691a031a
Merge pull request #76039 from johnSchnake/conformanceErrExit
Finish saving test results on failure
2019-04-03 11:14:49 -07:00
Yu-Ju Hong 2a37067735 GCE/Windows: disable stackdriver logging agent
The logging service could not be stopped at times, causing node startup
failures. Disable it until the issue is fixed.
2019-04-03 10:34:36 -07:00
John Schnake 4fec7c7568 Finish saving test results on failure
The conformance image should be saving its results
regardless of the results of the tests. However,
with errexit set, when ginkgo gets test failures
it exits 1 which prevents saving the results
for Sonobuoy to pick up.

Fixes: #76036
2019-04-03 08:38:43 -05:00
Michelle Au d2aa8178f2 Remove alpha CRD install 2019-04-02 10:59:11 -07:00
Kubernetes Prow Robot 397f3d4c0c
Merge pull request #75856 from yuwenma/base-etcd
Bebase etcd image to k8s.gcr.io/debian-base
2019-04-01 15:48:47 -07:00
ialidzhikov ebfb92bdce Update fluentd 1.4.1
Signed-off-by: ialidzhikov <i.alidjikov@gmail.com>
2019-04-01 18:24:47 +03:00
yuwenma 529b20aadd rebase etcd to k8s debian-base 2019-03-29 18:59:55 -07:00
Peter Hornyack 1814b0c495 Disable GCE agent address management on Windows nodes.
With this metadata key set, "GCEWindowsAgent: GCE address manager
status: disabled" will appear in the VM's serial port output during
boot.

Tested:
PROJECT=${CLOUDSDK_CORE_PROJECT} KUBE_GCE_ENABLE_IP_ALIASES=true NUM_WINDOWS_NODES=2 NUM_NODES=2 KUBERNETES_NODE_PLATFORM=windows go run ./hack/e2e.go -- --up
cluster/gce/windows/smoke-test.sh

cat > iis.yaml <<EOF
apiVersion: v1
kind: Pod
metadata:
  name: iis
  labels:
    app: iis
spec:
  containers:
  - image: mcr.microsoft.com/windows/servercore/iis
    imagePullPolicy: IfNotPresent
    name: iis-server
    ports:
    - containerPort: 80
      protocol: TCP
  nodeSelector:
    beta.kubernetes.io/os: windows
  tolerations:
  - effect: NoSchedule
    key: node.kubernetes.io/os
    operator: Equal
    value: windows1809
EOF

kubectl create -f iis.yaml
kubectl expose pod iis --type=LoadBalancer --name=iis
kubectl get services
curl http://<service external IP address>
2019-03-28 17:28:18 -07:00
Kubernetes Prow Robot da018a6bfa
Merge pull request #75362 from serathius/gcp-security-patches
Update gcp images with security patches
2019-03-28 14:25:58 -07:00
Kubernetes Prow Robot 753a6edc37
Merge pull request #74616 from ialidzhikov/fluentd-1.4.0
Update fluentd to 1.4.0
2019-03-26 12:33:48 -07:00
Kubernetes Prow Robot 81d37386aa
Merge pull request #75168 from ialidzhikov/update-golang-version
Update golang to 1.12.0
2019-03-26 11:17:20 -07:00
Kubernetes Prow Robot 9e1cf75bd1
Merge pull request #75381 from rramkumar1/configure-scripts-for-ingress-yaml
Plumb CUSTOM_INGRESS_YAML into other setup scripts
2019-03-25 12:10:33 -07:00
Kubernetes Prow Robot 657951c3ed
Merge pull request #74889 from akutz/feature/get-kube-binaries-client-choice
Allow choice of os/arch when downloading client
2019-03-21 22:50:57 -07:00
Kubernetes Prow Robot 8af1bf313e
Merge pull request #75317 from jihoon-seo/patch-1
Update cluster/images/hyperkube/README.md
2019-03-20 20:35:37 -07:00
Ted Yu 793fd74a42 Use go idiomatic to replace loop 2019-03-20 11:40:26 -07:00
Kubernetes Prow Robot 66bb1215c7
Merge pull request #75310 from chi-xu/20190313
Correct a spelling error
2019-03-20 09:53:11 -07:00
Kubernetes Prow Robot 8b19a24974
Merge pull request #75153 from ymlong/fix-20193838
Correcting spelling errors
2019-03-20 00:54:48 -07:00
ialidzhikov db6add318a Update fluentd to 1.4.0
Signed-off-by: ialidzhikov <i.alidjikov@gmail.com>
2019-03-20 08:48:26 +02:00
Kubernetes Prow Robot aa5fda22f2
Merge pull request #75269 from javier-b-perez/gce-configure
gce: configure: validate SA has storage scope
2019-03-19 11:06:22 -07:00
Łukasz Osipiuk ff18fbcebb Update Cluster Autscaler version to 1.14.0
No changes since 1.14.0-beta.2
Changelog: https://github.com/kubernetes/autoscaler/releases/tag/cluster-autoscaler-1.14.0
2019-03-19 16:15:20 +01:00
Dan Williams bfd8ad3268 build/gci: bump CNI version to 0.7.5 2019-03-18 17:16:42 -05:00
Kubernetes Prow Robot 28910bf2d7
Merge pull request #75387 from pjh/remove-use-release-node-binaries
Remove reference to USE_RELEASE_NODE_BINARIES.
2019-03-18 09:52:26 -07:00
Rohit Ramkumar 8ac15cd54a Plumb CUSTOM_INGRESS_YAML into other setup scripts so that it can be set before a call to kubetest 2019-03-15 13:36:30 -07:00
Łukasz Osipiuk e4663f73d3 Update Cluster Autscaler version to 1.14.0-beta.2 2019-03-15 12:01:47 +01:00
Marek Siarkowicz 9e9b906047 Update gcp images with security patches
[stackdriver addon] Bump prometheus-to-sd to v0.5.0 to pick up security fixes.
[fluentd-gcp addon] Bump fluentd-gcp-scaler to v0.5.1 to pick up security fixes.
[fluentd-gcp addon] Bump event-exporter to v0.2.4 to pick up security fixes.
[fluentd-gcp addon] Bump prometheus-to-sd to v0.5.0 to pick up security fixes.
[metatada-proxy addon] Bump prometheus-to-sd v0.5.0 to pick up security fixes.
2019-03-15 09:24:32 +01:00
Peter Hornyack 7f364b1c5b Remove reference to USE_RELEASE_NODE_BINARIES.
This variable was used for development purposes and was accidentally
introduced in
f0f7829934.

This is its only use in the tree:
https://github.com/kubernetes/kubernetes/search?q=USE_RELEASE_NODE_BINARIES&unscoped_q=USE_RELEASE_NODE_BINARIES
2019-03-14 14:50:54 -07:00
Jihoon Seo f5a66f8c13
Update cluster/images/hyperkube/README.md
Add missing period between 'components' and `hyperkube`.
Add line separation between 2 sentences.
2019-03-13 12:01:30 +09:00
xuchi d0e15aed2c Correct a spelling error 2019-03-13 09:39:13 +08:00
Kubernetes Prow Robot a213886f9c
Merge pull request #74502 from Random-Liu/collect-pod-log
Collect pod log in fluentd-gcp
2019-03-12 01:34:37 -07:00
Javier Pérez Hernández e783f0bcd4 gce: configure: validate SA has storage scope
If the VM SA doesn't have storage scope associated, don't use the
token in the curl request or the request will fail with 403.
2019-03-11 12:04:01 -07:00
Kubernetes Prow Robot 6ec5a7d337
Merge pull request #75181 from losipiuk/lo/master-ca-1.14-beta.1
Update Cluster Autscaler version to 1.14.0-beta.1
2019-03-11 07:15:32 -07:00
Kubernetes Prow Robot 8dd09e0b36
Merge pull request #75208 from pjh/gce-windows-readme-update
Update GCE Windows README steps for running e2e tests.
2019-03-10 21:15:26 -07:00
Lantao Liu 7ffa7da112 Configure logrotate for pod logs. 2019-03-08 16:45:41 -08:00
Kubernetes Prow Robot c534a1b9b3
Merge pull request #75149 from yguo0905/m73
GCE: bump COS image version to cos-beta-73-11647-64-0
2019-03-08 14:19:35 -08:00
Kubernetes Prow Robot d778b9308a
Merge pull request #75063 from wangzhen127/npd-test-fix
Fix NPD e2e test on Ubuntu node and update NPD container version
2019-03-08 14:19:09 -08:00
Peter Hornyack 0fbb246c30 Update GCE Windows README steps for running e2e tests. 2019-03-08 13:47:41 -08:00
Łukasz Osipiuk 2790d9151a Update Cluster Autscaler version to 1.14.0-beta.1 2019-03-08 14:01:55 +01:00
ialidzhikov c72115dede Update golang to 1.12.0
Signed-off-by: ialidzhikov <i.alidjikov@gmail.com>
2019-03-08 11:45:46 +02:00
Kubernetes Prow Robot cf439192fb
Merge pull request #75139 from mtaufen/windows-kube-proxy-args
Windows kubeproxy flags cleanup
2019-03-08 00:53:28 -08:00
Kubernetes Prow Robot 6a86f1576f
Merge pull request #75014 from mtaufen/windows-logrotate
Windows log rotation
2019-03-07 22:43:41 -08:00
Kubernetes Prow Robot 183e6cc190
Merge pull request #74433 from tallclair/runtimeclass-internal
Migrate RuntimeClass from a CRD to an internal API
2019-03-07 21:19:30 -08:00
ymlong 8e04d61488 Correcting spelling errors 2019-03-08 10:19:13 +08:00
Yang Guo eef545abfd GCE: bump COS image version to cos-beta-73-11647-64-0 2019-03-07 16:11:02 -08:00
Michael Taufen 0d59d2a43e Windows kubeproxy flags cleanup
- Pass most of the flags via kube-env, similar to Kubelet, to make it
  easier to reconfigure.
- Use kube-proxy's kubeconfig file to pass the API server address,
  rather than the --master flag.
- Make KUBEPROXY_ARGS a required variable in Windows kube-env, similar
  to KUBELET_ARGS.
2019-03-07 14:45:34 -08:00
Tim Allclair 63f61a6714 Migrate RuntimeClass to internal API 2019-03-07 11:07:54 -08:00
Michael Taufen 0a21d644f5 Rotate windows logs 2019-03-07 10:42:31 -08:00
Davanum Srinivas 200af47128
Stop waiting on conformance image pod when it fails
Change-Id: I516833167f8e0aaa0e7962c1d6bdbe2ca422dc6e
2019-03-07 11:03:05 -05:00
Kubernetes Prow Robot 57da888ff4
Merge pull request #75055 from pjh/gce-windows-disable-defender-again
Disable Windows Defender in Windows nodes on GCE again.
2019-03-07 01:07:26 -08:00
Zhen Wang f4d9e7d992 Fix NPD e2e test on Ubuntu node and update NPD container version 2019-03-06 22:42:47 -08:00
Kubernetes Prow Robot bd8eeaaaaf
Merge pull request #74468 from ixdy/bazel-cgo-crossbuild
bazel: improve multi-arch support for docker image artifacts
2019-03-06 20:35:11 -08:00
Kubernetes Prow Robot a60d44848f
Merge pull request #74748 from pjh/gce-windows-stackdriver
Install Stackdriver agent in Windows nodes on GCE.
2019-03-06 17:59:39 -08:00
Kubernetes Prow Robot ab7a48d796
Merge pull request #70036 from pbarker/audit-etoe
dynamic audit e2e test
2019-03-06 17:58:58 -08:00
Jeff Grafton 8495f7c9ed bazel: add docker_push rules 2019-03-06 12:35:33 -08:00
Peter Hornyack 80c635880d Disable Windows Defender in Windows nodes on GCE again.
This reverts commit fbf4fe4714. Windows
Defender seems to be causing our Windows nodes to crash and reboot
during e2e tests, e.g.
https://gcsweb.k8s.io/gcs/kubernetes-jenkins/logs/ci-kubernetes-e2e-windows-gce/228/artifacts/e2e-228-36623-windows-node-group-q4.
2019-03-06 11:42:03 -08:00
Jeff Grafton d7c2c2a606 bazel: refactor multi-arch container builds into starlark 2019-03-06 11:40:33 -08:00
Kubernetes Prow Robot 45e5f6053b
Merge pull request #74424 from liggitt/drop-k8s-io-node-labels
Clean up self-set node labels
2019-03-06 08:24:26 -08:00
Kubernetes Prow Robot 5a89793f8e
Merge pull request #74612 from jingxu97/windows
Add GcePD windows support
2019-03-05 21:42:22 -08:00
Peter Hornyack 197513141e Install Stackdriver agent in Windows test clusters. 2019-03-05 15:14:07 -08:00
Patrick Barker 67a245ae61 remove unneeded println 2019-03-05 14:38:58 -07:00
Patrick Barker 45d715cdc6 adds dynamic audit e2e test 2019-03-05 14:38:58 -07:00
Kubernetes Prow Robot 365c504e2b
Merge pull request #74928 from pjh/gce-windows-reenable-defender
Don't disable Windows Defender.
2019-03-05 11:10:25 -08:00
Kubernetes Prow Robot a352b74bcc
Merge pull request #74915 from jpbetz/revert-72062-2
revert 72062: Perform GCE master log rotation check every 5 minutes
2019-03-05 09:50:55 -08:00
Kubernetes Prow Robot b4e692b68f
Merge pull request #74854 from neolit123/conformance-nodes
conformace/run_e2e.sh: pass shellcheck and autodetect ginkgo nodes
2019-03-05 09:50:12 -08:00
Kubernetes Prow Robot 95cd1d59e4
Merge pull request #74209 from monotek/fluentd-helm-readme
added production note about EFK stack to the readme
2019-03-04 17:55:12 -08:00
Peter Hornyack fbf4fe4714 Don't disable Windows Defender. 2019-03-04 16:34:21 -08:00
Lubomir I. Ivanov fe2301f17d conformace/run_e2e.sh: pass shellcheck and autodetect ginkgo nodes
- for ginkgo parallel mode pass -p to autodetect ginkgo nodes.
- disable parallel mode by default (false).
- use provider "skeleton" instead of "local".
- make run_e2e.sh pass shellcheck.
2019-03-05 01:32:57 +02:00
Joe Betz add956f00f Revert "Perform GCE log rotation check every 5 minutes"
This reverts commit 1ba05d51a6.
2019-03-04 14:10:20 -08:00
Kubernetes Prow Robot 79e8a29544
Merge pull request #74852 from dims/expand-reviewers-approvers-add-label
Expand reviewers/approvers and add labels for conformance image
2019-03-04 13:25:50 -08:00
akutz 020dc47073 Allow choice of os/arch when downloading client
This patch introduces `KUBERNETES_CLIENT_OS` and
`KUBERNETES_CLIENT_ARCH` to the script `cluster/get-kube-binaries.sh` in
order to download a client that is not the OS/Arch of the host on which
the script is executing.
2019-03-04 10:23:04 -06:00
Jeff Grafton c98b3edb88 bazel: make conformance and hyperkube images multiarch 2019-03-03 21:09:22 -08:00
Davanum Srinivas b3adf5bd0d
Expand reviewers/approvers and add labels for conformance image
Change-Id: I6e13d262a8bf97ada89227a853debe58876c212a
2019-03-02 13:46:27 -05:00
Davanum Srinivas 87e87dd25b
Apparently conformance tests have to run serially
There are 10+ conformance tests marked [Serial], So we can't run things
in parallel by default.

Change-Id: I58afbb356899fb38caf4f9f6704bfb47f745b338
2019-03-02 13:30:00 -05:00
Kubernetes Prow Robot e001276b1d
Merge pull request #74836 from yujuhong/update-win-cni-config
GCE/Windows: exclude gateways from the IPAM subnet
2019-03-01 20:35:03 -08:00
Kubernetes Prow Robot 3bb22edef8
Merge pull request #74808 from dims/showcase-the-available-options-in-manifest
Showcase conformance images options in the manifest
2019-03-01 20:34:53 -08:00
Kubernetes Prow Robot 738d29b96c
Merge pull request #74767 from yuwenma/rebase-image
Rebase etcd-version-monitor base image to distroless.
2019-03-01 19:14:24 -08:00
Aaron Crickenberger 388a345efc Use no skip regex for conformance 2019-03-01 16:35:27 -08:00
Yu-Ju Hong 1f86bce95e GCE/Windows: exclude gateways from the IPAM subnet
Exclude IPs assigned for gateways in the IPAM configuration.
2019-03-01 16:35:26 -08:00
Kubernetes Prow Robot b8e05af7bf
Merge pull request #74736 from mtaufen/windows-kube-proxy-config
Stop setting --hostname-override on Windows kube-proxy and allow args  to be extended remotely if necessary
2019-03-01 16:12:23 -08:00
Kubernetes Prow Robot 4bafef2234
Merge pull request #74761 from dims/better-detection-of-binaries-for-conformance-image
Better detection of binaries for conformance image
2019-03-01 14:09:08 -08:00
Davanum Srinivas da566ad647
Showcase conformance images options in the manifest
The Dockerfile has a few more options, let's surface that in the
manifest. Also use the same defaults in the manifest as the Dockerfile
itself.

Change-Id: Ib7419cf7999430db15f39ac414c80ee362fcda76
2019-03-01 16:39:28 -05:00
Kubernetes Prow Robot 98c0d15f98
Merge pull request #74762 from pjh/gce-windows-dump-versions
Dump Windows version information during cluster bringup.
2019-03-01 01:08:36 -08:00
Kubernetes Prow Robot 824883d08a
Merge pull request #74754 from pjh/gce-windows-smoke-timeout
Increase Windows smoke-test timeout to 3 minutes.
2019-02-28 19:12:18 -08:00
Davanum Srinivas 9d7d682760
Better detection of binaries for conformance image
Change-Id: Id33a0827e8cb1e3369544eea3335c80c89780786
2019-02-28 22:00:28 -05:00
Peter Hornyack 18a2a98d67 Dump Windows version information during cluster bringup. 2019-02-28 17:22:49 -08:00
Peter Hornyack 43556be50e Enhance metadata fetching functions.
Introduce Get-InstanceMetadata which can be used to fetch
non-"attribute" metadata values.
2019-02-28 17:18:34 -08:00
Yuwen Ma fd2b57926e Rebase etcd-version-monitor base image to distroless. 2019-02-28 17:06:37 -08:00
Peter Hornyack 4181422a3c Increase Windows smoke-test timeout to 3 minutes.
I recently lowered the timeout value to 2 minutes but this turned out to
be too aggressive for now. 3 minutes is usually long enough for the test
containers to come up.

Note that this test is only used during development, it does not run
continuously anywhere.
2019-02-28 14:22:35 -08:00
Kubernetes Prow Robot 4741785d29
Merge pull request #73650 from mtaufen/windows-upload-scripts
upload Windows startup scripts to GCS for CI
2019-02-28 11:51:25 -08:00
Michael Taufen a76d132162 Stop setting --hostname-override on Windows kube-proxy and allow args to be extended remotely if necessary
--hostname-override should not be necessary, we can rely on kube-proxy
to auto-detect it just like we do in Kubelet.

I also added an optional variable to kube_env, KUBEPROXY_ARGS, that we
can use to remotely extend kube-proxy's command-line if necessary.
2019-02-28 10:13:07 -08:00
John Schnake 03d0e86117 Add support for dryRun option to kube-conformance image
A common issue users run into is wanting a list of tests
a certain regexp will run, without actually running it.

ginkgo supports this with the dryRun flag but it was not
exposed via the kube-conformance image. This change
will set the flag if the E2E_DRYRUN environment variable
is set.

Fixes #74727
2019-02-28 09:21:04 -06:00
Kubernetes Prow Robot 02b8056efb
Merge pull request #73288 from wangzhen127/npd-config
Decouple node-problem-detector release from kubernetes
2019-02-28 00:27:25 -08:00
Kubernetes Prow Robot ccf33be0cc
Merge pull request #73940 from jiayingz/nvidia-dp-update
Update nvidia-gpu-device-plugin addon.
2019-02-27 17:13:01 -08:00
Jing Xu 5e04d200ee Add GcePD windows support
This PR adds the support for GCEPD volume. For now, it uses a workaround
(https://github.com/pjh/gce-tools) to get disk number in windows for a given GCE PD name.
2019-02-27 17:00:32 -08:00
Kubernetes Prow Robot b2e9b2a842
Merge pull request #74608 from dims/lighter-weight-make-for-conformance-dependencies
Lighter weight make for conformance dependencies, better script and manifest
2019-02-27 07:19:55 -08:00
Davanum Srinivas 23b33f5c85
Switch to k8s.gcr.io (from staging)
Change-Id: Ib0d6f681be3537f0cbfcae1dc254f1c42a04be66
2019-02-27 08:54:45 -05:00
Davanum Srinivas 87d9903aaf
Add a script to run the conformance image and pull results
Change-Id: I1eb673fe37b5e8a719f9095473765c80fb7f2347
2019-02-27 07:38:43 -05:00
Kubernetes Prow Robot 1942c1ccb0
Merge pull request #71251 from monotek/kibana
updated kibana to 6.6.1
2019-02-26 23:40:33 -08:00
Zhen Wang efa96f7eb8 allows configuring NPD release and flags on GCI and add cluster e2e test 2019-02-26 21:21:54 -08:00
Kubernetes Prow Robot 81ec358db4
Merge pull request #74438 from pjh/gce-windows-log-dump
Support dumping logs from Windows test nodes on GCE
2019-02-26 18:12:09 -08:00
Peter Hornyack 0bb25290c8 Update log-dump.sh for Windows nodes.
Tested:
```
$ PROJECT=${CLOUDSDK_CORE_PROJECT} KUBERNETES_SKIP_CONFIRM=y NUM_NODES=2 \
  NUM_WINDOWS_NODES=2 KUBE_GCE_ENABLE_IP_ALIASES=true go run \
  ./hack/e2e.go -- --up
$ cluster/log-dump/log-dump.sh
$ ls _artifacts
```

And with: NUM_NODES=2 NUM_WINDOWS_NODES=0; NUM_NODES=0 NUM_WINDOWS_NODES=2
2019-02-26 12:10:19 -08:00
Yu-Ju Hong 093e5a50ac GCE/Windows: create a C:\tmp directory
This is required for running host path tests.
2019-02-26 10:59:06 -08:00
Kubernetes Prow Robot 7a4496429d
Merge pull request #71252 from monotek/elasticsearch
updated elasticsearch to 6.6.1
2019-02-26 09:33:44 -08:00
Jordan Liggitt e1db43ad1b Label old nodes with legacy addon labels 2019-02-26 11:43:10 -05:00
Jordan Liggitt 0174e043c5 Prepare switch from beta.kubernetes.io/masq-agent-ds-ready to node.kubernetes.io/masq-agent-ds-ready 2019-02-26 11:43:10 -05:00
Jordan Liggitt 943b32a289 Prepare switch from beta.kubernetes.io/kube-proxy-ds-ready to node.kubernetes.io/kube-proxy-ds-ready 2019-02-26 11:42:23 -05:00
Jordan Liggitt d6664a2365 Prepare switch from beta.kubernetes.io/metadata-proxy-ready to cloud.google.com/metadata-proxy-ready 2019-02-26 11:42:23 -05:00
Jordan Liggitt 8975233788 Finish migration of fluentd to daemonset 2019-02-26 11:42:23 -05:00
Davanum Srinivas 94ad1dfb11
Better manifest for running conformance image
Change-Id: I137180ed781edd4a9877cabe039e40a72aa71366
2019-02-26 10:29:12 -05:00
Kubernetes Prow Robot b8ddc7945b
Merge pull request #74522 from Pluies/master
Fix fluentd-gcp addon liveness probe
2019-02-26 06:38:24 -08:00
Davanum Srinivas 069eeb541b
Simpler make commands for ginkgo/kubectl/e2e.test
Change-Id: I78cff10231eabd53b1fc7bdd1526c861179e135a
2019-02-26 09:18:05 -05:00
André Bauer 9e2d9cfbb0 changed es image repo
Signed-off-by: André Bauer <monotek23@gmail.com>
2019-02-26 09:09:21 +01:00
Kubernetes Prow Robot 3fb6e77770
Merge pull request #74549 from yujuhong/pause-image
GCE: switch to using e2eteam/pause:3.1 for pause containers
2019-02-25 23:12:00 -08:00
Kubernetes Prow Robot 0ff7e463ee
Merge pull request #73746 from mrbobbytables/kubemark-shellcheck
Fix shellcheck lint errors in Kubemark scripts
2019-02-25 17:25:13 -08:00
Michael Taufen 98edbf41bc upload Windows startup scripts to GCS for CI 2019-02-25 16:44:57 -08:00
Kubernetes Prow Robot 3814176d42
Merge pull request #74455 from SataQiu/fix-shell-2019022302
Fix shellcheck lint errors in cluster and hack scripts
2019-02-25 15:15:19 -08:00
Yu-Ju Hong b863655faa GCE: switch to using e2eteam/pause:3.1 for pause containers
Stop building pause images on node startup.
2019-02-25 14:36:49 -08:00
Kubernetes Prow Robot 2aacb77374
Merge pull request #74444 from pjh/gce-windows-no-defender
Disable Windows Defender on Windows nodes.
2019-02-25 13:54:42 -08:00
Bob Killen 9a4f4878f5
Fix shellcheck lint errors in cluster/kubemark/util.sh 2019-02-25 15:21:29 -05:00
Bob Killen 9a58913e8f
Fix shellcheck lint errors in cluster/kubemark/iks/config-default.sh 2019-02-25 15:21:25 -05:00
Bob Killen ce4c85e3fd
Fix shellcheck lint errors in cluster/kubemark/gce/config-default.sh 2019-02-25 14:55:01 -05:00
Kubernetes Prow Robot 35a258d640
Merge pull request #73272 from danielqsj/juju
fix shellcheck in cluster/juju
2019-02-25 11:33:21 -08:00
Kubernetes Prow Robot f288678cfa
Merge pull request #73261 from danielqsj/local
fix shellcheck in cluster/local
2019-02-25 11:33:11 -08:00
Florent Delannoy e627474e8f Fix fluentd-gcp addon liveness probe
Fix three issues with the fluentd-gcp liveness probe:

h1. STUCK_THRESHOLD_SECONDS was overridden by LIVENESS_THRESHOLD_SECONDS
if defined

Probably a copy/paste issue introduced in edf1ffc074

h1. `[[` is [a bashism](https://stackoverflow.com/a/47576482), and will always failed when called with `/bin/sh`

Introduced by a844523c20

Given that we call the liveness probe with `/bin/sh`, we cannot use the
double-bracketed `[[` syntax for test, as it is not POSIX-compliant and
will throw an error.

Annoyingly, even through it prints an error, `sh` returns with exit code 0
in this case:

```bash
root@fluentd-7mprs:/# sh liveness.sh
liveness.sh: 8: liveness.sh: [[: not found
liveness.sh: 15: liveness.sh: [[: not found
root@fluentd-7mprs:/# echo $?
0
```

Which means the liveness probe is considered successful by Kubernetes,
despite failing to test things as it was intended. This is also
probably the reason why this bug wasn't reported sooner :)

Thankfully, the test in this case can just as easily be written as
POSIX-compliant as it doesn't use any bash-specific features within the
`[[` block.

h1. Buffers are transient and cannot be relied upon for monitoring

Finally, after fixing the above issue, we started seeing the fluentd
containers being restarted very often, and found an issue with the
underlying logic of the liveness probe.

The probe checks that the pod is still alive by running the following
command:

`find /var/log/fluentd-buffers -type f -newer /tmp/marker-stuck -print -quit`

This checks if any _regular_ file exists under `/var/log/fluentd-buffers`
that is more recent than a predetermined time, and will return an empty
string otherwise.

The issue is that these buffers are temporary and volatile, they get created and
deleted constantly. Here is an example of running that check every second on a
running fluentd:

```
root@fluentd-eks-playground-jdc8m:/# LIVENESS_THRESHOLD_SECONDS=${LIVENESS_THRESHOLD_SECONDS:-300};
root@fluentd-eks-playground-jdc8m:/# STUCK_THRESHOLD_SECONDS=${LIVENESS_THRESHOLD_SECONDS:-900};
root@fluentd-eks-playground-jdc8m:/# touch -d "${STUCK_THRESHOLD_SECONDS} seconds ago" /tmp/marker-stuck;
root@fluentd-eks-playground-jdc8m:/# touch -d "${LIVENESS_THRESHOLD_SECONDS} seconds ago" /tmp/marker-liveness;
root@fluentd-eks-playground-jdc8m:/# while true; do date ; find /var/log/fluentd-buffers -type f -newer /tmp/marker-stuck -print -quit ; sleep 1 ; done
Fri Feb 22 10:52:57 UTC 2019
Fri Feb 22 10:52:58 UTC 2019
/var/log/fluentd-buffers/kubernetes.system.buffer/buffer.b5827964ccf4c7004103c3fa7c8533f85.log
Fri Feb 22 10:52:59 UTC 2019
/var/log/fluentd-buffers/kubernetes.system.buffer/buffer.b5827964ccf4c7004103c3fa7c8533f85.log
Fri Feb 22 10:53:00 UTC 2019
Fri Feb 22 10:53:01 UTC 2019
/var/log/fluentd-buffers/kubernetes.system.buffer/buffer.b5827964fb8b2eedcccd2763ea7775cc2.log
Fri Feb 22 10:53:02 UTC 2019
/var/log/fluentd-buffers/kubernetes.system.buffer/buffer.b5827964fb8b2eedcccd2763ea7775cc2.log
Fri Feb 22 10:53:03 UTC 2019
Fri Feb 22 10:53:04 UTC 2019
Fri Feb 22 10:53:05 UTC 2019
Fri Feb 22 10:53:06 UTC 2019
/var/log/fluentd-buffers/kubernetes.system.buffer/buffer.b5827965564883997b673d703af54848b.log
Fri Feb 22 10:53:07 UTC 2019
/var/log/fluentd-buffers/kubernetes.system.buffer/buffer.b5827965564883997b673d703af54848b.log
Fri Feb 22 10:53:08 UTC 2019
/var/log/fluentd-buffers/kubernetes.system.buffer/buffer.b5827965564883997b673d703af54848b.log
Fri Feb 22 10:53:09 UTC 2019
Fri Feb 22 10:53:10 UTC 2019
Fri Feb 22 10:53:11 UTC 2019
Fri Feb 22 10:53:12 UTC 2019
Fri Feb 22 10:53:13 UTC 2019
Fri Feb 22 10:53:14 UTC 2019
Fri Feb 22 10:53:15 UTC 2019
Fri Feb 22 10:53:16 UTC 2019
```

We can see buffers being created, then disappearing. The LivenessProbe running
under these conditions has a ~50% chance of failing, despite fluentd being
perfectly happy.

I believe that check is probably ok for fluentd installs using large
amounts of buffers, in which case the liveness probe will be correct more
often than not, but fluentd installs that use buffering less intensively
will be negatively impacted by this.

My solution to fix this is to check the last updated time of buffering
_folders_ within `/var/log/fluentd_buffers`. These _do_ get updated when
buffers are created, and do not get deleted as buffers are emptied,
making them the perfect candidate for our use.

Here's an example with the `-d` flag for directories:
```
root@fluentd-eks-playground-jdc8m:/# while true; do date ; find /var/log/fluentd-buffers -type d -newer /tmp/marker-stuck -print -quit ; sleep 1 ; done
Fri Feb 22 10:57:51 UTC 2019
/var/log/fluentd-buffers/kubernetes.system.buffer
Fri Feb 22 10:57:52 UTC 2019
/var/log/fluentd-buffers/kubernetes.system.buffer
Fri Feb 22 10:57:53 UTC 2019
/var/log/fluentd-buffers/kubernetes.system.buffer
Fri Feb 22 10:57:54 UTC 2019
/var/log/fluentd-buffers/kubernetes.system.buffer
Fri Feb 22 10:57:55 UTC 2019
/var/log/fluentd-buffers/kubernetes.system.buffer
Fri Feb 22 10:57:56 UTC 2019
/var/log/fluentd-buffers/kubernetes.system.buffer
Fri Feb 22 10:57:57 UTC 2019
/var/log/fluentd-buffers/kubernetes.system.buffer
Fri Feb 22 10:57:58 UTC 2019
/var/log/fluentd-buffers/kubernetes.system.buffer
Fri Feb 22 10:57:59 UTC 2019
/var/log/fluentd-buffers/kubernetes.system.buffer
Fri Feb 22 10:58:00 UTC 2019
/var/log/fluentd-buffers/kubernetes.system.buffer
Fri Feb 22 10:58:01 UTC 2019
/var/log/fluentd-buffers/kubernetes.system.buffer
Fri Feb 22 10:58:02 UTC 2019
/var/log/fluentd-buffers/kubernetes.system.buffer
Fri Feb 22 10:58:03 UTC 2019
/var/log/fluentd-buffers/kubernetes.system.buffer
```

And example of the directory being updated as new buffers come in:
```
root@fluentd-eks-playground-jdc8m:/# ls -lah /var/log/fluentd-buffers/kubernetes.system.buffer
total 0
drwxr-xr-x 2 root root  6 Feb 22 11:17 .
drwxr-xr-x 3 root root 38 Feb 22 11:14 ..
root@fluentd-eks-playground-jdc8m:/# ls -lah /var/log/fluentd-buffers/kubernetes.system.buffer
total 16K
drwxr-xr-x 2 root root  224 Feb 22 11:18 .
drwxr-xr-x 3 root root   38 Feb 22 11:14 ..
-rw-r--r-- 1 root root 1.8K Feb 22 11:18 buffer.b58279be6e21e8b29fc333a7d50096ed0.log
-rw-r--r-- 1 root root  215 Feb 22 11:18 buffer.b58279be6e21e8b29fc333a7d50096ed0.log.meta
-rw-r--r-- 1 root root  429 Feb 22 11:18 buffer.b58279be6f09bdfe047a96486a525ece2.log
-rw-r--r-- 1 root root  195 Feb 22 11:18 buffer.b58279be6f09bdfe047a96486a525ece2.log.meta
root@fluentd-eks-playground-jdc8m:/# ls -lah /var/log/fluentd-buffers/kubernetes.system.buffer
total 0
drwxr-xr-x 2 root root  6 Feb 22 11:18 .
drwxr-xr-x 3 root root 38 Feb 22 11:14 ..
```
2019-02-25 11:48:31 +00:00
André Bauer 2bd6d3dc12 use image version 6.6.1
Signed-off-by: André Bauer <monotek23@gmail.com>
2019-02-25 11:05:52 +01:00
André Bauer 2d15ffc9cc updated to 6.5.2
Signed-off-by: André Bauer <monotek23@gmail.com>
2019-02-25 10:56:50 +01:00
André Bauer 0c29ea1a2e Update es-statefulset.yaml 2019-02-25 10:55:23 +01:00
André Bauer 53a936c359 Update Makefile 2019-02-25 10:55:23 +01:00
André Bauer 0e44fa6359 updated elasticsearch to 6.5.0 2019-02-25 10:55:23 +01:00
André Bauer fc850b5ecd fixed wording
Signed-off-by: André Bauer <monotek23@gmail.com>
2019-02-25 10:49:43 +01:00
André Bauer 421fcd8262 added prodution note to readme
Signed-off-by: André Bauer <monotek23@gmail.com>
2019-02-25 10:47:26 +01:00
danielqsj 7d051e1a75 update juju shell 2019-02-24 20:46:20 +08:00
danielqsj 7e655e8666 fix shellcheck in cluster/juju 2019-02-24 20:40:59 +08:00
danielqsj f02a986081 add comments to shell function 2019-02-24 20:35:46 +08:00
SataQiu 9cda80e836 fix shellcheck lint errors in cluster and hack scripts 2019-02-24 11:15:35 +08:00
Kubernetes Prow Robot 1cf8001e53
Merge pull request #74449 from xichengliudui/fix190223
make more of the shell pass lints
2019-02-23 12:52:34 -08:00
Kubernetes Prow Robot 8993fbc543
Merge pull request #74328 from daixiang0/delete-blank
delete all duplicate empty blanks
2019-02-23 01:43:58 -08:00
Peter Hornyack 621df2cddb Disable Windows Defender on Windows test nodes. 2019-02-22 18:35:38 -08:00
Xiang Dai 36065c6dd7 delete all duplicate empty blanks
Signed-off-by: Xiang Dai <764524258@qq.com>
2019-02-23 10:28:04 +08:00
Jeff Grafton 7a938eb541 Create work-around wrappers for pure attribute on go_binary and go_test
This enables cgo when cross-compiling certain tests and binaries to
Linux, while disabling cgo for Windows and Darwin.
2019-02-22 18:08:34 -08:00
Kubernetes Prow Robot 46d7e9c784
Merge pull request #74381 from yujuhong/add-key
GCE: add EventLog registry key for docker
2019-02-22 17:58:44 -08:00
Kubernetes Prow Robot fd7acc3275
Merge pull request #74065 from ixdy/break-up-test-tarball
break up the test tarball
2019-02-22 17:58:23 -08:00
Kubernetes Prow Robot 743f864310
Merge pull request #73819 from coffeepac/move-fluentd-es-images
Move fluentd es images
2019-02-22 17:58:12 -08:00
Peter Hornyack 3efd4ca1dc Enhance/repair detect-node-names() and related env vars for Windows nodes. 2019-02-22 14:56:55 -08:00
Patrick Christopher 1bd45ba6eb review updates 2019-02-22 10:00:10 -08:00
Aaron Crickenberger 2d868025be Promote spiffxp to approver for cluster/
Also add Katharine as reviewer
2019-02-22 09:25:31 -08:00
Kubernetes Prow Robot c7ac532816
Merge pull request #74360 from pjh/gce-windows-cluster-ssh
Enable OpenSSH on Windows nodes in test clusters.
2019-02-21 20:51:23 -08:00
Kubernetes Prow Robot 125dc6c8ea
Merge pull request #74187 from xichengliudui/fixgolint0218
Fix shellcheck lint errors in cluster/addons/fluentd-elasticsearch/fl……uentd-es-image/run.sh
2019-02-21 20:51:13 -08:00
Yu-Ju Hong 40d0ae311c GCE: add EventLog registry key for docker 2019-02-21 17:47:32 -08:00
Kubernetes Prow Robot 042f9ed3af
Merge pull request #74093 from blakebarnett/lower-neg-cache-ttl
Lowers the default nodelocaldns denial cache TTL
2019-02-21 17:47:16 -08:00
Blake 46c299c1b1 Match default cache size of 10000
https://github.com/coredns/coredns/blob/master/plugin/cache/cache.go#L236
This gets rounded down to the nearest multiple of 256: 9984
2019-02-21 15:03:30 -08:00
Peter Hornyack 57ca6f007e Enable OpenSSH on Windows nodes in test clusters.
Also switches to the most recent 64-bit version of OpenSSH for Windows.

Tested:
PROJECT=${CLOUDSDK_CORE_PROJECT} KUBERNETES_SKIP_CONFIRM=y NUM_NODES=2 \
NUM_WINDOWS_NODES=2 KUBE_GCE_ENABLE_IP_ALIASES=true TEST_CLUSTER=true \
./cluster/kube-up.sh
2019-02-21 14:03:43 -08:00
Jeff Grafton b360f95eb3 cleanup: we always need to download client and server tarballs 2019-02-21 13:17:58 -08:00
Jeff Grafton 56949c7834 Support split test tarballs in get-kube-binaries.sh 2019-02-21 13:17:58 -08:00
Peter Hornyack 6d78f2b666 Default to Windows Server version 1809 for Windows nodes.
Removes all references to 1803, including moving "win1803" directory to
just "windows". A single Windows directory suffices for now, if
necessary in the future we can shard it into directories for each
Windows version.

We've been running tests with Windows 1809 nodes for a couple days in
our fork without major problems:
https://testgrid.k8s.io/google-windows#windows-prototype&width=20.
Testing on Azure is already using 1809:
https://testgrid.k8s.io/sig-windows#Conformance%20acs-engine%20on%20Azure&width=20.
2019-02-21 09:44:44 -08:00
Kubernetes Prow Robot f1de0b557c
Merge pull request #74324 from mtaufen/fix-windows
Fix hash if statement
2019-02-20 23:57:18 -08:00
xichengliudui 053332ad46 Fix shellcheck lint errors in cluster/addons/fluentd-elasticsearch/fluentd-es-image/run.sh
update pull request

update pull request

update pull request

update pull request

update pull request
2019-02-21 02:00:48 -05:00
Kubernetes Prow Robot 6c1f2077e5
Merge pull request #74192 from xichengliudui/fixshellcheck190218
make more of the shell pass lints
2019-02-20 21:41:25 -08:00
Kubernetes Prow Robot 054a676141
Merge pull request #74142 from javier-b-perez/master
GCE config.sh script should use headers for curl GCS apis
2019-02-20 21:41:12 -08:00
Michael Taufen cf3ad9c655 Fix hash if statement 2019-02-20 16:56:00 -08:00
Kubernetes Prow Robot f04ce297d6
Merge pull request #74100 from mtaufen/file-download-improvements
Retry downloads, respect URL list, validate tar hash
2019-02-20 11:34:06 -08:00
Michael Taufen 7ffe810f1d Retry downloads, respect URL list, validate tar hash 2019-02-20 08:52:46 -08:00
Kubernetes Prow Robot f5989303b7
Merge pull request #74060 from SataQiu/fix-shellcheck-20190214
Fix shellcheck failures on kube-down.sh kubeadm.sh get-build.sh
2019-02-19 21:41:17 -08:00
Kubernetes Prow Robot db7d930aab
Merge pull request #74109 from pjh/gce-windows-cluster-smoke-test
Detect ready pods correctly and untaint Windows nodes in smoke-test.
2019-02-19 19:57:40 -08:00
Kubernetes Prow Robot 7b203c6809
Merge pull request #74137 from rajansandeep/readinessprobe
Add readinessProbe to CoreDNS
2019-02-19 16:24:04 -08:00
Kubernetes Prow Robot cbf45eea13
Merge pull request #74138 from rramkumar1/ingress-docs-fix
Update docs for Ingress-GCE related cluster addon
2019-02-19 15:05:50 -08:00
Peter Hornyack 8a7fe26888 Detect ready pods correctly and untaint Windows nodes in smoke-test. 2019-02-19 11:48:54 -08:00
Wenjia Zhang b9af5b5112 fix nit for etcd_apiserver TLS path 2019-02-19 11:19:49 -08:00
Sandeep Rajan 37c3d68a91 Add readinessProbe 2019-02-19 10:14:12 -05:00
Kubernetes Prow Robot 16cbb6b965
Merge pull request #73848 from krzysied/logexporter_custom_fix
Handling for use_custom_instance_list in dump_nodes_with_logexporter
2019-02-18 15:23:48 -08:00
xichengliudui b83b83f8e3 aaa 2019-02-18 17:50:55 -05:00
Rohit Ramkumar a50752ceb7 Update docs for Ingress-GCE related cluster addon 2019-02-18 13:17:01 -08:00
André Bauer d82d5fda35 updated kibana to 6.6.0
Signed-off-by: André Bauer <monotek23@gmail.com>
2019-02-18 11:00:02 +01:00
André Bauer fa859e4644 Merge branch 'master' into kibana 2019-02-18 10:58:49 +01:00
Kubernetes Prow Robot 92e0c231fa
Merge pull request #74107 from truongnh1992/adding-shebang
Adding '/usr/bin/env bash'
2019-02-18 00:23:35 -08:00
Kubernetes Prow Robot cdbc9d8567
Merge pull request #73686 from ShivangGoswami/master
Cleanup: Audit log and error capitalization
2019-02-17 23:08:39 -08:00
xichengliudui fd044c5770 make more of the shell pass lints 2019-02-18 00:42:46 -05:00
Kubernetes Prow Robot 5aaf6cd61b
Merge pull request #72386 from logicalhan/readiness
fail faster for readiness checks
2019-02-15 18:02:30 -08:00
Kubernetes Prow Robot a22763b24e
Merge pull request #74063 from huynq0911/fix_wrong_format_yaml_influxdb
Fix incorrect influxdb yaml file
2019-02-15 16:46:18 -08:00
Javier Pérez Hernández 708d7df561 k8s: gci: configure: add token to curl calls
Modify script to use curl to get metadata and Service Account token.
When the SA doesn't have 'Storage Read' scope, it can only read
public files.
2019-02-15 11:00:35 -08:00
Ben Moss 34ac4d9ee9 Update deprecated links 2019-02-15 09:13:07 -05:00
Kubernetes Prow Robot 808f2cf0ef
Merge pull request #72525 from justinsb/owners_should_not_be_executable
Remove executable file permission from OWNERS files
2019-02-14 23:55:45 -08:00
Nguyen Hai Truong 848da13359 Adding '/usr/bin/env bash'
This commit aims to add '/usr/bin/env bash' as a shebang line
to indicates scripts use bash shell for interpreting.

Signed-off-by: Nguyen Hai Truong <truongnh@vn.fujitsu.com>
2019-02-14 19:18:49 -08:00
Nguyen Quang Huy ac8466444c Fix incorrect influxdb yamle file
Remove redundant attribute in container declaration
2019-02-14 14:26:05 +07:00
SataQiu 78e272a927 fix shellcheck failures on kube-down.sh kubeadm.sh get-build.sh 2019-02-14 15:12:08 +08:00
Kubernetes Prow Robot 6f6f8bdfcc
Merge pull request #73822 from justinsb/renamed_set_node_image
e2e upgrade test: rename functions to match f0f7829934
2019-02-13 19:06:20 -08:00
Kubernetes Prow Robot b9072c3fe7
Merge pull request #73984 from yujuhong/find-node-tar
Add path in find-tar for locating node binary tarball
2019-02-13 14:32:26 -08:00
Blake e51c9025ac Lowers the default nodelocaldns denial cache TTL
Similar to `--no-negcache` on dnsmasq, this prevents issues which poll DNS for orchestration such as operators with StatefulSets. It can also be very confusing for users when negative caching results in a change they just made seeming to be broken until the cache expires. This assumes that 5 seconds is reasonable and will still catch repeated AAAA negative responses. We could also set the denial cache size to zero which should effectively fully disable it like dnsmasq in kube-dns but testing shows this approach seems to work well in our (albeit small) test clusters.
2019-02-13 13:23:53 -08:00
Kubernetes Prow Robot 812016fa23
Merge pull request #73818 from gnufied/add-e2e-for-no-expand
add e2e test when expansion is disabled
2019-02-13 12:16:09 -08:00
Kubernetes Prow Robot aefd281143
Merge pull request #73983 from ixdy/update-bazel-deps
Update repo-infra, bazel-skylib, and rules_docker dependencies
2019-02-13 11:00:08 -08:00
Hemant Kumar 7d823e5df0 also remove slow tag from resizing tests 2019-02-13 10:48:49 -05:00
Kubernetes Prow Robot 40c91a0951
Merge pull request #74001 from mm4tt/logexporter_version_bump
Start using new version of logexporter.
2019-02-13 04:18:38 -08:00
Matt Matejczyk d7d46013cb Start using new version of logexporter. 2019-02-13 08:52:04 +01:00
Yu-Ju Hong 0a68740d08 Add path in find-tar for locating node binary tarball 2019-02-12 18:15:17 -08:00
Jeff Grafton e216995ef1 Update repo-infra, bazel-skylib, rules_docker, and rules_go dependencies
Also require bazel 0.18.0+
2019-02-12 17:55:10 -08:00
Peter Hornyack 063725230d Delete RDP firewall rule during kube-down.
This rule is only created if the cluster has one or more Windows nodes,
but delete-firewall-rules() ignores firewall rule arguments that do not
exist so it's safe to always attempt to delete this rule.
2019-02-12 15:19:16 -08:00
Kubernetes Prow Robot aa00afe231
Merge pull request #73649 from ojmhetar/coredns-priorityclass
Add priority class to CoreDNS pods
2019-02-11 22:55:45 -08:00
Jiaying Zhang 52e92ab4b9 Update nvidia-gpu-device-plugin addon.
This includes changes from GoogleCloudPlatform/container-engine-accelerators#102
2019-02-11 15:52:33 -08:00
Yu-Ju Hong 1bba009689 Fix setting NODE_ARCH and NODE_PLATFORM 2019-02-11 13:55:44 -08:00
Kubernetes Prow Robot b50c643be0
Merge pull request #73540 from rlenferink/patch-5
Updated OWNERS files to include link to docs
2019-02-08 09:05:56 -08:00
Krzysztof Siedlecki bc42602024 adding handling for use_custom_instance_list in dump_nodes_with_logexporter 2019-02-08 14:02:06 +01:00
patc 0e219f4caa boilerplate fix 2019-02-07 21:12:46 -08:00
Kubernetes Prow Robot a7966022d7
Merge pull request #73714 from tallclair/debian-base
Bump debian-* base image versions to latest
2019-02-07 19:44:44 -08:00
Justin SB 28c34d8d71
e2e upgrade test: rename functions to match f0f7829934
These functions were renamed in
f0f7829934, but cluster/gce/upgrade.sh
was missed.
2019-02-07 13:27:34 -05:00
Kubernetes Prow Robot a0aa22226e
Merge pull request #73813 from wojtek-t/fix_kubemark_nodes
Fix kubemark windows num nodes
2019-02-07 07:12:41 -08:00
wojtekt 482e307406 Fix kubemark windows num nodes 2019-02-07 14:14:41 +01:00
Benjamin Elder bccf95b10c add bentheelder to reviewers 2019-02-06 16:26:51 -08:00
Matt Matejczyk 5e6171790b Propagate dump_systemd_journal to logexporter job.
Log exporter changes have been made in
https://github.com/kubernetes/test-infra/pull/11121 and new version has
been pushed in https://github.com/kubernetes/test-infra/pull/11149
2019-02-06 15:49:29 +01:00
Jorge O. Castro 522a34f1d7 Remove inactive maintainers
Signed-off-by: Jorge O. Castro <jorgec@vmware.com>
2019-02-05 09:45:39 -05:00
Matt Matejczyk 4dcc017d7a Propagate API_SERVER_TEST_LOG_LEVEL in cluster/gce/util.sh 2019-02-05 10:30:17 +01:00
Roy Lenferink b43c04452f Updated OWNERS files to include link to docs 2019-02-04 22:33:12 +01:00
Tim Allclair 1b9d0c1094 Bump debian-* base image versions to latest 2019-02-04 11:58:51 -08:00
ShivangGoswami 2866b1a3b8 Cleanup: Audit log and error capitalization 2019-02-04 10:59:34 +05:30
Matt Matejczyk 35543f8989 Allow dumping full systemd journal in log-dump.sh.
The feature is gated behind a newly introduced 'dump-systemd-journal' flag.
We want to dump the full systemd journal in our scalability performance tests.
2019-02-03 21:28:37 +01:00
Ojas M bb9c865d47 Add priority class to CoreDNS pods 2019-02-01 12:56:48 -08:00
Kubernetes Prow Robot 556c0b8593
Merge pull request #73627 from mborsz/heapster
Fix typo in create-heapster-node
2019-02-01 07:44:48 -08:00
Kubernetes Prow Robot b1eae78fc7
Merge pull request #73610 from rajansandeep/coredns1.3.1
Bump CoreDNS version to 1.3.1
2019-02-01 06:32:11 -08:00
ramnar 61f8b9e440 Bug fix 72757 2019-02-01 19:37:51 +05:30
Maciej Borsz afeabe99b7 Fix typo in create-heapster-node 2019-02-01 11:04:23 +01:00
Kubernetes Prow Robot 244795a06a
Merge pull request #73442 from pjh/gce-windows-cluster
Update cluster/gce/ scripts to support Windows nodes.
2019-01-31 18:20:37 -08:00
Sandeep Rajan aa11f6c298 Bump CoreDNS version to 1.3.1 2019-01-31 16:55:02 -05:00
Kubernetes Prow Robot 46fc4dd96a
Merge pull request #73577 from wojtek-t/remove_vmodule
Remove vmodule from apiserver to reduce lock contention
2019-01-31 04:59:42 -08:00
wojtekt 3032d92736 Remove vmodule from apiserver to reduce lock contention 2019-01-31 10:53:44 +01:00
Kubernetes Prow Robot 38d12df696
Merge pull request #73450 from liggitt/update-cluster-proportional-autoscaler
Update cluster-proportional-autoscaler to 1.4.0 to use apps/v1
2019-01-30 20:25:52 -08:00
Peter Hornyack f0f7829934 Update cluster/gce scripts to support Windows nodes. 2019-01-30 19:49:07 -08:00
Kubernetes Prow Robot 16000e3af7
Merge pull request #73220 from yujuhong/gce-os-labels
GCE: update addon DaemonSets to select node OS
2019-01-29 18:22:21 -08:00
Kubernetes Prow Robot c501d464af
Merge pull request #73267 from rajansandeep/proxytoforward
Use forward plugin instead of proxy plugin in the default configuration of CoreDNS
2019-01-29 11:32:28 -08:00
Matt Matejczyk 12904ac911 Tune suggested master disk sizes for big clusters.
Looks like the get-master-root-disk-size() and get-master-disk-size() functions didn't anticipate
clusters bigger than 2K nodes.

In https://github.com/kubernetes/kubernetes/issues/72976 we found out
that 100GB may be not enough for large clusters (5K nodes) when it comes
to master root disk size.

Updating both get-master-root-disk-size() and get-master-disk-size() to
make them consistent and match cluster sizes with get-master-size()
function.
2019-01-29 10:48:08 +01:00
patc 64918cc2b4 update makefiles to use cloud build and update README 2019-01-28 22:44:44 -08:00
Jordan Liggitt 01434acc66 Update cluster-proportional-autoscaler to 1.4.0 to use apps/v1 2019-01-29 00:45:21 -05:00
Kubernetes Prow Robot d54716338a
Merge pull request #73302 from rajansandeep/prometheusport
Add metrics port to CoreDNS service
2019-01-25 12:04:37 -08:00
Sandeep Rajan e9220dd408 add prometheus port 2019-01-25 11:29:03 +05:30
Kubernetes Prow Robot 7d91e1223d
Merge pull request #73293 from prameshj/nodelocal-update
Modify nodelocaldns yaml to include updateStrategy and xtables.lock
2019-01-24 20:07:57 -08:00
Kubernetes Prow Robot 0210c0d869
Merge pull request #73253 from phenixblue/improve-syntax-for-shell-script
Updates to eliminate shellcheck failures - kubernetes/cluster/centos
2019-01-24 17:46:43 -08:00
Kubernetes Prow Robot d306d68a36
Merge pull request #72978 from liggitt/addon-changelog
Update addon-manager changelog/image
2019-01-24 16:30:00 -08:00
Pavithra Ramesh 9611b112eb Add xtables.lock file to mounts
nodelocaldns needs to use the same iptables lock file to avoid
contention with other entities modifying iptables rules.
2019-01-24 13:21:06 -08:00
Joe Searcy e72a683f82 Updates to eliminate shellcheck failures in #72956
Extend quotes to full filepath strings

Fixes for reviewers comments
2019-01-24 15:57:26 -05:00
Kubernetes Prow Robot c21f60f862
Merge pull request #73268 from wojtek-t/better_scale_inflights
Better scale max-inflight request limits to correlate them with master VM size
2019-01-24 10:37:25 -08:00
wojtekt d911ebbc3f Correlate max-inflight values in GCE with master VM sizes 2019-01-24 15:13:36 +01:00
Sandeep Rajan c346ae1e49 Use forward plugin instead of proxy in the default configuration 2019-01-24 17:28:55 +05:30
Kubernetes Prow Robot 9ce03717a9
Merge pull request #73229 from yujuhong/append-owner
GCE: add yujuhong to OWNERS
2019-01-24 02:56:29 -08:00
danielqsj cd5e658518 fix shellcheck in cluster/local 2019-01-24 18:06:55 +08:00
Kubernetes Prow Robot b9631a881f
Merge pull request #73231 from liggitt/add-on-arch-build
Enable multi-arch build for add-on manager
2019-01-23 16:19:35 -08:00
Jordan Liggitt af66376960 Enable multi-arch build for add-on manager 2019-01-23 18:08:09 -05:00
Yu-Ju Hong 2edd3e3f52 GCE: add yujuhong to OWNERS 2019-01-23 14:46:29 -08:00
Kubernetes Prow Robot baaaa15fdf
Merge pull request #69940 from MarcPow/master
Bind coredns containers to  linux nodes to avoid Windows scheduling
2019-01-23 12:19:53 -08:00
Yu-Ju Hong 9c892243f6 GCE: update addon DaemonSets to select node OS
These DaemonSets supports only Linux today, so this change updates the
specs to reflect this limitation. The labels have recently been promoted
to GA. Using the beta labels for now until node-master version skew
problem no longer exists.
2019-01-23 09:01:40 -08:00
patc 7f873f23b9 multi stage dckerfile builds discovery app and ES image 2019-01-23 00:01:58 -08:00
Humble Devassy Chirammal 2651c72166 This patch consists of various corrections. (#73060)
* This patch consists of various corrections.

*) Remove unwanted `else` code block
*) Adjust variable intialization.
*) Correct error strings.
*) Use recommended variable increment method.

Signed-off-by: hchiramm <hchiramm@redhat.com>

* Remove cluster/images/cluster-version-monitor from hack/.golint_failures.

Signed-off-by: Humble Chirammal <hchiramm@redhat.com>
2019-01-22 12:28:59 -08:00
Kubernetes Prow Robot b6fb1572d8
Merge pull request #72861 from BenTheElder/slightly-less-bash-fire
fix bugs in get-kube scripts
2019-01-18 18:00:28 -08:00
Pavithra Ramesh 960705ac98 Add updateStrategy section for nodelocaldns 2019-01-18 16:40:01 -08:00