Commit Graph

8425 Commits (644632ddcfd5ceede125758d51c9aa3736763753)

Author SHA1 Message Date
Kubernetes Prow Robot 31903e644b
Merge pull request #76152 from SataQiu/fix-shell-cluster-20190404
Fix some shellcheck failures of cluster/*.sh
2019-04-19 18:59:41 -07:00
Yu-Ju Hong 4cd2625719 GCE/Windows: enable stackdriver logging agent
This change bumps the stackdriver logging agent version to v1-9,
re-enable it, and change the script/configuration to:
  * Create /var/log in the startup script, since the fluentd
    configuration expects the directory to exists
  * Add support for collecting kubelet/kube-proxy logs
2019-04-19 15:41:48 -07:00
Yu-Ju Hong 05074e9dde GCE/Windows: add cluster-location to instance metadata
This will be used by stackdriver to annotate the logs.
2019-04-19 15:41:48 -07:00
Jake Sanders d4d5afb9cc add additional approvers from GKE 2019-04-19 14:38:31 -07:00
aaa ad5bccc510 fix shellcheck in test-smoke.sh and test-network.sh
update pull request

update pull request

update pull request

update pull request
2019-04-19 16:15:10 -04:00
Jake Sanders 42fcd5eb63 remove erroneous kube-apiserver.manifest sed line 2019-04-19 17:40:28 +00:00
Kubernetes Prow Robot e4f9d8fe3f
Merge pull request #76331 from xichengliudui/fixshellcheckfailures-17
Fix shellcheck failures in stage-upload.sh
2019-04-19 09:47:55 -07:00
Marek Siarkowicz 2fc8ac9e81 [metrics-server addon] Restore metrics-server using of IP addresses
This preference list matches is used to pick prefered field from k8s
node object. It was introduced in metrics-server 0.3 and changed default
behaviour to use DNS instead of IP addresses. It was merged into k8s
1.12 and caused breaking change by introducing dependency on DNS
configuration.
2019-04-19 11:43:17 +02:00
Jake Sanders 113ab741e6 add option to set the value of the apiserver's insecure port 2019-04-18 20:35:08 +00:00
Marek Siarkowicz 37381eb384 Pick up security patches for fluentd-gcp-scaler by upgrading to version 0.5.2 2019-04-18 11:52:53 +02:00
Kubernetes Prow Robot c58864de3f
Merge pull request #76711 from dims/remove-deprecated-centos-local-support
Remove deprecated centos/local support
2019-04-17 22:58:19 -07:00
Peter Hornyack 81cc384a9a Pin GCE Windows node image to 1809 v20190312.
This is to work around
https://github.com/kubernetes/kubernetes/issues/76666.
2019-04-17 09:45:43 -07:00
Davanum Srinivas 8762bc39a7
Remove deprecated centos/local support
Change-Id: I4a451ec0125c05e66202daf781a4e5a1d895efb4
2019-04-17 11:03:34 -04:00
George Angel f40f767d94 update k8s.gcr.io/k8s-dns-node-cache image version
v1.15.0 is affected by https://github.com/kubernetes/dns/issues/282
2019-04-16 09:43:53 +01:00
Kubernetes Prow Robot 02ffdd1d5b
Merge pull request #76628 from kwmonroe/rehome-juju
re-home juju related source
2019-04-15 23:31:34 -07:00
Kubernetes Prow Robot dda0e75d36
Merge pull request #76404 from MrHohn/addon-manager-9.1
Update addon-manager to use debian-base:v1.0.0
2019-04-15 18:05:36 -07:00
Kubernetes Prow Robot dc2c9e8bd5
Merge pull request #74690 from wenjiaswe/etcdserverhttp
override ETCD_SERVER with https instead http when mTLS is enabled
2019-04-15 16:45:07 -07:00
Kubernetes Prow Robot 8a636a3151
Merge pull request #76467 from MrHohn/addon-manager-owner
Add approver and label to addon-manager
2019-04-15 14:25:06 -07:00
Kevin W Monroe 422a7ec49f re-home juju related source 2019-04-15 16:19:09 -05:00
Kubernetes Prow Robot b4c77eff33
Merge pull request #76427 from hprateek43/Fix-#75567
Fix for #75567
2019-04-15 11:46:39 -07:00
Wenjia Zhang 80c4bccf0f override ETCD_SERVER with https instead http when mTLS is enabled 2019-04-14 22:11:37 -07:00
Yuwen Ma 23469d6b04
Rebase etcd-empty-dir-cleanup to debian-base
Busybox has more severe CVE issues.
This is part of the effort described in kep kubernetes/enhancements#900, we can't change the ibase image to distroless directly since a bash script is used.
Moved to debian-base and put it as a temp exception for now.
2019-04-12 15:59:23 -07:00
Zihong Zheng 2d635bc29d Add approver and label to addon-manager 2019-04-12 13:04:43 -07:00
Kubernetes Prow Robot 67ee03afd0
Merge pull request #76440 from mm4tt/gce_private_clusters
Ability to set up Private GCE Test Clusters with NAT.
2019-04-12 07:31:03 -07:00
Kubernetes Prow Robot 0da72a1ef3
Merge pull request #76437 from brett-elliott/master
Bump metrics-server to v0.3.2
2019-04-12 07:30:48 -07:00
Matt Matejczyk 2efea76263 Ability to set up Private GCE Clusters with NAT.
Ref. https://github.com/kubernetes/kubernetes/issues/76374
2019-04-12 13:21:56 +02:00
Kubernetes Prow Robot ae8a4fcb0d
Merge pull request #76352 from jpbetz/reapply-72062
[reapply] Perform GCE log rotation check every 5 minutes
2019-04-12 00:41:14 -07:00
aaa 9768ba9eb3 Fix shellcheck failures in stage-upload.sh
update pull request

update pull request

update pull request

update pull request
2019-04-12 02:42:48 -04:00
Zihong Zheng 9f8d9ba847 Update addon-manager to use debian-base:v1.0.0 2019-04-11 10:18:33 -07:00
Brett Elliott da4a8aa5ce Bump metrics server to v0.3.2 2019-04-11 13:27:14 +02:00
Kubernetes Prow Robot 39073f0b9f
Merge pull request #74268 from wenjiaswe/fixingNitforetcdapiservertlspath
fix nit for etcd_apiserver TLS path
2019-04-11 01:22:11 -07:00
Harsh Singh 47275cb6cd Fix for #75567 2019-04-11 13:18:47 +05:30
Kubernetes Prow Robot 134ed2e703
Merge pull request #76353 from Random-Liu/add-cni-version-env
Add envs for CNI version.
2019-04-10 18:46:22 -07:00
Kubernetes Prow Robot e3c3f75b5b
Merge pull request #75658 from feiskyer/cri-tools-1.14
Update cri-tools to v1.14.0
2019-04-10 16:52:07 -07:00
Lantao Liu ad5b64ca10 Add envs for CNI version.
Signed-off-by: Lantao Liu <lantaol@google.com>
2019-04-09 16:38:53 -07:00
Joe Betz 6297c4f750 Perform GCE log rotation check every 5 minutes 2019-04-09 16:37:38 -07:00
yue9944882 b5e3acc5c0 remove internal client references in cluster/* 2019-04-09 21:43:54 +08:00
Pengfei Ni 1406e2b70f Update cri-tools to v1.14.0 2019-04-08 10:59:18 +08:00
Yubao Liu f7f51fab2a change default 5s ttl to 30s for coredns to be same with kube-dns/dnsmasq 2019-04-07 20:41:25 +08:00
Kubernetes Prow Robot 3e954d3bd3
Merge pull request #76211 from wangzhen127/npd063
Use Node-Problem-Detector v0.6.3 on GCI
2019-04-05 14:34:17 -07:00
Kubernetes Prow Robot 63ae37304b
Merge pull request #75967 from ialidzhikov/fluentd-1.4.1
Update fluentd 1.4.1
2019-04-05 11:51:58 -07:00
Zhen Wang 953677d7a5 Use Node-Problem-Detector v0.6.3 on GCI 2019-04-05 11:08:24 -07:00
Maciej Borsz a0b51681c4
Revert "[Distroless] Convert the GCE manifests for master containers." 2019-04-05 12:55:14 +02:00
Ke Wu 925dc7f2e0 Fix empty array expansion error in cluster/gce/util.sh
Empty array expansion causes "unbound variable" error in
bash 4.2 and bash 4.3.
2019-04-04 16:46:58 -07:00
Kubernetes Prow Robot 1c105e1629
Merge pull request #75624 from yuwenma/gce-manifest
[Distroless] Convert the GCE manifests for master containers.
2019-04-04 16:21:19 -07:00
Yuwen Ma af2659527f [Distroless] Convert the GCE manifests for master containers.
* Touched containers: kube-apiserver, kube-scheduler,
kube-controller-manager.
* Remove the shell dependencies when upstart the containers.
* Reformat the command parameters to ["Exec", "Param1", "Param2"]
2019-04-04 11:16:23 -07:00
SataQiu bc279da872 fix some shellcheck failures of cluster/*.sh 2019-04-04 23:20:52 +08:00
Kubernetes Prow Robot 3e7134891f
Merge pull request #76099 from yujuhong/disable-stackdriver
GCE/Windows: disable stackdriver logging agent
2019-04-04 00:41:14 -07:00
Kubernetes Prow Robot f1acf4ad94
Merge pull request #75514 from tedyu/master
Use go idiomatic to replace loop
2019-04-04 00:40:50 -07:00
Solly Ross 837976cb59 Prune directxman12 from metrics/autoscaling OWNERS
Since I'm not really working on metrics or autoscaling stuff any more, I
figured it was time to remove myself from the approvers list.
2019-04-03 16:24:51 -07:00
Kubernetes Prow Robot 41691a031a
Merge pull request #76039 from johnSchnake/conformanceErrExit
Finish saving test results on failure
2019-04-03 11:14:49 -07:00
Yu-Ju Hong 2a37067735 GCE/Windows: disable stackdriver logging agent
The logging service could not be stopped at times, causing node startup
failures. Disable it until the issue is fixed.
2019-04-03 10:34:36 -07:00
John Schnake 4fec7c7568 Finish saving test results on failure
The conformance image should be saving its results
regardless of the results of the tests. However,
with errexit set, when ginkgo gets test failures
it exits 1 which prevents saving the results
for Sonobuoy to pick up.

Fixes: #76036
2019-04-03 08:38:43 -05:00
Michelle Au d2aa8178f2 Remove alpha CRD install 2019-04-02 10:59:11 -07:00
Kubernetes Prow Robot 397f3d4c0c
Merge pull request #75856 from yuwenma/base-etcd
Bebase etcd image to k8s.gcr.io/debian-base
2019-04-01 15:48:47 -07:00
ialidzhikov ebfb92bdce Update fluentd 1.4.1
Signed-off-by: ialidzhikov <i.alidjikov@gmail.com>
2019-04-01 18:24:47 +03:00
yuwenma 529b20aadd rebase etcd to k8s debian-base 2019-03-29 18:59:55 -07:00
Peter Hornyack 1814b0c495 Disable GCE agent address management on Windows nodes.
With this metadata key set, "GCEWindowsAgent: GCE address manager
status: disabled" will appear in the VM's serial port output during
boot.

Tested:
PROJECT=${CLOUDSDK_CORE_PROJECT} KUBE_GCE_ENABLE_IP_ALIASES=true NUM_WINDOWS_NODES=2 NUM_NODES=2 KUBERNETES_NODE_PLATFORM=windows go run ./hack/e2e.go -- --up
cluster/gce/windows/smoke-test.sh

cat > iis.yaml <<EOF
apiVersion: v1
kind: Pod
metadata:
  name: iis
  labels:
    app: iis
spec:
  containers:
  - image: mcr.microsoft.com/windows/servercore/iis
    imagePullPolicy: IfNotPresent
    name: iis-server
    ports:
    - containerPort: 80
      protocol: TCP
  nodeSelector:
    beta.kubernetes.io/os: windows
  tolerations:
  - effect: NoSchedule
    key: node.kubernetes.io/os
    operator: Equal
    value: windows1809
EOF

kubectl create -f iis.yaml
kubectl expose pod iis --type=LoadBalancer --name=iis
kubectl get services
curl http://<service external IP address>
2019-03-28 17:28:18 -07:00
Kubernetes Prow Robot da018a6bfa
Merge pull request #75362 from serathius/gcp-security-patches
Update gcp images with security patches
2019-03-28 14:25:58 -07:00
Kubernetes Prow Robot 753a6edc37
Merge pull request #74616 from ialidzhikov/fluentd-1.4.0
Update fluentd to 1.4.0
2019-03-26 12:33:48 -07:00
Kubernetes Prow Robot 81d37386aa
Merge pull request #75168 from ialidzhikov/update-golang-version
Update golang to 1.12.0
2019-03-26 11:17:20 -07:00
Kubernetes Prow Robot 9e1cf75bd1
Merge pull request #75381 from rramkumar1/configure-scripts-for-ingress-yaml
Plumb CUSTOM_INGRESS_YAML into other setup scripts
2019-03-25 12:10:33 -07:00
Kubernetes Prow Robot 657951c3ed
Merge pull request #74889 from akutz/feature/get-kube-binaries-client-choice
Allow choice of os/arch when downloading client
2019-03-21 22:50:57 -07:00
Kubernetes Prow Robot 8af1bf313e
Merge pull request #75317 from jihoon-seo/patch-1
Update cluster/images/hyperkube/README.md
2019-03-20 20:35:37 -07:00
Ted Yu 793fd74a42 Use go idiomatic to replace loop 2019-03-20 11:40:26 -07:00
Kubernetes Prow Robot 66bb1215c7
Merge pull request #75310 from chi-xu/20190313
Correct a spelling error
2019-03-20 09:53:11 -07:00
Kubernetes Prow Robot 8b19a24974
Merge pull request #75153 from ymlong/fix-20193838
Correcting spelling errors
2019-03-20 00:54:48 -07:00
ialidzhikov db6add318a Update fluentd to 1.4.0
Signed-off-by: ialidzhikov <i.alidjikov@gmail.com>
2019-03-20 08:48:26 +02:00
Kubernetes Prow Robot aa5fda22f2
Merge pull request #75269 from javier-b-perez/gce-configure
gce: configure: validate SA has storage scope
2019-03-19 11:06:22 -07:00
Łukasz Osipiuk ff18fbcebb Update Cluster Autscaler version to 1.14.0
No changes since 1.14.0-beta.2
Changelog: https://github.com/kubernetes/autoscaler/releases/tag/cluster-autoscaler-1.14.0
2019-03-19 16:15:20 +01:00
Dan Williams bfd8ad3268 build/gci: bump CNI version to 0.7.5 2019-03-18 17:16:42 -05:00
Kubernetes Prow Robot 28910bf2d7
Merge pull request #75387 from pjh/remove-use-release-node-binaries
Remove reference to USE_RELEASE_NODE_BINARIES.
2019-03-18 09:52:26 -07:00
Rohit Ramkumar 8ac15cd54a Plumb CUSTOM_INGRESS_YAML into other setup scripts so that it can be set before a call to kubetest 2019-03-15 13:36:30 -07:00
Łukasz Osipiuk e4663f73d3 Update Cluster Autscaler version to 1.14.0-beta.2 2019-03-15 12:01:47 +01:00
Marek Siarkowicz 9e9b906047 Update gcp images with security patches
[stackdriver addon] Bump prometheus-to-sd to v0.5.0 to pick up security fixes.
[fluentd-gcp addon] Bump fluentd-gcp-scaler to v0.5.1 to pick up security fixes.
[fluentd-gcp addon] Bump event-exporter to v0.2.4 to pick up security fixes.
[fluentd-gcp addon] Bump prometheus-to-sd to v0.5.0 to pick up security fixes.
[metatada-proxy addon] Bump prometheus-to-sd v0.5.0 to pick up security fixes.
2019-03-15 09:24:32 +01:00
Peter Hornyack 7f364b1c5b Remove reference to USE_RELEASE_NODE_BINARIES.
This variable was used for development purposes and was accidentally
introduced in
f0f7829934.

This is its only use in the tree:
https://github.com/kubernetes/kubernetes/search?q=USE_RELEASE_NODE_BINARIES&unscoped_q=USE_RELEASE_NODE_BINARIES
2019-03-14 14:50:54 -07:00
Jihoon Seo f5a66f8c13
Update cluster/images/hyperkube/README.md
Add missing period between 'components' and `hyperkube`.
Add line separation between 2 sentences.
2019-03-13 12:01:30 +09:00
xuchi d0e15aed2c Correct a spelling error 2019-03-13 09:39:13 +08:00
Kubernetes Prow Robot a213886f9c
Merge pull request #74502 from Random-Liu/collect-pod-log
Collect pod log in fluentd-gcp
2019-03-12 01:34:37 -07:00
Javier Pérez Hernández e783f0bcd4 gce: configure: validate SA has storage scope
If the VM SA doesn't have storage scope associated, don't use the
token in the curl request or the request will fail with 403.
2019-03-11 12:04:01 -07:00
Kubernetes Prow Robot 6ec5a7d337
Merge pull request #75181 from losipiuk/lo/master-ca-1.14-beta.1
Update Cluster Autscaler version to 1.14.0-beta.1
2019-03-11 07:15:32 -07:00
Kubernetes Prow Robot 8dd09e0b36
Merge pull request #75208 from pjh/gce-windows-readme-update
Update GCE Windows README steps for running e2e tests.
2019-03-10 21:15:26 -07:00
Lantao Liu 7ffa7da112 Configure logrotate for pod logs. 2019-03-08 16:45:41 -08:00
Kubernetes Prow Robot c534a1b9b3
Merge pull request #75149 from yguo0905/m73
GCE: bump COS image version to cos-beta-73-11647-64-0
2019-03-08 14:19:35 -08:00
Kubernetes Prow Robot d778b9308a
Merge pull request #75063 from wangzhen127/npd-test-fix
Fix NPD e2e test on Ubuntu node and update NPD container version
2019-03-08 14:19:09 -08:00
Peter Hornyack 0fbb246c30 Update GCE Windows README steps for running e2e tests. 2019-03-08 13:47:41 -08:00
Łukasz Osipiuk 2790d9151a Update Cluster Autscaler version to 1.14.0-beta.1 2019-03-08 14:01:55 +01:00
ialidzhikov c72115dede Update golang to 1.12.0
Signed-off-by: ialidzhikov <i.alidjikov@gmail.com>
2019-03-08 11:45:46 +02:00
Kubernetes Prow Robot cf439192fb
Merge pull request #75139 from mtaufen/windows-kube-proxy-args
Windows kubeproxy flags cleanup
2019-03-08 00:53:28 -08:00
Kubernetes Prow Robot 6a86f1576f
Merge pull request #75014 from mtaufen/windows-logrotate
Windows log rotation
2019-03-07 22:43:41 -08:00
Kubernetes Prow Robot 183e6cc190
Merge pull request #74433 from tallclair/runtimeclass-internal
Migrate RuntimeClass from a CRD to an internal API
2019-03-07 21:19:30 -08:00
ymlong 8e04d61488 Correcting spelling errors 2019-03-08 10:19:13 +08:00
Yang Guo eef545abfd GCE: bump COS image version to cos-beta-73-11647-64-0 2019-03-07 16:11:02 -08:00
Michael Taufen 0d59d2a43e Windows kubeproxy flags cleanup
- Pass most of the flags via kube-env, similar to Kubelet, to make it
  easier to reconfigure.
- Use kube-proxy's kubeconfig file to pass the API server address,
  rather than the --master flag.
- Make KUBEPROXY_ARGS a required variable in Windows kube-env, similar
  to KUBELET_ARGS.
2019-03-07 14:45:34 -08:00
Tim Allclair 63f61a6714 Migrate RuntimeClass to internal API 2019-03-07 11:07:54 -08:00
Michael Taufen 0a21d644f5 Rotate windows logs 2019-03-07 10:42:31 -08:00
Davanum Srinivas 200af47128
Stop waiting on conformance image pod when it fails
Change-Id: I516833167f8e0aaa0e7962c1d6bdbe2ca422dc6e
2019-03-07 11:03:05 -05:00
Kubernetes Prow Robot 57da888ff4
Merge pull request #75055 from pjh/gce-windows-disable-defender-again
Disable Windows Defender in Windows nodes on GCE again.
2019-03-07 01:07:26 -08:00
Zhen Wang f4d9e7d992 Fix NPD e2e test on Ubuntu node and update NPD container version 2019-03-06 22:42:47 -08:00
Kubernetes Prow Robot bd8eeaaaaf
Merge pull request #74468 from ixdy/bazel-cgo-crossbuild
bazel: improve multi-arch support for docker image artifacts
2019-03-06 20:35:11 -08:00
Kubernetes Prow Robot a60d44848f
Merge pull request #74748 from pjh/gce-windows-stackdriver
Install Stackdriver agent in Windows nodes on GCE.
2019-03-06 17:59:39 -08:00
Kubernetes Prow Robot ab7a48d796
Merge pull request #70036 from pbarker/audit-etoe
dynamic audit e2e test
2019-03-06 17:58:58 -08:00
Jeff Grafton 8495f7c9ed bazel: add docker_push rules 2019-03-06 12:35:33 -08:00
Peter Hornyack 80c635880d Disable Windows Defender in Windows nodes on GCE again.
This reverts commit fbf4fe4714. Windows
Defender seems to be causing our Windows nodes to crash and reboot
during e2e tests, e.g.
https://gcsweb.k8s.io/gcs/kubernetes-jenkins/logs/ci-kubernetes-e2e-windows-gce/228/artifacts/e2e-228-36623-windows-node-group-q4.
2019-03-06 11:42:03 -08:00
Jeff Grafton d7c2c2a606 bazel: refactor multi-arch container builds into starlark 2019-03-06 11:40:33 -08:00
Kubernetes Prow Robot 45e5f6053b
Merge pull request #74424 from liggitt/drop-k8s-io-node-labels
Clean up self-set node labels
2019-03-06 08:24:26 -08:00
Kubernetes Prow Robot 5a89793f8e
Merge pull request #74612 from jingxu97/windows
Add GcePD windows support
2019-03-05 21:42:22 -08:00
Peter Hornyack 197513141e Install Stackdriver agent in Windows test clusters. 2019-03-05 15:14:07 -08:00
Patrick Barker 67a245ae61 remove unneeded println 2019-03-05 14:38:58 -07:00
Patrick Barker 45d715cdc6 adds dynamic audit e2e test 2019-03-05 14:38:58 -07:00
Kubernetes Prow Robot 365c504e2b
Merge pull request #74928 from pjh/gce-windows-reenable-defender
Don't disable Windows Defender.
2019-03-05 11:10:25 -08:00
Kubernetes Prow Robot a352b74bcc
Merge pull request #74915 from jpbetz/revert-72062-2
revert 72062: Perform GCE master log rotation check every 5 minutes
2019-03-05 09:50:55 -08:00
Kubernetes Prow Robot b4e692b68f
Merge pull request #74854 from neolit123/conformance-nodes
conformace/run_e2e.sh: pass shellcheck and autodetect ginkgo nodes
2019-03-05 09:50:12 -08:00
Kubernetes Prow Robot 95cd1d59e4
Merge pull request #74209 from monotek/fluentd-helm-readme
added production note about EFK stack to the readme
2019-03-04 17:55:12 -08:00
Peter Hornyack fbf4fe4714 Don't disable Windows Defender. 2019-03-04 16:34:21 -08:00
Lubomir I. Ivanov fe2301f17d conformace/run_e2e.sh: pass shellcheck and autodetect ginkgo nodes
- for ginkgo parallel mode pass -p to autodetect ginkgo nodes.
- disable parallel mode by default (false).
- use provider "skeleton" instead of "local".
- make run_e2e.sh pass shellcheck.
2019-03-05 01:32:57 +02:00
Joe Betz add956f00f Revert "Perform GCE log rotation check every 5 minutes"
This reverts commit 1ba05d51a6.
2019-03-04 14:10:20 -08:00
Kubernetes Prow Robot 79e8a29544
Merge pull request #74852 from dims/expand-reviewers-approvers-add-label
Expand reviewers/approvers and add labels for conformance image
2019-03-04 13:25:50 -08:00
akutz 020dc47073 Allow choice of os/arch when downloading client
This patch introduces `KUBERNETES_CLIENT_OS` and
`KUBERNETES_CLIENT_ARCH` to the script `cluster/get-kube-binaries.sh` in
order to download a client that is not the OS/Arch of the host on which
the script is executing.
2019-03-04 10:23:04 -06:00
Jeff Grafton c98b3edb88 bazel: make conformance and hyperkube images multiarch 2019-03-03 21:09:22 -08:00
Davanum Srinivas b3adf5bd0d
Expand reviewers/approvers and add labels for conformance image
Change-Id: I6e13d262a8bf97ada89227a853debe58876c212a
2019-03-02 13:46:27 -05:00
Davanum Srinivas 87e87dd25b
Apparently conformance tests have to run serially
There are 10+ conformance tests marked [Serial], So we can't run things
in parallel by default.

Change-Id: I58afbb356899fb38caf4f9f6704bfb47f745b338
2019-03-02 13:30:00 -05:00
Kubernetes Prow Robot e001276b1d
Merge pull request #74836 from yujuhong/update-win-cni-config
GCE/Windows: exclude gateways from the IPAM subnet
2019-03-01 20:35:03 -08:00
Kubernetes Prow Robot 3bb22edef8
Merge pull request #74808 from dims/showcase-the-available-options-in-manifest
Showcase conformance images options in the manifest
2019-03-01 20:34:53 -08:00
Kubernetes Prow Robot 738d29b96c
Merge pull request #74767 from yuwenma/rebase-image
Rebase etcd-version-monitor base image to distroless.
2019-03-01 19:14:24 -08:00
Aaron Crickenberger 388a345efc Use no skip regex for conformance 2019-03-01 16:35:27 -08:00
Yu-Ju Hong 1f86bce95e GCE/Windows: exclude gateways from the IPAM subnet
Exclude IPs assigned for gateways in the IPAM configuration.
2019-03-01 16:35:26 -08:00
Kubernetes Prow Robot b8e05af7bf
Merge pull request #74736 from mtaufen/windows-kube-proxy-config
Stop setting --hostname-override on Windows kube-proxy and allow args  to be extended remotely if necessary
2019-03-01 16:12:23 -08:00
Kubernetes Prow Robot 4bafef2234
Merge pull request #74761 from dims/better-detection-of-binaries-for-conformance-image
Better detection of binaries for conformance image
2019-03-01 14:09:08 -08:00
Davanum Srinivas da566ad647
Showcase conformance images options in the manifest
The Dockerfile has a few more options, let's surface that in the
manifest. Also use the same defaults in the manifest as the Dockerfile
itself.

Change-Id: Ib7419cf7999430db15f39ac414c80ee362fcda76
2019-03-01 16:39:28 -05:00
Kubernetes Prow Robot 98c0d15f98
Merge pull request #74762 from pjh/gce-windows-dump-versions
Dump Windows version information during cluster bringup.
2019-03-01 01:08:36 -08:00
Kubernetes Prow Robot 824883d08a
Merge pull request #74754 from pjh/gce-windows-smoke-timeout
Increase Windows smoke-test timeout to 3 minutes.
2019-02-28 19:12:18 -08:00
Davanum Srinivas 9d7d682760
Better detection of binaries for conformance image
Change-Id: Id33a0827e8cb1e3369544eea3335c80c89780786
2019-02-28 22:00:28 -05:00
Peter Hornyack 18a2a98d67 Dump Windows version information during cluster bringup. 2019-02-28 17:22:49 -08:00
Peter Hornyack 43556be50e Enhance metadata fetching functions.
Introduce Get-InstanceMetadata which can be used to fetch
non-"attribute" metadata values.
2019-02-28 17:18:34 -08:00
Yuwen Ma fd2b57926e Rebase etcd-version-monitor base image to distroless. 2019-02-28 17:06:37 -08:00
Peter Hornyack 4181422a3c Increase Windows smoke-test timeout to 3 minutes.
I recently lowered the timeout value to 2 minutes but this turned out to
be too aggressive for now. 3 minutes is usually long enough for the test
containers to come up.

Note that this test is only used during development, it does not run
continuously anywhere.
2019-02-28 14:22:35 -08:00
Kubernetes Prow Robot 4741785d29
Merge pull request #73650 from mtaufen/windows-upload-scripts
upload Windows startup scripts to GCS for CI
2019-02-28 11:51:25 -08:00
Michael Taufen a76d132162 Stop setting --hostname-override on Windows kube-proxy and allow args to be extended remotely if necessary
--hostname-override should not be necessary, we can rely on kube-proxy
to auto-detect it just like we do in Kubelet.

I also added an optional variable to kube_env, KUBEPROXY_ARGS, that we
can use to remotely extend kube-proxy's command-line if necessary.
2019-02-28 10:13:07 -08:00
John Schnake 03d0e86117 Add support for dryRun option to kube-conformance image
A common issue users run into is wanting a list of tests
a certain regexp will run, without actually running it.

ginkgo supports this with the dryRun flag but it was not
exposed via the kube-conformance image. This change
will set the flag if the E2E_DRYRUN environment variable
is set.

Fixes #74727
2019-02-28 09:21:04 -06:00
Kubernetes Prow Robot 02b8056efb
Merge pull request #73288 from wangzhen127/npd-config
Decouple node-problem-detector release from kubernetes
2019-02-28 00:27:25 -08:00
Kubernetes Prow Robot ccf33be0cc
Merge pull request #73940 from jiayingz/nvidia-dp-update
Update nvidia-gpu-device-plugin addon.
2019-02-27 17:13:01 -08:00
Jing Xu 5e04d200ee Add GcePD windows support
This PR adds the support for GCEPD volume. For now, it uses a workaround
(https://github.com/pjh/gce-tools) to get disk number in windows for a given GCE PD name.
2019-02-27 17:00:32 -08:00
Kubernetes Prow Robot b2e9b2a842
Merge pull request #74608 from dims/lighter-weight-make-for-conformance-dependencies
Lighter weight make for conformance dependencies, better script and manifest
2019-02-27 07:19:55 -08:00
Davanum Srinivas 23b33f5c85
Switch to k8s.gcr.io (from staging)
Change-Id: Ib0d6f681be3537f0cbfcae1dc254f1c42a04be66
2019-02-27 08:54:45 -05:00
Davanum Srinivas 87d9903aaf
Add a script to run the conformance image and pull results
Change-Id: I1eb673fe37b5e8a719f9095473765c80fb7f2347
2019-02-27 07:38:43 -05:00
Kubernetes Prow Robot 1942c1ccb0
Merge pull request #71251 from monotek/kibana
updated kibana to 6.6.1
2019-02-26 23:40:33 -08:00
Zhen Wang efa96f7eb8 allows configuring NPD release and flags on GCI and add cluster e2e test 2019-02-26 21:21:54 -08:00
Kubernetes Prow Robot 81ec358db4
Merge pull request #74438 from pjh/gce-windows-log-dump
Support dumping logs from Windows test nodes on GCE
2019-02-26 18:12:09 -08:00
Peter Hornyack 0bb25290c8 Update log-dump.sh for Windows nodes.
Tested:
```
$ PROJECT=${CLOUDSDK_CORE_PROJECT} KUBERNETES_SKIP_CONFIRM=y NUM_NODES=2 \
  NUM_WINDOWS_NODES=2 KUBE_GCE_ENABLE_IP_ALIASES=true go run \
  ./hack/e2e.go -- --up
$ cluster/log-dump/log-dump.sh
$ ls _artifacts
```

And with: NUM_NODES=2 NUM_WINDOWS_NODES=0; NUM_NODES=0 NUM_WINDOWS_NODES=2
2019-02-26 12:10:19 -08:00
Yu-Ju Hong 093e5a50ac GCE/Windows: create a C:\tmp directory
This is required for running host path tests.
2019-02-26 10:59:06 -08:00
Kubernetes Prow Robot 7a4496429d
Merge pull request #71252 from monotek/elasticsearch
updated elasticsearch to 6.6.1
2019-02-26 09:33:44 -08:00
Jordan Liggitt e1db43ad1b Label old nodes with legacy addon labels 2019-02-26 11:43:10 -05:00
Jordan Liggitt 0174e043c5 Prepare switch from beta.kubernetes.io/masq-agent-ds-ready to node.kubernetes.io/masq-agent-ds-ready 2019-02-26 11:43:10 -05:00
Jordan Liggitt 943b32a289 Prepare switch from beta.kubernetes.io/kube-proxy-ds-ready to node.kubernetes.io/kube-proxy-ds-ready 2019-02-26 11:42:23 -05:00
Jordan Liggitt d6664a2365 Prepare switch from beta.kubernetes.io/metadata-proxy-ready to cloud.google.com/metadata-proxy-ready 2019-02-26 11:42:23 -05:00
Jordan Liggitt 8975233788 Finish migration of fluentd to daemonset 2019-02-26 11:42:23 -05:00
Davanum Srinivas 94ad1dfb11
Better manifest for running conformance image
Change-Id: I137180ed781edd4a9877cabe039e40a72aa71366
2019-02-26 10:29:12 -05:00
Kubernetes Prow Robot b8ddc7945b
Merge pull request #74522 from Pluies/master
Fix fluentd-gcp addon liveness probe
2019-02-26 06:38:24 -08:00
Davanum Srinivas 069eeb541b
Simpler make commands for ginkgo/kubectl/e2e.test
Change-Id: I78cff10231eabd53b1fc7bdd1526c861179e135a
2019-02-26 09:18:05 -05:00
André Bauer 9e2d9cfbb0 changed es image repo
Signed-off-by: André Bauer <monotek23@gmail.com>
2019-02-26 09:09:21 +01:00
Kubernetes Prow Robot 3fb6e77770
Merge pull request #74549 from yujuhong/pause-image
GCE: switch to using e2eteam/pause:3.1 for pause containers
2019-02-25 23:12:00 -08:00
Kubernetes Prow Robot 0ff7e463ee
Merge pull request #73746 from mrbobbytables/kubemark-shellcheck
Fix shellcheck lint errors in Kubemark scripts
2019-02-25 17:25:13 -08:00
Michael Taufen 98edbf41bc upload Windows startup scripts to GCS for CI 2019-02-25 16:44:57 -08:00
Kubernetes Prow Robot 3814176d42
Merge pull request #74455 from SataQiu/fix-shell-2019022302
Fix shellcheck lint errors in cluster and hack scripts
2019-02-25 15:15:19 -08:00
Yu-Ju Hong b863655faa GCE: switch to using e2eteam/pause:3.1 for pause containers
Stop building pause images on node startup.
2019-02-25 14:36:49 -08:00
Kubernetes Prow Robot 2aacb77374
Merge pull request #74444 from pjh/gce-windows-no-defender
Disable Windows Defender on Windows nodes.
2019-02-25 13:54:42 -08:00
Bob Killen 9a4f4878f5
Fix shellcheck lint errors in cluster/kubemark/util.sh 2019-02-25 15:21:29 -05:00
Bob Killen 9a58913e8f
Fix shellcheck lint errors in cluster/kubemark/iks/config-default.sh 2019-02-25 15:21:25 -05:00
Bob Killen ce4c85e3fd
Fix shellcheck lint errors in cluster/kubemark/gce/config-default.sh 2019-02-25 14:55:01 -05:00
Kubernetes Prow Robot 35a258d640
Merge pull request #73272 from danielqsj/juju
fix shellcheck in cluster/juju
2019-02-25 11:33:21 -08:00
Kubernetes Prow Robot f288678cfa
Merge pull request #73261 from danielqsj/local
fix shellcheck in cluster/local
2019-02-25 11:33:11 -08:00
Florent Delannoy e627474e8f Fix fluentd-gcp addon liveness probe
Fix three issues with the fluentd-gcp liveness probe:

h1. STUCK_THRESHOLD_SECONDS was overridden by LIVENESS_THRESHOLD_SECONDS
if defined

Probably a copy/paste issue introduced in edf1ffc074

h1. `[[` is [a bashism](https://stackoverflow.com/a/47576482), and will always failed when called with `/bin/sh`

Introduced by a844523c20

Given that we call the liveness probe with `/bin/sh`, we cannot use the
double-bracketed `[[` syntax for test, as it is not POSIX-compliant and
will throw an error.

Annoyingly, even through it prints an error, `sh` returns with exit code 0
in this case:

```bash
root@fluentd-7mprs:/# sh liveness.sh
liveness.sh: 8: liveness.sh: [[: not found
liveness.sh: 15: liveness.sh: [[: not found
root@fluentd-7mprs:/# echo $?
0
```

Which means the liveness probe is considered successful by Kubernetes,
despite failing to test things as it was intended. This is also
probably the reason why this bug wasn't reported sooner :)

Thankfully, the test in this case can just as easily be written as
POSIX-compliant as it doesn't use any bash-specific features within the
`[[` block.

h1. Buffers are transient and cannot be relied upon for monitoring

Finally, after fixing the above issue, we started seeing the fluentd
containers being restarted very often, and found an issue with the
underlying logic of the liveness probe.

The probe checks that the pod is still alive by running the following
command:

`find /var/log/fluentd-buffers -type f -newer /tmp/marker-stuck -print -quit`

This checks if any _regular_ file exists under `/var/log/fluentd-buffers`
that is more recent than a predetermined time, and will return an empty
string otherwise.

The issue is that these buffers are temporary and volatile, they get created and
deleted constantly. Here is an example of running that check every second on a
running fluentd:

```
root@fluentd-eks-playground-jdc8m:/# LIVENESS_THRESHOLD_SECONDS=${LIVENESS_THRESHOLD_SECONDS:-300};
root@fluentd-eks-playground-jdc8m:/# STUCK_THRESHOLD_SECONDS=${LIVENESS_THRESHOLD_SECONDS:-900};
root@fluentd-eks-playground-jdc8m:/# touch -d "${STUCK_THRESHOLD_SECONDS} seconds ago" /tmp/marker-stuck;
root@fluentd-eks-playground-jdc8m:/# touch -d "${LIVENESS_THRESHOLD_SECONDS} seconds ago" /tmp/marker-liveness;
root@fluentd-eks-playground-jdc8m:/# while true; do date ; find /var/log/fluentd-buffers -type f -newer /tmp/marker-stuck -print -quit ; sleep 1 ; done
Fri Feb 22 10:52:57 UTC 2019
Fri Feb 22 10:52:58 UTC 2019
/var/log/fluentd-buffers/kubernetes.system.buffer/buffer.b5827964ccf4c7004103c3fa7c8533f85.log
Fri Feb 22 10:52:59 UTC 2019
/var/log/fluentd-buffers/kubernetes.system.buffer/buffer.b5827964ccf4c7004103c3fa7c8533f85.log
Fri Feb 22 10:53:00 UTC 2019
Fri Feb 22 10:53:01 UTC 2019
/var/log/fluentd-buffers/kubernetes.system.buffer/buffer.b5827964fb8b2eedcccd2763ea7775cc2.log
Fri Feb 22 10:53:02 UTC 2019
/var/log/fluentd-buffers/kubernetes.system.buffer/buffer.b5827964fb8b2eedcccd2763ea7775cc2.log
Fri Feb 22 10:53:03 UTC 2019
Fri Feb 22 10:53:04 UTC 2019
Fri Feb 22 10:53:05 UTC 2019
Fri Feb 22 10:53:06 UTC 2019
/var/log/fluentd-buffers/kubernetes.system.buffer/buffer.b5827965564883997b673d703af54848b.log
Fri Feb 22 10:53:07 UTC 2019
/var/log/fluentd-buffers/kubernetes.system.buffer/buffer.b5827965564883997b673d703af54848b.log
Fri Feb 22 10:53:08 UTC 2019
/var/log/fluentd-buffers/kubernetes.system.buffer/buffer.b5827965564883997b673d703af54848b.log
Fri Feb 22 10:53:09 UTC 2019
Fri Feb 22 10:53:10 UTC 2019
Fri Feb 22 10:53:11 UTC 2019
Fri Feb 22 10:53:12 UTC 2019
Fri Feb 22 10:53:13 UTC 2019
Fri Feb 22 10:53:14 UTC 2019
Fri Feb 22 10:53:15 UTC 2019
Fri Feb 22 10:53:16 UTC 2019
```

We can see buffers being created, then disappearing. The LivenessProbe running
under these conditions has a ~50% chance of failing, despite fluentd being
perfectly happy.

I believe that check is probably ok for fluentd installs using large
amounts of buffers, in which case the liveness probe will be correct more
often than not, but fluentd installs that use buffering less intensively
will be negatively impacted by this.

My solution to fix this is to check the last updated time of buffering
_folders_ within `/var/log/fluentd_buffers`. These _do_ get updated when
buffers are created, and do not get deleted as buffers are emptied,
making them the perfect candidate for our use.

Here's an example with the `-d` flag for directories:
```
root@fluentd-eks-playground-jdc8m:/# while true; do date ; find /var/log/fluentd-buffers -type d -newer /tmp/marker-stuck -print -quit ; sleep 1 ; done
Fri Feb 22 10:57:51 UTC 2019
/var/log/fluentd-buffers/kubernetes.system.buffer
Fri Feb 22 10:57:52 UTC 2019
/var/log/fluentd-buffers/kubernetes.system.buffer
Fri Feb 22 10:57:53 UTC 2019
/var/log/fluentd-buffers/kubernetes.system.buffer
Fri Feb 22 10:57:54 UTC 2019
/var/log/fluentd-buffers/kubernetes.system.buffer
Fri Feb 22 10:57:55 UTC 2019
/var/log/fluentd-buffers/kubernetes.system.buffer
Fri Feb 22 10:57:56 UTC 2019
/var/log/fluentd-buffers/kubernetes.system.buffer
Fri Feb 22 10:57:57 UTC 2019
/var/log/fluentd-buffers/kubernetes.system.buffer
Fri Feb 22 10:57:58 UTC 2019
/var/log/fluentd-buffers/kubernetes.system.buffer
Fri Feb 22 10:57:59 UTC 2019
/var/log/fluentd-buffers/kubernetes.system.buffer
Fri Feb 22 10:58:00 UTC 2019
/var/log/fluentd-buffers/kubernetes.system.buffer
Fri Feb 22 10:58:01 UTC 2019
/var/log/fluentd-buffers/kubernetes.system.buffer
Fri Feb 22 10:58:02 UTC 2019
/var/log/fluentd-buffers/kubernetes.system.buffer
Fri Feb 22 10:58:03 UTC 2019
/var/log/fluentd-buffers/kubernetes.system.buffer
```

And example of the directory being updated as new buffers come in:
```
root@fluentd-eks-playground-jdc8m:/# ls -lah /var/log/fluentd-buffers/kubernetes.system.buffer
total 0
drwxr-xr-x 2 root root  6 Feb 22 11:17 .
drwxr-xr-x 3 root root 38 Feb 22 11:14 ..
root@fluentd-eks-playground-jdc8m:/# ls -lah /var/log/fluentd-buffers/kubernetes.system.buffer
total 16K
drwxr-xr-x 2 root root  224 Feb 22 11:18 .
drwxr-xr-x 3 root root   38 Feb 22 11:14 ..
-rw-r--r-- 1 root root 1.8K Feb 22 11:18 buffer.b58279be6e21e8b29fc333a7d50096ed0.log
-rw-r--r-- 1 root root  215 Feb 22 11:18 buffer.b58279be6e21e8b29fc333a7d50096ed0.log.meta
-rw-r--r-- 1 root root  429 Feb 22 11:18 buffer.b58279be6f09bdfe047a96486a525ece2.log
-rw-r--r-- 1 root root  195 Feb 22 11:18 buffer.b58279be6f09bdfe047a96486a525ece2.log.meta
root@fluentd-eks-playground-jdc8m:/# ls -lah /var/log/fluentd-buffers/kubernetes.system.buffer
total 0
drwxr-xr-x 2 root root  6 Feb 22 11:18 .
drwxr-xr-x 3 root root 38 Feb 22 11:14 ..
```
2019-02-25 11:48:31 +00:00
André Bauer 2bd6d3dc12 use image version 6.6.1
Signed-off-by: André Bauer <monotek23@gmail.com>
2019-02-25 11:05:52 +01:00
André Bauer 2d15ffc9cc updated to 6.5.2
Signed-off-by: André Bauer <monotek23@gmail.com>
2019-02-25 10:56:50 +01:00
André Bauer 0c29ea1a2e Update es-statefulset.yaml 2019-02-25 10:55:23 +01:00
André Bauer 53a936c359 Update Makefile 2019-02-25 10:55:23 +01:00
André Bauer 0e44fa6359 updated elasticsearch to 6.5.0 2019-02-25 10:55:23 +01:00
André Bauer fc850b5ecd fixed wording
Signed-off-by: André Bauer <monotek23@gmail.com>
2019-02-25 10:49:43 +01:00
André Bauer 421fcd8262 added prodution note to readme
Signed-off-by: André Bauer <monotek23@gmail.com>
2019-02-25 10:47:26 +01:00
danielqsj 7d051e1a75 update juju shell 2019-02-24 20:46:20 +08:00
danielqsj 7e655e8666 fix shellcheck in cluster/juju 2019-02-24 20:40:59 +08:00
danielqsj f02a986081 add comments to shell function 2019-02-24 20:35:46 +08:00
SataQiu 9cda80e836 fix shellcheck lint errors in cluster and hack scripts 2019-02-24 11:15:35 +08:00
Kubernetes Prow Robot 1cf8001e53
Merge pull request #74449 from xichengliudui/fix190223
make more of the shell pass lints
2019-02-23 12:52:34 -08:00
Kubernetes Prow Robot 8993fbc543
Merge pull request #74328 from daixiang0/delete-blank
delete all duplicate empty blanks
2019-02-23 01:43:58 -08:00
Peter Hornyack 621df2cddb Disable Windows Defender on Windows test nodes. 2019-02-22 18:35:38 -08:00
Xiang Dai 36065c6dd7 delete all duplicate empty blanks
Signed-off-by: Xiang Dai <764524258@qq.com>
2019-02-23 10:28:04 +08:00
Jeff Grafton 7a938eb541 Create work-around wrappers for pure attribute on go_binary and go_test
This enables cgo when cross-compiling certain tests and binaries to
Linux, while disabling cgo for Windows and Darwin.
2019-02-22 18:08:34 -08:00
Kubernetes Prow Robot 46d7e9c784
Merge pull request #74381 from yujuhong/add-key
GCE: add EventLog registry key for docker
2019-02-22 17:58:44 -08:00
Kubernetes Prow Robot fd7acc3275
Merge pull request #74065 from ixdy/break-up-test-tarball
break up the test tarball
2019-02-22 17:58:23 -08:00
Kubernetes Prow Robot 743f864310
Merge pull request #73819 from coffeepac/move-fluentd-es-images
Move fluentd es images
2019-02-22 17:58:12 -08:00
Peter Hornyack 3efd4ca1dc Enhance/repair detect-node-names() and related env vars for Windows nodes. 2019-02-22 14:56:55 -08:00
Patrick Christopher 1bd45ba6eb review updates 2019-02-22 10:00:10 -08:00
Aaron Crickenberger 2d868025be Promote spiffxp to approver for cluster/
Also add Katharine as reviewer
2019-02-22 09:25:31 -08:00
Kubernetes Prow Robot c7ac532816
Merge pull request #74360 from pjh/gce-windows-cluster-ssh
Enable OpenSSH on Windows nodes in test clusters.
2019-02-21 20:51:23 -08:00
Kubernetes Prow Robot 125dc6c8ea
Merge pull request #74187 from xichengliudui/fixgolint0218
Fix shellcheck lint errors in cluster/addons/fluentd-elasticsearch/fl……uentd-es-image/run.sh
2019-02-21 20:51:13 -08:00
Yu-Ju Hong 40d0ae311c GCE: add EventLog registry key for docker 2019-02-21 17:47:32 -08:00
Kubernetes Prow Robot 042f9ed3af
Merge pull request #74093 from blakebarnett/lower-neg-cache-ttl
Lowers the default nodelocaldns denial cache TTL
2019-02-21 17:47:16 -08:00
Blake 46c299c1b1 Match default cache size of 10000
https://github.com/coredns/coredns/blob/master/plugin/cache/cache.go#L236
This gets rounded down to the nearest multiple of 256: 9984
2019-02-21 15:03:30 -08:00