Commit Graph

487 Commits (1e879c69ecf09ec73fc35ecc014ada41624bd002)

Author SHA1 Message Date
Euan Kemp e023ec90f3 cluster/gce: Update master root disk size
As part of #29213, the hyperkube image will be deployed alongside
existing dependencies.

This ends up just running over the root disk size of 10 during
extraction.
2016-09-06 13:54:39 -07:00
Kubernetes Submit Queue 09e5fa62b5 Merge pull request #31294 from mml/federated-oops
Automatic merge from submit-queue

Fix e2e logic when FEDERATION=false.

Also add debugging output to GCE detect-master.

Fixes #31285.
2016-09-05 13:01:12 -07:00
Amey Deshpande 6a2201f410 Pick a specific GCI version by default on GCE.
Prior to this change, a K8s branch (master as well as release) was
pinned to a GCI milestone.  It would pick up the latest GCI release on
that milestone at the time of cluster creation.  The rationale was the
K8s users would automatically get the bug fixes in newer versions of
GCI.  However in practice, it makes the runtime environment
non-deterministic, and lack of continuous e2e tests mean we would run
into breakages sooner or later.

With this change, each K8s release will pick a specific version
of GCI by default (similar to how the Debian-based container-vm gets used).
Users can override the default version through KUBE_GCE_MASTER_IMAGE and
KUBE_GCE_NODE_IMAGE environment variables.

We expect the default GCI version will be updated relatively frequently stay
updated with newer GCI releases.  We can also automate the process to
automatically bump the hard-coded GCI version in future.
2016-08-31 17:26:00 -07:00
Jerzy Szczepkowski c5f18d517d Marked HA master feautre experimental.
Marked HA master feautre experimental.
2016-08-30 11:50:22 +02:00
Matt Liggett 2ddc08e567 Add debugging output when trying to find the name of the master. 2016-08-23 13:18:22 -07:00
Jerzy Szczepkowski 05a41623b0 Support for creation/removes of master replicas.
HA master: initial support for creation/removal of masters replicas by
kube-up/kube-down scripts for GCE on gci.
2016-08-15 22:36:49 +02:00
Vishnu kannan 2f6514bd63 Node e2e can run tests against GCE images based on a regex.
It can run tests against multiple existing images that match a regex.
GCI images will be using a regex.

Signed-off-by: Vishnu kannan <vishnuk@google.com>
2016-08-10 11:24:54 -07:00
Ryan Hitchman 5508e49ef5 Use bash ranges "{1..3}" instead of "$(seq 1 3)". 2016-08-08 16:23:39 -07:00
Jerzy Szczepkowski c8d920ea6c Reverted conversion of influx-db to PetSet.
Reverted conversion of influx-db to PetSet.
2016-08-04 17:08:46 +02:00
Bryan Boreham f5159b49c2 Correct gcloud list arg from '--zone' to '--zones'
`--zone` was accepted as an abbreviation but is now deprecated
and will be disabled entirely in gcloud 132.0.0.
2016-08-01 14:49:38 +01:00
Vishnu kannan bb4e1e62fa Mention updating node e2e config before updating gci image milestone
Signed-off-by: Vishnu kannan <vishnuk@google.com>
2016-07-29 14:26:15 -07:00
Filip Grzadkowski e9585fba91 1. Precompute REGION variable in config
2. Add timeout for waiting for loadbalancer
3. Fix kube-down so that it doesn't delete some resources if there are still masters/nodes in other zones
2016-07-20 17:25:25 +02:00
Filip Grzadkowski 70bb57a3e1 Revert "Merge pull request #29278 from kubernetes/revert-29201-ha_master"
This reverts commit ecebdb5707, reversing
changes made to 976ca09d71.
2016-07-20 16:37:31 +02:00
Filip Grzadkowski 194895740c Revert "Add and delete load balancer in front of apiserver." 2016-07-20 14:02:59 +02:00
Piotr Szczesniak 976ca09d71 Merge pull request #29201 from fgrzadkowski/ha_master
Add and delete load balancer in front of apiserver.
2016-07-20 09:38:19 +02:00
Filip Grzadkowski 5c0f0daae0 Add and delete load balancer in front of apiserver. 2016-07-19 16:44:21 +02:00
k8s-merge-robot c31882a5f2 Merge pull request #29140 from maisem/gce_on_cvm
Automatic merge from submit-queue

Fixing kube-up for CVM masters.

@roberthbailey @zmerlynn
2016-07-18 20:06:56 -07:00
Maisem Ali fe14340051 Fixing kube-up for CVM masters. 2016-07-18 14:21:41 -07:00
Zach Loafman 4c667c38c9 GCE bring-up: Differentiate NODE_TAGS from NODE_INSTANCE_PREFIX
Kubernetes-side fix to #29074 (GKE-side fix necessary as well)
2016-07-18 14:20:45 -07:00
Maisem Ali 97f3f80833 Splitting OS_DISTRIBUTION into NODE_OS_DISTRIBUTION and
MASTER_OS_DISTRIBUTION.
2016-07-15 12:02:31 -07:00
David McMahon ef0c9f0c5b Remove "All rights reserved" from all the headers. 2016-06-29 17:47:36 -07:00
Jerzy Szczepkowski d00cdf75e8 Influxdb migrated to PetSet and PersistentVolumes.
Influxdb migrated to PetSet and PersistentVolumes.
2016-06-27 15:39:09 +02:00
Jerzy Szczepkowski a745bb17e3 Cleanup in kube-down: removed deletion of old autoscaler.
Cleanup in kube-down: removed deletion of old autoscaler.
2016-06-24 13:32:37 +02:00
gmarek 6d201c9c57 kube-down deletes instance templates created by the cluster upgrade 2016-06-22 14:09:37 +02:00
nikhiljindal 864b267a8d Simplifying the code to create kubeconfig 2016-06-20 16:10:47 -07:00
k8s-merge-robot b442ba72e5 Merge pull request #27522 from adityakali/gci53
Automatic merge from submit-queue

make GCI image detection robust

This change makes sure that in case we roll back a released GCI image, the image detection logic picks a correct active image. 

@kubernetes/goog-image @Amey-D @wonderfly @dchen1107
2016-06-20 04:05:45 -07:00
Aditya Kali 4a46c948f2 make GCI image detection robust 2016-06-17 09:56:53 -07:00
Zach Loafman 2d77dc639b gce/kube-down: Parallelize IGM deletion, batch more
Batch up to 200 instance / route deletions, handle
IGM deletion in parallel.
2016-06-13 14:05:43 -07:00
Aditya Kali 4dde9a66f4 version bump for gci to milestone 53
Fixes #26455

GCI release 53 includes kubernetes v1.3.0-alpha.5 with docker-1.11.2.
2016-06-10 15:28:52 -07:00
Marcin Wielgus edf21902b5 Rename ENABLE_NODE_AUTOSCALER to ENABLE_CLUSTER_AUTOSCALER - part 1 2016-06-08 13:35:38 +02:00
Quinton Hoole c578678820 Merge pull request #26914 from nikhiljindal/FedAPIServerSecret
Updating federation up script to create secrets with federation-apiserver and k8s apiservers kubeconfigs
2016-06-07 22:29:39 -07:00
k8s-merge-robot bae931204c Merge pull request #26990 from mwielgus/ca-0.1.2
Automatic merge from submit-queue

Bump cluster autoscaler version and enable scale down by default

Follow up of https://github.com/kubernetes/contrib/pull/1148. 

cc: @piosz @fgrzadkowski @jszczepkowski
2016-06-07 19:42:19 -07:00
nikhiljindal 9443bf0f9c Adding secrets for kubernetes clusters as well 2016-06-07 17:30:15 -07:00
Marcin Wielgus c191c72063 Bump cluster autoscaler version and enable scale down by default 2016-06-07 21:42:56 +02:00
k8s-merge-robot 98c432a943 Merge pull request #25841 from ojarjur/ojarjur/registry-mirror
Automatic merge from submit-queue

GCE: Enable using gcr.io as a Docker registry mirror.

```release-note
Use gcr.io as a Docker registry mirror when setting up a cluster in GCE.
```
[![Analytics](https://kubernetes-site.appspot.com/UA-36037335-10/GitHub/.github/PULL_REQUEST_TEMPLATE.md?pixel)]()

This only affects clusters running under GCE.
2016-06-06 14:50:05 -07:00
Aditya Kali f7c2624849 pin GCI version to milestone 52
This is mainly for pinning the 1.2 branch to GCI milestone 52
which contains correct docker and kubelet built in.
Doing this allows us to upgrade docker to v1.11 (issue #26455)
in GCI 53 without breaking the 1.2 release branch.
2016-06-02 13:52:38 -07:00
Daniel Wang a8701baf17 Move the defaults setting of GCI to util.sh
This change recovers some of the side effects of
https://github.com/kubernetes/kubernetes/pull/26197, i.e., keeps the defaults of
`NODE_IMAGE` and `NODE_IMAGE_PROJECT` to `MASTER_IMAGE` and
`MASTER_IMAGE_PROJECT`, for backward compatibility. Although it keeps
`OS_DISTRIBUTION` defaulting to `gci`, the default settings of these vars are
moved to `cluster/gce/util.sh` and conditioned on `OS_DISTRIBUTION==gci`.
2016-06-01 15:37:02 -07:00
Marcin 74ad0def81 Add missing $ in util.sh 2016-05-31 19:41:54 +02:00
Marcin 0b7c2fa6fe Allow to enable scale down in kube-up for gce 2016-05-30 17:07:54 +02:00
Daniel Wang 3d95151971 Replace containervm with GCI as default master image for GCE clusters 2016-05-24 17:28:17 -05:00
Omar Jarjur 338b33f3f4 Enable using gcr.io as a Docker registry mirror.
This only affects clusters running under GCE.
2016-05-24 08:12:48 -07:00
k8s-merge-robot 04d6162e0f Merge pull request #26071 from gmarek/kube-up
Automatic merge from submit-queue

Make node-instance-group base names unique to prevent collisions

We create multiple IGMs for >1000 Node clusters. When we have a conflict on base name IGMs will fight over ownership of the VM that happen to have the name belonging to multiple IGMs.

This change will increase reliability of starting big clusters.

cc @wojtek-t @alex-mohr @roberthbailey @mikedanese
2016-05-24 07:31:22 -07:00
gmarek 7b629a778c Make node-instance-group base names unique to prevent collisions 2016-05-23 14:36:08 +02:00
Andy Zheng f31c4f6d69 Revert "Revert "Add support for running GCI on the GCE cloud provider""
This reverts commit 40f53b1765.
2016-05-23 00:52:08 -07:00
Filip Grzadkowski 2f2b7c8c97 Merge pull request #25734 from jszczepkowski/kac-test
Added enforcing of setting nodes numbers for cluster autoscaler.
2016-05-20 10:28:56 +02:00
Daniel Smith 40f53b1765 Revert "Add support for running GCI on the GCE cloud provider" 2016-05-18 21:31:28 -07:00
Andy Zheng a737e1eba1 Add support for running GCI on the GCE cloud provider 2016-05-18 15:15:05 -07:00
k8s-merge-robot 51203dad7a Merge pull request #25763 from andyzheng0831/gci-image
Automatic merge from submit-queue

GCI: Fix the condition for using the default image

This PR revises the condition for using the default GCI image. The old logic is not convenient for manually run e2e tests in some cases (mainly for GCI team to test custom images). The new logic by this PR is very similar to the logic in using ContainerVM. When setting distro to "gci", if master or node image is unset, we use gci-dev for it. If either is set, we respect it.

@roberthbailey @zmerlynn @dchen1107 please review it, and we should cherry pick it in release-1.2 branch. Thanks!

cc/ @kubernetes/goog-image @adityakali FYI
2016-05-17 22:55:41 -07:00
Zach Loafman 87a8905b2c Fix gce/util.sh:get-master-root-disk-size 2016-05-17 13:43:17 -07:00
Andy Zheng e47f45d1a3 GCI: Fix the condition for using the default image 2016-05-17 10:53:14 -07:00
Jerzy Szczepkowski 858eb0388e Added enforcing of setting nodes numbers for cluster autoscaler.
Added enforcing of setting nodes numbers for cluster autoscaler in kube-up script.
2016-05-17 13:15:49 +02:00
gmarek 64618c3311 Make bigger master root disks in GCE for large clusters 2016-05-16 15:57:56 +02:00
k8s-merge-robot aada051b20 Merge pull request #25357 from mwielgus/ca-salt
Automatic merge from submit-queue

Salt configuration for the new Cluster Autoscaler for GCE

Adds support for cloud autoscaler from contrib/cloud-autoscaler in kube-up.sh GCE script.

cc: @fgrzadkowski @piosz
2016-05-15 11:35:19 -07:00
Marcin Wielgus ac4b380453 Salt configuration for the new Cluster Autoscaler for GCE 2016-05-13 17:04:24 +02:00
Jeff Grafton bc7922c374 Replace awkward bash pipelines with gcloud's --format='value()'
Also remove old `gcloud compute instance-groups managed get-operation`
code. This command doesn't even exist in gcloud anymore.
2016-05-06 15:49:57 -07:00
Jeff Grafton 6cd9dcba60 Use --format='value()' instead of --fields with gcloud 2016-05-04 12:32:13 -07:00
k8s-merge-robot f8196d9048 Merge pull request #24834 from ixdy/node-names
Automatic merge from submit-queue

Fix detect-node-names to not error out if there are no nodes

Fixes #21564.

Teardown was not working correctly in rare cases because `detect-node-names` was failing before any of the actual cleanup was run. I'm pretty sure the issue was that there was an instance group, but no instances in the instance group, so we bailed out when we tried to expand the bash array.

This PR adds a guard so we don't bail if the array is empty.

cc @jlowdermilk @spxtr
2016-04-29 16:37:41 -07:00
Andy Zheng 967fd495e0 GCI: Update the command to get the image 2016-04-29 09:15:27 -07:00
k8s-merge-robot 19408e5575 Merge pull request #24893 from andyzheng0831/gci
Automatic merge from submit-queue

Add support for running clusters on GCI

Google Container-VM Image (GCI) is the next revision of Container-VM. See documentation at https://cloud.google.com/compute/docs/containers/vm-image/. This change adds support for starting a Kubernetes cluster using GCI.

With this change, users can start a kubernetes cluster using the latest kubelet and kubectl release binary built in the GCI image by running:

	$ KUBE_OS_DISTRIBUTION="gci" cluster/kube-up.sh

Or run a testing cluster on GCI by running:

	$ KUBE_OS_DISTRIBUTION="gci" go run hack/e2e.go -v --up

The commands above will choose the latest GCI image by default.
2016-04-27 23:46:26 -07:00
Andy Zheng 93929c3853 Add support for running clusters on GCI 2016-04-27 15:24:21 -07:00
CJ Cullen 3253739505 Use ABAC authz instead of AllowAll. 2016-04-27 11:38:15 -07:00
Jeff Grafton 5487a3f8f6 Fix detect-node-names to not error out if there are no nodes 2016-04-26 17:48:25 -07:00
Jeff Grafton b69b94103c Make detect-node-names less brittle for G{C,K}E 2016-04-25 12:10:25 -07:00
k8s-merge-robot 19a106f61e Merge pull request #24274 from euank/configurable-release-bucket
Automatic merge from submit-queue

jenkins: Allow configuration of release bucket

This allows others to leverage the existing E2E code to test some
patched kube binary by simply overriding the bucket and reusing many of
the existing scripts
2016-04-21 23:09:05 -07:00
Euan Kemp 9f9c54cdd2 jenkins: Allow configuration of release bucket
This allows others to leverage the existing E2E code to test some
patched kube binary by simply overriding the bucket and reusing many of
the existing scripts
2016-04-14 12:26:43 -07:00
CJ Cullen 5a785698aa Strip comments from configure-vm.sh for gce 2016-04-11 15:13:02 -07:00
k8s-merge-robot c6e995a824 Merge pull request #23558 from andyzheng0831/enhance
Automatic merge from submit-queue

Trusty: Regional release .tar.gz support

@zmerlynn and @roberthbailey please review it. This change is to support the feature added in PR #22234. The entire logic is pretty much the same as in #22234, with only few minor changes in implementation.

I had manually run e2e tests with "export RELEASE_REGION_FALLBACK=true" on two clusters: (1) Trusty on master nodes on ContainerVM; (2) Master and nodes all on trusty. All tests are green. I don't figure out a way to simulate regional fallback. But I did test the function download_or_bust() out-of-box.

cc/ @wonderfly @dchen1107 @fabioy FYI.
2016-04-01 03:39:08 -07:00
Andy Zheng a83f11c484 Trusty: Regional release .tar.gz support 2016-03-28 15:21:14 -07:00
Matt Liggett 300855c5bb Stop swallowing stdout and stderr when we check for the staging bucket.
Should help debug #23461
2016-03-25 15:01:21 -07:00
k8s-merge-robot f2d14990a5 Merge pull request #23180 from roberthbailey/dead-code
Auto commit by PR queue bot
2016-03-21 17:30:22 -07:00
Andy Zheng 2ea9ef53f3 Trusty: Support hybrid cluster with nodes on ContainerVM 2016-03-19 22:47:35 -07:00
Robert Bailey c5ca042ede Remove the restart-kube-proxy and restart-apiserver functions
since neither are used any longer.
2016-03-18 01:49:01 -07:00
gmarek c325319ccb Default MIG size to 1000 2016-03-14 12:21:14 +01:00
k8s-merge-robot 31903c8bb1 Merge pull request #22594 from justinsb/aws_ulite
Auto commit by PR queue bot
2016-03-10 10:15:28 -08:00
Joe Finney e660adbad8 Add connect timeout to ssh-to-node. 2016-03-07 13:29:04 -08:00
k8s-merge-robot bc96422e23 Merge pull request #22571 from zmerlynn/regional-registry
Auto commit by PR queue bot
2016-03-06 13:21:19 -08:00
k8s-merge-robot e5a908c898 Merge pull request #22476 from ixdy/cluster-logs
Auto commit by PR queue bot
2016-03-05 23:06:29 -08:00
Justin Santa Barbara 523e1d0d8c Ubernetes-Lite: reuse existing configuration when reusing master
In particular, we need to share the kubelet cert & key, otherwise we
can't connect to the kube-api.

Fix #22593
2016-03-05 17:00:02 -05:00
Zach Loafman 60e1fccc7f Regional registries: Hack the manifests!
Allow the gcr.io/google_containers registry to be overridden
regionally by just blasting a new KUBE_ADDON_REGISTRY out. Instead of
adding every addon to Salt and asking all of the other consumers
(Trusty, Juju, Mesos, etc) to change, just script the sed ourselves.

This is probably the 9th grossest thing I've ever done, but it works
well, and it works quickly. I kind of wish it didn't.
2016-03-04 20:00:54 -08:00
Jeff Grafton d70c516d4f Remove log collection code in cluster/gce/util.sh.
Also update some docs to mention cluster/log-dump.sh.
2016-03-03 15:28:33 -08:00
Robert Bailey 30b78e4d7f Add the GCE cluster name to instance metadata. 2016-03-03 13:57:31 -08:00
Jeff Lowdermilk 73479d5209 Merge pull request #22441 from thockin/flake-22244-gce-static-ip
Check static IP postcondition if op fails
2016-03-03 11:38:50 -08:00
Tim Hockin 6dca871693 Check static IP postcondition if op fails 2016-03-03 08:20:12 -08:00
k8s-merge-robot c9accdfd57 Merge pull request #22234 from zmerlynn/regional-buckets
Auto commit by PR queue bot
2016-03-02 03:01:54 -08:00
Joe Finney 74515a6b23 Fix detect-nodes in cluster/gce/util.sh. 2016-03-01 17:12:43 -08:00
Zach Loafman 5a031bf52c GCE startup: Regional release .tar.gz support
* In kube-up.sh, create a staging bucket with a location nearest the
zone being created. If new variable RELEASE_REGION_FALLBACK is set
(default false), create multiple buckets and stage to fallback
URLs. (In open source, this path is primarily for testing.)

* In configure-vm.sh, split the URL env variables by comma (if any
extra are present) and retry on the fallback URLs. Also factor the
hash checking into this path rather than outside, since a corrupt
release in a particular geo can be retried in a different geo.

* Remove the local already-staged .tar.gz checks. They've caused
several issues along the way, and with this code path become virtually
unmaintainable. (I could add a sentinel for each bucket it's possibly
staged to, but ew.)
2016-03-01 09:59:39 -08:00
Justin Santa Barbara ee58dc22d7 Merge pull request #21673 from gmarek/kube-down
Auto commit by PR queue bot
2016-02-27 19:21:18 -05:00
gmarek 053e882622 kube-down looks for instance templates even if instance group does not exists 2016-02-24 11:15:45 +01:00
Zach Loafman e85cf1bb97 Attempt to delete node-template before retry
I don't think we've seen this commonly, so I'm proposing a simplistic
fix for now.

Fixes #20878
2016-02-22 16:09:15 -08:00
k8s-merge-robot 1ce188e557 Merge pull request #21430 from spxtr/e2e-internal
Auto commit by PR queue bot
2016-02-21 12:08:22 -08:00
k8s-merge-robot d6192dd152 Merge pull request #21342 from spxtr/remove-e2e-test
Auto commit by PR queue bot
2016-02-21 00:39:38 -08:00
Matt Liggett 58283aff67 Also retain docker and kubelet logs.
Still trying to debug #20916
2016-02-18 13:32:18 -08:00
Joe Finney 2b756ca56d Move E2E kube-up call into test-setup from e2e-internal. 2016-02-17 16:49:07 -08:00
Joe Finney 6f7182e500 Remove hack/e2e-test.sh in favor of hack/e2e.go. 2016-02-16 14:54:50 -08:00
Quinton Hoole b4a3559807 Add Jenkins e2e test job for multi-zone Ubernetes Lite 2016-02-16 13:07:17 -08:00
k8s-merge-robot 2299547ca5 Merge pull request #21183 from mml/slow-flake
Auto commit by PR queue bot
2016-02-16 05:12:50 -08:00
k8s-merge-robot b1dedc0900 Merge pull request #21058 from gmarek/fix
Auto commit by PR queue bot
2016-02-15 06:12:27 -08:00
Matt Liggett c25331f251 Save the master logs to separate files rather than dumping to stdout. 2016-02-12 17:30:45 -08:00
Matt Liggett 7a2e649745 Include some debug output when the API server never appears.
Helps with #20916
2016-02-11 16:33:35 -08:00
gmarek aa79f04aed Fix #21027 2016-02-11 22:25:08 +01:00