k3s/DESIGN.md

# Kubernetes Design Overview

- [Overview](#overview)
- [Key Concepts](#key-concepts)
  - [Pods](#pods)
  - [Labels](#labels)
- [The Kubernetes Node](#the-kubernetes-node)
  - [Kubelet](#kubelet)
  - [Kubernetes Proxy](#kubernetes-proxy)
- [The Kubernetes Master](#the-kubernetes-master)
  - [etcd](#etcd)
  - [Kubernetes API Server](#kubernetes-api-server)
  - [Kubernetes Controller Manager Server](#kubernetes-controller-manager-server)
- [Network Model](#network-model)
- [Release Process](#release-process)
- [GCE Cluster Configuration](#gce-cluster-configuration)
  - [Cluster Security](#cluster-security)

## Overview

Kubernetes is a system for managing containerized applications across multiple hosts, providing basic mechanisms for deployment, maintenance, and scaling of applications. Its APIs are intended to serve as the foundation for an open ecosystem of tools, automation systems, and higher-level API layers.

Kubernetes uses [Docker](http://www.docker.io) to package, instantiate, and run containerized applications.

Is Kubernetes, then, a Docker "orchestration" system? Yes and no.

Kubernetes establishes robust declarative primitives for maintaining the desired state requested by the user. We see these primitives as the main value added by Kubernetes. Self-healing mechanisms, such as auto-restarting, re-scheduling, and replicating containers require active controllers, not just imperative orchestration.

Kubernetes is primarily targeted at applications comprised of multiple containers, such as elastic, distributed micro-services. It is also designed to facilitate migration of non-containerized application stacks to Kubernetes. It therefore includes abstractions for grouping containers in both loosely coupled and tightly coupled formations, and provides ways for containers to find and communicate with each other in relatively familiar ways.

Kubernetes enables users to ask a cluster to run a set of containers. The system automatically chooses hosts to run those containers on. While Kubernetes's scheduler is currently very simple, we expect it to grow in sophistication over time. Scheduling is a policy-rich, topology-aware, workload-specific function that significantly impacts availability, performance, and capacity. The scheduler needs to take into account individual and collective resource requirements, quality of service requirements, hardware/software/policy constraints, affinity and anti-affinity specifications, data locality, inter-workload interference, deadlines, and so on. Workload-specific requirements will be exposed through the API as necessary.

Architecturally, we want Kubernetes to be built as a collection of pluggable components and layers, with the ability to use alternative schedulers, storage systems, and distribution mechanisms, and we're evolving its current code in that direction.

Kubernetes is intended to run on multiple cloud providers, as well as on physical hosts.

A single Kubernetes cluster is not intended to span multiple availability zones. Instead, we recommend building a higher-level layer to replicate complete deployments of highly available applications across multiple zones.

Kubernetes is not currently suitable for use by multiple users -- see [Cluster Security](#cluster-security), below.

### Cluster Architecture

A running Kubernetes cluster contains node agents (kubelet) and master components (APIs, scheduler, etc), on top of a distributed storage solution. This diagram shows our desired eventual state, though we're still working on a few things, like making kubelet itself (all our components, really) run within docker, and making the scheduler 100% pluggable.

![Architecture Diagram](/docs/architecture.png?raw=true "Architecture overview")

## Key Concepts

While Docker itself works with individual containers, Kubernetes provides higher-level organizational constructs in support of common cluster-level usage patterns, currently focused on service applications, but which could also be expanded to batch and test workloads in the future.

### Pods

A _pod_ (as in a pod of whales or pea pod) is a relatively tightly coupled group of containers that are scheduled onto the same host. It models an application-specific "virtual host" in a containerized environment. Pods serve as units of scheduling, deployment, and horizontal scaling/replication, share fate, and share some resources, such as storage volumes and IP addresses.

[More details on pods](https://github.com/GoogleCloudPlatform/kubernetes/blob/master/docs/pods.md).

### Labels

Loosely coupled cooperating pods are organized using key/value _labels_.

Individual labels are used to specify identifying metadata, and to convey the semantic purposes/roles of pods of containers. Examples of typical pod label keys include `service`, `environment` (e.g., with values `dev`, `qa`, or `production`), `tier` (e.g., with values `frontend` or `backend`), and `track` (e.g., with values `daily` or `weekly`), but you are free to develop your own conventions.

Via a _label selector_ the user can identify a set of pods. The label selector is the core grouping primitive in Kubernetes. It could be used to identify service replicas or shards, worker pool members, or peers in a distributed application.

Kubernetes currently supports two objects that use label selectors to keep track of their members, `service`s and `replicationController`s:
- `service`: A service is a configuration unit for the [proxies](#kubernetes-proxy) that run on every worker node.  It is named and points to one or more pods.
- `replicationController`: A replication controller takes a template and ensures that there is a specified number of "replicas" of that template running at any one time.  If there are too many, it'll kill some.  If there are too few, it'll start more.

The set of pods that a `service` targets is defined with a label selector. Similarly, the population of pods that a `replicationController` is monitoring is also defined with a label selector. 

For management convenience and consistency, `services` and `replicationControllers` may themselves have labels and would generally carry the labels their corresponding pods have in common.

[More details on labels](https://github.com/GoogleCloudPlatform/kubernetes/blob/master/docs/labels.md).

## The Kubernetes Node

When looking at the architecture of the system, we'll break it down to services that run on the worker node and services that comprise the cluster-level control plane.

The Kubernetes node has the services necessary to run Docker containers and be managed from the master systems.

The Kubernetes node design is an extension of the [Container-optimized Google Compute Engine image](https://developers.google.com/compute/docs/containers/container_vms).  Over time the plan is for these images/nodes to merge and be the same thing used in different ways. It has the services necessary to run Docker containers and be managed from the master systems.

Each node runs Docker, of course.  Docker takes care of the details of downloading images and running containers.

### Kubelet
The second component on the node is called the `kubelet`.  The Kubelet is the logical successor (and rewritten in go) of the [Container Agent](https://github.com/GoogleCloudPlatform/container-agent) that is part of the Compute Engine image.

The Kubelet works in terms of a container manifest.  A container manifest (defined [here](https://developers.google.com/compute/docs/containers/container_vms#container_manifest)) is a YAML file that describes a `pod`.  The Kubelet takes a set of manifests that are provided in various mechanisms and ensures that the containers described in those manifests are started and continue running.

There are 4 ways that a container manifest can be provided to the Kubelet:

* **File** Path passed as a flag on the command line.  This file is rechecked every 20 seconds (configurable with a flag).
* **HTTP endpoint** HTTP endpoint passed as a parameter on the command line.  This endpoint is checked every 20 seconds (also configurable with a flag.)
* **etcd server**  The Kubelet will reach out and do a `watch` on an [etcd](https://github.com/coreos/etcd) server.  The etcd path that is watched is `/registry/hosts/$(hostname -f)`.  As this is a watch, changes are noticed and acted upon very quickly.
* **HTTP server** The kubelet can also listen for HTTP and respond to a simple API (underspec'd currently) to submit a new manifest.

### Kubernetes Proxy

Each node also runs a simple network proxy.  This reflects `services` as defined in the Kubernetes API on each node and can do simple TCP stream forwarding or round robin TCP forwarding across a set of backends.

Service endpoints are currently found through [Docker-links-compatible](https://docs.docker.com/userguide/dockerlinks/) environment variables specifying ports opened by the service proxy. Currently the user must select a unique port to expose the service on on the proxy, as well as the container's port to target.

## The Kubernetes Control Plane

The Kubernetes control plane is split into a set of components, but they all run on a single _master_ node.  These work together to provide an unified view of the cluster.

### etcd

All persistent master state is stored in an instance of `etcd`.  This provides a great way to store configuration data reliably.  With `watch` support, coordinating components can be notified very quickly of changes.

### Kubernetes API Server

This server serves up the main [Kubernetes API](https://github.com/GoogleCloudPlatform/kubernetes/tree/master/api).

It validates and configures data for 3 types of objects: `pod`s, `service`s, and `replicationController`s.

Beyond just servicing REST operations, validating them and storing them in `etcd`, the API Server does two other things:

* Schedules pods to worker nodes.  Right now the scheduler is very simple.
* Synchronize pod information (where they are, what ports they are exposing) with the service configuration.

### Kubernetes Controller Manager Server

The `replicationController` type described above isn't strictly necessary for Kubernetes to be useful.  It is really a service that is layered on top of the simple `pod` API.  To enforce this layering, the logic for the replicationController is actually broken out into another server.  This server watches `etcd` for changes to `replicationController` objects and then uses the public Kubernetes API to implement the replication algorithm.

## Release Process

Right now "building" or "releasing" Kubernetes consists of some scripts (in `release/`) to create a `tar` of the necessary data and then uploading it to Google Cloud Storage.  In the future we will generate Docker images for the bulk of the above described components: [Issue #19](https://github.com/GoogleCloudPlatform/kubernetes/issues/19).

## GCE Cluster Configuration

The scripts and data in the `cluster/` directory automates creating a set of Google Compute Engine VMs and installing all of the Kubernetes components.  There is a single master node and a set of worker (called minion) nodes.

`config-default.sh` has a set of tweakable definitions/parameters for the cluster.

The heavy lifting of configuring the VMs is done by [SaltStack](http://www.saltstack.com/).

The bootstrapping works like this:

1. The `kube-up.sh` script uses the GCE [`startup-script`](https://developers.google.com/compute/docs/howtos/startupscript) mechanism for both the master node and the minion nodes.
  * For the minion, this simply configures and installs SaltStack.  The network range that this minion is assigned is baked into the startup-script for that minion (see [the networking doc](https://github.com/GoogleCloudPlatform/kubernetes/blob/master/docs/networking.md) for more details).
  * For the master, the release files are downloaded from GCS and unpacked.  Various parts (specifically the SaltStack configuration) are installed in the right places.
2. SaltStack then installs the necessary servers on each node.
  * All go code is currently downloaded to each machine and compiled at install time.
  * The custom networking bridge is configured on each minion before Docker is installed.
  * Configuration (like telling the `apiserver` the hostnames of the minions) is dynamically created during the saltstack install.
3. After the VMs are started, the `kube-up.sh` script will call `curl` every 2 seconds until the `apiserver` starts responding.

`kube-down.sh` can be used to tear the entire cluster down.  If you build a new release and want to update your cluster, you can use `kube-push.sh` to update and apply (`highstate` in salt parlance) the salt config.

### Cluster Security

As there is no security currently built into the `apiserver`, the salt configuration will install `nginx`.  `nginx` is configured to serve HTTPS with a self signed certificate.  HTTP basic auth is used from the client to `nginx`.  `nginx` then forwards the request on to the `apiserver` over plain old HTTP.  Because a self signed certificate is used, access to the server should be safe from eavesdropping but is subject to "man in the middle" attacks.  Access via the browser will result in warnings and tools like curl will require an "--insecure" flag.

All communication within the cluster (worker nodes to the master, for instance) occurs on the internal virtual network and should be safe from eavesdropping.

The password is generated randomly as part of the `kube-up.sh` script and stored in `~/.kubernetes_auth`.
Add DESIGN.md to document core design. Fixes #5 2014-06-09 06:02:07 +00:00			`# Kubernetes Design Overview`

TOC to markdown Thanks Joe :) 2014-06-09 23:13:44 +00:00			`- [Overview](#overview)`
Fixed TOC links. 2014-06-18 07:43:00 +00:00			`- [Key Concepts](#key-concepts)`
			`- [Pods](#pods)`
			`- [Labels](#labels)`
TOC to markdown Thanks Joe :) 2014-06-09 23:13:44 +00:00			`- [The Kubernetes Node](#the-kubernetes-node)`
			`- [Kubelet](#kubelet)`
			`- [Kubernetes Proxy](#kubernetes-proxy)`
			`- [The Kubernetes Master](#the-kubernetes-master)`
			`- [etcd](#etcd)`
			`- [Kubernetes API Server](#kubernetes-api-server)`
			`- [Kubernetes Controller Manager Server](#kubernetes-controller-manager-server)`
			`- [Network Model](#network-model)`
			`- [Release Process](#release-process)`
			`- [GCE Cluster Configuration](#gce-cluster-configuration)`
			`- [Cluster Security](#cluster-security)`
Adding a TOC 2014-06-09 22:39:11 +00:00
			`## Overview`

Add networking documentation from issue #188. Refactor pod, label, and networking documentation to push details into separate documents. Add some documentation of how to connect to services. 2014-07-16 01:42:02 +00:00			`Kubernetes is a system for managing containerized applications across multiple hosts, providing basic mechanisms for deployment, maintenance, and scaling of applications. Its APIs are intended to serve as the foundation for an open ecosystem of tools, automation systems, and higher-level API layers.`

			`Kubernetes uses [Docker](http://www.docker.io) to package, instantiate, and run containerized applications.`

			`Is Kubernetes, then, a Docker "orchestration" system? Yes and no.`

			`Kubernetes establishes robust declarative primitives for maintaining the desired state requested by the user. We see these primitives as the main value added by Kubernetes. Self-healing mechanisms, such as auto-restarting, re-scheduling, and replicating containers require active controllers, not just imperative orchestration.`

			`Kubernetes is primarily targeted at applications comprised of multiple containers, such as elastic, distributed micro-services. It is also designed to facilitate migration of non-containerized application stacks to Kubernetes. It therefore includes abstractions for grouping containers in both loosely coupled and tightly coupled formations, and provides ways for containers to find and communicate with each other in relatively familiar ways.`

			Kubernetes enables users to ask a cluster to run a set of containers. The system automatically chooses hosts to run those containers on. While Kubernetes's scheduler is currently very simple, we expect it to grow in sophistication over time. Scheduling is a policy-rich, topology-aware, workload-specific function that significantly impacts availability, performance, and capacity. The scheduler needs to take into account individual and collective resource requirements, quality of service requirements, hardware/software/policy constraints, affinity and anti-affinity specifications, data locality, inter-workload interference, deadlines, and so on. Workload-specific requirements will be exposed through the API as necessary.

			`Architecturally, we want Kubernetes to be built as a collection of pluggable components and layers, with the ability to use alternative schedulers, storage systems, and distribution mechanisms, and we're evolving its current code in that direction.`

			`Kubernetes is intended to run on multiple cloud providers, as well as on physical hosts.`

			`A single Kubernetes cluster is not intended to span multiple availability zones. Instead, we recommend building a higher-level layer to replicate complete deployments of highly available applications across multiple zones.`

			`Kubernetes is not currently suitable for use by multiple users -- see [Cluster Security](#cluster-security), below.`
Add DESIGN.md to document core design. Fixes #5 2014-06-09 06:02:07 +00:00
Link to architecture diagram 2014-07-19 06:29:58 +00:00			`### Cluster Architecture`

			`A running Kubernetes cluster contains node agents (kubelet) and master components (APIs, scheduler, etc), on top of a distributed storage solution. This diagram shows our desired eventual state, though we're still working on a few things, like making kubelet itself (all our components, really) run within docker, and making the scheduler 100% pluggable.`

			`![Architecture Diagram](/docs/architecture.png?raw=true "Architecture overview")`

Added more motivation for pods and labels, and put them together in a subsection near the top. 2014-06-18 07:40:01 +00:00			`## Key Concepts`
Add DESIGN.md to document core design. Fixes #5 2014-06-09 06:02:07 +00:00
Add networking documentation from issue #188. Refactor pod, label, and networking documentation to push details into separate documents. Add some documentation of how to connect to services. 2014-07-16 01:42:02 +00:00			`While Docker itself works with individual containers, Kubernetes provides higher-level organizational constructs in support of common cluster-level usage patterns, currently focused on service applications, but which could also be expanded to batch and test workloads in the future.`
Added more motivation for pods and labels, and put them together in a subsection near the top. 2014-06-18 07:40:01 +00:00
			`### Pods`

Add networking documentation from issue #188. Refactor pod, label, and networking documentation to push details into separate documents. Add some documentation of how to connect to services. 2014-07-16 01:42:02 +00:00			`A _pod_ (as in a pod of whales or pea pod) is a relatively tightly coupled group of containers that are scheduled onto the same host. It models an application-specific "virtual host" in a containerized environment. Pods serve as units of scheduling, deployment, and horizontal scaling/replication, share fate, and share some resources, such as storage volumes and IP addresses.`
Added more motivation for pods and labels, and put them together in a subsection near the top. 2014-06-18 07:40:01 +00:00
Add networking documentation from issue #188. Refactor pod, label, and networking documentation to push details into separate documents. Add some documentation of how to connect to services. 2014-07-16 01:42:02 +00:00			`[More details on pods](https://github.com/GoogleCloudPlatform/kubernetes/blob/master/docs/pods.md).`
Added more motivation for pods and labels, and put them together in a subsection near the top. 2014-06-18 07:40:01 +00:00
			`### Labels`

Add networking documentation from issue #188. Refactor pod, label, and networking documentation to push details into separate documents. Add some documentation of how to connect to services. 2014-07-16 01:42:02 +00:00			`Loosely coupled cooperating pods are organized using key/value _labels_.`
Added more motivation for pods and labels, and put them together in a subsection near the top. 2014-06-18 07:40:01 +00:00
Add networking documentation from issue #188. Refactor pod, label, and networking documentation to push details into separate documents. Add some documentation of how to connect to services. 2014-07-16 01:42:02 +00:00			Individual labels are used to specify identifying metadata, and to convey the semantic purposes/roles of pods of containers. Examples of typical pod label keys include `service`, `environment` (e.g., with values `dev`, `qa`, or `production`), `tier` (e.g., with values `frontend` or `backend`), and `track` (e.g., with values `daily` or `weekly`), but you are free to develop your own conventions.
Added more motivation for pods and labels, and put them together in a subsection near the top. 2014-06-18 07:40:01 +00:00
Add networking documentation from issue #188. Refactor pod, label, and networking documentation to push details into separate documents. Add some documentation of how to connect to services. 2014-07-16 01:42:02 +00:00			`Via a _label selector_ the user can identify a set of pods. The label selector is the core grouping primitive in Kubernetes. It could be used to identify service replicas or shards, worker pool members, or peers in a distributed application.`
Added more motivation for pods and labels, and put them together in a subsection near the top. 2014-06-18 07:40:01 +00:00
Move definitions of service and replicationController to avoid forward references. 2014-07-09 19:31:51 +00:00			Kubernetes currently supports two objects that use label selectors to keep track of their members, `service`s and `replicationController`s:
Add networking documentation from issue #188. Refactor pod, label, and networking documentation to push details into separate documents. Add some documentation of how to connect to services. 2014-07-16 01:42:02 +00:00			- `service`: A service is a configuration unit for the [proxies](#kubernetes-proxy) that run on every worker node. It is named and points to one or more pods.
Move definitions of service and replicationController to avoid forward references. 2014-07-09 19:31:51 +00:00			- `replicationController`: A replication controller takes a template and ensures that there is a specified number of "replicas" of that template running at any one time. If there are too many, it'll kill some. If there are too few, it'll start more.

			The set of pods that a `service` targets is defined with a label selector. Similarly, the population of pods that a `replicationController` is monitoring is also defined with a label selector.
Add DESIGN.md to document core design. Fixes #5 2014-06-09 06:02:07 +00:00
Implemented jbeda's suggestions, except s/stage/environment/g. 2014-06-18 18:01:51 +00:00			For management convenience and consistency, `services` and `replicationControllers` may themselves have labels and would generally carry the labels their corresponding pods have in common.

Add networking documentation from issue #188. Refactor pod, label, and networking documentation to push details into separate documents. Add some documentation of how to connect to services. 2014-07-16 01:42:02 +00:00			`[More details on labels](https://github.com/GoogleCloudPlatform/kubernetes/blob/master/docs/labels.md).`
Add DESIGN.md to document core design. Fixes #5 2014-06-09 06:02:07 +00:00
			`## The Kubernetes Node`

Add networking documentation from issue #188. Refactor pod, label, and networking documentation to push details into separate documents. Add some documentation of how to connect to services. 2014-07-16 01:42:02 +00:00			`When looking at the architecture of the system, we'll break it down to services that run on the worker node and services that comprise the cluster-level control plane.`
Added more motivation for pods and labels, and put them together in a subsection near the top. 2014-06-18 07:40:01 +00:00
Add DESIGN.md to document core design. Fixes #5 2014-06-09 06:02:07 +00:00			`The Kubernetes node has the services necessary to run Docker containers and be managed from the master systems.`

Linking directly to container VMs page 2014-06-10 16:50:40 +00:00			`The Kubernetes node design is an extension of the [Container-optimized Google Compute Engine image](https://developers.google.com/compute/docs/containers/container_vms). Over time the plan is for these images/nodes to merge and be the same thing used in different ways. It has the services necessary to run Docker containers and be managed from the master systems.`
Add DESIGN.md to document core design. Fixes #5 2014-06-09 06:02:07 +00:00
			`Each node runs Docker, of course. Docker takes care of the details of downloading images and running containers.`

			`### Kubelet`
Minor edit rewrite -> rewritten 2014-06-16 15:13:20 +00:00			The second component on the node is called the `kubelet`. The Kubelet is the logical successor (and rewritten in go) of the [Container Agent](https://github.com/GoogleCloudPlatform/container-agent) that is part of the Compute Engine image.
Add DESIGN.md to document core design. Fixes #5 2014-06-09 06:02:07 +00:00
Tiny typo fixes 2014-06-09 21:14:43 +00:00			The Kubelet works in terms of a container manifest. A container manifest (defined [here](https://developers.google.com/compute/docs/containers/container_vms#container_manifest)) is a YAML file that describes a `pod`. The Kubelet takes a set of manifests that are provided in various mechanisms and ensures that the containers described in those manifests are started and continue running.
Add DESIGN.md to document core design. Fixes #5 2014-06-09 06:02:07 +00:00
			`There are 4 ways that a container manifest can be provided to the Kubelet:`

			`* File Path passed as a flag on the command line. This file is rechecked every 20 seconds (configurable with a flag).`
			`* HTTP endpoint HTTP endpoint passed as a parameter on the command line. This endpoint is checked every 20 seconds (also configurable with a flag.)`
			* etcd server The Kubelet will reach out and do a `watch` on an [etcd](https://github.com/coreos/etcd) server. The etcd path that is watched is `/registry/hosts/$(hostname -f)`. As this is a watch, changes are noticed and acted upon very quickly.
			`* HTTP server The kubelet can also listen for HTTP and respond to a simple API (underspec'd currently) to submit a new manifest.`

			`### Kubernetes Proxy`

			Each node also runs a simple network proxy. This reflects `services` as defined in the Kubernetes API on each node and can do simple TCP stream forwarding or round robin TCP forwarding across a set of backends.

Add networking documentation from issue #188. Refactor pod, label, and networking documentation to push details into separate documents. Add some documentation of how to connect to services. 2014-07-16 01:42:02 +00:00			`Service endpoints are currently found through [Docker-links-compatible](https://docs.docker.com/userguide/dockerlinks/) environment variables specifying ports opened by the service proxy. Currently the user must select a unique port to expose the service on on the proxy, as well as the container's port to target.`

			`## The Kubernetes Control Plane`
Add DESIGN.md to document core design. Fixes #5 2014-06-09 06:02:07 +00:00
Add networking documentation from issue #188. Refactor pod, label, and networking documentation to push details into separate documents. Add some documentation of how to connect to services. 2014-07-16 01:42:02 +00:00			`The Kubernetes control plane is split into a set of components, but they all run on a single _master_ node. These work together to provide an unified view of the cluster.`
Add DESIGN.md to document core design. Fixes #5 2014-06-09 06:02:07 +00:00
			`### etcd`

			All persistent master state is stored in an instance of `etcd`. This provides a great way to store configuration data reliably. With `watch` support, coordinating components can be notified very quickly of changes.

			`### Kubernetes API Server`

			`This server serves up the main [Kubernetes API](https://github.com/GoogleCloudPlatform/kubernetes/tree/master/api).`

Move definitions of service and replicationController to avoid forward references. 2014-07-09 19:31:51 +00:00			It validates and configures data for 3 types of objects: `pod`s, `service`s, and `replicationController`s.
Add DESIGN.md to document core design. Fixes #5 2014-06-09 06:02:07 +00:00
			Beyond just servicing REST operations, validating them and storing them in `etcd`, the API Server does two other things:

			`* Schedules pods to worker nodes. Right now the scheduler is very simple.`
			`* Synchronize pod information (where they are, what ports they are exposing) with the service configuration.`

			`### Kubernetes Controller Manager Server`

Fix typo in DESIGN.md 2014-06-10 20:58:07 +00:00			The `replicationController` type described above isn't strictly necessary for Kubernetes to be useful. It is really a service that is layered on top of the simple `pod` API. To enforce this layering, the logic for the replicationController is actually broken out into another server. This server watches `etcd` for changes to `replicationController` objects and then uses the public Kubernetes API to implement the replication algorithm.
Add DESIGN.md to document core design. Fixes #5 2014-06-09 06:02:07 +00:00
			`## Release Process`

Tiny typo fixes 2014-06-09 21:14:43 +00:00			Right now "building" or "releasing" Kubernetes consists of some scripts (in `release/`) to create a `tar` of the necessary data and then uploading it to Google Cloud Storage. In the future we will generate Docker images for the bulk of the above described components: [Issue #19](https://github.com/GoogleCloudPlatform/kubernetes/issues/19).
Add DESIGN.md to document core design. Fixes #5 2014-06-09 06:02:07 +00:00
			`## GCE Cluster Configuration`

			The scripts and data in the `cluster/` directory automates creating a set of Google Compute Engine VMs and installing all of the Kubernetes components. There is a single master node and a set of worker (called minion) nodes.

			`config-default.sh` has a set of tweakable definitions/parameters for the cluster.

			`The heavy lifting of configuring the VMs is done by [SaltStack](http://www.saltstack.com/).`

			`The bootstrapping works like this:`

			1. The `kube-up.sh` script uses the GCE [`startup-script`](https://developers.google.com/compute/docs/howtos/startupscript) mechanism for both the master node and the minion nodes.
Add networking documentation from issue #188. Refactor pod, label, and networking documentation to push details into separate documents. Add some documentation of how to connect to services. 2014-07-16 01:42:02 +00:00			`* For the minion, this simply configures and installs SaltStack. The network range that this minion is assigned is baked into the startup-script for that minion (see [the networking doc](https://github.com/GoogleCloudPlatform/kubernetes/blob/master/docs/networking.md) for more details).`
Add DESIGN.md to document core design. Fixes #5 2014-06-09 06:02:07 +00:00			`* For the master, the release files are downloaded from GCS and unpacked. Various parts (specifically the SaltStack configuration) are installed in the right places.`
			`2. SaltStack then installs the necessary servers on each node.`
			`* All go code is currently downloaded to each machine and compiled at install time.`
			`* The custom networking bridge is configured on each minion before Docker is installed.`
			* Configuration (like telling the `apiserver` the hostnames of the minions) is dynamically created during the saltstack install.
			3. After the VMs are started, the `kube-up.sh` script will call `curl` every 2 seconds until the `apiserver` starts responding.

			`kube-down.sh` can be used to tear the entire cluster down. If you build a new release and want to update your cluster, you can use `kube-push.sh` to update and apply (`highstate` in salt parlance) the salt config.

			`### Cluster Security`

Tiny typos 2014-06-10 16:40:03 +00:00			As there is no security currently built into the `apiserver`, the salt configuration will install `nginx`. `nginx` is configured to serve HTTPS with a self signed certificate. HTTP basic auth is used from the client to `nginx`. `nginx` then forwards the request on to the `apiserver` over plain old HTTP. Because a self signed certificate is used, access to the server should be safe from eavesdropping but is subject to "man in the middle" attacks. Access via the browser will result in warnings and tools like curl will require an "--insecure" flag.
Add warnings about self signed certs and MitM attacks. Also put in pointers for IRC and mailing lists. 2014-06-09 23:46:16 +00:00
			`All communication within the cluster (worker nodes to the master, for instance) occurs on the internal virtual network and should be safe from eavesdropping.`
Add DESIGN.md to document core design. Fixes #5 2014-06-09 06:02:07 +00:00
			The password is generated randomly as part of the `kube-up.sh` script and stored in `~/.kubernetes_auth`.