k3s/cluster/addons/dns/README.md

# DNS in Kubernetes

Kubernetes offers a DNS cluster addon, which most of the supported environments
enable by default.  We use [SkyDNS](https://github.com/skynetservices/skydns)
as the DNS server, with some custom logic to slave it to the kubernetes API
server.

## What things get DNS names?
The only objects to which we are assigning DNS names are Services.  Every
Kubernetes Service is assigned a virtual IP address which is stable as long as
the Service exists (as compared to Pod IPs which can change over time due to
crashes or scheduling changes).  This maps well to DNS, which has a long
history of clients that, on purpose or on accident, do not respect DNS TTLs
(see previous remark about Pod IPs changing).

## Where does resolution work?
Kubernetes Service DNS names can be resolved using standard methods (e.g. [`gethostbyname`](
http://linux.die.net/man/3/gethostbyname)) inside any pod, except pods which
have the `hostNet` field set to `true`.

## Supported DNS schema
The following sections detail the supported record types and layout that is
supported.  Any other layout or names or queries that happen to work are
considered implementation details and are subject to change without warning.

### A records
"Normal" (not headless) Services are assigned a DNS A record for a name of the
form `my-svc.my-namespace.svc.cluster.local`.  This resolves to the cluster IP
of the Service.

"Headless" (without a cluster IP) Services are also assigned a DNS A record for
a name of the form `my-svc.my-namespace.svc.cluster.local`.  Unlike normal
Services, this resolves to the set of IPs of the pods selected by the Service.
Clients are expected to consume the set or else use standard round-robin
selection from the set.

### SRV records
SRV Records are created for named ports that are part of normal or Headless
Services.
For each named port, the SRV record would have the form
`_my-port-name._my-port-protocol.my-svc.my-namespace.svc.cluster.local`.
For a regular service, this resolves to the port number and the CNAME:
`my-svc.my-namespace.svc.cluster.local`.
For a headless service, this resolves to multiple answers, one for each pod
that is backing the service, and contains the port number and a CNAME of the pod
with the format `auto-generated-name.my-svc.my-namespace.svc.cluster.local`
SRV records always contain the 'svc' segment in them and are not supported for
old-style CNAMEs where the 'svc' segment was omitted.


### Backwards compatibility
Previous versions of kube-dns made names of the for
`my-svc.my-namespace.cluster.local` (the 'svc' level was added later).  For
compatibility, kube-dns supports both names for the time being.  Users should
avoid creating a namespace named 'svc', to avoid conflicts.  The old name
format is deprecated and will be removed in a future release.

## How do I find the DNS server?
The DNS server itself runs as a Kubernetes Service.  This gives it a stable IP
address.  When you run the SkyDNS service, you want to assign a static IP to use for
the Service.  For example, if you assign the DNS Service IP as `10.0.0.10`, you
can configure your kubelet to pass that on to each container as a DNS server.

Of course, giving services a name is just half of the problem - DNS names need a
domain also.  This implementation uses a configurable local domain, which can
also be passed to containers by kubelet as a DNS search suffix.

## How do I configure it?
The easiest way to use DNS is to use a supported kubernetes cluster setup,
which should have the required logic to read some config variables and plumb
them all the way down to kubelet.

Supported environments offer the following config flags, which are used at
cluster turn-up to create the SkyDNS pods and configure the kubelets.  For
example, see `cluster/gce/config-default.sh`.

```shell
ENABLE_CLUSTER_DNS=true
DNS_SERVER_IP="10.0.0.10"
DNS_DOMAIN="cluster.local"
DNS_REPLICAS=1
```

This enables DNS with a DNS Service IP of `10.0.0.10` and a local domain of
`cluster.local`, served by a single copy of SkyDNS.

If you are not using a supported cluster setup, you will have to replicate some
of this yourself.  First, each kubelet needs to run with the following flags
set:

```
--cluster_dns=<DNS service ip>
--cluster_domain=<default local domain>
```

Second, you need to start the DNS server ReplicationController and Service. See
the example files ([ReplicationController](skydns-rc.yaml.in) and
[Service](skydns-svc.yaml.in)), but keep in mind that these are templated for
Salt.  You will need to replace the `{{ <param> }}` blocks with your own values
for the config variables mentioned above.  Other than the templating, these are
normal kubernetes objects, and can be instantiated with `kubectl create`.

## How do I test if it is working?
First deploy DNS as described above.

### 1 Create a simple Pod to use as a test environment.

Create a file named busybox.yaml with the
following contents:

```yaml
apiVersion: v1beta3
kind: Pod
metadata:
  name: busybox
  namespace: default
spec:
  containers:
  - image: busybox
    command:
      - sleep
      - "3600"
    imagePullPolicy: IfNotPresent
    name: busybox
  restartPolicy: Always
```

Then create a pod using this file:

```
kubectl create -f busybox.yaml
```

### 2 Wait for this pod to go into the running state. 

You can get its status with:
```
kubectl get pods busybox
```

You should see:
```
NAME      READY     REASON    RESTARTS   AGE
busybox   1/1       Running   0          <some-time>
```

### 3 Validate DNS works
Once that pod is running, you can exec nslookup in that environment:
```
kubectl exec busybox -- nslookup kubernetes
```

You should see something like:
```
Server:    10.0.0.10
Address 1: 10.0.0.10

Name:      kubernetes
Address 1: 10.0.0.1
```

If you see that, DNS is working correctly.


## How does it work?
SkyDNS depends on etcd for what to serve, but it doesn't really need all of
what etcd offers (at least not in the way we use it).  For simplicty, we run
etcd and SkyDNS together in a pod, and we do not try to link etcd instances
across replicas.  A helper container called [kube2sky](kube2sky/) also runs in
the pod and acts a bridge between Kubernetes and SkyDNS.  It finds the
Kubernetes master through the `kubernetes` service (via environment
variables), pulls service info from the master, and writes that to etcd for
SkyDNS to find.

## Inheriting DNS from the node
When running a pod, kubelet will prepend the cluster DNS server and search
paths to the node's own DNS settings.  If the node is able to resolve DNS names
specific to the larger environment, pods should be able to, also.  See "Known
issues" below for a caveat.

## Known issues
Kubernetes installs do not configure the nodes' resolv.conf files to use the
cluster DNS by default, because that process is inherently distro-specific.
This should probably be implemented eventually.

Linux's libc is impossibly stuck ([see this bug from
2005](https://bugzilla.redhat.com/show_bug.cgi?id=168253)) with limits of just
3 DNS `nameserver` records and 6 DNS `search` records.  Kubernetes needs to
consume 1 `nameserver` record and 3 `search` records.  This means that if a
local installation already uses 3 `nameserver`s or uses more than 3 `search`es,
some of those settings will be lost.  As a partial workaround, the node can run
`dnsmasq` which will provide more `nameserver` entries, but not more `search`
entries.

## Making changes
Please observe the release process for making changes to the `kube2sky`
image that is documented in [RELEASES.md](kube2sky/RELEASES.md). Any significant changes
to the YAML template for `kube-dns` should result a bump of the version number
for the `kube-dns` replication controller and well as the `version` label. This
will permit a rolling update of `kube-dns`.


[![Analytics](https://kubernetes-site.appspot.com/UA-36037335-10/GitHub/cluster/addons/dns/README.md?pixel)]()
Add SkyDNS example This is not a complete solution, but a piece of one. More coming. 2014-11-07 00:44:23 +00:00			`# DNS in Kubernetes`
better DNS docs 2015-01-22 20:18:25 +00:00
			`Kubernetes offers a DNS cluster addon, which most of the supported environments`
			`enable by default. We use [SkyDNS](https://github.com/skynetservices/skydns)`
			`as the DNS server, with some custom logic to slave it to the kubernetes API`
			`server.`
Add SkyDNS example This is not a complete solution, but a piece of one. More coming. 2014-11-07 00:44:23 +00:00
			`## What things get DNS names?`
			`The only objects to which we are assigning DNS names are Services. Every`
			`Kubernetes Service is assigned a virtual IP address which is stable as long as`
better DNS docs 2015-01-22 20:18:25 +00:00			`the Service exists (as compared to Pod IPs which can change over time due to`
			`crashes or scheduling changes). This maps well to DNS, which has a long`
			`history of clients that, on purpose or on accident, do not respect DNS TTLs`
			`(see previous remark about Pod IPs changing).`
Add SkyDNS example This is not a complete solution, but a piece of one. More coming. 2014-11-07 00:44:23 +00:00
Clarify where resolution works. 2015-06-04 00:01:43 +00:00			`## Where does resolution work?`
			Kubernetes Service DNS names can be resolved using standard methods (e.g. [`gethostbyname`](
			`http://linux.die.net/man/3/gethostbyname)) inside any pod, except pods which`
			have the `hostNet` field set to `true`.

Document the supported DNS schema 2015-06-01 19:42:53 +00:00			`## Supported DNS schema`
			`The following sections detail the supported record types and layout that is`
			`supported. Any other layout or names or queries that happen to work are`
			`considered implementation details and are subject to change without warning.`

			`### A records`
			`"Normal" (not headless) Services are assigned a DNS A record for a name of the`
			form `my-svc.my-namespace.svc.cluster.local`. This resolves to the cluster IP
			`of the Service.`

			`"Headless" (without a cluster IP) Services are also assigned a DNS A record for`
			a name of the form `my-svc.my-namespace.svc.cluster.local`. Unlike normal
			`Services, this resolves to the set of IPs of the pods selected by the Service.`
			`Clients are expected to consume the set or else use standard round-robin`
			`selection from the set.`

README.md updated to explain SRV Record support. 2015-06-05 18:10:51 +00:00			`### SRV records`
			`SRV Records are created for named ports that are part of normal or Headless`
			`Services.`
			`For each named port, the SRV record would have the form`
			`_my-port-name._my-port-protocol.my-svc.my-namespace.svc.cluster.local`.
			`For a regular service, this resolves to the port number and the CNAME:`
			`my-svc.my-namespace.svc.cluster.local`.
			`For a headless service, this resolves to multiple answers, one for each pod`
			`that is backing the service, and contains the port number and a CNAME of the pod`
			with the format `auto-generated-name.my-svc.my-namespace.svc.cluster.local`
			`SRV records always contain the 'svc' segment in them and are not supported for`
			`old-style CNAMEs where the 'svc' segment was omitted.`


Document the supported DNS schema 2015-06-01 19:42:53 +00:00			`### Backwards compatibility`
			`Previous versions of kube-dns made names of the for`
			`my-svc.my-namespace.cluster.local` (the 'svc' level was added later). For
			`compatibility, kube-dns supports both names for the time being. Users should`
			`avoid creating a namespace named 'svc', to avoid conflicts. The old name`
			`format is deprecated and will be removed in a future release.`
skydns changes 2015-05-13 20:33:38 +00:00
Add SkyDNS example This is not a complete solution, but a piece of one. More coming. 2014-11-07 00:44:23 +00:00			`## How do I find the DNS server?`
			`The DNS server itself runs as a Kubernetes Service. This gives it a stable IP`
better DNS docs 2015-01-22 20:18:25 +00:00			`address. When you run the SkyDNS service, you want to assign a static IP to use for`
			the Service. For example, if you assign the DNS Service IP as `10.0.0.10`, you
			`can configure your kubelet to pass that on to each container as a DNS server.`
Add SkyDNS example This is not a complete solution, but a piece of one. More coming. 2014-11-07 00:44:23 +00:00
			`Of course, giving services a name is just half of the problem - DNS names need a`
better DNS docs 2015-01-22 20:18:25 +00:00			`domain also. This implementation uses a configurable local domain, which can`
			`also be passed to containers by kubelet as a DNS search suffix.`
Add SkyDNS example This is not a complete solution, but a piece of one. More coming. 2014-11-07 00:44:23 +00:00
Deferred creation of SkyDNS, monitoring and logging objects This implements phase 1 of the proposal in #3579, moving the creation of the pods, RCs, and services to the master after the apiserver is available. This is such a wide commit because our existing initial config story is special: * Add kube-addons service and associated salt configuration: We configure /etc/kubernetes/addons to be a directory of objects that are appropriately configured for the current cluster. "/etc/init.d/kube-addons start" slurps up everything in that dir. (Most of the difficult is the business logic in salt around getting that directory built at all.) ** We cheat and overlay cluster/addons into saltbase/salt/kube-addons as config files for the kube-addons meta-service. * Change .yaml.in files to salt templates * Rename {setup,teardown}-{monitoring,logging} to {setup,teardown}-{monitoring,logging}-firewall to properly reflect their real purpose now (the purpose of these functions is now ONLY to bring up the firewall rules, and possibly to relay the IP to the user). * Rework GCE {setup,teardown}-{monitoring,logging}-firewall: Both functions were improperly configuring global rules, yet used lifecycles tied to the cluster. Use $NODE_INSTANCE_PREFIX with the rule. The logging rule needed a $NETWORK specifier. The monitoring rule tried gcloud describe first, but given the instancing, this feels like a waste of time now. * Plumb ENABLE_CLUSTER_MONITORING, ENABLE_CLUSTER_LOGGING, ELASTICSEARCH_LOGGING_REPLICAS and DNS_REPLICAS down to the master, since these are needed there now. (Desperately want just a yaml or json file we can share between providers that has all this crap. Maybe #3525 is an answer?) Huge caveats: I've gone pretty firm testing on GCE, including twiddling the env variables and making sure the objects I expect to come up, come up. I've tested that it doesn't break GKE bringup somehow. But I haven't had a chance to test the other providers. 2015-01-18 23:16:52 +00:00			`## How do I configure it?`
better DNS docs 2015-01-22 20:18:25 +00:00			`The easiest way to use DNS is to use a supported kubernetes cluster setup,`
			`which should have the required logic to read some config variables and plumb`
			`them all the way down to kubelet.`

			`Supported environments offer the following config flags, which are used at`
			`cluster turn-up to create the SkyDNS pods and configure the kubelets. For`
			example, see `cluster/gce/config-default.sh`.
Add SkyDNS example This is not a complete solution, but a piece of one. More coming. 2014-11-07 00:44:23 +00:00
			```shell
Deferred creation of SkyDNS, monitoring and logging objects This implements phase 1 of the proposal in #3579, moving the creation of the pods, RCs, and services to the master after the apiserver is available. This is such a wide commit because our existing initial config story is special: * Add kube-addons service and associated salt configuration: We configure /etc/kubernetes/addons to be a directory of objects that are appropriately configured for the current cluster. "/etc/init.d/kube-addons start" slurps up everything in that dir. (Most of the difficult is the business logic in salt around getting that directory built at all.) ** We cheat and overlay cluster/addons into saltbase/salt/kube-addons as config files for the kube-addons meta-service. * Change .yaml.in files to salt templates * Rename {setup,teardown}-{monitoring,logging} to {setup,teardown}-{monitoring,logging}-firewall to properly reflect their real purpose now (the purpose of these functions is now ONLY to bring up the firewall rules, and possibly to relay the IP to the user). * Rework GCE {setup,teardown}-{monitoring,logging}-firewall: Both functions were improperly configuring global rules, yet used lifecycles tied to the cluster. Use $NODE_INSTANCE_PREFIX with the rule. The logging rule needed a $NETWORK specifier. The monitoring rule tried gcloud describe first, but given the instancing, this feels like a waste of time now. * Plumb ENABLE_CLUSTER_MONITORING, ENABLE_CLUSTER_LOGGING, ELASTICSEARCH_LOGGING_REPLICAS and DNS_REPLICAS down to the master, since these are needed there now. (Desperately want just a yaml or json file we can share between providers that has all this crap. Maybe #3525 is an answer?) Huge caveats: I've gone pretty firm testing on GCE, including twiddling the env variables and making sure the objects I expect to come up, come up. I've tested that it doesn't break GKE bringup somehow. But I haven't had a chance to test the other providers. 2015-01-18 23:16:52 +00:00			`ENABLE_CLUSTER_DNS=true`
			`DNS_SERVER_IP="10.0.0.10"`
skydns changes 2015-05-13 20:33:38 +00:00			`DNS_DOMAIN="cluster.local"`
Deferred creation of SkyDNS, monitoring and logging objects This implements phase 1 of the proposal in #3579, moving the creation of the pods, RCs, and services to the master after the apiserver is available. This is such a wide commit because our existing initial config story is special: * Add kube-addons service and associated salt configuration: We configure /etc/kubernetes/addons to be a directory of objects that are appropriately configured for the current cluster. "/etc/init.d/kube-addons start" slurps up everything in that dir. (Most of the difficult is the business logic in salt around getting that directory built at all.) ** We cheat and overlay cluster/addons into saltbase/salt/kube-addons as config files for the kube-addons meta-service. * Change .yaml.in files to salt templates * Rename {setup,teardown}-{monitoring,logging} to {setup,teardown}-{monitoring,logging}-firewall to properly reflect their real purpose now (the purpose of these functions is now ONLY to bring up the firewall rules, and possibly to relay the IP to the user). * Rework GCE {setup,teardown}-{monitoring,logging}-firewall: Both functions were improperly configuring global rules, yet used lifecycles tied to the cluster. Use $NODE_INSTANCE_PREFIX with the rule. The logging rule needed a $NETWORK specifier. The monitoring rule tried gcloud describe first, but given the instancing, this feels like a waste of time now. * Plumb ENABLE_CLUSTER_MONITORING, ENABLE_CLUSTER_LOGGING, ELASTICSEARCH_LOGGING_REPLICAS and DNS_REPLICAS down to the master, since these are needed there now. (Desperately want just a yaml or json file we can share between providers that has all this crap. Maybe #3525 is an answer?) Huge caveats: I've gone pretty firm testing on GCE, including twiddling the env variables and making sure the objects I expect to come up, come up. I've tested that it doesn't break GKE bringup somehow. But I haven't had a chance to test the other providers. 2015-01-18 23:16:52 +00:00			`DNS_REPLICAS=1`
Add SkyDNS example This is not a complete solution, but a piece of one. More coming. 2014-11-07 00:44:23 +00:00			```

better DNS docs 2015-01-22 20:18:25 +00:00			This enables DNS with a DNS Service IP of `10.0.0.10` and a local domain of
skydns changes 2015-05-13 20:33:38 +00:00			`cluster.local`, served by a single copy of SkyDNS.
better DNS docs 2015-01-22 20:18:25 +00:00
			`If you are not using a supported cluster setup, you will have to replicate some`
			`of this yourself. First, each kubelet needs to run with the following flags`
			`set:`

			```
			`--cluster_dns=<DNS service ip>`
			`--cluster_domain=<default local domain>`
			```

			`Second, you need to start the DNS server ReplicationController and Service. See`
			`the example files ([ReplicationController](skydns-rc.yaml.in) and`
			`[Service](skydns-svc.yaml.in)), but keep in mind that these are templated for`
			Salt. You will need to replace the `{{ <param> }}` blocks with your own values
			`for the config variables mentioned above. Other than the templating, these are`
			normal kubernetes objects, and can be instantiated with `kubectl create`.

Add instructions for validating the DNS server. 2015-06-10 01:52:52 +00:00			`## How do I test if it is working?`
			`First deploy DNS as described above.`

			`### 1 Create a simple Pod to use as a test environment.`

			`Create a file named busybox.yaml with the`
			`following contents:`

			```yaml
			`apiVersion: v1beta3`
			`kind: Pod`
			`metadata:`
			`name: busybox`
			`namespace: default`
			`spec:`
			`containers:`
			`- image: busybox`
			`command:`
			`- sleep`
			`- "3600"`
			`imagePullPolicy: IfNotPresent`
			`name: busybox`
			`restartPolicy: Always`
			```

			`Then create a pod using this file:`

			```
			`kubectl create -f busybox.yaml`
			```

			`### 2 Wait for this pod to go into the running state.`

			`You can get its status with:`
			```
			`kubectl get pods busybox`
			```

			`You should see:`
			```
			`NAME READY REASON RESTARTS AGE`
			`busybox 1/1 Running 0 <some-time>`
			```

			`### 3 Validate DNS works`
			`Once that pod is running, you can exec nslookup in that environment:`
			```
			`kubectl exec busybox -- nslookup kubernetes`
			```

			`You should see something like:`
			```
			`Server: 10.0.0.10`
			`Address 1: 10.0.0.10`

			`Name: kubernetes`
			`Address 1: 10.0.0.1`
			```

			`If you see that, DNS is working correctly.`


Add SkyDNS example This is not a complete solution, but a piece of one. More coming. 2014-11-07 00:44:23 +00:00			`## How does it work?`
Use a helper bridge between k8s and SkyDNS Rather than have to keep SkyDNS up to date with Kubernetes, use a buddy-container "kube2sky" to watch kubernetes and sync Service records into etcd for SkyDNS ot find. This also adds namespace support. 2014-11-11 21:15:59 +00:00			`SkyDNS depends on etcd for what to serve, but it doesn't really need all of`
better DNS docs 2015-01-22 20:18:25 +00:00			`what etcd offers (at least not in the way we use it). For simplicty, we run`
			`etcd and SkyDNS together in a pod, and we do not try to link etcd instances`
			`across replicas. A helper container called [kube2sky](kube2sky/) also runs in`
			`the pod and acts a bridge between Kubernetes and SkyDNS. It finds the`
doc updates 2015-06-02 19:51:24 +00:00			Kubernetes master through the `kubernetes` service (via environment
better DNS docs 2015-01-22 20:18:25 +00:00			`variables), pulls service info from the master, and writes that to etcd for`
			`SkyDNS to find.`
Add SkyDNS example This is not a complete solution, but a piece of one. More coming. 2014-11-07 00:44:23 +00:00
Document the supported DNS schema 2015-06-01 19:42:53 +00:00			`## Inheriting DNS from the node`
			`When running a pod, kubelet will prepend the cluster DNS server and search`
			`paths to the node's own DNS settings. If the node is able to resolve DNS names`
			`specific to the larger environment, pods should be able to, also. See "Known`
			`issues" below for a caveat.`

Add SkyDNS example This is not a complete solution, but a piece of one. More coming. 2014-11-07 00:44:23 +00:00			`## Known issues`
Update DNS doc 2015-01-15 05:54:04 +00:00			`Kubernetes installs do not configure the nodes' resolv.conf files to use the`
			`cluster DNS by default, because that process is inherently distro-specific.`
			`This should probably be implemented eventually.`
Add ga-beacon analytics to gendocs scripts hack/run-gendocs.sh puts ga-beacon analytics link into all md files, hack/verify-gendocs.sh verifies presence of link. 2015-05-14 22:12:45 +00:00
Document the supported DNS schema 2015-06-01 19:42:53 +00:00			`Linux's libc is impossibly stuck ([see this bug from`
			`2005](https://bugzilla.redhat.com/show_bug.cgi?id=168253)) with limits of just`
			3 DNS `nameserver` records and 6 DNS `search` records. Kubernetes needs to
			consume 1 `nameserver` record and 3 `search` records. This means that if a
			local installation already uses 3 `nameserver`s or uses more than 3 `search`es,
			`some of those settings will be lost. As a partial workaround, the node can run`
			`dnsmasq` which will provide more `nameserver` entries, but not more `search`
			`entries.`

Document process for updating version of kube-dns 2015-05-18 22:38:58 +00:00			`## Making changes`
			Please observe the release process for making changes to the `kube2sky`
			`image that is documented in [RELEASES.md](kube2sky/RELEASES.md). Any significant changes`
			to the YAML template for `kube-dns` should result a bump of the version number
			for the `kube-dns` replication controller and well as the `version` label. This
			will permit a rolling update of `kube-dns`.

Add ga-beacon analytics to gendocs scripts hack/run-gendocs.sh puts ga-beacon analytics link into all md files, hack/verify-gendocs.sh verifies presence of link. 2015-05-14 22:12:45 +00:00
			`[![Analytics](https://kubernetes-site.appspot.com/UA-36037335-10/GitHub/cluster/addons/dns/README.md?pixel)]()`