k3s/docs/user-guide/pod-states.md

<!-- BEGIN MUNGE: UNVERSIONED_WARNING -->

<!-- BEGIN STRIP_FOR_RELEASE -->

<img src="http://kubernetes.io/img/warning.png" alt="WARNING"
     width="25" height="25">
<img src="http://kubernetes.io/img/warning.png" alt="WARNING"
     width="25" height="25">
<img src="http://kubernetes.io/img/warning.png" alt="WARNING"
     width="25" height="25">
<img src="http://kubernetes.io/img/warning.png" alt="WARNING"
     width="25" height="25">
<img src="http://kubernetes.io/img/warning.png" alt="WARNING"
     width="25" height="25">

<h2>PLEASE NOTE: This document applies to the HEAD of the source tree</h2>

If you are using a released version of Kubernetes, you should
refer to the docs that go with that version.

<strong>
The latest 1.0.x release of this document can be found
[here](http://releases.k8s.io/release-1.0/docs/user-guide/pod-states.md).

Documentation for other releases can be found at
[releases.k8s.io](http://releases.k8s.io).
</strong>
--

<!-- END STRIP_FOR_RELEASE -->

<!-- END MUNGE: UNVERSIONED_WARNING -->

# The life of a pod

Updated: 4/14/2015

This document covers the lifecycle of a pod.  It is not an exhaustive document, but an introduction to the topic.

## Pod Phase

As consistent with the overall [API convention](../devel/api-conventions.md#typical-status-properties), phase is a simple, high-level summary of the phase of the lifecycle of a pod. It is not intended to be a comprehensive rollup of observations of container-level or even pod-level conditions or other state, nor is it intended to be a comprehensive state machine.

The number and meanings of `PodPhase` values are tightly guarded.  Other than what is documented here, nothing should be assumed about pods with a given `PodPhase`.

* Pending: The pod has been accepted by the system, but one or more of the container images has not been created.  This includes time before being scheduled as well as time spent downloading images over the network, which could take a while.
* Running: The pod has been bound to a node, and all of the containers have been created.  At least one container is still running, or is in the process of starting or restarting.
* Succeeded: All containers in the pod have terminated in success, and will not be restarted.
* Failed: All containers in the pod have terminated, at least one container has terminated in failure (exited with non-zero exit status or was terminated by the system).
* Unknown: For some reason the state of the pod could not be obtained, typically due to an error in communicating with the host of the pod.

## Pod Conditions

A pod containing containers that specify readiness probes will also report the Ready condition. Condition status values may be `True`, `False`, or `Unknown`.

## Container Probes

A [Probe](https://godoc.org/k8s.io/kubernetes/pkg/api/v1#Probe) is a diagnostic performed periodically by the kubelet on a container. Specifically the diagnostic is one of three [Handlers](https://godoc.org/k8s.io/kubernetes/pkg/api/v1#Handler):

* `ExecAction`: executes a specified command inside the container expecting on success that the command exits with status code 0.
* `TCPSocketAction`: performs a tcp check against the container's IP address on a specified port expecting on success that the port is open.
* `HTTPGetAction`: performs an HTTP Get against the container's IP address on a specified port and path expecting on success that the response has a status code greater than or equal to 200 and less than 400.

Each probe will have one of three results:

* `Success`: indicates that the container passed the diagnostic.
* `Failure`: indicates that the container failed the diagnostic.
* `Unknown`: indicates that the diagnostic failed so no action should be taken.

Currently, the kubelet optionally performs two independent diagnostics on running containers which trigger action:

* `LivenessProbe`: indicates whether the container is *live*, i.e. still running. The LivenessProbe hints to the kubelet when a container is unhealthy. If the LivenessProbe fails, the kubelet will kill the container and the container will be subjected to it's [RestartPolicy](#restartpolicy). The default state of Liveness before the initial delay is `Success`. The state of Liveness for a container when no probe is provided is assumed to be `Success`.
* `ReadinessProbe`: indicates whether the container is *ready* to service requests. If the ReadinessProbe fails, the endpoints controller will remove the pod's IP address from the endpoints of all services that match the pod. Thus, the ReadinessProbe is sometimes useful to signal to the endpoints controller that even though a pod may be running, it should not receive traffic from the proxy (e.g. the container has a long startup time before it starts listening or the container is down for maintenance). The default state of Readiness before the initial delay is `Failure`. The state of Readiness for a container when no probe is provided is assumed to be `Success`.

## Container Statuses

More detailed information about the current (and previous) container statuses can be found in [ContainerStatuses](https://godoc.org/k8s.io/kubernetes/pkg/api/v1#PodStatus). The information reported depends on the current [ContainerState](https://godoc.org/k8s.io/kubernetes/pkg/api/v1#ContainerState), which may be Waiting, Running, or Terminated.

## RestartPolicy

The possible values for RestartPolicy are `Always`, `OnFailure`, or `Never`. If RestartPolicy is not set, the default value is `Always`. RestartPolicy applies to all containers in the pod. RestartPolicy only refers to restarts of the containers by the Kubelet on the same node. Failed containers that are restarted by Kubelet, are restarted with an exponential back-off delay, the delay is in multiples of sync-frequency 0, 1x, 2x, 4x, 8x ... capped at 5 minutes and is reset after 10 minutes of successful execution. As discussed in the [pods document](pods.md#durability-of-pods-or-lack-thereof), once bound to a node, a pod will never be rebound to another node. This means that some kind of controller is necessary in order for a pod to survive node failure, even if just a single pod at a time is desired.

Three types of controllers are currently available:

- Use a [`Job`](jobs.md) for pods which are expected to terminate (e.g. batch computations).
- Use a [`ReplicationController`](replication-controller.md) for pods which are not expected to
  terminate, and where (e.g. web servers).
- Use a [`DaemonSet`](../admin/daemons.md): Use for pods which need to run 1 per machine because they provide a
  machine-specific system service.
If you are unsure whether to use ReplicationController or Daemon, then see [Daemon Set versus
Replication Controller](../admin/daemons.md#daemon-set-versus-replication-controller).

`ReplicationController` is *only* appropriate for pods with `RestartPolicy = Always`.
`Job` is *only* appropriate for pods with `RestartPolicy` equal to `OnFailure` or `Never`.

All 3 types of controllers contain a PodTemplate, which has all the same fields as a Pod.
It is recommended to create the appropriate controller and let it create pods, rather than to
directly create pods yourself.  That is because pods alone are not resilient to machine failures,
but Controllers are.

## Pod lifetime

In general, pods which are created do not disappear until someone destroys them.  This might be a human or a `ReplicationController`, or another controller.  The only exception to this rule is that pods with a `PodPhase` of `Succeeded` or `Failed` for more than some duration (determined by the master) will expire and be automatically reaped.

If a node dies or is disconnected from the rest of the cluster, some entity within the system (call it the NodeController for now) is responsible for applying policy (e.g. a timeout) and marking any pods on the lost node as `Failed`.

## Examples

   * Pod is `Running`, 1 container, container exits success
     * Log completion event
     * If RestartPolicy is:
       * Always: restart container, pod stays `Running`
       * OnFailure: pod becomes `Succeeded`
       * Never: pod becomes `Succeeded`

   * Pod is `Running`, 1 container, container exits failure
     * Log failure event
     * If RestartPolicy is:
       * Always: restart container, pod stays `Running`
       * OnFailure: restart container, pod stays `Running`
       * Never: pod becomes `Failed`

   * Pod is `Running`, 2 containers, container 1 exits failure
     * Log failure event
     * If RestartPolicy is:
       * Always: restart container, pod stays `Running`
       * OnFailure: restart container, pod stays `Running`
       * Never: pod stays `Running`
     * When container 2 exits...
       * Log failure event
       * If RestartPolicy is:
         * Always: restart container, pod stays `Running`
         * OnFailure: restart container, pod stays `Running`
         * Never: pod becomes `Failed`

   * Pod is `Running`, container becomes OOM
     * Container terminates in failure
     * Log OOM event
     * If RestartPolicy is:
       * Always: restart container, pod stays `Running`
       * OnFailure: restart container, pod stays `Running`
       * Never: log failure event, pod becomes `Failed`

   * Pod is `Running`, a disk dies
     * All containers are killed
     * Log appropriate event
     * Pod becomes `Failed`
     * If running under a controller, pod will be recreated elsewhere

   * Pod is `Running`, its node is segmented out
     * NodeController waits for timeout
     * NodeController marks pod `Failed`
     * If running under a controller, pod will be recreated elsewhere


<!-- BEGIN MUNGE: GENERATED_ANALYTICS -->
[![Analytics](https://kubernetes-site.appspot.com/UA-36037335-10/GitHub/docs/user-guide/pod-states.md?pixel)]()
<!-- END MUNGE: GENERATED_ANALYTICS -->
Run gendocs and munges 2015-07-12 04:04:52 +00:00			`<!-- BEGIN MUNGE: UNVERSIONED_WARNING -->`

			`<!-- BEGIN STRIP_FOR_RELEASE -->`

Better scary message 2015-07-16 17:02:26 +00:00			`<img src="http://kubernetes.io/img/warning.png" alt="WARNING"`
			`width="25" height="25">`
			`<img src="http://kubernetes.io/img/warning.png" alt="WARNING"`
			`width="25" height="25">`
			`<img src="http://kubernetes.io/img/warning.png" alt="WARNING"`
			`width="25" height="25">`
			`<img src="http://kubernetes.io/img/warning.png" alt="WARNING"`
			`width="25" height="25">`
			`<img src="http://kubernetes.io/img/warning.png" alt="WARNING"`
			`width="25" height="25">`
Run gendocs 2015-07-13 22:15:35 +00:00
Better scary message 2015-07-16 17:02:26 +00:00			`<h2>PLEASE NOTE: This document applies to the HEAD of the source tree</h2>`
Run gendocs and munges 2015-07-12 04:04:52 +00:00
Better scary message 2015-07-16 17:02:26 +00:00			`If you are using a released version of Kubernetes, you should`
			`refer to the docs that go with that version.`
Run gendocs and munges 2015-07-12 04:04:52 +00:00
Better scary message 2015-07-16 17:02:26 +00:00			`<strong>`
			`The latest 1.0.x release of this document can be found`
			`[here](http://releases.k8s.io/release-1.0/docs/user-guide/pod-states.md).`

			`Documentation for other releases can be found at`
			`[releases.k8s.io](http://releases.k8s.io).`
			`</strong>`
			`--`
Run gendocs 2015-07-13 22:15:35 +00:00
Run gendocs and munges 2015-07-12 04:04:52 +00:00			`<!-- END STRIP_FOR_RELEASE -->`

			`<!-- END MUNGE: UNVERSIONED_WARNING -->`
Run gendocs 2015-07-17 22:35:41 +00:00
Add a doc on pod states 2014-09-23 04:35:18 +00:00			`# The life of a pod`

Updated API conventions and other details, per #6133. 2015-04-15 00:39:23 +00:00			`Updated: 4/14/2015`
Add a doc on pod states 2014-09-23 04:35:18 +00:00
Updated API conventions and other details, per #6133. 2015-04-15 00:39:23 +00:00			`This document covers the lifecycle of a pod. It is not an exhaustive document, but an introduction to the topic.`
Add a doc on pod states 2014-09-23 04:35:18 +00:00
Updated API conventions and other details, per #6133. 2015-04-15 00:39:23 +00:00			`## Pod Phase`
Add a doc on pod states 2014-09-23 04:35:18 +00:00
Move some docs from docs/ top-level into docs/{admin/,devel/,user-guide/}. 2015-07-15 05:07:44 +00:00			`As consistent with the overall [API convention](../devel/api-conventions.md#typical-status-properties), phase is a simple, high-level summary of the phase of the lifecycle of a pod. It is not intended to be a comprehensive rollup of observations of container-level or even pod-level conditions or other state, nor is it intended to be a comprehensive state machine.`
Add a doc on pod states 2014-09-23 04:35:18 +00:00
Updated API conventions and other details, per #6133. 2015-04-15 00:39:23 +00:00			The number and meanings of `PodPhase` values are tightly guarded. Other than what is documented here, nothing should be assumed about pods with a given `PodPhase`.
Add a doc on pod states 2014-09-23 04:35:18 +00:00
Updated API conventions and other details, per #6133. 2015-04-15 00:39:23 +00:00			`* Pending: The pod has been accepted by the system, but one or more of the container images has not been created. This includes time before being scheduled as well as time spent downloading images over the network, which could take a while.`
			`* Running: The pod has been bound to a node, and all of the containers have been created. At least one container is still running, or is in the process of starting or restarting.`
			`* Succeeded: All containers in the pod have terminated in success, and will not be restarted.`
			`* Failed: All containers in the pod have terminated, at least one container has terminated in failure (exited with non-zero exit status or was terminated by the system).`
Add PodPhase 'Unknown' Add PodPhase 'Unknown' and its corresponding description. 2015-09-08 13:14:30 +00:00			`* Unknown: For some reason the state of the pod could not be obtained, typically due to an error in communicating with the host of the pod.`
Add a doc on pod states 2014-09-23 04:35:18 +00:00
Updated API conventions and other details, per #6133. 2015-04-15 00:39:23 +00:00			`## Pod Conditions`
Add a doc on pod states 2014-09-23 04:35:18 +00:00
Updated API conventions and other details, per #6133. 2015-04-15 00:39:23 +00:00			A pod containing containers that specify readiness probes will also report the Ready condition. Condition status values may be `True`, `False`, or `Unknown`.
Add a doc on pod states 2014-09-23 04:35:18 +00:00
document container probes 2015-05-27 17:03:46 +00:00			`## Container Probes`

rewrite links pointing to godoc 2015-08-12 22:27:36 +00:00			`A [Probe](https://godoc.org/k8s.io/kubernetes/pkg/api/v1#Probe) is a diagnostic performed periodically by the kubelet on a container. Specifically the diagnostic is one of three [Handlers](https://godoc.org/k8s.io/kubernetes/pkg/api/v1#Handler):`
document container probes 2015-05-27 17:03:46 +00:00
			* `ExecAction`: executes a specified command inside the container expecting on success that the command exits with status code 0.
small corrections to pod-states doc 2015-06-16 15:12:56 +00:00			* `TCPSocketAction`: performs a tcp check against the container's IP address on a specified port expecting on success that the port is open.
Copy edits for typos 2015-08-09 18:18:06 +00:00			* `HTTPGetAction`: performs an HTTP Get against the container's IP address on a specified port and path expecting on success that the response has a status code greater than or equal to 200 and less than 400.
document container probes 2015-05-27 17:03:46 +00:00
			`Each probe will have one of three results:`

			* `Success`: indicates that the container passed the diagnostic.
			* `Failure`: indicates that the container failed the diagnostic.
			* `Unknown`: indicates that the diagnostic failed so no action should be taken.

			`Currently, the kubelet optionally performs two independent diagnostics on running containers which trigger action:`

small corrections to pod-states doc 2015-06-16 15:12:56 +00:00			* `LivenessProbe`: indicates whether the container is live, i.e. still running. The LivenessProbe hints to the kubelet when a container is unhealthy. If the LivenessProbe fails, the kubelet will kill the container and the container will be subjected to it's [RestartPolicy](#restartpolicy). The default state of Liveness before the initial delay is `Success`. The state of Liveness for a container when no probe is provided is assumed to be `Success`.
			* `ReadinessProbe`: indicates whether the container is ready to service requests. If the ReadinessProbe fails, the endpoints controller will remove the pod's IP address from the endpoints of all services that match the pod. Thus, the ReadinessProbe is sometimes useful to signal to the endpoints controller that even though a pod may be running, it should not receive traffic from the proxy (e.g. the container has a long startup time before it starts listening or the container is down for maintenance). The default state of Readiness before the initial delay is `Failure`. The state of Readiness for a container when no probe is provided is assumed to be `Success`.
document container probes 2015-05-27 17:03:46 +00:00
Updated API conventions and other details, per #6133. 2015-04-15 00:39:23 +00:00			`## Container Statuses`
Add a doc on pod states 2014-09-23 04:35:18 +00:00
rewrite links pointing to godoc 2015-08-12 22:27:36 +00:00			`More detailed information about the current (and previous) container statuses can be found in [ContainerStatuses](https://godoc.org/k8s.io/kubernetes/pkg/api/v1#PodStatus). The information reported depends on the current [ContainerState](https://godoc.org/k8s.io/kubernetes/pkg/api/v1#ContainerState), which may be Waiting, Running, or Terminated.`
Add a doc on pod states 2014-09-23 04:35:18 +00:00
Updated API conventions and other details, per #6133. 2015-04-15 00:39:23 +00:00			`## RestartPolicy`
Add a doc on pod states 2014-09-23 04:35:18 +00:00
back off restarts of crashlooping containers Signed-off-by: Sam Abed <samabed@gmail.com> 2015-08-13 12:59:15 +00:00			The possible values for RestartPolicy are `Always`, `OnFailure`, or `Never`. If RestartPolicy is not set, the default value is `Always`. RestartPolicy applies to all containers in the pod. RestartPolicy only refers to restarts of the containers by the Kubelet on the same node. Failed containers that are restarted by Kubelet, are restarted with an exponential back-off delay, the delay is in multiples of sync-frequency 0, 1x, 2x, 4x, 8x ... capped at 5 minutes and is reset after 10 minutes of successful execution. As discussed in the [pods document](pods.md#durability-of-pods-or-lack-thereof), once bound to a node, a pod will never be rebound to another node. This means that some kind of controller is necessary in order for a pod to survive node failure, even if just a single pod at a time is desired.
Add a doc on pod states 2014-09-23 04:35:18 +00:00
Job and DaemonSet documentation. 2015-09-16 00:29:44 +00:00			`Three types of controllers are currently available:`
Add a doc on pod states 2014-09-23 04:35:18 +00:00
Job and DaemonSet documentation. 2015-09-16 00:29:44 +00:00			- Use a [`Job`](jobs.md) for pods which are expected to terminate (e.g. batch computations).
			- Use a [`ReplicationController`](replication-controller.md) for pods which are not expected to
			`terminate, and where (e.g. web servers).`
			- Use a [`DaemonSet`](../admin/daemons.md): Use for pods which need to run 1 per machine because they provide a
			`machine-specific system service.`
			`If you are unsure whether to use ReplicationController or Daemon, then see [Daemon Set versus`
			`Replication Controller](../admin/daemons.md#daemon-set-versus-replication-controller).`

			`ReplicationController` is only appropriate for pods with `RestartPolicy = Always`.
			`Job` is only appropriate for pods with `RestartPolicy` equal to `OnFailure` or `Never`.

			`All 3 types of controllers contain a PodTemplate, which has all the same fields as a Pod.`
			`It is recommended to create the appropriate controller and let it create pods, rather than to`
			`directly create pods yourself. That is because pods alone are not resilient to machine failures,`
			`but Controllers are.`
Add a doc on pod states 2014-09-23 04:35:18 +00:00
			`## Pod lifetime`

Job and DaemonSet documentation. 2015-09-16 00:29:44 +00:00			In general, pods which are created do not disappear until someone destroys them. This might be a human or a `ReplicationController`, or another controller. The only exception to this rule is that pods with a `PodPhase` of `Succeeded` or `Failed` for more than some duration (determined by the master) will expire and be automatically reaped.
Add a doc on pod states 2014-09-23 04:35:18 +00:00
Updated API conventions and other details, per #6133. 2015-04-15 00:39:23 +00:00			If a node dies or is disconnected from the rest of the cluster, some entity within the system (call it the NodeController for now) is responsible for applying policy (e.g. a timeout) and marking any pods on the lost node as `Failed`.
Add a doc on pod states 2014-09-23 04:35:18 +00:00
			`## Examples`

Updated API conventions and other details, per #6133. 2015-04-15 00:39:23 +00:00			* Pod is `Running`, 1 container, container exits success
Add a doc on pod states 2014-09-23 04:35:18 +00:00			`* Log completion event`
			`* If RestartPolicy is:`
Updated API conventions and other details, per #6133. 2015-04-15 00:39:23 +00:00			* Always: restart container, pod stays `Running`
			* OnFailure: pod becomes `Succeeded`
			* Never: pod becomes `Succeeded`
Add a doc on pod states 2014-09-23 04:35:18 +00:00
Updated API conventions and other details, per #6133. 2015-04-15 00:39:23 +00:00			* Pod is `Running`, 1 container, container exits failure
Add a doc on pod states 2014-09-23 04:35:18 +00:00			`* Log failure event`
			`* If RestartPolicy is:`
Updated API conventions and other details, per #6133. 2015-04-15 00:39:23 +00:00			* Always: restart container, pod stays `Running`
			* OnFailure: restart container, pod stays `Running`
			* Never: pod becomes `Failed`
Add a doc on pod states 2014-09-23 04:35:18 +00:00
Updated API conventions and other details, per #6133. 2015-04-15 00:39:23 +00:00			* Pod is `Running`, 2 containers, container 1 exits failure
Add a doc on pod states 2014-09-23 04:35:18 +00:00			`* Log failure event`
			`* If RestartPolicy is:`
Updated API conventions and other details, per #6133. 2015-04-15 00:39:23 +00:00			* Always: restart container, pod stays `Running`
			* OnFailure: restart container, pod stays `Running`
			* Never: pod stays `Running`
Add a doc on pod states 2014-09-23 04:35:18 +00:00			`* When container 2 exits...`
			`* Log failure event`
			`* If RestartPolicy is:`
Updated API conventions and other details, per #6133. 2015-04-15 00:39:23 +00:00			* Always: restart container, pod stays `Running`
			* OnFailure: restart container, pod stays `Running`
			* Never: pod becomes `Failed`
Add a doc on pod states 2014-09-23 04:35:18 +00:00
Updated API conventions and other details, per #6133. 2015-04-15 00:39:23 +00:00			* Pod is `Running`, container becomes OOM
Add a doc on pod states 2014-09-23 04:35:18 +00:00			`* Container terminates in failure`
			`* Log OOM event`
			`* If RestartPolicy is:`
Updated API conventions and other details, per #6133. 2015-04-15 00:39:23 +00:00			* Always: restart container, pod stays `Running`
			* OnFailure: restart container, pod stays `Running`
			* Never: log failure event, pod becomes `Failed`
Add a doc on pod states 2014-09-23 04:35:18 +00:00
Updated API conventions and other details, per #6133. 2015-04-15 00:39:23 +00:00			* Pod is `Running`, a disk dies
Add a doc on pod states 2014-09-23 04:35:18 +00:00			`* All containers are killed`
			`* Log appropriate event`
Updated API conventions and other details, per #6133. 2015-04-15 00:39:23 +00:00			* Pod becomes `Failed`
Add a doc on pod states 2014-09-23 04:35:18 +00:00			`* If running under a controller, pod will be recreated elsewhere`

Updated API conventions and other details, per #6133. 2015-04-15 00:39:23 +00:00			* Pod is `Running`, its node is segmented out
Add a doc on pod states 2014-09-23 04:35:18 +00:00			`* NodeController waits for timeout`
Updated API conventions and other details, per #6133. 2015-04-15 00:39:23 +00:00			* NodeController marks pod `Failed`
Add a doc on pod states 2014-09-23 04:35:18 +00:00			`* If running under a controller, pod will be recreated elsewhere`
Add ga-beacon analytics to gendocs scripts hack/run-gendocs.sh puts ga-beacon analytics link into all md files, hack/verify-gendocs.sh verifies presence of link. 2015-05-14 22:12:45 +00:00

Apply mungedocs changes 2015-07-14 00:13:09 +00:00			`<!-- BEGIN MUNGE: GENERATED_ANALYTICS -->`
automated link fixes 2015-07-14 16:37:37 +00:00			`[![Analytics](https://kubernetes-site.appspot.com/UA-36037335-10/GitHub/docs/user-guide/pod-states.md?pixel)]()`
Apply mungedocs changes 2015-07-14 00:13:09 +00:00			`<!-- END MUNGE: GENERATED_ANALYTICS -->`