mirror of https://github.com/k3s-io/k3s
215 lines
8.5 KiB
Markdown
215 lines
8.5 KiB
Markdown
<!-- BEGIN MUNGE: UNVERSIONED_WARNING -->
|
|
|
|
<!-- BEGIN STRIP_FOR_RELEASE -->
|
|
|
|
<img src="http://kubernetes.io/img/warning.png" alt="WARNING"
|
|
width="25" height="25">
|
|
<img src="http://kubernetes.io/img/warning.png" alt="WARNING"
|
|
width="25" height="25">
|
|
<img src="http://kubernetes.io/img/warning.png" alt="WARNING"
|
|
width="25" height="25">
|
|
<img src="http://kubernetes.io/img/warning.png" alt="WARNING"
|
|
width="25" height="25">
|
|
<img src="http://kubernetes.io/img/warning.png" alt="WARNING"
|
|
width="25" height="25">
|
|
|
|
<h2>PLEASE NOTE: This document applies to the HEAD of the source tree</h2>
|
|
|
|
If you are using a released version of Kubernetes, you should
|
|
refer to the docs that go with that version.
|
|
|
|
<!-- TAG RELEASE_LINK, added by the munger automatically -->
|
|
<strong>
|
|
The latest release of this document can be found
|
|
[here](http://releases.k8s.io/release-1.2/docs/design/selector-generation.md).
|
|
|
|
Documentation for other releases can be found at
|
|
[releases.k8s.io](http://releases.k8s.io).
|
|
</strong>
|
|
--
|
|
|
|
<!-- END STRIP_FOR_RELEASE -->
|
|
|
|
<!-- END MUNGE: UNVERSIONED_WARNING -->
|
|
|
|
Design
|
|
=============
|
|
|
|
# Goals
|
|
|
|
Make it really hard to accidentally create a job which has an overlapping
|
|
selector, while still making it possible to chose an arbitrary selector, and
|
|
without adding complex constraint solving to the APIserver.
|
|
|
|
# Use Cases
|
|
|
|
1. user can leave all label and selector fields blank and system will fill in
|
|
reasonable ones: non-overlappingness guaranteed.
|
|
2. user can put on the pod template some labels that are useful to the user,
|
|
without reasoning about non-overlappingness. System adds additional label to
|
|
assure not overlapping.
|
|
3. If user wants to reparent pods to new job (very rare case) and knows what
|
|
they are doing, they can completely disable this behavior and specify explicit
|
|
selector.
|
|
4. If a controller that makes jobs, like scheduled job, wants to use different
|
|
labels, such as the time and date of the run, it can do that.
|
|
5. If User reads v1beta1 documentation or reuses v1beta1 Job definitions and
|
|
just changes the API group, the user should not automatically be allowed to
|
|
specify a selector, since this is very rarely what people want to do and is
|
|
error prone.
|
|
6. If User downloads an existing job definition, e.g. with
|
|
`kubectl get jobs/old -o yaml` and tries to modify and post it, he should not
|
|
create an overlapping job.
|
|
7. If User downloads an existing job definition, e.g. with
|
|
`kubectl get jobs/old -o yaml` and tries to modify and post it, and he
|
|
accidentally copies the uniquifying label from the old one, then he should not
|
|
get an error from a label-key conflict, nor get erratic behavior.
|
|
8. If user reads swagger docs and sees the selector field, he should not be able
|
|
to set it without realizing the risks.
|
|
8. (Deferred requirement:) If user wants to specify a preferred name for the
|
|
non-overlappingness key, they can pick a name.
|
|
|
|
# Proposed changes
|
|
|
|
## API
|
|
|
|
`extensions/v1beta1 Job` remains the same. `batch/v1 Job` changes change as
|
|
follows.
|
|
|
|
Field `job.spec.manualSelector` is added. It controls whether selectors are
|
|
automatically generated. In automatic mode, user cannot make the mistake of
|
|
creating non-unique selectors. In manual mode, certain rare use cases are
|
|
supported.
|
|
|
|
Validation is not changed. A selector must be provided, and it must select the
|
|
pod template.
|
|
|
|
Defaulting changes. Defaulting happens in one of two modes:
|
|
|
|
### Automatic Mode
|
|
|
|
- User does not specify `job.spec.selector`.
|
|
- User is probably unaware of the `job.spec.manualSelector` field and does not
|
|
think about it.
|
|
- User optionally puts labels on pod template (optional). User does not think
|
|
about uniqueness, just labeling for user's own reasons.
|
|
- Defaulting logic sets `job.spec.selector` to
|
|
`matchLabels["controller-uid"]="$UIDOFJOB"`
|
|
- Defaulting logic appends 2 labels to the `.spec.template.metadata.labels`.
|
|
- The first label is controller-uid=$UIDOFJOB.
|
|
- The second label is "job-name=$NAMEOFJOB".
|
|
|
|
### Manual Mode
|
|
|
|
- User means User or Controller for the rest of this list.
|
|
- User does specify `job.spec.selector`.
|
|
- User does specify `job.spec.manualSelector=true`
|
|
- User puts a unique label or label(s) on pod template (required). User does
|
|
think carefully about uniqueness.
|
|
- No defaulting of pod labels or the selector happen.
|
|
|
|
### Rationale
|
|
|
|
UID is better than Name in that:
|
|
- it allows cross-namespace control someday if we need it.
|
|
- it is unique across all kinds. `controller-name=foo` does not ensure
|
|
uniqueness across Kinds `job` vs `replicaSet`. Even `job-name=foo` has a
|
|
problem: you might have a `batch.Job` and a `snazzyjob.io/types.Job` -- the
|
|
latter cannot use label `job-name=foo`, though there is a temptation to do so.
|
|
- it uniquely identifies the controller across time. This prevents the case
|
|
where, for example, someone deletes a job via the REST api or client
|
|
(where cascade=false), leaving pods around. We don't want those to be picked up
|
|
unintentionally. It also prevents the case where a user looks at an old job that
|
|
finished but is not deleted, and tries to select its pods, and gets the wrong
|
|
impression that it is still running.
|
|
|
|
Job name is more user friendly. It is self documenting
|
|
|
|
Commands like `kubectl get pods -l job-name=myjob` should do exactly what is
|
|
wanted 99.9% of the time. Automated control loops should still use the
|
|
controller-uid=label.
|
|
|
|
Using both gets the benefits of both, at the cost of some label verbosity.
|
|
|
|
The field is a `*bool`. Since false is expected to be much more common,
|
|
and since the feature is complex, it is better to leave it unspecified so that
|
|
users looking at a stored pod spec do not need to be aware of this field.
|
|
|
|
### Overriding Unique Labels
|
|
|
|
If user does specify `job.spec.selector` then the user must also specify
|
|
`job.spec.manualSelector`. This ensures the user knows that what he is doing is
|
|
not the normal thing to do.
|
|
|
|
To prevent users from copying the `job.spec.manualSelector` flag from existing
|
|
jobs, it will be optional and default to false, which means when you ask GET and
|
|
existing job back that didn't use this feature, you don't even see the
|
|
`job.spec.manualSelector` flag, so you are not tempted to wonder if you should
|
|
fiddle with it.
|
|
|
|
## Job Controller
|
|
|
|
No changes
|
|
|
|
## Kubectl
|
|
|
|
No required changes. Suggest moving SELECTOR to wide output of `kubectl get
|
|
jobs` since users do not write the selector.
|
|
|
|
## Docs
|
|
|
|
Remove examples that use selector and remove labels from pod templates.
|
|
Recommend `kubectl get jobs -l job-name=name` as the way to find pods of a job.
|
|
|
|
# Conversion
|
|
|
|
The following applies to Job, as well as to other types that adopt this pattern:
|
|
|
|
- Type `extensions/v1beta1` gets a field called `job.spec.autoSelector`.
|
|
- Both the internal type and the `batch/v1` type will get
|
|
`job.spec.manualSelector`.
|
|
- The fields `manualSelector` and `autoSelector` have opposite meanings.
|
|
- Each field defaults to false when unset, and so v1beta1 has a different
|
|
default than v1 and internal. This is intentional: we want new uses to default
|
|
to the less error-prone behavior, and we do not want to change the behavior of
|
|
v1beta1.
|
|
|
|
*Note*: since the internal default is changing, client library consumers that
|
|
create Jobs may need to add "job.spec.manualSelector=true" to keep working, or
|
|
switch to auto selectors.
|
|
|
|
Conversion is as follows:
|
|
- `extensions/__internal` to `extensions/v1beta1`: the value of
|
|
`__internal.Spec.ManualSelector` is defaulted to false if nil, negated,
|
|
defaulted to nil if false, and written `v1beta1.Spec.AutoSelector`.
|
|
- `extensions/v1beta1` to `extensions/__internal`: the value of
|
|
`v1beta1.SpecAutoSelector` is defaulted to false if nil, negated, defaulted to
|
|
nil if false, and written to `__internal.Spec.ManualSelector`.
|
|
|
|
This conversion gives the following properties.
|
|
|
|
1. Users that previously used v1beta1 do not start seeing a new field when they
|
|
get back objects.
|
|
2. Distinction between originally unset versus explicitly set to false is not
|
|
preserved (would have been nice to do so, but requires more complicated
|
|
solution).
|
|
3. Users who only created v1beta1 examples or v1 examples, will not ever see the
|
|
existence of either field.
|
|
4. Since v1beta1 are convertable to/from v1, the storage location (path in etcd)
|
|
does not need to change, allowing scriptable rollforward/rollback.
|
|
|
|
# Future Work
|
|
|
|
Follow this pattern for Deployments, ReplicaSet, DaemonSet when going to v1, if
|
|
it works well for job.
|
|
|
|
Docs will be edited to show examples without a `job.spec.selector`.
|
|
|
|
We probably want as much as possible the same behavior for Job and
|
|
ReplicationController.
|
|
|
|
|
|
<!-- BEGIN MUNGE: GENERATED_ANALYTICS -->
|
|
[![Analytics](https://kubernetes-site.appspot.com/UA-36037335-10/GitHub/docs/design/selector-generation.md?pixel)]()
|
|
<!-- END MUNGE: GENERATED_ANALYTICS -->
|