github/k3s - k3s - https://git.xinac.net

Commit Graph

Author	SHA1	Message	Date
Jan Safranek	282404cbc9	Add Exec interface to VolumeHost This exec should be used by volume plugins to execute mount utilities. It will eventually execute things in mount containers.	2017-08-14 12:16:25 +02:00
Jeff Grafton	a7f49c906d	Use buildozer to delete licenses() rules except under third_party/	2017-08-11 09:32:39 -07:00
Jeff Grafton	33276f06be	Use buildozer to remove deprecated automanaged tags	2017-08-11 09:31:50 -07:00
Hemant Kumar	f4e792ed42	Log attach detach controller skipping pods at higher priority This will help us in tracking down problems related to pods not getting added to desired state of world because of events arriving out of order or some other problem related to that.	2017-07-28 13:23:28 -04:00
supereagle	adc0eef43e	remove duplicated import and wrong alias name of api package	2017-07-25 10:04:25 +08:00
Jacob Simpson	b565f53822	update-bazel.sh	2017-07-17 15:06:08 -07:00
Chao Xu	9d489c8504	manual changes	2017-07-17 15:05:38 -07:00
Jacob Simpson	a765b8cfca	Migrate api.Scheme to scheme.Scheme	2017-07-17 15:05:38 -07:00
Jacob Simpson	29c1b81d4c	Scripted migration from clientset_generated to client-go.	2017-07-17 15:05:37 -07:00
Alexander Block	61275ad8d4	Fix flaky test Test_Run_OneVolumeAttachAndDetachMultipleNodesWithReadWriteMany Only relying on the NewAttacher/Detacher call counts is not enough as they happen in parallel to the testing/verification code and thus the actual attaching/detaching may not be done yet, resulting in flaky test results. Fixes #46244	2017-07-11 18:21:50 +02:00
Kubernetes Submit Queue	c662e1d7d8	Merge pull request #46949 from xingzhou/typo Automatic merge from submit-queue Fixed a comment typo Typo fix Fixed #48414 Release note: ``` None ```	2017-07-03 11:33:36 -07:00
Kubernetes Submit Queue	d19a2841e3	Merge pull request #47645 from jsafrane/integration-test-speedup Automatic merge from submit-queue (batch tested with PRs 48139, 48042, 47645, 48054, 48003) Speed up attach/detach controller integration tests Internal attach/detach controller timers should be configurable and tests should use much shorter values. `reconcilerSyncDuration` is deliberately left out of `TimerConfig` because it's the only one that's not a constant one, it's configurable by user. Fixes #47129 Before: ``` --- PASS: TestPodDeletionWithDswp (63.21s) --- PASS: TestPodUpdateWithWithADC (13.68s) --- PASS: TestPodUpdateWithKeepTerminatedPodVolumes (13.55s) --- PASS: TestPodAddedByDswp (183.01s) --- PASS: TestPersistentVolumeRecycler (12.55s) --- PASS: TestPersistentVolumeDeleter (12.54s) --- PASS: TestPersistentVolumeBindRace (3.51s) --- PASS: TestPersistentVolumeClaimLabelSelector (12.50s) --- PASS: TestPersistentVolumeClaimLabelSelectorMatchExpressions (12.54s) --- PASS: TestPersistentVolumeMultiPVs (3.05s) --- PASS: TestPersistentVolumeMultiPVsPVCs (4.36s) --- PASS: TestPersistentVolumeControllerStartup (7.29s) --- PASS: TestPersistentVolumeProvisionMultiPVCs (5.02s) --- PASS: TestPersistentVolumeMultiPVsDiffAccessModes (12.48s) ok k8s.io/kubernetes/test/integration/volume 359.727s ``` After: ``` --- PASS: TestPodDeletionWithDswp (3.71s) --- PASS: TestPodUpdateWithWithADC (3.63s) --- PASS: TestPodUpdateWithKeepTerminatedPodVolumes (3.70s) --- PASS: TestPodAddedByDswp (5.68s) --- PASS: TestPersistentVolumeRecycler (12.54s) --- PASS: TestPersistentVolumeDeleter (12.55s) --- PASS: TestPersistentVolumeBindRace (3.55s) --- PASS: TestPersistentVolumeClaimLabelSelector (12.50s) --- PASS: TestPersistentVolumeClaimLabelSelectorMatchExpressions (12.52s) --- PASS: TestPersistentVolumeMultiPVs (3.98s) --- PASS: TestPersistentVolumeMultiPVsPVCs (3.85s) --- PASS: TestPersistentVolumeControllerStartup (7.18s) --- PASS: TestPersistentVolumeProvisionMultiPVCs (5.23s) --- PASS: TestPersistentVolumeMultiPVsDiffAccessModes (12.48s) ok k8s.io/kubernetes/test/integration/volume 103.267s ``` PV controller tests are the slowest ones now. @kubernetes/sig-storage-pr-reviews /assign @gnufied ```release-note NONE ```	2017-06-27 14:08:17 -07:00
Kubernetes Submit Queue	18362beb0d	Merge pull request #42254 from justinsb/volumes_dont_leak_nodestatusupdateneeded Automatic merge from submit-queue volumes: SetNodeStatusUpdateNeeded on error If an error happened during the UpdateNodeStatuses loop, there were some code paths where we would not call SetNodeStatusUpdateNeeded, leaking the state. Add it to all paths by adding a function. Part of #40583 ```release-note NONE ```	2017-06-22 21:43:04 -07:00
Chao Xu	60604f8818	run hack/update-all	2017-06-22 11:31:03 -07:00
Chao Xu	f2d3220a11	run root-rewrite-import-client-go-api-types	2017-06-22 11:30:59 -07:00
Chao Xu	f4989a45a5	run root-rewrite-v1-..., compile	2017-06-22 10:25:57 -07:00
Kubernetes Submit Queue	d0a2beb1e7	Merge pull request #42249 from justinsb/volumes_logging Automatic merge from submit-queue (batch tested with PRs 42252, 42251, 42249, 47512, 47887) volumes: Add logging when removing node fails Part of #40583 ```release-note NONE ```	2017-06-21 22:13:30 -07:00
Kubernetes Submit Queue	b795ec7de0	Merge pull request #42251 from justinsb/simplify_append Automatic merge from submit-queue (batch tested with PRs 42252, 42251, 42249, 47512, 47887) volumes: simplify append-to-slice code Minor simplification - can append to empty/nil slice. Part of #40583 ```release-note NONE ```	2017-06-21 22:13:27 -07:00
Kubernetes Submit Queue	bebe346d5f	Merge pull request #42252 from justinsb/volumes_raise_loglevels Automatic merge from submit-queue (batch tested with PRs 42252, 42251, 42249, 47512, 47887) volumes: promote some logs from info -> warning Part of #40583 ```release-note NONE ```	2017-06-21 22:13:24 -07:00
Kubernetes Submit Queue	2df2247a82	Merge pull request #42250 from justinsb/volumes_getnodeandvolume_comment Automatic merge from submit-queue volumes: add comment on getNodeAndVolume Add comments on getNodeAndVolume to explain the code - it is a little subtle, and it confused me on first reading. Part of #40583 ```release-note NONE ```	2017-06-20 15:07:47 -07:00
Jan Safranek	b28790a63b	Speed up attach/detach controller integration tests Internal attach/detach controller timers should be configurable and tests should use much shorter values. reconcilerSyncDuration is deliberately left out of TimerConfig because it's the only one that's not a constant one, it's configurable by user.	2017-06-16 12:15:04 +02:00
Xing Zhou	750d0d8730	Fixed a comment typo	2017-06-05 10:47:59 +08:00
Justin Santa Barbara	d420531f95	volumes: SetNodeStatusUpdateNeeded on error If an error happened during the UpdateNodeStatuses loop, there were some code paths where we would not call SetNodeStatusUpdateNeeded, leaking the state. Add it to all paths by adding a function. Part of #40583	2017-06-01 00:32:20 -04:00
Shyam Jeedigunta	4425864707	Migrate kubelet configmap management logic to an interface	2017-05-31 10:39:36 +02:00
Kubernetes Submit Queue	0aad9d30e3	Merge pull request #44897 from msau42/local-storage-plugin Automatic merge from submit-queue (batch tested with PRs 46076, 43879, 44897, 46556, 46654) Local storage plugin What this PR does / why we need it: Volume plugin implementation for local persistent volumes. Scheduler predicate will direct already-bound PVCs to the node that the local PV is at. PVC binding still happens independently. Which issue this PR fixes (optional, in `fixes #<issue number>(, fixes #<issue_number>, ...)` format, will close that issue when PR gets merged): Part of #43640 Release note: ``` Alpha feature: Local volume plugin allows local directories to be created and consumed as a Persistent Volume. These volumes have node affinity and pods will only be scheduled to the node that the volume is at. ```	2017-05-30 23:20:02 -07:00
Kubernetes Submit Queue	c34b359bd7	Merge pull request #45923 from verult/cxing/NodeStatusUpdaterFix Automatic merge from submit-queue (batch tested with PRs 46383, 45645, 45923, 44884, 46294) Node status updater now deletes the node entry in attach updates... … when node is missing in NodeInformer cache. - Added RemoveNodeFromAttachUpdates as part of node status updater operations. What this PR does / why we need it: Fixes issue of unnecessary node status updates when node is deleted. Which issue this PR fixes (optional, in `fixes #<issue number>(, fixes #<issue_number>, ...)` format, will close that issue when PR gets merged): fixes #42438 Special notes for your reviewer: Unit tested added, but a more comprehensive test involving the attach detach controller requires certain testing functionality that is currently absent, and will require larger effort. Will be added at a later time. There is an edge case caused by the following steps: 1) A node is deleted and restarted. The node exists, but is not yet recognized by Kubernetes. 2) A pod requiring a volume attach with nodeName specifically set to this node. This would make the pod stuck in ContainerCreating state. This is low-pri since it's a specific edge case that can be avoided. Release note: ```release-note NONE ```	2017-05-26 12:58:02 -07:00
Cheng Xing	f9dc2d5ca3	Node status updater now deletes the node entry in attach updates when node is missing in NodeInformer cache. Fixes #42438 . - Added RemoveNodeFromAttachUpdates as part of node status updater operations.	2017-05-24 18:31:47 -07:00
NickrenREN	add091b1fb	fix regression in UX experience for double attach volume send event when volume is not allowed to multi-attach	2017-05-25 09:27:24 +08:00
Justin Santa Barbara	35be997c2f	volumes: promote some logs from info -> warning Part of #40583	2017-05-23 22:36:42 -04:00
Michelle Au	6ade5461ad	Add GetNodeLabels to VolumeHost interface	2017-05-22 14:44:06 -07:00
Alexander Block	06baeb33b2	Don't try to attach volumes which are already attached to other nodes	2017-05-18 06:56:30 +02:00
Kubernetes Submit Queue	6dbe853e29	Merge pull request #45544 from ianchakeres/reconciler-err-cleanup Automatic merge from submit-queue (batch tested with PRs 45990, 45544, 45745, 45742, 45678) Refactor reconciler volume log and error messages What this PR does / why we need it: Utilizes volume-specific error and log messages introduced in #44969, inside files that also log volume information. Specifically: - pkg/kubelet/volumemanager/reconciler/reconciler.go, - pkg/controller/volume/attachdetach/reconciler/reconciler.go, and - pkg/kubelet/volumemanager/populator/desired_state_of_world_populator.go Which issue this PR fixes : fixes #40905 Special notes for your reviewer: Release note: ```release-note ``` NONE	2017-05-17 18:40:51 -07:00
Ian Chakeres	b1315f4491	Refactor reconciler volume log and error messages	2017-05-11 22:33:17 -07:00
Hemant Kumar	951a36aac7	Add Keepterminatedpodvolumes as a annotation on node and lets make sure that controller respects it and doesn't detaches mounted volumes.	2017-05-11 22:31:14 -04:00
Hemant Kumar	9a1a9cbe08	detach the volume when pod is terminated Make sure volume is detached when pod is terminated because of any reason and not deleted from api server.	2017-05-11 22:18:22 -04:00
NickrenREN	0861688237	add and clear err message in RemoveVolumeFromReportAsAttached	2017-05-08 09:37:21 +08:00
Tomas Smetana	3064fe4d39	Fix issue #44757 : Flaky Test_AttachDetachControllerRecovery	2017-04-21 12:43:54 +02:00
Tomas Smetana	852c44ae59	Fix issue #34242 : Attach/detach should recover from a crash When the attach/detach controller crashes and a pod with attached PV is deleted afterwards the controller will never detach the pod's attached volumes. To prevent this the controller should try to recover the state from the nodes status.	2017-04-20 13:04:50 +02:00
NickrenREN	5cafb9042b	find and add active pods for dswp loops through the list of active pods and ensures that each one exists in the desired state of the world cache	2017-04-18 11:21:37 +08:00
Matthew Wong	e1ce33d944	WaitForCacheSync before running attachdetach controller	2017-04-17 14:02:33 -04:00
Mike Danese	a05c3c0efd	autogenerated	2017-04-14 10:40:57 -07:00
Andy Goldstein	e63fcf708d	Make controller Run methods consistent - startup/shutdown logging - wait for cache sync logging - defer utilruntime.HandleCrash() - wait for stop channel before exiting	2017-04-14 07:27:45 -04:00
Tomas Smetana	6898bc60ce	Attach/detach controller: fix potential race in constructor	2017-03-17 13:34:53 +01:00
Hemant Kumar	786da1de12	Impement bulk polling of volumes This implements Bulk volume polling using ideas presented by justin in https://github.com/kubernetes/kubernetes/pull/39564 But it changes the implementation to use an interface and doesn't affect other implementations.	2017-03-02 14:59:59 -05:00
Justin Santa Barbara	1d357b334f	volumes: simplify append-to-slice code	2017-02-28 10:37:28 -05:00
Justin Santa Barbara	0ee71ef214	volumes: add comment on getNodeAndVolume Add comments on getNodeAndVolume to explain the code - it is a little subtle, and it confused me on first reading.	2017-02-28 10:30:10 -05:00
Justin Santa Barbara	b7edfda828	volumes: Add logging when removing node fails	2017-02-28 10:17:33 -05:00
deads2k	fd34b11e13	react to informer updates	2017-02-13 09:18:32 -05:00
deads2k	a86fabb9d2	regenerate informers	2017-02-13 07:59:34 -05:00
Andy Goldstein	70c6087600	Replace hand-written informers with generated ones Replace existing uses of hand-written informers with generated ones. Follow-up commits will switch the use of one-off informers to shared informers.	2017-02-06 13:49:27 -05:00
deads2k	8a12000402	move client/record	2017-01-31 19:14:13 -05:00
Kubernetes Submit Queue	88890f586c	Merge pull request #40126 from resouer/return-value Automatic merge from submit-queue (batch tested with PRs 40126, 40565, 38777, 40564, 40572) Do not swallow error in asw.updateNodeStatusUpdateNeeded Ref #39056 Bubble the error up to `SetNodeUpdateStatusNeeded` and log it out. NOTE: This does not modify interface of `SetNodeUpdateStatusNeeded`	2017-01-27 01:34:16 -08:00
deads2k	9488e2ba30	move testing/core to client-go	2017-01-26 13:54:40 -05:00
Dr. Stefan Schimanski	a0137e9b28	Update generated files	2017-01-25 19:49:45 +01:00
Dr. Stefan Schimanski	d7eb3b6870	pkg/util: move uuid and strategicpatch into k8s.io/apimachinery	2017-01-25 19:45:09 +01:00
Harry Zhang	70941f65bf	Do not swallow error in volume	2017-01-25 21:29:48 +08:00
deads2k	b0b156b381	make tools/cache authoritative	2017-01-25 08:29:45 -05:00
Wojciech Tyczynski	bf7138652f	SecretVolume using secret manager	2017-01-23 16:10:01 +01:00
Wojciech Tyczynski	d08abdb187	Allow for returning map[string]interface{} from patch.	2017-01-18 11:53:30 +01:00
Clayton Coleman	9a2a50cda7	refactor: use metav1.ObjectMeta in other types	2017-01-17 16:17:19 -05:00
Kubernetes Submit Queue	f74b4bbbad	Merge pull request #38094 from yarntime/fix_update_typo Automatic merge from submit-queue fix typos fix typos.	2017-01-16 18:22:33 -08:00
deads2k	6a4d5cd7cc	start the apimachinery repo	2017-01-11 09:09:48 -05:00
yarntime@163.com	f7c737e8a9	fix typos	2017-01-11 16:08:20 +08:00
Kubernetes Submit Queue	7c3fff1a95	Merge pull request #39551 from chrislovecnm/reconciler-time-increases Automatic merge from submit-queue (batch tested with PRs 39628, 39551, 38746, 38352, 39607) Increasing times on reconciling volumes fixing impact to AWS. #What this PR does / why we need it: We are currently blocked by API timeouts with PV volumes. See https://github.com/kubernetes/kubernetes/issues/39526. This is a workaround, not a fix. Special notes for your reviewer: A second PR will be dropped with CLI cobra options in it, but we are starting with increasing the reconciliation periods. I am dropping this without major testing and will test on our AWS account. Will be marked WIP until I run smoke tests. Release note: ```release-note Provide kubernetes-controller-manager flags to control volume attach/detach reconciler sync. The duration of the syncs can be controlled, and the syncs can be shut off as well. ```	2017-01-10 11:54:15 -08:00
chrislovecnm	ac49139c9f	updates from review	2017-01-09 17:20:19 -07:00
chrislovecnm	a973c38c7d	The capability to control duration via controller-manager flags, and the option to shut off reconciliation.	2017-01-09 16:47:13 -07:00
NickrenREN	639572ac68	fix redundant alias and remove unused function	2017-01-09 17:13:09 +08:00
Jeff Grafton	20d221f75c	Enable auto-generating sources rules	2017-01-05 14:14:13 -08:00
Mike Danese	161c391f44	autogenerated	2016-12-29 13:04:10 -08:00
rkouj	e7e3c55ad7	Add unit tests for MountVolume() of operation executor	2016-12-27 16:07:06 -08:00
rkouj	d5f7610b82	Refactor operation_executor to make it unit testable	2016-12-27 15:12:16 -08:00
Chao Xu	03d8820edc	rename /release_1_5 to /clientset	2016-12-14 12:39:48 -08:00
Kubernetes Submit Queue	8abbedae54	Merge pull request #38315 from mikedanese/pin-gazel Automatic merge from submit-queue Pin gazel to a version and support cgo This fixes the bazel build. @krousey who is buildcop	2016-12-12 19:32:29 -08:00
Kubernetes Submit Queue	f45e918b8b	Merge pull request #35833 from apelisse/owners-pkg-controller Automatic merge from submit-queue Curating Owners: pkg/controller cc @jsafrane @mikedanese @bprashanth @derekwaynecarr @thockin @saad-ali In an effort to expand the existing pool of reviewers and establish a two-tiered review process (first someone lgtms and then someone experienced in the project approves), we are adding new reviewers to existing owners files. ## If You Care About the Process: We did this by algorithmically figuring out who’s contributed code to the project and in what directories. Unfortunately, that doesn’t work perfectly: people that have made mechanical code changes (e.g change the copyright header across all directories) end up as reviewers in lots of places. Instead of using pure commit data, we generated an excessively large list of reviewers and pruned based on all time commit data, recent commit data and review data (number of PRs commented on). At this point we have a decent list of reviewers, but it needs one last pass for fine tuning. ## TLDR: As an owner of a sig/directory and a leader of the project, here’s what we need from you: 1. Use PR https://github.com/kubernetes/kubernetes/pull/35715 as an example. 2. The pull-request is made editable, please edit the OWNERS file to add the names of people that should be reviewing code in the future in the reviewers section. You probably do NOT need to modify the approvers section. 3. Notify me if you want some OWNERS file to be removed. Being an approver or reviewer of a parent directory makes you a reviewer/approver of the subdirectories too, so not all OWNERS files may be necessary. 4. Please use ALIAS if you want to use the same list of people over and over again (don't hesitate to ask me for help, or use the pull-request above as an example)	2016-12-12 18:51:33 -08:00
Mike Danese	c87de85347	autoupdate BUILD files	2016-12-12 13:30:07 -08:00
Kubernetes Submit Queue	43233caaf0	Merge pull request #37871 from Random-Liu/use-patch-in-kubelet Automatic merge from submit-queue (batch tested with PRs 36692, 37871) Use PatchStatus to update node status in kubelet. Fixes https://github.com/kubernetes/kubernetes/issues/37771. This PR changes kubelet to update node status with `PatchStatus`. @caesarxuchao @ymqytw told me that there is a limitation of current `CreateTwoWayMergePatch`, it doesn't support primitive type slice which uses strategic merge. * I checked the node status, the only primitive type slices in NodeStatus are as follows, they are not using strategic merge: * [`ContainerImage.Names`](https://github.com/kubernetes/kubernetes/blob/master/pkg/api/v1/types.go#L2963) * [`VolumesInUse`](https://github.com/kubernetes/kubernetes/blob/master/pkg/api/v1/types.go#L2909) * Volume package is already [using `CreateStrategicMergePath` to generate node status update patch](https://github.com/kubernetes/kubernetes/blob/master/pkg/controller/volume/attachdetach/statusupdater/node_status_updater.go#L111), and till now everything is fine. @yujuhong @dchen1107 /cc @kubernetes/sig-node	2016-12-09 11:29:11 -08:00
Random-Liu	beba1ebbf8	Use PatchStatus to update node status in kubelet.	2016-12-08 17:13:59 -08:00
Jordan Liggitt	6819706adf	Pass addressable values to DeepCopy	2016-12-08 14:16:01 -05:00
Hemant Kumar	fcf5d79be7	Add integration tests for desire state of world populator This adds tests for code introduced here : https://github.com/kubernetes/kubernetes/issues/26994 Via integration test we can now verify that if pod delete event is somehow missed by AttachDetach controller - it still get cleaned up by Desired State of World populator.	2016-12-06 06:52:52 -05:00
Kubernetes Submit Queue	fb7e9d901d	Merge pull request #37939 from yarntime/fix_typo_in_node_status_updater Automatic merge from submit-queue (batch tested with PRs 37997, 37939, 37990, 36700, 37258) fix typo in node_status_updater fix typo.	2016-12-02 19:26:47 -08:00
Kubernetes Submit Queue	c552f8918b	Merge pull request #37727 from rkouj/bug-fix-upgrade-test Automatic merge from submit-queue SetNodeUpdateStatusNeeded whenever nodeAdd event is received What this PR does / why we need it: Bug fix and SetNodeStatusUpdateNeeded for a node whenever its api object is added. This is to ensure that we don't lose the attached list of volumes in the node when its api object is deleted and recreated. fixes https://github.com/kubernetes/kubernetes/issues/37586 https://github.com/kubernetes/kubernetes/issues/37585 Special notes for your reviewer: <!-- Steps to write your release note: 1. Use the release-note-* labels to set the release note state (if you have access) 2. Enter your extended release note in the below block; leaving it blank means using the PR title as the release note. If no release note is required, just write `NONE`. -->	2016-12-02 05:44:57 -08:00
yarntime@163.com	df6e9db9d9	fix typo	2016-12-02 17:33:45 +08:00
rkouj	638ef1b977	SetNodeUpdateStatusNeeded whenever nodeAdd event is received	2016-11-30 21:12:34 -08:00
deads2k	d973158a4e	make controller manager use specified stop channel	2016-11-28 15:02:21 -05:00
Chao Xu	bcc783c594	run hack/update-all.sh	2016-11-23 15:53:09 -08:00
Chao Xu	7eeb71f698	cmd/kube-controller-manager	2016-11-23 15:53:09 -08:00
ymqytw	3cc294b1e0	Revert "support patch list of primitives" This reverts commit `34891ad9f6`.	2016-11-22 21:06:36 -08:00
ymqytw	d248843b65	Revert "try old patch after new patch fails" This reverts commit `f32696e734`.	2016-11-22 21:02:30 -08:00
Kubernetes Submit Queue	b9d2d74a94	Merge pull request #37038 from ymqytw/retry_old_patch_after_new_patch_fail Automatic merge from submit-queue Fix kubectl Stratigic Merge Patch compatibility As @smarterclayton pointed out in [comment1](https://github.com/kubernetes/kubernetes/pull/35647#pullrequestreview-8290820) and [comment2](https://github.com/kubernetes/kubernetes/pull/35647#pullrequestreview-8290847) in PR #35647, we cannot assume the API servers publish version and they shares the same version. This PR removes all the calls of GetServerSupportedSMPatchVersion(). Change the behavior of `apply` and `edit` to: Retrying with the old patch version, if the new version fails. Default other usage of SMPatch to the new version, since they don't update list of primitives. fixes #36916 cc: @pwittrock @smarterclayton	2016-11-19 01:02:47 -08:00
ymqytw	f32696e734	try old patch after new patch fails	2016-11-17 14:28:09 -08:00
Jing Xu	3d3e44e77e	fix issue in converting aws volume id from mount paths This PR is to fix the issue in converting aws volume id from mount paths. Currently there are three aws volume id formats supported. The following lists example of those three formats and their corresponding global mount paths: 1. aws:///vol-123456 (/var/lib/kubelet/plugins/kubernetes.io/aws-ebs/mounts/aws/vol-123456) 2. aws://us-east-1/vol-123456 (/var/lib/kubelet/plugins/kubernetes.io/mounts/aws/us-est-1/vol-123455) 3. vol-123456 (/var/lib/kubelet/plugins/kubernetes.io/mounts/aws/us-est-1/vol-123455) For the first two cases, we need to check the mount path and convert them back to the original format.	2016-11-16 22:35:48 -08:00
Kubernetes Submit Queue	3e169be887	Merge pull request #35647 from ymqytw/patch_primitive_list Automatic merge from submit-queue Fix strategic patch for list of primitive type with merge sementic Fix strategic patch for list of primitive type when the patch strategy is `merge`. Before: we cannot replace or delete an item in a list of primitive, e.g. string, when the patch strategy is `merge`. It will always append new items to the list. This patch will generate a map to update the list of primitive type. The server with this patch will accept either a new patch or an old patch. The client will found out the APIserver version before generate the patch. Fixes #35163, #32398 cc: @pwittrock @fabianofranz ``` release-note Fix strategic patch for list of primitive type when patch strategy is `merge` to remove deleted objects. ```	2016-11-11 14:36:44 -08:00
Rajat Ramesh Koujalagi	d81e216fc6	Better messaging for missing volume components on host to perform mount	2016-11-09 15:16:11 -08:00
ymqytw	34891ad9f6	support patch list of primitives	2016-11-09 11:46:59 -08:00
Paul Morie	4722cb299b	Remove GetRootContext from VolumeHost	2016-11-03 12:21:19 -04:00
Antoine Pelisse	c695a54c1c	Update OWNERS approvers and reviewers: pkg/controller	2016-11-02 16:19:18 -07:00
Jing Xu	abbde43374	Add sync state loop in master's volume reconciler At master volume reconciler, the information about which volumes are attached to nodes is cached in actual state of world. However, this information might be out of date in case that node is terminated (volume is detached automatically). In this situation, reconciler assume volume is still attached and will not issue attach operation when node comes back. Pods created on those nodes will fail to mount. This PR adds the logic to periodically sync up the truth for attached volumes kept in the actual state cache. If the volume is no longer attached to the node, the actual state will be updated to reflect the truth. In turn, reconciler will take actions if needed. To avoid issuing many concurrent operations on cloud provider, this PR tries to add batch operation to check whether a list of volumes are attached to the node instead of one request per volume. More details are explained in PR #33760	2016-10-28 09:24:53 -07:00
Kubernetes Submit Queue	453bfa1f0f	Merge pull request #34368 from jingxu97/Oct/statusupdate-10-7 Automatic merge from submit-queue Node status updater should SetNodeStatusUpdateNeeded if it fails to update status When volume controller tries to update the node status, if it fails to update the nodes status, it should call SetNodeStatusUpdateNeeded so that the volume list could be updated next time.	2016-10-26 11:09:16 -07:00
Jan Safranek	ad946f4fcc	Fixed mutation warning in Attach/Detach controller Objects from shared informer must not be changed, they are shared among all controllers. This fixes CacheMutationDetector panic with this output: CACHE *api.Node[5] ALTERED! {"metadata":{"name":"ip-172-18-8-71.ec2.internal","selfLink":"/api/v1/nodes/ip-172-18-8-71.ec2.internal","uid":"73d07d16-976e-11e6-8225-0e2f14b56070","resourceVersion":"136","creationTimestamp":"2016-10-21T09:12:12Z","labels":{"beta.kubernetes.io/arch":"amd64","beta.kubernetes.io/instance-type":"t2.medium","beta.kubernetes.io/os":"linux","failure-domain.beta.kubernetes.io/region":"us-east-1","failure-domain.beta.kubernetes.io/zone":"us-east-1d","kubernetes.io/hostname":"ip-172-18-8-71.ec2.internal"},"annotations":{"volumes.kubernetes.io/controller-managed-attach-detach":"true"}},"spec":{"externalID":"i-9cb6180f","providerID":"aws:///us-east-1d/i-9cb6180f"},"status":{"capacity":{"alpha.kubernetes.io/nvidia-gpu":"0","cpu":"2","memory":"4045568Ki","pods":"110"},"allocatable":{"alpha.kubernetes.io/nvidia-gpu":"0","cpu":"2","memory":"4045568Ki","pods":"110"},"conditions":[{"type":"OutOfDisk","status":"False","lastHeartbeatTime":"2016-10-21T09:12:52Z","lastTransitionTime":"2016-10-21T09:12:12Z","reason":"KubeletHasSufficientDisk","message":"kubelet has sufficient disk space available"},{"type":"MemoryPressure","status":"False","lastHeartbeatTime":"2016-10-21T09:12:52Z","lastTransitionTime":"2016-10-21T09:12:12Z","reason":"KubeletHasSufficientMemory","message":"kubelet has sufficient memory available"},{"type":"DiskPressure","status":"False","lastHeartbeatTime":"2016-10-21T09:12:52Z","lastTransitionTime":"2016-10-21T09:12:12Z","reason":"KubeletHasNoDiskPressure","message":"kubelet has no disk pressure"},{"type":"InodePressure","status":"False","lastHeartbeatTime":"2016-10-21T09:12:52Z","lastTransitionTime":"2016-10-21T09:12:12Z","reason":"KubeletHasNoInodePressure","message":"kubelet has no inode pressure"},{"type":"Ready","status":"True","lastHeartbeatTime":"2016-10-21T09:12:52Z","lastTransitionTime":"2016-10-21T09:12:22Z","reason":"KubeletReady","message":"kubelet is posting ready status"}],"addresses":[{"type":"InternalIP","address":"172.18.8.71"},{"type":"LegacyHostIP","address":"172.18.8.71"},{"type":"ExternalIP","address":"54.85.104.236"}],"daemonEndpoints":{"kubeletEndpoint":{"Port":10250}},"nodeInfo":{"machineID":"78a79498db8e4fdc9ac24b5e436a982c","systemUUID":"EC2BB406-5467-4ABE-B54D-D9993C45714F","bootID":"2553d6b8-1ddb-4ef0-902a-d09a807b89ba","kernelVersion":"4.6.7-300.fc24.x86_64","osImage":"Fedora 24 (Cloud Edition)","containerRuntimeVersion":"docker://1.10.3","kubeletVersion":"v1.5.0-alpha.1.726+5aac5eddb809e4","kubeProxyVersion":"v1.5.0-alpha.1.726+5aac5eddb809e4","operatingSystem":"linux","architecture":"amd64"},"images":[{"names":["openshift/origin-release:latest"],"sizeBytes":714569002},{"names":["openshift/origin-haproxy-router-base:latest"],"sizeBytes":294417608},{"names":["openshift/origin-base:latest"],"sizeBytes":275310761},{"names":["docker.io/centos@sha256:2ae0d2c881c7123870114fb9cc7afabd1e31f9888dac8286884f6cf59373ed9b","docker.io/centos:centos7"],"sizeBytes":196744353},{"names":["gcr.io/google_containers/busybox@sha256:4bdd623e848417d96127e16037743f0cd8b528c026e9175e22a84f639eca58ff","gcr.io/google_containers/busybox:1.24"],"sizeBytes":1113554},{"names":["gcr.io/google_containers/pause-amd64@sha256:163ac025575b775d1c0f9bf0bdd0f086883171eb475b5068e7defa4ca9e76516","gcr.io/google_containers/pause-amd64:3.0"],"sizeBytes":746888}],"volumesInUse":["kubernetes.io/aws-ebs/aws://us-east-1d/vol-f4bd0352"] A: ,"volumesAttached":[{"name":"kubernetes.io/aws-ebs/aws://us-east-1d/vol-f4bd0352","devicePath":"/dev/xvdba"}]}} B: }}	2016-10-25 14:28:10 +02:00
Jing Xu	70efadc2f4	Node status updater should SetNodeStatusUpdateNeeded if it fails to update status When volume controller tries to update the node status, if it fails to update the nodes status, it should call SetNodeStatusUpdateNeeded so that the volume list could be updated next time.	2016-10-24 13:59:39 -07:00
Mike Danese	3b6a067afc	autogenerated	2016-10-21 17:32:32 -07:00
Jing Xu	9e8edf6baf	Fix issue in updating device path when volume is attached multiple times When volume is attached, it is possible that the actual state already has this volume object (e.g., the volume is attached to multiple nodes, or volume was detached and attached again). We need to update the device path in such situation, otherwise, the device path would be stale information and cause kubelet mount to the wrong device. This PR partially fixes issue #29324	2016-10-03 17:14:23 -07:00
Justin Santa Barbara	54195d590f	Use strongly-typed types.NodeName for a node name We had another bug where we confused the hostname with the NodeName. To avoid this happening again, and to make the code more self-documenting, we use types.NodeName (a typedef alias for string) whenever we are referring to the Node.Name. A tedious but mechanical commit therefore, to change all uses of the node name to use types.NodeName Also clean up some of the (many) places where the NodeName is referred to as a hostname (not true on AWS), or an instanceID (not true on GCE), etc.	2016-09-27 10:47:31 -04:00
Kubernetes Submit Queue	0a4316f11e	Merge pull request #32807 from jingxu97/stateupdateNeeded-9-15 Automatic merge from submit-queue Fix race condition in setting node statusUpdateNeeded flag This PR fixes the race condition in setting node statusUpdateNeeded flag in master's attachdetach controller. This flag is used to indicate whether a node status has been updated by the node_status_updater or not. When updater finishes update a node status, it is set to false. When the node status is changed such as volume is detached or new volume is attached to the node, the flag is set to true so that updater can update the status again. The previous workflow has a race condition as follows 1. updater gets the currently attached volume list from the node which needs to be updated. 2. A new volume A is attached to the same node right after 1 and set the flag to TRUE 3. updater updates the node attached volume list (which does not include volume A) and then set the flag to FALSE. The result is that volume A will be never added to the attached volume list so at node side, this volume is never attached. So in this PR, the flag is set to FALSE when updater tries to get the attached volume list (as in an atomic operation). So in the above example, after step 2, the flag will be TRUE again, in step 3, updater does not set the flag if updates is sucessful. So after that, flag is still TRUE and in next round of update, the node status will be updated.	2016-09-23 11:25:16 -07:00
Jing Xu	14cad206f5	Fix race conditino in setting node statusUpdateNeeded flag This PR fixes the race condition in setting node statusUpdateNeeded flag in master's attachdetach controller. This flag is used to indicate whether a node status has been updated by the node_status_updater or not. When updater finishes update a node status, it is set to false. When the node status is changed such as volume is detached or new volume is attached to the node, the flag is set to true so that updater can update the status again. The previous workflow has a race condition as follows 1. updater gets the currently attached volume list from the node which needs to be updated. 2. A new volume A is attached to the same node right after 1 and set the flag to TRUE 3. updater updates the node attached volume list (which does not include volume A) and then set the flag to FALSE. The result is that volume A will be never added to the attached volume list so at node side, this volume is never attached. So in this PR, the flag is set to FALSE when updater tries to get the attached volume list (as in an atomic operation). So in the above example, after step 2, the flag will be TRUE again, in step 3, updater does not set the flag if updates is sucessful. So after that, flag is still TRUE and in next round of update, the node status will be updated. This PR also changes a unit test due to the workflow changes	2016-09-22 14:02:30 -07:00
Mike Danese	a765d59932	move informer and controller to pkg/client/cache Signed-off-by: Mike Danese <mikedanese@google.com>	2016-09-15 12:50:08 -07:00
Jing Xu	efaceb28cc	Fix race condition in updating attached volume between master and node This PR tries to fix issue #29324. This cause of this issue is a race condition happens when marking volumes as attached for node status. This PR tries to clean up the logic of when and where to mark volumes as attached/detached. Basically the workflow as follows, 1. When volume is attached sucessfully, the volume and node info is added into nodesToUpdateStatusFor to mark the volume as attached to the node. 2. When detach request comes in, it will check whether it is safe to detach now. If the check passes, remove the volume from volumesToReportAsAttached to indicate the volume is no longer considered as attached now. Afterwards, reconciler tries to update node status and trigger detach operation. If any of these operation fails, the volume is added back to the volumesToReportAsAttached list showing that it is still attached. These steps should make sure that kubelet get the right (might be outdated) information about which volume is attached or not. It also garantees that if detach operation is pending, kubelet should not trigger any mount operations.	2016-09-12 13:51:08 -07:00
Jing Xu	b9157b7524	Post event message for volume attachment This PR is to add event message when attaching volume fails to help users to debug. For detach failure, may address in a different PR since it requires more data structure change.	2016-09-01 16:24:36 -07:00
Kubernetes Submit Queue	3d7a105d9b	Merge pull request #30903 from jingxu97/cherrypick-8-19 Automatic merge from submit-queue Avoid failure message flush log when node no longer exist When node is deleted, attach-detach controller cache may contain stale information of this node, and update node status fails in reconciler loop. This message easily flush the log file. This PR is just a quick fix of this issue. More complete fix including make controller cache up to date will be addressed in another PR.	2016-08-19 15:45:58 -07:00
Kubernetes Submit Queue	6ce405c6ee	Merge pull request #27778 from screeley44/k8-vol-executor Automatic merge from submit-queue Add Events for operation_executor to show status of mounts, failed/successful to show in describe events Fixes #27590 @saad-ali @pmorie @erinboyd After talking with @pmorie last week about the above issue, I decided to poke around and see if I could remedy. The refactoring broke my previous UXP merged PR's that correctly showed failed mount errors in the describe events. However, Not sure I implemented correctly, but it tested out and seems to be working, let me know what I missed or if this is not the correct approach. ``` Events: FirstSeen LastSeen Count From SubobjectPath Type Reason Message --------- -------- ----- ---- ------------- -------- ------ ------- 2m 2m 1 {default-scheduler } Normal Scheduled Successfully assigned nfs-bb-pod1 to 127.0.0.1 44s 44s 1 {kubelet 127.0.0.1} Warning FailedMount Unable to mount volumes for pod "nfs-bb-pod1_default(a94f64f1-37c9-11e6-9aa5-52540073d346)": timeout expired waiting for volumes to attach/mount for pod "nfs-bb-pod1"/"default". list of unattached/unmounted volumes=[nfsvol] 44s 44s 1 {kubelet 127.0.0.1} Warning FailedSync Error syncing pod, skipping: timeout expired waiting for volumes to attach/mount for pod "nfs-bb-pod1"/"default". list of unattached/unmounted volumes=[nfsvol] 38s 38s 1 {kubelet } Warning FailedMount Unable to mount volumes for pod "a94f64f1-37c9-11e6-9aa5-52540073d346": Mount failed: exit status 32 Mounting arguments: nfs1.rhs:/opt/data99 /var/lib/kubelet/pods/a94f64f1-37c9-11e6-9aa5-52540073d346/volumes/kubernetes.io~nfs/nfsvol nfs [] Output: mount.nfs: Connection timed out Resolution hint: Check and make sure the NFS Server exists (ensure that correct IPAddress/Hostname was given) and is available/reachable. Also make sure firewall ports are open on both client and NFS Server (2049 v4 and 2049, 20048 and 111 for v3). Use commands telnet <nfs server> <port> and showmount <nfs server> to help test connectivity. ```	2016-08-19 08:27:48 -07:00
Jing Xu	70deeb0ae4	node not exist during node status update should not block others When node is deleted, attach-detach controller cache may contain stale information of this node, and update node status fails in reconciler loop. But one node update failure should not block updating other nodes. Also the warning message easily flush the log file. This PR is just a quick fix of this issue. More complete fix including make sure controller cache up to date will be addressed in another PR.	2016-08-18 13:51:30 -07:00
Kubernetes Submit Queue	9696a27aa0	Merge pull request #30737 from saad-ali/fix29358Round2 Automatic merge from submit-queue Skip safe to detach check if node API object no longer exists Fixes #29358	2016-08-18 04:00:05 -07:00
Scott Creeley	782d7d9815	Add Events for operation_executor to show status of mounts, failed or successful	2016-08-17 09:53:47 -04:00
saadali	0c72568247	Skip safe to detach if node api obj doesn't exist	2016-08-16 21:30:51 -07:00
Avesh Agarwal	52a60fe3be	Fix default resource limits (node capacities) for downward api volumes	2016-08-16 14:41:17 -04:00
Dominika Hodovska	816f6d32ca	Collapse duplicate informer creation paths	2016-08-04 09:02:13 +02:00
Paul Morie	c884297990	Fix collisions issues / timeouts for mounts For non-attachable volumes, do not call GetVolumeName on the plugin and instead generate a unique name based on the identity of the pod and the name of the volume within the pod.	2016-07-27 17:53:50 -04:00
saadali	89fd358c52	Assume volume detached if node doesn't exist Fixes #29358	2016-07-22 22:07:32 -07:00
k8s-merge-robot	99e24da2ff	Merge pull request #29077 from saad-ali/fixIssue29051NamespaceDeletion Automatic merge from submit-queue Fix "PVC Volume not detached if pod deleted via namespace deletion" issue Fixes #29051: "PVC Volume not detached if pod deleted via namespace deletion" This PR: * Fixes a bug in `desired_state_of_the_world_populator.go` to check the value of `exists` returned by the `podInformer` so that it can delete pods even if the delete event is missed (or fails). * Reduces the desired state of the world populators sleep period from 5 min to 1 min (reducing the amount of time a volume would remain attached if a volume delete event is missed or fails).	2016-07-20 20:40:32 -07:00
saadali	afd8a58e5c	Reduce DSW populator sleep period from 5 min to 1	2016-07-20 01:03:04 -07:00
saadali	d210c2231f	Check pod exist in attach controller DSW populator Fix bug in desired_state_of_the_world_populator.go to check exists so that it can delete pods even if the delete event is missed (or fails)	2016-07-20 01:03:04 -07:00
saadali	88d495026d	Allow mounts to run in parallel for non-attachable Allow mount volume operations to run in parallel for non-attachable volume plugins. Allow unmount volume operations to run in parallel for all volume plugins.	2016-07-19 21:54:26 -07:00
Morgan Bauer	69719167a3	close channel to prevent memory leak - wait.JitterUntil goroutine is never cleaned up when used with wait.NeverStop - fixup comment	2016-07-06 09:34:20 -07:00
saadali	0dd17fff22	Reorganize volume controllers and manager	2016-07-01 18:50:25 -07:00

1 2 3 4 5

224 Commits (af2659527f0bd2f7ad8500ffcd0e5640bfd53cc3)