github/k3s - k3s - https://git.xinac.net

Commit Graph

Author	SHA1	Message	Date
k8s-merge-robot	707cc2bbb8	Merge pull request #26493 from caesarxuchao/fix-gc-flake Automatic merge from submit-queue Fixes 25890 flake. Let GC convert ListOptions to v1 before passing it to the dynamic client GC's ListWatcher directly passed the api.ListOptions to the dynamic client, but the parameter codec of dynamic client converts the options to queries based on the tags in the struct, which are not present in api.ListOptions, so the queries are not sent to the server. As a result, the Watch request was sent without a resourceVersion, causing missed events. Flake #25890 is caused by the missed deletion events. This PR converts the api.ListOptions to v1.ListOptions before the GC passes it to the dynamic codec. The flaky test has successfully passed 79 times ([log](https://00e9e64bacd064560a027fbee9c5a373a1614f3a56e652ae40-apidata.googleusercontent.com/download/storage/v1_internal/b/kubernetes-jenkins/o/pr-logs%2Fpull%2F25923%2Fkubernetes-pull-test-unit-integration%2F28364%2Fbuild-log.txt?qk=AD5uMEv72OjSUqDyk5i-ZLurcmM4i7gket1c7WaqR7yuIYz7WhPYT7ewVBafijV0ymnPTYqxRYt1kp6S9YQv7chPwC-3UtrKetKfhYnvAFrPGXAIBxHytTmpFohRAYgsARN1B6j1f9vyK5lM-8jyzRGhCK3sCRsAPnbDBWIWFlbH4b1n3vUET3P71QamHrF5itYyaqRU5pMZV3Cwwr81X8q7h5hCzm3Ip78RpMzfjEqTG0RcM2TLGccUrlkWVBLh4hn0NFpUIkzVFugFA5ooJffo-0AdJnO3mGWEOnXNVFWftJbK8cKnTns0DISrYFOyH_PlOe_YHCxgIXIT-dW8G-nbqoUjn5SBqunr36rcpaYCIwe2va4W_AcLCT43xiEAezRER_U9AuIqi_22KMd6SuHTyljhmWFPvPk8-gpjthLWXhcE7LPO5dV41hnZHnbI4n_9eI1nSVm7q9XdSvX1sWKV1GCwn8oj017AnxVvl9bScultko_0dTC747UqJ6UTFakLuFcHFe-F5Tz7ItDWlBVPoXeC7gTpyuicFKLsdqGlW9F5X6kIwNrBRj9uRsS-QuzSER-fVkQCn4dUTcokttRH_0bYvyfr9oqiDXmywMgOp-L0sKayk8JOVynh2q0Tju9sdkvFr0PxoAjhofomfIC1SZ_JkOzwAT1TUW8dLjPHluMct34xW_-qna1AmkoxM4bZQLhllap96NTC-0IdtzeKDrTul8p7u3WXSJjjEMSijibTNMlnkB0AluT1_RNO94OnzuFv4YlcV24FPhJzchhbyKREkOb_wzgcnSbRwGHjIcfRgkX-IzoXHVBcMYFUrPmsXrnRcfad4XwjkUOgvivkURW2_EwnzgrLDh-IKek51_0FpT1MnFCSG0gQbVSs_iMVPr6UXNAw62LGbKVtl3ZMXyapEpcO8azNbn6Wvd550R704JXxYlU)). @lavalamp @krousey @smarterclayton	2016-06-04 01:52:31 -07:00
k8s-merge-robot	bd2bc25308	Merge pull request #25865 from jsafrane/devel/pv-convert-from-12 Automatic merge from submit-queue volume controller: Convert PersistentVolumes from Kubernetes 1.2 In Kubernetes 1.2 we used template PersistentVolume for provisioning. When a claim for dynamic volume was detected, Kubernetes did: - create template PV for the claim with dummy pointer to storage asset - allocate storage asset such as AWS EBS - fill real pointer to the created storage asset to the template PV In refactored volume provisioner, Kubernetes allocates the storage asset first and then creates a Kubernetes PV instance already with the correct pointer to the storage asset. To support seamles upgrade from 1.2 to 1.3 we need to remove these unprovisioned template PVs. The new controller does not use them, it will see PVC for dynamic provisioning and create real PV instead. See https://github.com/pmorie/pv-haxxz/pull/3 for pseudocode.	2016-06-03 23:27:13 -07:00
k8s-merge-robot	4877153727	Merge pull request #26772 from jsafrane/flake-controller-cache-empty Automatic merge from submit-queue Wait for all volumes/claims to get synced in unit test. Controller.HasSynced() returns true when all initial claims/volumes were sent to appropriate goroutines, not when the goroutine has actually processed them. Fixes #26712	2016-06-03 17:05:22 -07:00
k8s-merge-robot	a00dbea133	Merge pull request #26758 from mqliang/lookupcache-threadsafe Automatic merge from submit-queue bugfix:lookupcache's Get method can not be called concurrently ref https://github.com/kubernetes/kubernetes/issues/26376 @lavalamp @therc @mikedanese	2016-06-03 12:46:13 -07:00
Chao Xu	06f49f7ca7	Let the dynamic client take a customized parameter codec for List, Watch, and DeleteCollection. Let the gc's ListWatcher use api.ParameterCodec. Fixes 25890.	2016-06-03 11:22:51 -07:00
mqliang	9a0ff5a9e8	bugfix:lookupcache's Get method can not be called concurrently	2016-06-04 02:21:25 +08:00
Jan Safranek	27b11c5342	Convert PersistentVolumes from Kubernetes 1.2 In Kubernetes 1.2 we used template PersistentVolume for provisioning. When a claim for dynamic volume was detected, Kubernetes did: - create template PV for the claim with dummy pointer to storage asset - allocate storage asset such as AWS EBS - fill real pointer to the created storage asset to the template PV In refactored volume provisioner, Kubernetes allocates the storage asset first and then creates a Kubernetes PV instance already with the correct pointer to the storage asset. To support seamles upgrade from 1.2 to 1.3 we need to remove these unprovisioned template PVs. The new controller does not use them, it will see PVC for dynamic provisioning and create real PV instead.	2016-06-03 14:26:06 +02:00
k8s-merge-robot	3157e87cb2	Merge pull request #26768 from wojtek-t/routecontroller_logs Automatic merge from submit-queue Improve logging in routecontroller @zmerlynn	2016-06-03 04:51:12 -07:00
k8s-merge-robot	59e008dbcb	Merge pull request #26733 from pmorie/pv-controller-typos Automatic merge from submit-queue Fix typo and linewrap comments in PV controller Fix some typos and linewrap long comments that I found while going over this code investigating something.	2016-06-03 04:51:08 -07:00
Wojciech Tyczynski	de1d35a66d	Improve logging in routecontroller	2016-06-03 12:05:12 +02:00
Jan Safranek	962505ad01	Wait for all volumes/claims to get synced in unit test. Controller.HasSynced() returns true when all initial claims/volumes were sent to appropriate goroutines, not when the goroutine has actually processed them.	2016-06-03 10:53:56 +02:00
k8s-merge-robot	75ef1ca270	Merge pull request #26351 from saad-ali/attachDetachControllerKubeletChanges Automatic merge from submit-queue Attach/Detach Controller Kubelet Changes This PR contains changes to enable attach/detach controller proposed in #20262. Specifically it: * Introduces a new `enable-controller-attach-detach` kubelet flag to enable control by attach/detach controller. Default enabled. * Removes all references `SafeToDetach` annotation from controller. * Adds the new `VolumesInUse` field to the Node Status API object. * Modifies the controller to use `VolumesInUse` instead of `SafeToDetach` annotation to gate detachment. * Modifies kubelet to set `VolumesInUse` before Mount and after Unmount. * There is a bug in the `node-problem-detector` binary that causes `VolumesInUse` to get reset to nil every 30 seconds. Issue https://github.com/kubernetes/node-problem-detector/issues/9#issuecomment-221770924 opened to fix that. * There is a bug here in the mount/unmount code that prevents resetting `VolumeInUse in some cases, this will be fixed by mount/unmount refactor. * Have controller process detaches before attaches so that volumes referenced by pods that are rescheduled to a different node are detached first. * Fix misc bugs in controller. * Modify GCE attacher to: remove retries, remove mutex, and not fail if volume is already attached or already detached. Fixes #14642, #19953 ```release-note Kubernetes v1.3 introduces a new Attach/Detach Controller. This controller manages attaching and detaching volumes on-behalf of nodes that have the "volumes.kubernetes.io/controller-managed-attach-detach" annotation. A kubelet flag, "enable-controller-attach-detach" (default true), controls whether a node sets the "controller-managed-attach-detach" or not. ```	2016-06-02 23:30:32 -07:00
k8s-merge-robot	a41d84408c	Merge pull request #26518 from jsafrane/initial-sync Automatic merge from submit-queue Fill controller caches on startup The controller needs to fill its caches before it starts binding/recycling/ deleting or provisioning volumes and claims. This was done using blocking initial 'xxx added' from going through syncClaim/syncVolume. However, when the caches were full, the controller waited for the next sync period to do actual binding/recycling etc. In this patch, the controller fills its caches directly from etcd and then processes initial 'xxx added' events to reconcile the world and bind/recycle/ delete/provision stuff, resulting in faster binding after startup. Fixes #25967 (properly)	2016-06-02 21:44:56 -07:00
Saad Ali	9dbe943491	Attach/Detach Controller Kubelet Changes This PR contains Kubelet changes to enable attach/detach controller control. * It introduces a new "enable-controller-attach-detach" kubelet flag to enable control by controller. Default enabled. * It removes all references "SafeToDetach" annoation from controller. * It adds the new VolumesInUse field to the Node Status API object. * It modifies the controller to use VolumesInUse instead of SafeToDetach annotation to gate detachment. * There is a bug in node-problem-detector that causes VolumesInUse to get reset every 30 seconds. Issue https://github.com/kubernetes/node-problem-detector/issues/9 opened to fix that.	2016-06-02 16:47:11 -07:00
Paul Morie	277c0a4e90	Fix typo and linewrap comments in PV controller	2016-06-02 15:50:07 -04:00
Janet Kuo	36f704c975	List RSes only once when getting old+new RSes in deployment controller	2016-06-02 11:24:43 -07:00
k8s-merge-robot	335da9b125	Merge pull request #26410 from jsafrane/fix-test-race Automatic merge from submit-queue Fix data race in volume controller unit test. Reactor must be locked when fiddling with reactor.volumes and reactor.claims. Therefore add new functions to add/delete volume/claim with sending an event. Fixes #26345	2016-06-02 04:25:08 -07:00
k8s-merge-robot	745eb08e83	Merge pull request #26595 from janetkuo/log-test-e2e-deployment Automatic merge from submit-queue Adding logs in deployment for debugging Ref #26509 [![Analytics](https://kubernetes-site.appspot.com/UA-36037335-10/GitHub/.github/PULL_REQUEST_TEMPLATE.md?pixel)]()	2016-06-01 20:35:42 -07:00
Jan Safranek	ee74cc4354	Fix fake event recorder race Event recorder should wait for some time to get all expected events, the event may be written by another goroutine that just have finished. It should not slow down the test in most cases, only when there is a bug and expected event is not sent.	2016-06-01 10:16:35 +02:00
Jan Safranek	2d43e4549e	Fix data race in volume controller unit test. Reactor must be locked when fiddling with reactor.volumes and reactor.claims. Therefore add new functions to add/delete volume/claim with sending an event.	2016-06-01 08:35:33 +02:00
k8s-merge-robot	04f77dd602	Merge pull request #26556 from jsafrane/fix-format Automatic merge from submit-queue Fix log arguments. 'i' is not printed. @kubernetes/sig-storage	2016-05-31 21:24:50 -07:00
k8s-merge-robot	38d5be4f36	Merge pull request #26555 from jsafrane/stabilize-test-flakes Automatic merge from submit-queue Stabilize controller unit tests. Remove test "5-1", it's flaky as it depends on order of execution of goroutines. When the controller starts, existing claim is enqueued as "initial sync event" and a new volume is enqueued to separate goroutine. It is not deterministic which goroutine processes its events first and there is no way how to tell that the claim event was processed. Also, force resync of the controllers after the test to make sure all events are processed. Fixes unit test flakes. @kubernetes/sig-storage	2016-05-31 17:06:12 -07:00
Janet Kuo	310a7d2eb5	Adding logs in deployment for debugging	2016-05-31 15:59:46 -07:00
k8s-merge-robot	38181bb3fb	Merge pull request #25917 from pmorie/pv-selector Automatic merge from submit-queue Add LabelSelector to PersistentVolumeClaimSpec Implements #25413. @kubernetes/sig-storage @bgrant0607 @thockin @jsafrane @eparis	2016-05-31 08:22:07 -07:00
Jan Safranek	21059e8b6d	Fix log arguments. 'i' is not printed.	2016-05-31 12:12:15 +02:00
Jan Safranek	011eac7c8b	Stabilize controller unit tests. Remove test "5-1", it's flaky as it depends on order of execution of goroutines. When the controller starts, existing claim is enqueued as "initial sync event" and a new volume is enqueued to separate goroutine. It is not deterministic which goroutine processes its events first and there is no way how to tell that the claim event was processed. Also, force resync of the controllers after the test to make sure all events are processed.	2016-05-31 12:07:47 +02:00
gmarek	7cac170214	AllocateOrOccupyCIDR returs quickly	2016-05-31 09:11:42 +02:00
k8s-merge-robot	d1277e34fd	Merge pull request #25913 from pweil-/ds-tombstone Automatic merge from submit-queue daemonset handle DeletedFinalStateUnknown During an e2e run in OpenShift we ran into the DS controller panic when handling `DeletedFinalStateUnknown`. This PR checks for `DeletedFinalStateUnknown` and queues the embedded object if it is a `DaemonSet`. @mikedanese - would you mind taking a look? @deads2k ``` panic: interface conversion: interface is cache.DeletedFinalStateUnknown, not extensions.DaemonSet goroutine 4369 [running]: k8s.io/kubernetes/pkg/controller/daemon.func·005(0x2f8a0c0, 0xc20b559680) /data/src/github.com/openshift/origin/Godeps/_workspace/src/k8s.io/kubernetes/pkg/controller/daemon/controller.go:160 +0x50 k8s.io/kubernetes/pkg/controller/framework.ResourceEventHandlerFuncs.OnDelete(0xc20a0ae090, 0xc20a0ae0a0, 0xc20a0ae0b0, 0x2f8a0c0, 0xc20b559680) /data/src/github.com/openshift/origin/Godeps/_workspace/src/k8s.io/kubernetes/pkg/controller/framework/controller.go:178 +0x41 k8s.io/kubernetes/pkg/controller/framework.(ResourceEventHandlerFuncs).OnDelete(0xc20b8ebf20, 0x2f8a0c0, 0xc20b559680) <autogenerated>:25 +0xb5 k8s.io/kubernetes/pkg/controller/framework.func·001(0x2f8a280, 0xc20b5522e0, 0x0, 0x0) /data/src/github.com/openshift/origin/Godeps/_workspace/src/k8s.io/kubernetes/pkg/controller/framework/controller.go:248 +0x4be k8s.io/kubernetes/pkg/controller/framework.(Controller).processLoop(0xc20bb727e0) /data/src/github.com/openshift/origin/Godeps/_workspace/src/k8s.io/kubernetes/pkg/controller/framework/controller.go:122 +0x6f k8s.io/kubernetes/pkg/controller/framework.Controller.(k8s.io/kubernetes/pkg/controller/framework.processLoop)·fm() /data/src/github.com/openshift/origin/Godeps/_workspace/src/k8s.io/kubernetes/pkg/controller/framework/controller.go:97 +0x27 k8s.io/kubernetes/pkg/util/wait.func·001() /data/src/github.com/openshift/origin/Godeps/_workspace/src/k8s.io/kubernetes/pkg/util/wait/wait.go:66 +0x61 k8s.io/kubernetes/pkg/util/wait.JitterUntil(0xc209f8cfb8, 0x3b9aca00, 0x0, 0xc2080543c0) /data/src/github.com/openshift/origin/Godeps/_workspace/src/k8s.io/kubernetes/pkg/util/wait/wait.go:67 +0x8f k8s.io/kubernetes/pkg/util/wait.Until(0xc209f8cfb8, 0x3b9aca00, 0xc2080543c0) /data/src/github.com/openshift/origin/Godeps/_workspace/src/k8s.io/kubernetes/pkg/util/wait/wait.go:47 +0x4a k8s.io/kubernetes/pkg/controller/framework.(Controller).Run(0xc20bb727e0, 0xc2080543c0) /data/src/github.com/openshift/origin/Godeps/_workspace/src/k8s.io/kubernetes/pkg/controller/framework/controller.go:97 +0x1fb created by k8s.io/kubernetes/pkg/controller/daemon.(DaemonSetsController).Run /data/src/github.com/openshift/origin/Godeps/_workspace/src/k8s.io/kubernetes/pkg/controller/daemon/controller.go:212 +0xae ``` https://ci.openshift.redhat.com/jenkins/job/test_pull_requests_origin_check/1002/artifact/origin/artifacts/test-cmd/logs/openshift.log	2016-05-30 17:54:17 -07:00
Paul Morie	4ffa3c6754	Add label selector to match criteria for claims to volumes	2016-05-30 12:11:12 -04:00
Paul Morie	faa112bad1	Add selector to PersistentVolumeClaim	2016-05-30 12:09:50 -04:00
k8s-merge-robot	9aeeef1d81	Merge pull request #26414 from jsafrane/reduce-sync-period Automatic merge from submit-queue Reduce volume controller sync period fixes #24236 and most probably also fixes #25294. Needs #25881! With the cache, binder is not affected by sync period. Without the cache, binding of 1000 PVCs takes more than 5 minutes (instead of ~70 seconds). 15 seconds were chosen by fair 2d10 roll :-)	2016-05-30 05:54:51 -07:00
Jan Safranek	df161c3a7e	Fill controller caches on startup The controller needs to fill its caches before it starts binding/recycling/ deleting or provisioning volumes and claims. This was done using blocking initial 'xxx added' from going through syncClaim/syncVolume. However, when the caches were full, the controller waited for the next sync period to do actual binding/recycling etc. In this patch, the controller fills its caches directly from etcd and then processes initial 'xxx added' events to reconcile the world and bind/recycle/ delete/provision stuff, resulting in faster binding after startup. Fixes #25967 (properly)	2016-05-30 13:16:45 +02:00
k8s-merge-robot	5643b7498f	Merge pull request #25881 from jsafrane/devel/pv-add-cache Automatic merge from submit-queue volume controller: Add cache with the latest version of PVs and PVCs When the controller binds a PV to PVC, it saves both objects to etcd. However, there is still an old version of these objects in the controller Informer cache. So, when a new PVC comes, the PV is still seen as available and may get bound to the new PVC. This will be blocked by etcd, still, it creates unnecessary traffic that slows everything down. To make everything worse, when periodic sync with the old PVC is performed, this PVC is seen by the controller as Pending (while it's already Bound on etcd) and will be bound to a different PV. Writing to this PV won't be blocked by etcd, only subsequent write of the PVC fails. So, the controller will need to roll back the PV in another transaction(s). The controller can keep itself pretty busy this way. Also, we save bound PVs (and PVCs) as two transactions - we save say PV.Spec first and then .Status. The controller gets "PV.Spec updated" event from etcd and tries to fix the Status, as it seems to the controller it's outdated. This write again fails - there already is a correct version in etcd. As we can't influence the Informer cache, it is read-only to the controller, this patch introduces second cache in the controller, which holds latest and greatest version on PVs and PVCs to prevent these useless writes to etcd . It gets updated with events from etcd and after etcd confirms successful save of PV/PVC modified by the controller. The cache stores only pointers to PVs/PVCs, so in ideal case it shares the actual object data with the informer cache. They will diverge only for a short time when the controller modifies something and the informer cache did not get update events yet. @kubernetes/sig-storage	2016-05-30 04:13:18 -07:00
Jan Safranek	2aa9f1dd8f	Reduce volume controller sync period	2016-05-30 09:59:31 +02:00
k8s-merge-robot	577cdf937d	Merge pull request #26415 from wojtek-t/network_not_ready Automatic merge from submit-queue Add a NodeCondition "NetworkUnavaiable" to prevent scheduling onto a node until the routes have been created This is new version of #26267 (based on top of that one). The new workflow is: - we have an "NetworkNotReady" condition - Kubelet when it creates a node, it sets it to "true" - RouteController will set it to "false" when the route is created - Scheduler is scheduling only on nodes that doesn't have "NetworkNotReady ==true" condition @gmarek @bgrant0607 @zmerlynn @cjcullen @derekwaynecarr @danwinship @dcbw @lavalamp @vishh	2016-05-29 03:06:59 -07:00
k8s-merge-robot	a550cf16b9	Merge pull request #25826 from freehan/svcsourcerange Automatic merge from submit-queue promote sourceRange into service spec @thockin one more for your pile I will add docs at `http://releases.k8s.io/HEAD/docs/user-guide/services-firewalls.md` cc: @justinsb Fixes: #20392	2016-05-28 02:20:13 -07:00
k8s-merge-robot	7fae9c14e2	Merge pull request #25662 from deads2k/prevent-hotloop Automatic merge from submit-queue prevent namespace cleanup hotloop Found chasing a sentry report. Looks like we hot-loop on namespace deletion failures. @derekwaynecarr ptal	2016-05-28 01:30:51 -07:00
Alex Robinson	d577550dd0	Merge pull request #26054 from gmarek/flags Make service-range flag in controller-manager optional	2016-05-27 14:26:15 -07:00
Wojciech Tyczynski	be1b57100d	Change to NotReadyNetworking and use in scheduler	2016-05-27 19:32:49 +02:00
gmarek	7bdf480340	Node is NotReady until the Route is created	2016-05-27 19:29:51 +02:00
Alex Robinson	7522389d8d	Merge pull request #26207 from zmerlynn/fix-unneeded-updated nodecontroller: Fix log message on successful update	2016-05-27 09:56:28 -07:00
saadali	3c345abafd	Fix DATA RACE in unit tests: reconciler_test.go	2016-05-27 01:19:25 -07:00
Alex Mohr	9803393a67	Merge pull request #25960 from jsafrane/do-not-sort-bind volume controller: Speed up binding by not sorting volumes	2016-05-26 15:47:14 -07:00
Alex Mohr	edda837142	Merge pull request #25599 from caesarxuchao/orphaning-finalizer Add orphaning finalizer logic to GC	2016-05-26 13:19:19 -07:00
Minhan Xia	a1bd33f510	promote sourceRange into service spec	2016-05-26 10:42:30 -07:00
Wojciech Tyczynski	aa65a7974a	Spread creating routes over time and retry on failures	2016-05-26 13:00:53 +02:00
k8s-merge-robot	98766f4548	Merge pull request #26301 from zmerlynn/wait_proper Automatic merge from submit-queue routecontroller: Add wait.NonSlidingUntil, use it [![Analytics](https://kubernetes-site.appspot.com/UA-36037335-10/GitHub/.github/PULL_REQUEST_TEMPLATE.md?pixel)]() Make sure the reconciliation loop kicks in again immediately if it takes a loooooong time.	2016-05-26 03:29:21 -07:00
k8s-merge-robot	bda0dc88aa	Merge pull request #25457 from saad-ali/expectedStateOfWorldDataStructure Automatic merge from submit-queue Attach Detach Controller Business Logic This PR adds the meat of the attach/detach controller proposed in #20262. The PR splits the in-memory cache into a desired and actual state of the world.	2016-05-26 00:41:54 -07:00
k8s-merge-robot	da7d3c189a	Merge pull request #25869 from jsafrane/devel/operation-logs Automatic merge from submit-queue volume controller: use better operation names Using volume/claim.UID in the operation name is not really useful, as UIDs are not logged by rest of the controller. On the other hand, volume.Name and claim.Namespace/Name is logged pretty often and it would help to log these also in operation name. Still, I'd prefer to have the operation name really unique to be protected from users deleting a volume and quickly creating another one with the same name, so UID is still part of the operation name. This has been already proven to be very useful in controller debugging.	2016-05-25 17:58:07 -07:00
Zach Loafman	cb69960742	nodecontroller: Fix log message on successful update	2016-05-25 14:44:15 -07:00

1 2 3 4 5 ...

1124 Commits (60c1ec8eac56355646f74c32b272eae36fdd894e)