github/k3s - k3s - https://git.xinac.net

Commit Graph

Author	SHA1	Message	Date
Kubernetes Submit Queue	f4d8220df5	Merge pull request #65616 from cofyc/fix56163 Automatic merge from submit-queue (batch tested with PRs 65570, 65616). If you want to cherry-pick this change to another branch, please follow the instructions <a href="https://github.com/kubernetes/community/blob/master/contributors/devel/cherry-picks.md">here</a>. Retry scheduling on StorageClass events What this PR does / why we need it: Which issue(s) this PR fixes (optional, in `fixes #<issue number>(, fixes #<issue_number>, ...)` format, will close the issue(s) when PR gets merged): Fixes #56163 Special notes for your reviewer: I have taken over #60006. It's hard to test in e2e, because we cannot know reschedule of pod is triggered by which event (periodically service/node events will move pods to active queue too). ~~I'll add integration tests for this functionality after [this PR](https://github.com/kubernetes/kubernetes/pull/65296) get merged.~~ (already added) Release note: ```release-note NONE ```	2018-07-31 19:18:00 -07:00
Kubernetes Submit Queue	0e9b1dd20f	Merge pull request #66671 from hanxiaoshuai/cleanup07261 Automatic merge from submit-queue (batch tested with PRs 63955, 66685, 66671). If you want to cherry-pick this change to another branch, please follow the instructions <a href="https://github.com/kubernetes/community/blob/master/contributors/devel/cherry-picks.md">here</a>. remove unused code in pkg/scheduler/algorithm/scheduler_interface_test.go What this PR does / why we need it: remove unused code in pkg/scheduler/algorithm/scheduler_interface_test.go Which issue(s) this PR fixes (optional, in `fixes #<issue number>(, fixes #<issue_number>, ...)` format, will close the issue(s) when PR gets merged): Fixes # Special notes for your reviewer: Release note: ```release-note NONE ```	2018-07-26 21:05:11 -07:00
Kubernetes Submit Queue	fea4ad2783	Merge pull request #66670 from foxyriver/fix-log Automatic merge from submit-queue. If you want to cherry-pick this change to another branch, please follow the instructions <a href="https://github.com/kubernetes/community/blob/master/contributors/devel/cherry-picks.md">here</a>. fix error log What this PR does / why we need it: fix error log Release note: ```release-note NONE ```	2018-07-26 19:43:19 -07:00
Mayank Kumar	a5b6d805ea	Use GetControllerOf from apimachinery and remove kubernetes copy	2018-07-26 12:20:35 -07:00
hangaoshuai	f3fb9e0f33	remove unused code in pkg/scheduler/algorithm/scheduler_interface_test.go	2018-07-26 21:01:50 +08:00
foxyriver	3b4f250c4a	fix error log	2018-07-26 19:48:48 +08:00
Kubernetes Submit Queue	e4465b6e2f	Merge pull request #66599 from cofyc/fixfeaturegate Automatic merge from submit-queue (batch tested with PRs 66540, 66599). If you want to cherry-pick this change to another branch, please follow the instructions <a href="https://github.com/kubernetes/community/blob/master/contributors/devel/cherry-picks.md">here</a>. Invalidate CheckVolumeBinding predicate only when VolumeScheduling feature is enabled What this PR does / why we need it: Invalidate CheckVolumeBinding predicate only when VolumeScheduling feature is enabled. Which issue(s) this PR fixes (optional, in `fixes #<issue number>(, fixes #<issue_number>, ...)` format, will close the issue(s) when PR gets merged): Fixes # Special notes for your reviewer: Release note: ```release-note NONE ```	2018-07-26 01:55:17 -07:00
Kubernetes Submit Queue	84a15d0291	Merge pull request #66540 from hanxiaoshuai/fixut0724 Automatic merge from submit-queue (batch tested with PRs 66540, 66599). If you want to cherry-pick this change to another branch, please follow the instructions <a href="https://github.com/kubernetes/community/blob/master/contributors/devel/cherry-picks.md">here</a>. replace predicates string with corresponding const in TestDefaultPredicates What this PR does / why we need it: replace predicates string with corresponding const in TestDefaultPredicates. Unify with the const in func defaultPredicates(). Which issue(s) this PR fixes (optional, in `fixes #<issue number>(, fixes #<issue_number>, ...)` format, will close the issue(s) when PR gets merged): Fixes # Special notes for your reviewer: Release note: ```release-note NONE ```	2018-07-26 01:55:14 -07:00
Bobby (Babak) Salamat	be55371ff2	minor cleanup of selector_spreading priority function	2018-07-25 13:43:37 -07:00
Yecheng Fu	d2fc875489	Invalidate CheckVolumeBinding predicate only when VolumeScheduling feature is enabled.	2018-07-25 15:11:23 +08:00
Kubernetes Submit Queue	4dbcf32b3c	Merge pull request #66471 from islinwb/improve_TestZeroRequest Automatic merge from submit-queue (batch tested with PRs 66291, 66471, 66499). If you want to cherry-pick this change to another branch, please follow the instructions <a href="https://github.com/kubernetes/community/blob/master/contributors/devel/cherry-picks.md">here</a>. Improve unit test TestZeroRequest What this PR does / why we need it: Which issue(s) this PR fixes (optional, in `fixes #<issue number>(, fixes #<issue_number>, ...)` format, will close the issue(s) when PR gets merged): Fixes #66468 Special notes for your reviewer: Release note: ```release-note NONE ```	2018-07-24 13:59:58 -07:00
Kubernetes Submit Queue	2119d349b0	Merge pull request #66291 from resouer/fix-extender Automatic merge from submit-queue. If you want to cherry-pick this change to another branch, please follow the instructions <a href="https://github.com/kubernetes/community/blob/master/contributors/devel/cherry-picks.md">here</a>. Extender preemption should respect IsInterested() What this PR does / why we need it: Extender preemption should respect IsInterested() Which issue(s) this PR fixes (optional, in `fixes #<issue number>(, fixes #<issue_number>, ...)` format, will close the issue(s) when PR gets merged): Fixes #66289 Special notes for your reviewer: The bug is reported and the first commit is co-authored by: @chenchun Release note: ```release-note Extender preemption should respect IsInterested() ```	2018-07-24 13:48:38 -07:00
hangaoshuai	2c59a683a2	replace predicates string with corresponding const in TestDefaultPredicates	2018-07-24 14:27:36 +08:00
Weibin Lin	972e78748a	add pod UID	2018-07-23 10:44:31 +08:00
Harry Zhang	d644162a29	Extender preemption should respect IsInterested() Co-authored-by: Harry Zhang <resouer@gmail.com> Co-authored-by: Chun Chen <ramichen@tencent.com>	2018-07-23 10:13:38 +08:00
Weibin Lin	5449d153bb	Improve unit test TestZeroRequest	2018-07-23 09:15:19 +08:00
Kubernetes Submit Queue	4797c8df8f	Merge pull request #63665 from xchapter7x/pkg-scheduler-core Automatic merge from submit-queue. If you want to cherry-pick this change to another branch, please follow the instructions <a href="https://github.com/kubernetes/community/blob/master/contributors/devel/cherry-picks.md">here</a>. use subtest for table units (pkg/scheduler/core) What this PR does / why we need it: Update scheduler's unit table tests to use subtest Which issue(s) this PR fixes (optional, in `fixes #<issue number>(, fixes #<issue_number>, ...)` format, will close the issue(s) when PR gets merged): Special notes for your reviewer: breaks up PR: https://github.com/kubernetes/kubernetes/pull/63281 /ref #63267 Release note: ```release-note This PR will leverage subtests on the existing table tests for the scheduler units. Some refactoring of error/status messages and functions to align with new approach. ```	2018-07-21 01:52:30 -07:00
Kubernetes Submit Queue	827aa934ac	Merge pull request #66397 from gnufied/fix-default-max-volume-ebs Automatic merge from submit-queue (batch tested with PRs 66410, 66398, 66061, 66397, 65558). If you want to cherry-pick this change to another branch, please follow the instructions <a href="https://github.com/kubernetes/community/blob/master/contributors/devel/cherry-picks.md">here</a>. Fix volume limit for EBS on m5 and c5 instances This is a fix for lower volume limits on m5 and c5 instance types while we wait for https://github.com/kubernetes/features/issues/554 to land GA. This problem became urgent because many of our users are trying to migrate to those instance types in light of spectre/meltdown vulnerability but lower volume limit on those instance types often causes cluster instability. Yes they can workaround by configuring the scheduler with lower limit but often this becomes somewhat difficult to do when cluster is mixed. The newer default limits were picked from https://docs.aws.amazon.com/AWSEC2/latest/UserGuide/volume_limits.html Text about spectre/meltdown is available on - https://community.bitnami.com/t/spectre-variant-2/54961/5 /sig storage /sig scheduling ```release-note Fix volume limit for EBS on m5 and c5 instance types ```	2018-07-20 18:51:11 -07:00
John Calabrese	ad234e58be	use subtest for table units remove duplicate testname from error msg remove subtest for test setup loop do not break on test failure https://github.com/kubernetes/kubernetes/pull/63665#discussion_r203571355 remove duplicate test.name in output https://github.com/kubernetes/kubernetes/pull/63665#discussion_r203574001 https://github.com/kubernetes/kubernetes/pull/63665#discussion_r203574012	2018-07-20 16:02:50 -04:00
Yecheng Fu	8f0373792f	Retry scheduling on various events.	2018-07-20 09:54:34 +08:00
Kubernetes Submit Queue	795b7da8b0	Merge pull request #65714 from resouer/fix-63784 Automatic merge from submit-queue. If you want to cherry-pick this change to another branch, please follow the instructions <a href="https://github.com/kubernetes/community/blob/master/contributors/devel/cherry-picks.md">here</a>. Re-design equivalence class cache to two level cache What this PR does / why we need it: The current ecache introduced a global lock across all the nodes, and this patch tried to assign ecache per node to eliminate that global lock. The improvement of scheduling performance and throughput are both significant. CPU Profile Result Machine: 32-core 60GB GCE VM 1k nodes 10k pods bench test (we've highlighted the critical function): 1. Current default scheduler with ecache enabled: ![equivlance class cache bench test 001](https://user-images.githubusercontent.com/1701782/42196992-51b0a32a-7eb3-11e8-89ee-f13383091a00.jpeg) 2. Current default scheduler with ecache disabled: ![equivlance class cache bench test 002](https://user-images.githubusercontent.com/1701782/42196993-51eb0c68-7eb3-11e8-9326-1a7762072863.jpeg) 3. Current default scheduler with this patch and ecache enabled: ![equivlance class cache bench test 003](https://user-images.githubusercontent.com/1701782/42196994-52280ed8-7eb3-11e8-8100-690e2af2cf2f.jpeg) Throughput Test Result 1k nodes 3k pods `scheduler_perf` test: Current default scheduler, ecache is disabled: ```bash Minimal observed throughput for 3k pod test: 200 PASS ok k8s.io/kubernetes/test/integration/scheduler_perf 30.091s ``` With this patch, ecache is enabled: ```bash Minimal observed throughput for 3k pod test: 556 PASS ok k8s.io/kubernetes/test/integration/scheduler_perf 11.119s ``` Design and implementation: The idea is: we re-designed ecache into a "two level cache". The first level cache holds the global lock across nodes and sync is needed only when node is added or deleted, which is of much lower frequency. The second level cache is assigned per node and its lock is restricted to per node level, thus there's no need to bother the global lock during whole predicate process cycle. For more detail, please check [the original discussion](https://github.com/kubernetes/kubernetes/issues/63784#issuecomment-399848349). Which issue(s) this PR fixes (optional, in `fixes #<issue number>(, fixes #<issue_number>, ...)` format, will close the issue(s) when PR gets merged): Fixes #63784 Special notes for your reviewer: ~~Tagged as WIP to make sure this does not break existing code and tests, we can start review after CI is happy.~~ Release note: ```release-note Re-design equivalence class cache to two level cache ```	2018-07-19 16:16:02 -07:00
Hemant Kumar	45b8107378	Fix volume limit for EBS on m5 and c5 instances	2018-07-19 16:27:52 -04:00
Kubernetes Submit Queue	357decc9db	Merge pull request #63666 from xchapter7x/pkg-scheduler-factory Automatic merge from submit-queue (batch tested with PRs 58487, 63666). If you want to cherry-pick this change to another branch, please follow the instructions <a href="https://github.com/kubernetes/community/blob/master/contributors/devel/cherry-picks.md">here</a>. use subtest for table units (pkg/scheduler/factory) What this PR does / why we need it: Update scheduler's unit table tests to use subtest Which issue(s) this PR fixes (optional, in `fixes #<issue number>(, fixes #<issue_number>, ...)` format, will close the issue(s) when PR gets merged): Special notes for your reviewer: breaks up PR: https://github.com/kubernetes/kubernetes/pull/63281 /ref #63267 Release note: ```release-note This PR will leverage subtests on the existing table tests for the scheduler units. Some refactoring of error/status messages and functions to align with new approach. ```	2018-07-19 02:09:06 -07:00
Harry Zhang	e5a7a4caf7	Fist level ecache for nodeMap Use new cache map in scheduler Add a integration test Move init before schedudling Add lock for first level cache	2018-07-18 15:11:59 +08:00
Harry Zhang	17977478e7	RWLock for cache	2018-07-18 15:11:59 +08:00
Nikhita Raghunath	c166743272	scheduler: fix panic while removing node from imageStates cache	2018-07-16 11:42:28 +05:30
tanshanshan	06fb64cdf8	fix glogformat	2018-07-14 10:22:12 +08:00
Kubernetes Submit Queue	b883f4cff8	Merge pull request #65745 from silveryfu/image-locality-scoring Automatic merge from submit-queue (batch tested with PRs 66011, 66111, 66106, 66039, 65745). If you want to cherry-pick this change to another branch, please follow the instructions <a href="https://github.com/kubernetes/community/blob/master/contributors/devel/cherry-picks.md">here</a>. Enable adaptive scoring in ImageLocalityPriority What this PR does / why we need it: This PR replaces the original, pure image-size based scoring to an adaptive scoring scheme. The new scoring scheme considers not only the image size but also its `"spread" `- the definition of `"spread"` is described in what follows: > Given an image`i`, `spread_i = num_node_has_i / total_num_nodes` And the image receives the score: `score_i = size_i * spread_i`, as proposed by @resouer. The final node score is the summation of image scores for all images found existing on the node that are mentioned in the pod spec. The goal of this heuristic is to better _balance image locality with other scheduling policies_. In particular, it aims to mitigate and prevent the undesirable "node heating problem", _i.e._, pods get assigned to the same or a few nodes due to preferred image locality. Given an image, the larger `spread` it has the more image locality we can consider for it - since we can expect more nodes having this image. The new image state information in scheduler cache, enabled in this PR, allows other potential heuristics to be explored. Special notes for your reviewer: @resouer Additional unit tests are WIP. Release note: ```release-note NONE ```	2018-07-12 17:57:16 -07:00
Silvery Fu	2003a0db97	Rework image locality with spread-based scoring	2018-07-11 23:58:23 -07:00
Silvery Fu	c3f111f74a	Add image states to scheduler cache	2018-07-11 23:58:02 -07:00
Silvery Fu	05293233cf	Update generated bazel	2018-07-11 23:57:34 -07:00
Yecheng Fu	b841b15e27	Invalidate CheckVolumeBinding predicate cache on PV update.	2018-07-12 14:55:30 +08:00
Kubernetes Submit Queue	f2db955b9d	Merge pull request #64363 from idealhack/sub-benchmarks/scheduler/schedulercache Automatic merge from submit-queue. If you want to cherry-pick this change to another branch, please follow the instructions <a href="https://github.com/kubernetes/community/blob/master/contributors/devel/cherry-picks.md">here</a>. scheduler: update tests to use sub-benchmarks (pkg/scheduler/cache) What this PR does / why we need it: Go 1.7 added the subtest feature which can make table-driven tests much easier to run and debug. Some tests are not using this feature. Further reading: [Using Subtests and Sub-benchmarks](https://blog.golang.org/subtests) /kind cleanup Release note: ```release-note NONE ```	2018-07-01 19:04:19 -07:00
Yang Li	d7e12ce453	scheduler: update tests to use sub-benchmarks (pkg/scheduler/cache)	2018-07-01 00:51:42 +08:00
Kubernetes Submit Queue	ea3451f83e	Merge pull request #65188 from aveshagarwal/master-rhbz-1555057 Automatic merge from submit-queue (batch tested with PRs 65188, 65541, 65534). If you want to cherry-pick this change to another branch, please follow the instructions <a href="https://github.com/kubernetes/community/blob/master/contributors/devel/cherry-picks.md">here</a>. Increase glog level of some scheduling errors. In our production environments, we are noticing that for every scheduling error, we are logging 3 errors at following lines: 1. https://github.com/kubernetes/kubernetes/blob/master/pkg/scheduler/scheduler.go#L194 2. https://github.com/kubernetes/kubernetes/blob/master/pkg/scheduler/factory/factory.go#L1416 3. https://github.com/kubernetes/kubernetes/blob/master/pkg/scheduler/factory/factory.go#L1323 This PR increases log levels of the last 2 errors to V(3).Infof. We can discuss if it would be helpful to increase the log level of the first error too. @kubernetes/sig-scheduling-pr-reviews @bsalamat @k82cn @liggitt @sjenning ```release-note None. ```	2018-06-29 21:42:07 -07:00
Dr. Stefan Schimanski	f8de7cea40	Update generated files	2018-06-29 20:36:17 +02:00
Avesh Agarwal	c0cffb8a34	Increase glog level of some scheduling errors.	2018-06-28 23:34:29 -04:00
Kubernetes Submit Queue	a13fe4d15d	Merge pull request #65424 from liggitt/scheduler-config Automatic merge from submit-queue. If you want to cherry-pick this change to another branch, please follow the instructions <a href="https://github.com/kubernetes/community/blob/master/contributors/devel/cherry-picks.md">here</a>. Fix scheduler config decoding Fixes #65413 Implements a custom unmarshaler for a single scheduler config type which did not correctly specify JSON tags until https://github.com/kubernetes/kubernetes/issues/65414 is resolved Adds missing compatibility tests for scheduler extenders back to 1.7 ```release-note Fixes incompatibility with custom scheduler extender configurations specifying `bindVerb` ```	2018-06-25 00:21:35 -07:00
Jordan Liggitt	fcaaf59359	Fix scheduler config decoding	2018-06-24 23:28:56 -04:00
Kubernetes Submit Queue	f0311d8232	Merge pull request #65396 from bsalamat/sched_no_sort Automatic merge from submit-queue. If you want to cherry-pick this change to another branch, please follow the instructions <a href="https://github.com/kubernetes/community/blob/master/contributors/devel/cherry-picks.md">here</a>. Improve scheduler's performance by eliminating sorting of nodes by their score What this PR does / why we need it: Profiling scheduler, I noticed that scheduler spends a significant amount of time in sorting the nodes after we score them to find nodes with the highest score. Finding nodes with the highest score does not need sorting the array. This PR replaces the sort with a linear scan. Eliminating the sort results in over 10% improvement in throughput of the scheduler. Before (3 runs for 5000 nodes, scheduling 1000 pods in a cluster running 2000 pods): BenchmarkScheduling/5000Nodes/2000Pods-12 1000 20682552 ns/op BenchmarkScheduling/5000Nodes/2000Pods-12 1000 20464729 ns/op BenchmarkScheduling/5000Nodes/2000Pods-12 1000 21188906 ns/op After: BenchmarkScheduling/5000Nodes/2000Pods-12 1000 18485866 ns/op BenchmarkScheduling/5000Nodes/2000Pods-12 1000 18457749 ns/op BenchmarkScheduling/5000Nodes/2000Pods-12 1000 18418200 ns/op Release note: ```release-note Improve scheduler's performance by eliminating sorting of nodes by their score. ```	2018-06-23 20:12:01 -07:00
Bobby (Babak) Salamat	ffc8cc2f50	Improve scheduler's performance by eliminating sorting when finding the host with the highest score	2018-06-23 11:24:43 -07:00
Kubernetes Submit Queue	582b88c879	Merge pull request #64995 from bsalamat/preempt_opt Automatic merge from submit-queue (batch tested with PRs 65388, 64995). If you want to cherry-pick this change to another branch, please follow the instructions <a href="https://github.com/kubernetes/community/blob/master/contributors/devel/cherry-picks.md">here</a>. Add more conditions to the list of predicate failures that won't be resolved by preemption What this PR does / why we need it: Adds more conditions to the list of predicate failures that won't be resolved by preemption. This change can potentially improve performance of preemption by avoiding the nodes that won't be able to schedule the pending pod no matter how many other pods are removed from them. Which issue(s) this PR fixes (optional, in `fixes #<issue number>(, fixes #<issue_number>, ...)` format, will close the issue(s) when PR gets merged): Fixes # Special notes for your reviewer: Release note: ```release-note Add more conditions to the list of predicate failures that won't be resolved by preemption. ``` /sig scheduling	2018-06-23 05:52:07 -07:00
Bobby (Babak) Salamat	8cdf83ed1e	Add tests to cover newly added unresolvable failures	2018-06-22 17:06:19 -07:00
Bobby (Babak) Salamat	fab26e470c	Add more unresolvable conditions to optimize preemption logic	2018-06-22 17:04:55 -07:00
Jeff Grafton	23ceebac22	Run hack/update-bazel.sh	2018-06-22 16:22:57 -07:00
Jeff Grafton	a725660640	Update to gazelle 0.12.0 and run hack/update-bazel.sh	2018-06-22 16:22:18 -07:00
Kubernetes Submit Queue	b45ba959c0	Merge pull request #64693 from xiechengsheng/fix-typos Automatic merge from submit-queue (batch tested with PRs 65024, 65287, 65345, 64693, 64941). If you want to cherry-pick this change to another branch, please follow the instructions <a href="https://github.com/kubernetes/community/blob/master/contributors/devel/cherry-picks.md">here</a>. Fix some typos in code comments. Signed-off-by: xiechengsheng <XIE1995@whut.edu.cn> What this PR does / why we need it: Fix some typos in code comments. Which issue(s) this PR fixes (optional, in `fixes #<issue number>(, fixes #<issue_number>, ...)` format, will close the issue(s) when PR gets merged): NONE Special notes for your reviewer: Release note: ```release-note NONE ```	2018-06-22 06:10:21 -07:00
xiechengsheng	66e0b53c3c	fix some typos Signed-off-by: xiechengsheng <XIE1995@whut.edu.cn>	2018-06-22 09:26:14 +08:00
Kubernetes Submit Queue	23b4690d00	Merge pull request #65306 from shyamjvs/fine-grained-scheduler-metric Automatic merge from submit-queue. If you want to cherry-pick this change to another branch, please follow the instructions <a href="https://github.com/kubernetes/community/blob/master/contributors/devel/cherry-picks.md">here</a>. Split scheduler latency metric to fine-grained steps This splits the summary metric we recently added into finer steps. It should be very useful for performance experiments. /cc @wojtek-t fyi - @bsalamat @misterikkit Strictly speaking this is a breaking change, but since this metric was added only ~week ago I think it should fine (we should port this change to 1.11). ```release-note Split 'scheduling_latency_seconds' metric into finer steps (predicate, priority, premption) ```	2018-06-21 09:11:58 -07:00
Shyam Jeedigunta	b9ae20c99e	Split scheduler latency metric to fine-grained steps	2018-06-21 14:19:39 +02:00

1 2 3 4 5 ...

390 Commits (83f6efcec03168d640607a29c5015ee7eb0c058b)