Commit Graph

86 Commits (a07b027261383127494fc6d4e3ca2c544fb4a464)

Author SHA1 Message Date
John Schnake 338bc3ce42 Move framework ssh code to new package
The framework/ssh.go code was heavily used throughout the framework
and could be useful elsewhere but reusing those methods requires
importing all of the framework.

Extracting these methods to their own package for reuse.

Only a few methods had to be copied into this package from the
rest of the framework to avoid an import cycle.
2019-05-08 11:00:36 -05:00
Kenichi Omichi 7af6ddd353 Fix golint failures of framework/metrics_util.go
This fixes golint failures of framework/metrics_util.go.

Cleanup:
- SaturationTime was only used in test/e2e/scalability/density.go.
  So this moves it into the e2e test.
- interestingClusterAutoscalerMetrics was not used in filterMetrics()
  so this removes the related code.
2019-04-02 16:25:16 +00:00
Kubernetes Prow Robot fe9d7d59d2
Merge pull request #74406 from danielqsj/clean_unused_metrics
clean nonexistent metrics in e2e test
2019-04-02 01:36:46 -07:00
Kenichi Omichi a81dd53e53 Fix golint failures of e2e/framework/p*.go
This fixes golint failures of
- test/e2e/framework/perf_util.go
- test/e2e/framework/pods.go
- test/e2e/framework/profile_gatherer.go
- test/e2e/framework/provider.go
- test/e2e/framework/psp_util.go
2019-03-29 18:50:01 +00:00
danielqsj 088773e030 add kubelet_docker_operations_errors_total into InterestingKubeletMetrics 2019-03-19 10:30:31 +08:00
danielqsj b6ce45961b clean nonexistent metrics in e2e test 2019-03-18 17:57:40 +08:00
danielqsj f7b437cae0 convert latency in mertics name to duration 2019-02-22 21:40:13 +08:00
Kubernetes Prow Robot 9e55995a31
Merge pull request #73989 from oomichi/golint-e2e-framework-metrics
Fix golint failures under test/e2e/framework/metrics
2019-02-19 16:23:48 -08:00
danielqsj 79a3eb816c rename latency to duration in metrics 2019-02-18 17:40:04 +08:00
danielqsj 9fd99a48f5 Change kubelet metrics to conform guideline 2019-02-18 14:01:58 +08:00
Kenichi Omichi 89f6f1d0cc Fix golint failures under e2e/framework/metrics 2019-02-13 18:07:16 +00:00
Kenichi Omichi dbe669aeb1 Move PrintSample() to place which uses it
PrintSample() was used at metrics_util.go only, so this moves
the method to metrics_util.go and makes it local.
2019-02-13 01:32:53 +00:00
wojtekt 0a0835e92d Add metric exposing amount of processed init events in watchcache 2019-02-08 13:03:33 +01:00
Kubernetes Prow Robot ae45068688
Merge pull request #72323 from danielqsj/dockershim
Change docker metrics to conform metrics guidelines
2019-02-06 09:30:54 -08:00
Krzysztof Siedlecki aa93e2bb94 reverting test to apiserver_request_latencies_summary metric 2019-01-30 10:36:32 +01:00
Shyam Jeedigunta bb23e0aec7 Avoid master ssh in eks 2019-01-22 20:21:03 -08:00
danielqsj 8b418631c0 Move etcd latency metrics to histogram and update test case 2019-01-11 20:38:24 +08:00
danielqsj 94d1050303 Change docker metrics to conform guideline 2018-12-26 15:58:04 +08:00
tanshanshan cb95edafe8 kube-scheduler: enable secure ports 10259 2018-11-08 16:43:59 +01:00
Shyam Jeedigunta 203664933d Add etcd DB size monitoring in density test 2018-08-30 14:40:59 +02:00
foxyriver 4baeb09f6c need ExpectNoError check 2018-08-01 18:10:14 +08:00
Krzysztof Siedlecki e5c9383b59 Collecting etcd histogram metrics 2018-07-16 14:32:54 +02:00
Kubernetes Submit Queue a8777c26fa
Merge pull request #64695 from krzysied/etcd_metrics
Automatic merge from submit-queue (batch tested with PRs 64695, 65982, 65908). If you want to cherry-pick this change to another branch, please follow the instructions <a href="https://github.com/kubernetes/community/blob/master/contributors/devel/cherry-picks.md">here</a>.

Collecting etcd metrics

**What this PR does / why we need it**:
Adding etcd metrics to performance test log.

**Which issue(s) this PR fixes** *(optional, in `fixes #<issue number>(, fixes #<issue_number>, ...)` format, will close the issue(s) when PR gets merged)*:
ref #64030

**Special notes for your reviewer**:

**Release note**:

```release-note
NONE
```
2018-07-10 08:55:03 -07:00
Kubernetes Submit Queue 6c847f3e7a
Merge pull request #65307 from shyamjvs/fix-scheduler-reset-metrics-bug
Automatic merge from submit-queue (batch tested with PRs 65301, 65291, 65307, 63845, 65313). If you want to cherry-pick this change to another branch, please follow the instructions <a href="https://github.com/kubernetes/community/blob/master/contributors/devel/cherry-picks.md">here</a>.

Fix scheduler reset metrics bug in testinfra

/cc @krzysied 

```release-note
NONE
```
2018-06-22 03:08:13 -07:00
Shyam Jeedigunta b9ae20c99e Split scheduler latency metric to fine-grained steps 2018-06-21 14:19:39 +02:00
Shyam Jeedigunta cd1a5353eb Fix scheduler reset metrics bug in testinfra 2018-06-21 13:50:59 +02:00
Krzysztof Siedlecki e32910a544 Readding summary metrics 2018-06-14 15:05:12 +02:00
Krzysztof Siedlecki 0547bbf744 Revert "Fixing scheduling latency metrics"
This reverts commit 0e833bfc83.
2018-06-14 14:50:12 +02:00
Kubernetes Submit Queue 65a5e68147
Merge pull request #64521 from shyamjvs/compute-scheduler-throughput-avg
Automatic merge from submit-queue. If you want to cherry-pick this change to another branch, please follow the instructions <a href="https://github.com/kubernetes/community/blob/master/contributors/devel/cherry-picks.md">here</a>.

Compute avg and quantiles of scheduler throughput in density test

Based on my comment here - https://github.com/kubernetes/kubernetes/pull/64266#issuecomment-393189953

/sig scheduling
/kind cleanup
/priority important-soon
/milestone v1.11
/cc @wojtek-t 

```release-note
NONE
```
2018-06-13 14:23:51 -07:00
Shyam Jeedigunta 979a8d73e1 Compute avg and quantiles of scheduler throughput in density test 2018-06-12 18:40:52 +02:00
Krzysztof Siedlecki aa022310a4 Collecting etcd metrics 2018-06-04 16:23:08 +02:00
Krzysztof Siedlecki 0e833bfc83 Fixing scheduling latency metrics 2018-05-30 11:20:12 +02:00
Shyam Jeedigunta f363f549c0 Measure scheduler throughput in density test 2018-05-25 14:49:11 +02:00
Shyam Jeedigunta 0f0c754eb4 Get rid of duplicate VerifyPodStartupLatency util in node density tests 2018-03-21 16:58:31 +01:00
Shyam Jeedigunta b0dd166fa3 Capture different parts of pod-startup latency as metrics 2018-03-21 16:58:25 +01:00
Kubernetes Submit Queue d4724d7e43
Merge pull request #55056 from porridge/typo-percentil
Automatic merge from submit-queue. If you want to cherry-pick this change to another branch, please follow the instructions <a href="https://github.com/kubernetes/community/blob/master/contributors/devel/cherry-picks.md">here</a>.

Fix a typo.

**Release note**:
```release-note
NONE
```
2017-11-20 01:40:50 -08:00
xiangpengzhao 32675e6f62 Remove check for SubResourcePodProxyVersion and SubResourceServiceAndNodeProxyVersion 2017-11-03 23:11:09 +08:00
Marcin Owsiany c2ab5c8246 Fix a typo. 2017-11-03 13:43:32 +01:00
Kevin 4c8539cece use core client with explicit version globally 2017-10-27 15:48:32 +08:00
Shyam Jeedigunta f373645865 Increase api latency threshold for cluster-scoped list calls 2017-09-21 13:33:22 +02:00
Clayton Coleman 30a92a8f0a
Report scope in e2e test metrics 2017-09-11 22:13:55 -04:00
Kubernetes Submit Queue fdf14b8218 Merge pull request #50913 from shyamjvs/list-call-slo
Automatic merge from submit-queue (batch tested with PRs 50893, 50913, 50963, 50629, 50640)

Increase latency threshold for list api calls

This is only a short-term solution to make our density test green. In the long-term, we should measure as per our new SLIs.
From @wojtek-t's [doc](https://docs.google.com/document/d/1Q5qxdeBPgTTIXZxdsFILg7kgqWhvOwY8uROEf0j5YBw) on the new SLIs/SLOs, we have the following SLO for list calls:

```
SLO1: In default Kubernetes installation, 99th percentile of SLI2 per cluster-day:
<= 1s if total number of objects of the same type as resource in the system <= X
<= 5s if total number of objects of the same type as resource in the system <= Y
<= 30s if total number of objects of the same types as resource in the system <= Z
```

I would guess that 170,000 pods would fall into the 2nd bracket (at least) and hence the new value of 5s. WDYT?

cc @kubernetes/sig-scalability-misc @wojtek-t @gmarek
2017-08-22 05:31:07 -07:00
Shyam Jeedigunta 70123e71bb Increase latency threshold for list api calls 2017-08-19 00:55:35 +02:00
Kubernetes Submit Queue b67b0ad7eb Merge pull request #50768 from shyamjvs/fix-scheduler-metric-in-gke
Automatic merge from submit-queue (batch tested with PRs 50550, 50768)

Don't SSH to master for metrics in case of GKE

cc @kubernetes/sig-scalability-misc @crassirostris
2017-08-17 03:13:59 -07:00
Shyam Jeedigunta a938c000e3 Don't SSH to master for metrics in case of GKE 2017-08-16 15:24:50 +02:00
Aleksandra Malinowska 55682f2a55 add grabbing CA metrics in e2e tests 2017-08-10 11:22:45 +02:00
Mik Vyatskov e79a228a78 Move the sig-instrumentation test to a dedicated folder 2017-08-07 10:33:03 +02:00
Jacob Simpson 29c1b81d4c Scripted migration from clientset_generated to client-go. 2017-07-17 15:05:37 -07:00
gmarek 55880e6b4b Move metrics_grabbert to test/e2e 2017-07-07 13:13:44 +02:00
Shyam Jeedigunta 04822a9672 Increase threshold for LIST apicall latencies to 2s 2017-06-13 15:49:01 +02:00