Add an explanation for the quantile aggregation operator

Sadly, just linking to the Histogram best practice document, as done for `histogram_quantile`, would be confusing here because the best practice document only deals with quantiles in the context of Histograms and Summaries, which is very different from the context of the `quantile` aggregator and `quantile_over_time` function, which is already a source of a lot of confusion. Thus, I think the least bad solution is to add a short explanation in this section directly. There isn't even a good resource on the internet we can link to. A lot of statisticians use φ-quantiles, but they don't have a generally accepted name for it. I have added the explanation after the other detailed explanations of `count_values`, `topk` and `bottomk`. I think that fits quite nicely into the flow. Signed-off-by: beorn7 <beorn@grafana.com>
2020-07-06 17:25:55 +02:00 · 2020-07-06 17:25:55 +02:00 · cf698f71e5
parent ad7da8fd35
commit cf698f71e5
1 changed files with 6 additions and 1 deletions
--- a/docs/querying/operators.md
+++ b/docs/querying/operators.md
@ -226,6 +226,11 @@ time series is the number of times that sample value was present.
 the input samples, including the original labels, are returned in the result
 vector. `by` and `without` are only used to bucket the input vector.

+`quantile` calculates the φ-quantile, the value that ranks at number φ*N among
+the N metric values of the dimensions aggregated over. φ is provided as the
+aggregation parameter. For example, `quantile(0.5, ...)` calculates the median,
+`quantile(0.95, ...)` the 95th percentile.
+
 Example:

 If the metric `http_requests_total` had time series that fan out by