mirror of https://github.com/prometheus/prometheus
You can not select more than 25 topics
Topics must start with a letter or number, can include dashes ('-') and can be up to 35 characters long.
125 lines
4.7 KiB
125 lines
4.7 KiB
11 years ago
|
# Introduction
|
||
|
|
||
|
Prometheus is in the business of doing several things:
|
||
|
|
||
|
* exposing metrics,
|
||
|
* collecting metrics,
|
||
|
* performing computational rules upon them and generating synthetic series,
|
||
|
* triggering events based upon rule conditions, and
|
||
|
* metric series archival and deletion.
|
||
|
|
||
|
The focus of this guide is oriented exclusively toward exposing metrics with
|
||
|
an eye toward computational rules and event handling in the Prometheus core.
|
||
|
|
||
|
After reading this guide, you will understand the following:
|
||
|
|
||
|
* what types of Prometheus metrics are available and how they work,
|
||
|
* how you should embed these metrics in your applications,
|
||
|
* what metadata you attach to them, and
|
||
|
* what to instrument.
|
||
|
|
||
|
# Metric Types
|
||
|
|
||
|
Prometheus offers three core metric types:
|
||
|
|
||
|
* counters,
|
||
|
* gauges, and
|
||
|
* summaries.
|
||
|
|
||
|
Each is tailored toward a different purpose, so it is important to use the
|
||
|
right one for the right job.
|
||
|
|
||
|
## Counter
|
||
|
|
||
|
A _Counter_ tracks addition or subtraction of values from itself. It is used
|
||
|
to represent state that you control and is not external.
|
||
|
|
||
|
The following are good examples or applications of counters:
|
||
|
|
||
|
* pedometers: devices that track the number of steps you take,
|
||
|
* tickers: devices that measure the number of times an event has occurred,
|
||
|
* censuses: processes to count to number of individuals of a population, and
|
||
|
* _your_ bank account ledger: the sum of all transactions conducted on your
|
||
|
liquid assets.
|
||
|
|
||
|
The key thing to note about _Counters_ is that they are best used when you are
|
||
|
directly receiving the events that mutate their values and mutating the
|
||
|
underlying values that represent their individual state.
|
||
|
|
||
|
A good example is of an elevator that tracks how many times it has been used in
|
||
|
its duty cycle. Each time someone hits the button to dispatch it, the elevator
|
||
|
can update its tally; and this is a tally it keeps!
|
||
|
|
||
|
## Gauges
|
||
|
|
||
|
A _Gauge_ reports instantaneous values based on external state. This is
|
||
|
primarily state that you do not control directly but rather that you depend on
|
||
|
or can inspect.
|
||
|
|
||
|
The following are good examples or applications of gauges:
|
||
|
|
||
|
* thermometer,
|
||
|
* barometer,
|
||
|
* altimeter, and
|
||
|
* total system resource utilization.
|
||
|
|
||
|
Unlike a _Counter_, you don't have direct access to the events that brought the
|
||
|
_Gauge_ to its value today. This is a white-box and black-box distinction.
|
||
|
|
||
|
## Which to use: Counter or Gauge?
|
||
|
Knowing whether to use a _Counter_ or a _Gauge_ can be tough. Let's perform a
|
||
|
practical thought exercise. Below is an image of an automobile's instrument
|
||
|
dashboard:
|
||
|
|
||
|
![Vehicle Instrument Console](dashboard.jpg "Vehicle Instrument Console")
|
||
|
|
||
|
What do you see in this photo? Take a minute to make a list. (Don't worry;
|
||
|
I'll wait for you.)
|
||
|
|
||
|
From left to right and top to bottom, there are the following instruments:
|
||
|
|
||
|
* Tachometer: it measures the revolutions per minute of the engine;
|
||
|
* Clock: it measures the time;
|
||
|
* Thermometer: it measures the temperature outside the vehicle;
|
||
|
* Odometer: they measure the total miles the vehicle has traveled as well as
|
||
|
how far it has traveled on its last trip;
|
||
|
* Fuel Gauge: it measures how much fuel is in the tank;
|
||
|
* Speedometer: it measures how fast we are going.
|
||
|
|
||
|
Great! Look at this list and now think about the definitions above for
|
||
|
_Counter_ and _Gauge_. Can you identify what instruments are what? (Again,
|
||
|
I'll wait for you.)
|
||
|
|
||
|
* Tachometer: Counter. Tricky. The vehicle's sensors observe how quickly the
|
||
|
flywheel rotates and updates the instrument's value on frequent intervals.
|
||
|
* Clock: Gauge. It is an external value, and we cannot watch the clock's
|
||
|
internal events.
|
||
|
* Thermometer: Gauge. It measures an external value, and there is no way to
|
||
|
track thermodynamic events that influence the outside temperature.
|
||
|
* Odometer: Counter. It measures distances by the number of times the
|
||
|
wheels have turned, which itself is based on an event.
|
||
|
* Fuel Gauge: Gauge. Who would have guessed? It uses a floating ballast and
|
||
|
a rheostat and is not event-based.
|
||
|
* Speedmeter: Counter. Tricky. It measures wheel rotation (events) over
|
||
|
time.
|
||
|
|
||
|
## Summaries
|
||
|
|
||
|
_Summary_ is a unique metric type, and one you have probably not used before.
|
||
|
It samples events over sliding windows of time and provides instantaneous
|
||
|
insight into their distributions, frequencies, and sums.
|
||
|
|
||
|
Practically speaking, a summary collects event values and reports the
|
||
|
following information:
|
||
|
|
||
|
* streaming quantile values of the entire observed population,
|
||
|
* the total sum of all observed values, and
|
||
|
* the count of events that have been observed.
|
||
|
|
||
|
This is quite convenient, for if you are interested in tracking latencies of an
|
||
|
operation in real time, you three types of information reported for free with
|
||
|
one metric.
|
||
|
|
||
|
# Acknowledgements
|
||
|
Robert Bray: Vehicle Instrument Panel
|