InfluxDB: Flux - Distinct() vs. Unique(): Unterschied zwischen den Versionen

Version vom 10. Dezember 2021, 11:01 Uhr

Because I was confused several times about which function is the right one (distinct() or unique()) I write this site.

Inhaltsverzeichnis

[Verbergen]

1 Base Data
- 1.1 Flux query to generate this data
2 Distinct()
- 2.1 Example 1 - without parameter
- 2.2 Example 2 - with parameter "host"
3 Unique()

Base Data

All the the examples from this site are based from the following data:

Flux query to generate this data

import "array"

var = [
  {_time: time(v: "2021-12-10T00:01:00Z"), host: "HOST01", _field: "cpu_usage", _value: 37},
  {_time: time(v: "2021-12-10T00:02:00Z"), host: "HOST02", _field: "cpu_usage", _value: 72},
  {_time: time(v: "2021-12-10T00:03:00Z"), host: "HOST03", _field: "cpu_usage", _value: 88},
  {_time: time(v: "2021-12-10T00:04:00Z"), host: "HOST03", _field: "cpu_usage", _value: 11},
  {_time: time(v: "2021-12-10T00:05:00Z"), host: "HOST01", _field: "cpu_usage", _value: 37},
  {_time: time(v: "2021-12-10T00:06:00Z"), host: "HOST02", _field: "cpu_usage", _value: 90},
  {_time: time(v: "2021-12-10T00:07:00Z"), host: "HOST04", _field: "cpu_usage", _value: 90},
  {_time: time(v: "2021-12-10T00:08:00Z"), host: "HOST03", _field: "cpu_usage", _value: 77},
  {_time: time(v: "2021-12-10T00:09:00Z"), host: "HOST05", _field: "cpu_usage", _value: 57},
  {_time: time(v: "2021-12-10T00:10:00Z"), host: "HOST01", _field: "cpu_usage", _value: 13}
]

array.from(rows: var)

Distinct()

The distinct() function returns the unique values for a given column. Null is considered its own distinct value if it is present.
The _value of each output record is set to only the specified column. This means all other columns will be removed. Distinct() is a selector function.
The function distinct() by default uses the column _value.
For example I use distinct() mainly for Grafana template variables.

Function documentation: https://docs.influxdata.com/flux/v0.x/stdlib/universe/distinct/

Example 1 - without parameter

So if you use distinct() without parameter like following query:

..
array.from(rows: var)
  |> distinct()

Output

This means following rows will be removed:

Example 2 - with parameter "host"

So if you use distinct() with parameter like following query:

..
array.from(rows: var)
  |> distinct(column: "host)

Output

This means following rows will be removed:

Unique()

@@ Zeile 31: / Zeile 31: @@
 ''The distinct() function '''returns the unique values for a given column'''. Null is considered its own distinct value if it is present.'' <br>
 ''The _value of each output record is set to '''only the specified column'''. This means '''all other columns will be removed'''. Distinct() is a selector function.'' <br>
-''The function [https://docs.influxdata.com/flux/v0.x/stdlib/universe/distinct/ distinct()] by default uses the column <code>_value</code>.'' <br>
+''The function distinct() by default uses the column <code>_value</code>.'' <br>
 ''For example I use distinct() mainly for '''Grafana template variables'''.'' <br>
+Function documentation: https://docs.influxdata.com/flux/v0.x/stdlib/universe/distinct/
 === Example 1 - without parameter ===

InfluxDB: Flux - Distinct() vs. Unique(): Unterschied zwischen den Versionen

Version vom 10. Dezember 2021, 11:01 Uhr

Inhaltsverzeichnis

Base Data

Flux query to generate this data

Distinct()

Example 1 - without parameter

Example 2 - with parameter "host"

Unique()

Navigationsmenü

Meine Werkzeuge

Namensräume

Varianten

Ansichten

Aktionen

Suche

Navigation

Informatik

Werkzeuge

Drucken/exportieren