elasticsearch/docs/reference/esql/functions/auto_bucket.asciidoc

[discrete]
[[esql-auto_bucket]]
=== `AUTO_BUCKET`
Creates human-friendly buckets and returns a `datetime` value for each row that
corresponds to the resulting bucket the row falls into. Combine `AUTO_BUCKET`
with <<esql-stats-by>> to create a date histogram.

You provide a target number of buckets, a start date, and an end date, and it
picks an appropriate bucket size to generate the target number of buckets or
fewer. For example, this asks for at most 20 buckets over a whole year, which
picks monthly buckets:

[source.merge.styled,esql]
----
include::{esql-specs}/date.csv-spec[tag=auto_bucket_month]
----
[%header.monospaced.styled,format=dsv,separator=|]
|===
include::{esql-specs}/date.csv-spec[tag=auto_bucket_month-result]
|===

The goal isn't to provide *exactly* the target number of buckets, it's to pick a
range that people are comfortable with that provides at most the target number of
buckets.

If you ask for more buckets then `AUTO_BUCKET` can pick a smaller range. For example,
asking for at most 100 buckets in a year will get you week long buckets:

[source.merge.styled,esql]
----
include::{esql-specs}/date.csv-spec[tag=auto_bucket_week]
----
[%header.monospaced.styled,format=dsv,separator=|]
|===
include::{esql-specs}/date.csv-spec[tag=auto_bucket_week-result]
|===

`AUTO_BUCKET` does not filter any rows. It only uses the provided time range to
pick a good bucket size. For rows with a date outside of the range, it returns a
`datetime` that corresponds to a bucket outside the range. Combine `AUTO_BUCKET`
with <<esql-where>> to filter rows.

A more complete example might look like:

[source.merge.styled,esql]
----
include::{esql-specs}/date.csv-spec[tag=auto_bucket_in_agg]
----
[%header.monospaced.styled,format=dsv,separator=|]
|===
include::{esql-specs}/date.csv-spec[tag=auto_bucket_in_agg-result]
|===

NOTE: `AUTO_BUCKET` does not create buckets that don't match any documents. That's
why the example above is missing `1985-03-01` and other dates.

==== Numeric fields

`auto_bucket` can also operate on numeric fields like this:
[source.merge.styled,esql]
----
include::{esql-specs}/ints.csv-spec[tag=auto_bucket]
----
[%header.monospaced.styled,format=dsv,separator=|]
|===
include::{esql-specs}/ints.csv-spec[tag=auto_bucket-result]
|===

Unlike the example above where you are intentionally filtering on a date range,
you rarely want to filter on a numeric range. So you have find the `min` and `max`
separately. We don't yet have an easy way to do that automatically. Improvements
coming!
Restructure ES\|QL docs (#100806) * Break out 'Limitations' into separate page * Add REST API docs * Restructure commands, functions, and operators refs * Add placeholder for getting started guide * Group 'Syntax', 'Metafields', and 'MV fields' under 'Language' * Add placeholder for Kibana page * Add link from landing page * Apply uniform formatting to ACOS, CASE, and DATE_PARSE function refs * Reword default LIMIT * Add support for COUNT() Move 'Commands' and 'Functions and operators' to individual pages --------- Co-authored-by: Elastic Machine <elasticmachine@users.noreply.github.com> 2023-10-17 23:36:14 +08:00			`[discrete]`
Docs for `auto_bucket` (ESQL-1208) This adds some docs for the `auto_bucket` command. --------- Co-authored-by: Abdon Pijpelink <abdon.pijpelink@elastic.co> 2023-06-01 01:49:39 +08:00			`[[esql-auto_bucket]]`
			=== `AUTO_BUCKET`
			Creates human-friendly buckets and returns a `datetime` value for each row that
			corresponds to the resulting bucket the row falls into. Combine `AUTO_BUCKET`
			`with <<esql-stats-by>> to create a date histogram.`

			`You provide a target number of buckets, a start date, and an end date, and it`
			`picks an appropriate bucket size to generate the target number of buckets or`
			`fewer. For example, this asks for at most 20 buckets over a whole year, which`
			`picks monthly buckets:`

Docs: compress results into query (ESQL-1259) This compresses the results and the query on the page to take up less space and make them more obviously connected. 2023-06-12 22:37:45 +08:00			`[source.merge.styled,esql]`
Docs for `auto_bucket` (ESQL-1208) This adds some docs for the `auto_bucket` command. --------- Co-authored-by: Abdon Pijpelink <abdon.pijpelink@elastic.co> 2023-06-01 01:49:39 +08:00			`----`
			`include::{esql-specs}/date.csv-spec[tag=auto_bucket_month]`
			`----`
Docs: compress results into query (ESQL-1259) This compresses the results and the query on the page to take up less space and make them more obviously connected. 2023-06-12 22:37:45 +08:00			`[%header.monospaced.styled,format=dsv,separator=\|]`
Docs for `auto_bucket` (ESQL-1208) This adds some docs for the `auto_bucket` command. --------- Co-authored-by: Abdon Pijpelink <abdon.pijpelink@elastic.co> 2023-06-01 01:49:39 +08:00			`\|===`
			`include::{esql-specs}/date.csv-spec[tag=auto_bucket_month-result]`
			`\|===`

			`The goal isn't to provide exactly the target number of buckets, it's to pick a`
			`range that people are comfortable with that provides at most the target number of`
			`buckets.`

			If you ask for more buckets then `AUTO_BUCKET` can pick a smaller range. For example,
			`asking for at most 100 buckets in a year will get you week long buckets:`

Docs: compress results into query (ESQL-1259) This compresses the results and the query on the page to take up less space and make them more obviously connected. 2023-06-12 22:37:45 +08:00			`[source.merge.styled,esql]`
Docs for `auto_bucket` (ESQL-1208) This adds some docs for the `auto_bucket` command. --------- Co-authored-by: Abdon Pijpelink <abdon.pijpelink@elastic.co> 2023-06-01 01:49:39 +08:00			`----`
			`include::{esql-specs}/date.csv-spec[tag=auto_bucket_week]`
			`----`
Docs: compress results into query (ESQL-1259) This compresses the results and the query on the page to take up less space and make them more obviously connected. 2023-06-12 22:37:45 +08:00			`[%header.monospaced.styled,format=dsv,separator=\|]`
Docs for `auto_bucket` (ESQL-1208) This adds some docs for the `auto_bucket` command. --------- Co-authored-by: Abdon Pijpelink <abdon.pijpelink@elastic.co> 2023-06-01 01:49:39 +08:00			`\|===`
			`include::{esql-specs}/date.csv-spec[tag=auto_bucket_week-result]`
			`\|===`

			`AUTO_BUCKET` does not filter any rows. It only uses the provided time range to
			`pick a good bucket size. For rows with a date outside of the range, it returns a`
			`datetime` that corresponds to a bucket outside the range. Combine `AUTO_BUCKET`
			`with <<esql-where>> to filter rows.`

			`A more complete example might look like:`

Docs: compress results into query (ESQL-1259) This compresses the results and the query on the page to take up less space and make them more obviously connected. 2023-06-12 22:37:45 +08:00			`[source.merge.styled,esql]`
Docs for `auto_bucket` (ESQL-1208) This adds some docs for the `auto_bucket` command. --------- Co-authored-by: Abdon Pijpelink <abdon.pijpelink@elastic.co> 2023-06-01 01:49:39 +08:00			`----`
			`include::{esql-specs}/date.csv-spec[tag=auto_bucket_in_agg]`
			`----`
Docs: compress results into query (ESQL-1259) This compresses the results and the query on the page to take up less space and make them more obviously connected. 2023-06-12 22:37:45 +08:00			`[%header.monospaced.styled,format=dsv,separator=\|]`
Docs for `auto_bucket` (ESQL-1208) This adds some docs for the `auto_bucket` command. --------- Co-authored-by: Abdon Pijpelink <abdon.pijpelink@elastic.co> 2023-06-01 01:49:39 +08:00			`\|===`
			`include::{esql-specs}/date.csv-spec[tag=auto_bucket_in_agg-result]`
			`\|===`

			NOTE: `AUTO_BUCKET` does not create buckets that don't match any documents. That's
[DOCS] Some minor ES\|QL docs fixes (#99423) 2023-09-11 22:20:10 +08:00			why the example above is missing `1985-03-01` and other dates.
Support `auto_bucket` for numeric fields (ESQL-1494) This adds support for numeric fields to `auto_bucket` and adds a new `floor` function to round numeric down to the nearest integer. That function is exposed because it's probably useful. I added it in this PR because `auto_bucket` uses it as an implementation detail as well. 2023-08-01 04:45:59 +08:00
			`==== Numeric fields`

			`auto_bucket` can also operate on numeric fields like this:
			`[source.merge.styled,esql]`
			`----`
			`include::{esql-specs}/ints.csv-spec[tag=auto_bucket]`
			`----`
			`[%header.monospaced.styled,format=dsv,separator=\|]`
			`\|===`
			`include::{esql-specs}/ints.csv-spec[tag=auto_bucket-result]`
			`\|===`

			`Unlike the example above where you are intentionally filtering on a date range,`
			you rarely want to filter on a numeric range. So you have find the `min` and `max`
			`separately. We don't yet have an easy way to do that automatically. Improvements`
			`coming!`