elasticsearch/docs/reference/esql/functions/aggregation-functions.asciidoc

[[esql-agg-functions]]
==== {esql} aggregate functions

++++
<titleabbrev>Aggregate functions</titleabbrev>
++++

The <<esql-stats-by>> function supports these aggregate functions:

// tag::agg_list[]
* <<esql-agg-avg>>
* <<esql-agg-count>>
* <<esql-agg-count-distinct>>
* <<esql-agg-max>>
* <<esql-agg-median>>
* <<esql-agg-median-absolute-deviation>>
* <<esql-agg-min>>
* <<esql-agg-percentile>>
* experimental:[] <<esql-agg-st-centroid>>
* <<esql-agg-sum>>
* <<esql-agg-values>>
// end::agg_list[]

include::avg.asciidoc[]
include::count.asciidoc[]
include::count-distinct.asciidoc[]
include::max.asciidoc[]
include::median.asciidoc[]
include::median-absolute-deviation.asciidoc[]
include::min.asciidoc[]
include::percentile.asciidoc[]
include::st_centroid_agg.asciidoc[]
include::sum.asciidoc[]
include::values.asciidoc[]
Restructure ES\|QL docs (#100806) * Break out 'Limitations' into separate page * Add REST API docs * Restructure commands, functions, and operators refs * Add placeholder for getting started guide * Group 'Syntax', 'Metafields', and 'MV fields' under 'Language' * Add placeholder for Kibana page * Add link from landing page * Apply uniform formatting to ACOS, CASE, and DATE_PARSE function refs * Reword default LIMIT * Add support for COUNT() Move 'Commands' and 'Functions and operators' to individual pages --------- Co-authored-by: Elastic Machine <elasticmachine@users.noreply.github.com> 2023-10-17 23:36:14 +08:00			`[[esql-agg-functions]]`
Add improvements to the ES\|QL docs (#101195) Content and structural improvements to the ES\|QL docs --------- Co-authored-by: Alexandros Batsakis <abatsakis@splunk.com> Co-authored-by: Abdon Pijpelink <abdon.pijpelink@elastic.co> 2023-10-23 22:45:42 +08:00			`==== {esql} aggregate functions`
Restructure ES\|QL docs (#100806) * Break out 'Limitations' into separate page * Add REST API docs * Restructure commands, functions, and operators refs * Add placeholder for getting started guide * Group 'Syntax', 'Metafields', and 'MV fields' under 'Language' * Add placeholder for Kibana page * Add link from landing page * Apply uniform formatting to ACOS, CASE, and DATE_PARSE function refs * Reword default LIMIT * Add support for COUNT() Move 'Commands' and 'Functions and operators' to individual pages --------- Co-authored-by: Elastic Machine <elasticmachine@users.noreply.github.com> 2023-10-17 23:36:14 +08:00
			`++++`
			`<titleabbrev>Aggregate functions</titleabbrev>`
			`++++`

			`The <<esql-stats-by>> function supports these aggregate functions:`

Add improvements to the ES\|QL docs (#101195) Content and structural improvements to the ES\|QL docs --------- Co-authored-by: Alexandros Batsakis <abatsakis@splunk.com> Co-authored-by: Abdon Pijpelink <abdon.pijpelink@elastic.co> 2023-10-23 22:45:42 +08:00			`// tag::agg_list[]`
Restructure ES\|QL docs (#100806) * Break out 'Limitations' into separate page * Add REST API docs * Restructure commands, functions, and operators refs * Add placeholder for getting started guide * Group 'Syntax', 'Metafields', and 'MV fields' under 'Language' * Add placeholder for Kibana page * Add link from landing page * Apply uniform formatting to ACOS, CASE, and DATE_PARSE function refs * Reword default LIMIT * Add support for COUNT() Move 'Commands' and 'Functions and operators' to individual pages --------- Co-authored-by: Elastic Machine <elasticmachine@users.noreply.github.com> 2023-10-17 23:36:14 +08:00			`* <<esql-agg-avg>>`
			`* <<esql-agg-count>>`
			`* <<esql-agg-count-distinct>>`
			`* <<esql-agg-max>>`
			`* <<esql-agg-median>>`
			`* <<esql-agg-median-absolute-deviation>>`
			`* <<esql-agg-min>>`
			`* <<esql-agg-percentile>>`
ESQL: Support ST_CONTAINS and ST_WITHIN (#106503) * WIP Started adding ST_CONTAINS * Add generated evaluators * Reduced warnings and use correct evaluators * Refactored tests to remove duplicate code, and fixed Contains/multi-components * Gradle build disallows using getDeclaredField * Fixed cases where rectangles cross the dateline * Fixed meta function tests * Added ST_WITHIN to support inverting ST_CONTAINS If the ST_CONTAINS is called with the constant on the left, we either have to create a lot more Evaluators to cover that case, or we have to invert it to ST_WITHIN. This inversion was a much easier option. * Simplify inversion logic * Add comment on choice of surrogate approach * Add unit tests and missing fold() function * Simple code cleanup * Add integration tests for literals * Add more integration tests based on actual data * Generated documentation files * Add documentation * Fixed failing function count test * Add tests that push-to-source works for ST_CONTAINS and ST_WITHIN * Test more combinations of WITH/CONTAINS and literal on right and left This also verifies that the re-writing of CONTAINS to WITHIN or vice versa occurs when the literal is on the left. * test that physical planning also handles doc-values from STATS * Added more tests for WITHIN/CONTAINS together with CENTROID This should test the doc-values for points. * Add cartesian_point tests * Add cartesian_shape tests * Disable Lucene-push-down for CARTESIAN data This is a limitation in Lucene, which we could address as a performance optimization in a future PR, but since it probably requires Lucene changes, it cannot be done in this work. * Fix doc links * Added test data and tests for cartesian multi-polygons Testing INTERSECTS, CONTAINS and WITHIN with multi-polydon fields * Use required features for spatial points, shapes and centroid * 8.13.0 is not yet historical version This needs to be reverted as soon as 8.13.0 is released * Added st_intersects and st_contains_within 'features' * Code review updates * Re-enable lucene push-down * Added more required_features * Fix point contains non-point * Fix point contains point * Re-enable lucene push-down in tests too Forgot to change the physical planner unit tests after re-enabling lucene push-down * Generate automatic docs * Use generated examples docs * Generated examples use '-result' prefix (singular) * Mark spatial functions as preview/experimental 2024-04-02 16:31:00 +08:00			`* experimental:[] <<esql-agg-st-centroid>>`
Restructure ES\|QL docs (#100806) * Break out 'Limitations' into separate page * Add REST API docs * Restructure commands, functions, and operators refs * Add placeholder for getting started guide * Group 'Syntax', 'Metafields', and 'MV fields' under 'Language' * Add placeholder for Kibana page * Add link from landing page * Apply uniform formatting to ACOS, CASE, and DATE_PARSE function refs * Reword default LIMIT * Add support for COUNT() Move 'Commands' and 'Functions and operators' to individual pages --------- Co-authored-by: Elastic Machine <elasticmachine@users.noreply.github.com> 2023-10-17 23:36:14 +08:00			`* <<esql-agg-sum>>`
ESQL: Values aggregation function (#106065) This creates the `VALUES` aggregation function which buffers all field values it receives and emits them as a multivalued field. It can use a significant amount of memory and will circuit break if it uses too much memory, but it's really useful for putting together self-join-like behavior. It sort of functions as a stop-gap measure until we have more self-join style things. In the future we'll have spill-to-disk for aggregations and, likely, some kind of self-join command for aggregations at least so this will be able to grow beyond memory. But for now, memory it is. Example: ``` FROM employees \| EVAL first_letter = SUBSTRING(first_name, 0, 1) \| STATS first_name=VALUES(first_name) BY first_letter \| SORT first_letter ; first_name:keyword \| first_letter:keyword [Anneke, Alejandro, Anoosh, Amabile, Arumugam] \| A [Bezalel, Berni, Bojan, Basil, Brendon, Berhard, Breannda] \| B [Chirstian, Cristinel, Claudi, Charlene] \| C [Duangkaew, Divier, Domenick, Danel] \| D ``` I made this work for everything but `geo_point` and `cartesian_point` because I'm not 100% sure how to integrate with those. We can grab those in a follow up. Closes #103600 2024-03-22 00:52:04 +08:00			`* <<esql-agg-values>>`
Add improvements to the ES\|QL docs (#101195) Content and structural improvements to the ES\|QL docs --------- Co-authored-by: Alexandros Batsakis <abatsakis@splunk.com> Co-authored-by: Abdon Pijpelink <abdon.pijpelink@elastic.co> 2023-10-23 22:45:42 +08:00			`// end::agg_list[]`
Restructure ES\|QL docs (#100806) * Break out 'Limitations' into separate page * Add REST API docs * Restructure commands, functions, and operators refs * Add placeholder for getting started guide * Group 'Syntax', 'Metafields', and 'MV fields' under 'Language' * Add placeholder for Kibana page * Add link from landing page * Apply uniform formatting to ACOS, CASE, and DATE_PARSE function refs * Reword default LIMIT * Add support for COUNT() Move 'Commands' and 'Functions and operators' to individual pages --------- Co-authored-by: Elastic Machine <elasticmachine@users.noreply.github.com> 2023-10-17 23:36:14 +08:00
			`include::avg.asciidoc[]`
			`include::count.asciidoc[]`
			`include::count-distinct.asciidoc[]`
			`include::max.asciidoc[]`
			`include::median.asciidoc[]`
			`include::median-absolute-deviation.asciidoc[]`
			`include::min.asciidoc[]`
			`include::percentile.asciidoc[]`
Rename ST_CENTROID to ST_CENTROID_AGG (#107226) * Rename ST_CENTROID to ST_CENTROID_AGG In order to allow development of a scalar ST_CENTROID function. * Fix table alignment 2024-04-10 23:56:45 +08:00			`include::st_centroid_agg.asciidoc[]`
Restructure ES\|QL docs (#100806) * Break out 'Limitations' into separate page * Add REST API docs * Restructure commands, functions, and operators refs * Add placeholder for getting started guide * Group 'Syntax', 'Metafields', and 'MV fields' under 'Language' * Add placeholder for Kibana page * Add link from landing page * Apply uniform formatting to ACOS, CASE, and DATE_PARSE function refs * Reword default LIMIT * Add support for COUNT() Move 'Commands' and 'Functions and operators' to individual pages --------- Co-authored-by: Elastic Machine <elasticmachine@users.noreply.github.com> 2023-10-17 23:36:14 +08:00			`include::sum.asciidoc[]`
ESQL: Values aggregation function (#106065) This creates the `VALUES` aggregation function which buffers all field values it receives and emits them as a multivalued field. It can use a significant amount of memory and will circuit break if it uses too much memory, but it's really useful for putting together self-join-like behavior. It sort of functions as a stop-gap measure until we have more self-join style things. In the future we'll have spill-to-disk for aggregations and, likely, some kind of self-join command for aggregations at least so this will be able to grow beyond memory. But for now, memory it is. Example: ``` FROM employees \| EVAL first_letter = SUBSTRING(first_name, 0, 1) \| STATS first_name=VALUES(first_name) BY first_letter \| SORT first_letter ; first_name:keyword \| first_letter:keyword [Anneke, Alejandro, Anoosh, Amabile, Arumugam] \| A [Bezalel, Berni, Bojan, Basil, Brendon, Berhard, Breannda] \| B [Chirstian, Cristinel, Claudi, Charlene] \| C [Duangkaew, Divier, Domenick, Danel] \| D ``` I made this work for everything but `geo_point` and `cartesian_point` because I'm not 100% sure how to integrate with those. We can grab those in a follow up. Closes #103600 2024-03-22 00:52:04 +08:00			`include::values.asciidoc[]`