Commit Graph

18416 Commits

Author SHA1 Message Date
Ioana Tagirta a2393063c0
ES|QL: Add docs for FORK (#130314)
Co-authored-by: Liam Thompson <32779855+leemthompo@users.noreply.github.com>
2025-06-30 20:04:21 +02:00
Gal Lalouche 26c4354666
ESQL: TopNOperator, release Row on failure (#130330)
Handles the case where the Row was released on failure, by moving the declaration to a try-with-resource clause.

Resolves #130215, #130222, #130270.
2025-07-01 02:40:45 +10:00
Nhat Nguyen 6de476a88a
Add tests and docs for first/last_over_time and rate (#130290)
This PR adds unit tests and docs for first_over_time, last_over_time, 
and rate. For the rate function, the tests currently only verify that
the output is a double, not the actual value.
2025-06-30 09:22:13 -07:00
Graeme Mjehovich a43aaa8417
Enhancement: ILM sets indexing_complete to true from ReadOnly action (#129945)
* ILM sets indexing_complete from ReadOnly action

* Update docs/changelog/129945.yaml

---------

Co-authored-by: Joe Gallo <joe.gallo@elastic.co>
2025-06-30 11:17:21 -04:00
Martijn van Groningen b6e518f01a
Remove tmp_fdt_no_mmap feature flag. (#130308)
After running this change for a week,
no regressions where detected in
nightly benchmarks that use index sorting.
2025-06-30 16:07:17 +02:00
Lisa Cawley fe75f8d6c6
[DOCS] Fix broken link in semantic-text.md (#130265)
This PR fixes an Asciidoc-style link that lingered in https://www.elastic.co/docs/reference/elasticsearch/mapping-reference/semantic-text
2025-06-30 11:04:07 +02:00
Nhat Nguyen d887e7de9d
Add doc for min/max_over_time (#130269)
Add the initial docs structure for the TS command and time-series aggregation.
More content will be added later.
2025-06-28 10:28:31 -07:00
David Kilfoyle d3f042eea3
[Docs] Revert addition of 'Remove index block API' (#130248)
* Revert addition of 'Remove index block AP'

* Also remove 'Add index block API'
2025-06-27 15:02:04 -04:00
Mike Pellegrini 52495aa5fc
Fix incorrect accounting of semantic text indexing memory pressure (#130221) 2025-06-27 14:29:54 -04:00
Tim Brooks ea2e7b4382
Reapply "Dispatch ingest work to coordination thread pool (#130152)
This reverts commit 73b0a60.

Additionally, it adds thread pool documentation.
2025-06-27 11:34:28 -06:00
Artem Shelkovnikov b9360a49c5
Move connector generic issues to the bottom of the section (#130223) 2025-06-27 17:32:11 +02:00
Artem Shelkovnikov 404eb61dec
Update es-connectors-sharepoint-online.md (#130013) 2025-06-27 14:39:49 +02:00
Artem Shelkovnikov 945e807c24
[DOCS] Add mongodb connector UUID known issue (#129492)
* Update es-connectors-mongodb.md

* Apply suggestions from code review

Co-authored-by: Charlotte Hoblik <116336412+charlotte-hoblik@users.noreply.github.com>

* Update docs/reference/search-connectors/es-connectors-mongodb.md

Co-authored-by: Charlotte Hoblik <116336412+charlotte-hoblik@users.noreply.github.com>

---------

Co-authored-by: Charlotte Hoblik <116336412+charlotte-hoblik@users.noreply.github.com>
2025-06-27 14:39:38 +02:00
Kathleen DeRusso 10c0799e9b
Fix typo (#130195) 2025-06-27 13:39:19 +01:00
Kathleen DeRusso 2d38751fe7
Add knn query vector builder example over a semantic text field to docs (#130135) 2025-06-27 08:18:40 -04:00
Simon Cooper ff65fd1133
Turn direct IO for BBQ rescoring off by default (#130014)
Add a changelog entry for direct io option
2025-06-27 08:31:20 +01:00
Nick Tindall 77b459c454
Improve accuracy of write load forecast when shard numbers change (#129990) 2025-06-27 13:04:50 +10:00
James Baiera e54cd63fd0
Add docs for index component selector api convention (#128786)
Co-authored-by: David Kilfoyle <41695641+kilfoyle@users.noreply.github.com>
2025-06-26 16:47:38 -04:00
Tim Brooks 1d3bd46c6a
Allow larger write queues for large nodes (#130061)
With the rise of larger CPU count nodes our current write queue size
might be too conservative. Indexing pressure will still provide protect
against out of memories.
2025-06-26 12:18:38 -06:00
Kathleen DeRusso 81a6eadba2
Support returning default index_options for semantic_text fields when include_defaults is true (#129967) 2025-06-26 12:31:58 -04:00
Pawan Kartik 2667a2d4ba
Fix: prevent duplication of "invalid index name" string in the final exception error message (#130027)
* Use `throwInvalidIndexNameException()` to throw invalid ex after
dropping asterisk in `IdentifierBuilder#resolveAndValidateIndex()`

* Assert the message in test

* Refactor

* drop invalid chars from assertion string due to randomisation issue

* Re-assert invalid chars

* Update docs/changelog/130027.yaml
2025-06-26 16:11:41 +01:00
Kathleen DeRusso f2cf76f027
Update semantic text docs to suggest customization using index_options (#130028)
* Update semantic text docs to suggest using index options for customization

* Correct type of  index_options

* Move example

* PR feedback

* Copy warning fix
2025-06-26 10:10:45 -04:00
Valeriy Khakhutskyy 4c7d922eeb
[ML] Fix timeout bug in DBQ deletion of unused and orphan ML data (#130083)
There was a bug in the code for deleting unused and orphan ML data. When deletion using DBQ occurred, the bug caused the request to time out. This PR resolves the issue.
2025-06-26 13:24:59 +02:00
Gal Lalouche 6970bd24a0
ESQL: Aggressive release of shard contexts (#129454)
Keep better track of shard contexts using RefCounted, so they can be released more aggressively during operator processing. For example, during TopN, we can potentially release some contexts if they don't pass the limit filter.

This is done in preparation of TopN fetch optimization, which will delay the fetching of additional columns to the data node coordinator, instead of doing it in each individual worker, thereby reducing IO. Since the node coordinator would need to maintain the shard contexts for a potentially longer duration, it is important we try to release what we can eariler.

An even more advanced optimization is to delay fetching to the main cluster coordinator, but that would be more involved, since we need to first figure out how to transport the shard contexts between nodes.

Summary of main changes:

DocVector now maintains a RefCounted instance per shard.
Things which can build or release DocVectors (e.g., LuceneSourceOperator, TopNOperator), can also hold RefCounted instances, so they can pass them to DocVector and also ensure contexts aren't released if they can still be potentially used later.
Driver's main loop iteration (runSingleLoopIteration), now closes its operators even between different operator processing. This is extra aggressive, and was mostly done to improve testability.
Added a couple of tests to TopNOperator and a new integration test EsqlTopNShardManagementIT, which uses the pausable plugin framework to check that TopNOperator releases things as early as possible..
2025-06-26 09:49:40 +10:00
Nhat Nguyen 2bc62848e8
Avoid dropping aggregate groupings in local plans (#129370)
The local plan optimizer should not change the layout, as it has already 
been agreed upon. However, CombineProjections can violate this when some
grouping elements refer to the same attribute. This occurs when
ReplaceFieldWithConstantOrNull replaces missing fields with the same
reference for a given data type.

Closes #128054
Closes #129811
2025-06-25 11:48:15 -07:00
Stanislav Malyshev d3e00bec73
ES|QL CCS GA release notes highlight (#130032)
* ESQL CCS GA
2025-06-25 19:29:27 +01:00
Pawan Kartik c94c021d0e
ES|QL: Check if cluster aliases and index patterns are valid before executing query (#122497)
ES|QL index patterns validation: Ensure that the patterns in the query are syntactically and semantically valid

---------

Co-authored-by: elasticsearchmachine <infra-root+elasticsearchmachine@elastic.co>
Co-authored-by: Andrei Stefan <astefan@users.noreply.github.com>
Co-authored-by: Alexander Spies <alexander.spies@elastic.co>
2025-06-25 16:47:42 +01:00
Stef Nestor af735accda
(Doc+) Link Slow Logs to Read+Write models (#129810)
* (Doc+) Link Slow Logs to Read+Write models

👋 howdy team! Baby update to cross-link the reading+write models from the Slow Logs page. 🙏

* feedback

Co-authored-by: shainaraskas <58563081+shainaraskas@users.noreply.github.com>

---------

Co-authored-by: shainaraskas <58563081+shainaraskas@users.noreply.github.com>
2025-06-25 09:40:44 -06:00
elasticsearchmachine e977fccc4f
Finalize release notes for v9.0.3 release (#129938)
* Finalize docs for v9.0.3 release

* Fix breaking changes page

* Fix deprications page

* Fix release notes index page

---------

Co-authored-by: Charlotte Hoblik <sarolta@saroltah.hu>
Co-authored-by: Charlotte Hoblik <116336412+charlotte-hoblik@users.noreply.github.com>
2025-06-25 14:02:49 +02:00
elasticsearchmachine 87e6c7462f
Finalize docs for v9.0.2 release (#128837)
Co-authored-by: Charlotte Hoblik <116336412+charlotte-hoblik@users.noreply.github.com>
2025-06-25 12:27:27 +01:00
Lorenzo Dematté b52e5a71eb
[Entitlements] Small fix on relative_path docs (#129984) 2025-06-25 11:53:54 +01:00
Liam Thompson 4df0e9930c
[DOCS] Update ESQL metadata fields page (#129939)
* [DOCS] Update ESQL metadata fields page

**esql-metadata-fields.md:**
- restructured from bullet list to table format for metadata fields
- added `_index_mode` and `_source` fields to available metadata
- improved field descriptions (more detailed)
- added "usage and limitations" section
- reorganized examples into subsections with headers
- added `_score` sorting example
- added tip box linking to search documentation

* 🚙Drive by updates to search functions ref page

moved tutorial link into tip box at top
added cross-reference to search overview documentation
minor text flow improvements and punctuation fixes

* Fix id typo

* Apply suggestions from review

Co-authored-by: Bogdan Pintea <sig11@mailbox.org>
2025-06-25 12:00:28 +02:00
Tim Vernum 8b62a55f2f
Watch SSL files instead of directories (#129738)
With the introduction of entitlements (#120243) and exclusive file
access (#123087) it is no longer safe to watch a whole directory.

In a lot of deployments, the parent directory for SSL config files
will be the main config directory, which also contains exclusive files
such as SAML realm metadata or File realm users. Watching that
directory will cause entitlement warnings because it is not
permissible for core/ssl-config to read files that are exclusively
owned by the security module (or other modules)
2025-06-25 18:24:57 +10:00
Alexander Spies 7b5e92fcb2
ESQL: Declare LOOKUP JOIN as GA in docs (#129947)
* Declare LU JOIN GA in 9.1
* Align applies_to tags for sample, change_point

Co-authored-by: Liam Thompson <32779855+leemthompo@users.noreply.github.com>
2025-06-25 09:30:37 +02:00
Mike Pellegrini 651bc39565
Simplified Linear & RRF Retrievers - Return error on empty fields param (#129962) 2025-06-24 19:25:24 -04:00
David Kyle 3a1551e0ef
[ML] Move to the Cohere V2 API for new inference endpoints (#129884) 2025-06-25 07:51:05 +10:00
HYUNSANG HAN (한현상, Travis) d16271b78d
Add RemoveBlock API to allow `DELETE /{index}/_block/{block}` (#129128)
Introduces a new `RemoveBlock` API that complements the existing `AddBlock` API by allowing users to remove index blocks using `DELETE /{index}/_block/{block}`.

Resolves #128966

---------

Co-authored-by: Niels Bauman <nielsbauman@gmail.com>
2025-06-25 06:16:14 +10:00
elasticsearchmachine cff9da2eae Prune changelogs after 8.17.8 release 2025-06-24 18:00:57 +00:00
David Turner ba103f1c24
Reverse disordered-version warning message (#129904)
The comment in `TransportHandshaker` indicates (correctly) that we emit
a warning when talking to a chronologically-newer-yet-numerically-older
version, but the wording of the warning message is inverted and says
that the remote is chronologically-older-yet-numerically-newer. This
commit straightens out the message to match the situation it is
describing.

Relates #123397
2025-06-24 18:30:11 +01:00
Mark J. Hoy 7249ac4d42
Mark Token Pruning for Sparse Vector as GA (#128854)
* remove [preview] labels sparse vec / token pruning

* Update docs/changelog/128854.yaml

* update changelog

* set changelog to feature/ml

* set proper area

* add applies_to labels

* add clarification for token pruning behaviour
2025-06-24 11:36:02 -04:00
Martijn van Groningen ae3c3601fd
Remove docs warning that synthetic source is in es|ql is experimental. (#129930) 2025-06-24 16:59:55 +02:00
elasticsearchmachine 9cc034ba44
Add release notes for v9.0.3 release (#129803)
* Update docs for v9.0.3 release

* Add merge scheduler issue to known issues pages

* Add patch release information

---------

Co-authored-by: Charlotte Hoblik <116336412+charlotte-hoblik@users.noreply.github.com>
Co-authored-by: Charlotte Hoblik <sarolta@saroltah.hu>
2025-06-24 11:34:32 +01:00
Panagiotis Bailis b855266bd1
Make bbq_hnsw the default index option for dense-vector fields with more than 384 dimensions (#129825) 2025-06-24 12:20:16 +03:00
Charlotte Hoblik 1a0ab74323
[DOCS]: Add Vector tile search API examples (#129520)
* Add vector tile examples

* Add new page to TOC

* Add internal translation example

* Update docs/reference/elasticsearch/rest-apis/vector-tile-search.md

Co-authored-by: István Zoltán Szabó <istvan.szabo@elastic.co>

* Update docs/reference/elasticsearch/rest-apis/vector-tile-search.md

Co-authored-by: István Zoltán Szabó <istvan.szabo@elastic.co>

---------

Co-authored-by: István Zoltán Szabó <istvan.szabo@elastic.co>
2025-06-24 09:30:16 +01:00
Charlotte Hoblik fb30e59a38
Add Query API key information examples (#129719) 2025-06-24 10:13:04 +02:00
Liam Thompson 5aeadf277e
[DOCS] Remove planning verbiage (#129900) 2025-06-24 09:53:42 +02:00
Nik Everett 59a10bdd7b
ESQL: Add counters to signature for `IS NULL` (#129670)
This adds `counter_long` and `counter_double` to the signatures of
supported fields for `IS NULL` and `IS NOT NULL`. We hadn't been
generating those signatures since the docs v3 migration, so this had to
plug those in. In addition, it changes the wording on a few things and
adds a note that if a field is only in some documents then the ones
missing the field will have `NULL` - which is important information for
`IS NULL` and `IS NOT NULL`.
2025-06-24 01:18:15 +01:00
Mark J. Hoy a671505c8a
Update sparse_vector field mapping to include default setting for token pruning (#129089)
* Initial checkin of refactored index_options code

* [CI] Auto commit changes from spotless

* initial unit testing

* complete unit tests; add yaml tests

* [CI] Auto commit changes from spotless

* register test feature for sparse vector

* Update docs/changelog/129089.yaml

* update changelog

* add docs

* explicit set default index_options if null

* [CI] Auto commit changes from spotless

* update yaml tests; update docs

* fix yaml tests

* readd auth for teardown

* only serialize index options if not default

* [CI] Auto commit changes from spotless

* serialization refactor; pass index version around

* [CI] Auto commit changes from spotless

* fix transport versions merge

* fix up docs

* [CI] Auto commit changes from spotless

* fix docs; add include_defaults unit and yaml test

* [CI] Auto commit changes from spotless

* override getIndexReaderManager for SemanticQueryBuilderTests

* [CI] Auto commit changes from spotless

* cleanup mapper/builder/tests; index vers. in type

still need to refactor / clean YAML tests

* [CI] Auto commit changes from spotless

* cleanups to mapper tests for clarity

* [CI] Auto commit changes from spotless

* move feature into mappers; fix yaml tests

* cleanups; add comments; remove redundant test

* [CI] Auto commit changes from spotless

* escape more periods in the YAML tests

* cleanup mapper and type tests

* [CI] Auto commit changes from spotless

* rename mapping for previous index test

* set explicit number of shards for yaml test

---------

Co-authored-by: elasticsearchmachine <infra-root+elasticsearchmachine@elastic.co>
Co-authored-by: Kathleen DeRusso <kathleen.derusso@elastic.co>
2025-06-24 08:21:32 +10:00
Pat Whelan aeb37189af
[ML] SageMaker Elastic Payload (#129413)
Send the Elastic API Payload to a SageMaker endpoint, and parse the
response as if it were an Elastic API response.

- SageMaker now supports all task types in the Elastic API format.
- Streaming is supported using the SageMaker client/server rpc,
  rather than SSE. Payloads must be in a complete and valid JSON
  structure.
- Task Settings can be used for additional passthrough settings, but
  they will not be saved alongside the model. Elastic cannot make
  guarantees on the structure or contents of this payload, so Elastic
  will treat it like the other input payloads and only allow them during
  inference.
2025-06-24 06:43:24 +10:00
Liam Thompson 8c06acccf8
[DOCS][ESQL] GA search functions for 9.1 (#129786)
* [DOCS][ESQL] Flip preview booleans, to GA search functions

* render docs, tweak some applies_to metadata in docs gen code

- **rendered docs (md):**
  - kql: removed serverless preview, added ga 9.1.0
  - match: removed serverless preview, added ga 9.1.0
  - match_phrase: changed from preview 9.1.0 to unavailable 9.0 + ga 9.1.0
  - qstr: removed serverless preview, added ga 9.1.0
  - search functions list: removed bullet point before term function

- **docs generation code (java):**
  - match_phrase: updated function info annotations to unavailable 9.0 + ga 9.1.0
  - query_string: uncommented ga 9.1.0 annotation
2025-06-23 20:51:54 +01:00