Commit Graph

10516 Commits

Author SHA1 Message Date
amyjtechwriter a80d9cb289
Index Management now has link to Discover in UI. (#96326)
* Index Management now has link to Discover in UI.

* updating screenshot for data streams section

* Update docs/reference/indices/index-mgmt.asciidoc

Co-authored-by: Abdon Pijpelink <abdon.pijpelink@elastic.co>

* Update docs/reference/indices/index-mgmt.asciidoc

Co-authored-by: Abdon Pijpelink <abdon.pijpelink@elastic.co>

---------

Co-authored-by: Abdon Pijpelink <abdon.pijpelink@elastic.co>
2023-07-24 12:31:15 +01:00
Abdon Pijpelink 8469883198
[DOCS] Make it clear that "both" is not a valid value for "script.allowed_types" (#97837) 2023-07-24 10:20:57 +02:00
Marantidis Kiriakos ea42c2e076
boxplot support for transform 52189 (#96515) 2023-07-24 10:11:26 +02:00
David Turner 60935c68cc
Adjust sizing guidance re. doc count (#97831)
In #87246 we describe some reasons why it's a good idea to limit the doc
count of a shard, and we started to do so in #94065, so this commit
adjusts the sizing guidance docs to match.
2023-07-20 14:56:52 +01:00
Abdon Pijpelink e3c11016b9
[DOCS] Add tech preview admonition for geo line aggs on time-series data (#97844)
* Add tech preview admonition for time-series data

* Light editing
2023-07-20 15:40:12 +02:00
Iraklis Psaroudakis 3f9279df17
Mute testResetSystemIndices and feature-migration (#97781)
Relates #97780
2023-07-20 07:30:48 -04:00
Abdon Pijpelink 40409bf8ca
[DOCS] Semantic search page (#97715)
Co-authored-by: István Zoltán Szabó <istvan.szabo@elastic.co>
Co-authored-by: David Roberts <dave.roberts@elastic.co>
2023-07-20 10:45:13 +02:00
David Turner c7c1dc1914
Add docs on troubleshooting NFS repos (#97601)
Spell out a bit more clearly that ES works through the OS's filesystem
abstraction, giving advice about how to reproduce problems outside of
ES.
2023-07-19 13:55:09 +01:00
Ioana Tagirta d2685926f3
Document dictionary parameter for Search Applications (#97501)
* Document dictionary parameter for Search Applications

* Address PR feedback
2023-07-19 14:26:16 +02:00
István Zoltán Szabó 3fd94678e6
[DOCS] Adds important admonition to handling delayed data page (#97753) 2023-07-19 13:32:06 +02:00
István Zoltán Szabó 57fd6b84fb
[DOCS] Expands ELSER tutorial with optimization info (#97392)
Co-authored-by: David Kyle <david.kyle@elastic.co>
2023-07-19 10:38:11 +02:00
Ryan Ernst 3f8f7182be
Remove index version and transport version from main endpoint response (#97675)
The index and transport versions are low level details of how a node
behaves in a cluster. They were recently added to the main endpoint
response, but they are too low level and should be moved to another
endpoint TBD.

This commit removes those versions from the main endpoint response. Due
to the fact lucene version is now derived from index version, this
commit also adds an explicit lucene version member to the main response.
2023-07-18 06:36:46 -07:00
Chris Hegarty 8aee1673af
Add tags (#97751)
This is a very minor change that adds tags to a couple of aggs docs. These tags are currently unused, but will be referred to by ESQL docs.
2023-07-18 13:48:37 +01:00
Mary Gouseti 9aed799b51
HealthPeriodicLogger disabled by default (#97722)
Co-authored-by: Matt Culbreth <matt.culbreth@elastic.co>
2023-07-18 10:18:23 +03:00
David Kyle 85cb7e28f2
Mute DocsClientYamlTestSuiteIT test {yaml=reference/eql/eql/line_338} (#97717) 2023-07-17 14:20:25 +01:00
Michael Peterson eaa86796a7
Add completion_time time field to async_search get and status response (#97700)
The completion_time is set as the start_time (already present) plus the 'took'
time that is set in the SearchResponse object and only if the isRunning status == false
since took is set even for in-progress searches.

We use the 'took' field because it is based on relative time, not absolute wall clock time
which can go backwards due to NTP issues. See the comments in TransportSearchAction about
the SearchTimeProvider for details.

Closes #88640
2023-07-17 09:13:15 -04:00
Andrei Dan 22bc45a82f
Change default value for data stream lifecycle poll interval to 5 mins (#97583) 2023-07-17 06:52:45 -04:00
Mayya Sharipova f8c626f792
Track max_score in collapse when requested (#97703)
Before we used to track max_score in collapse when requested (track_scores=true)
or when there is no sort in collapse (see PR#27122). But this feature
was lost through refactoring and changes.

This PR restores this feature.

Closes #97653
2023-07-17 06:48:00 -04:00
Martijn van Groningen d5ddc868f6
Document ILM's implicit rollover. (#97685)
This was introduced via #94065
Relates to #87246
2023-07-14 17:29:59 +02:00
Martijn van Groningen bea09c004e
Explain tsdb counters better. (#97618) 2023-07-13 17:15:17 +02:00
Abdon Pijpelink 0f810b19e9
[DOCS] Clarify that dense vectors can be created with ES (#97636)
* [DOCS] Clarify that dense vectors can be created with ES

* Fix rendering issue

* Break up long sentence
2023-07-13 14:04:32 +02:00
Jean-Fabrice Bobo 8f2760628f
Update filtering.asciidoc (#97542) 2023-07-13 14:01:12 +02:00
Abdon Pijpelink a9b3d7ada7
[DOCS] Clarify that disk impact of TSDS varies per data set (#97571) 2023-07-13 10:14:09 +02:00
István Zoltán Szabó 9cd609f22c
[DOCS] Adds deployment_id as an option to query_vector_builder (#97576) 2023-07-12 09:35:36 +02:00
David Roberts e810d7b77b
[ML] `inference_config` is optional for the infer trained model API (#97464)
It was made optional in #92359 which was released in version 8.6.1,
but the docs weren't updated to reflect this.
2023-07-12 08:35:06 +01:00
Kathleen DeRusso 9ee32c4cc8
Add documentation for query rules CRUD APIs (#97524)
Co-authored-by: Abdon Pijpelink <abdon.pijpelink@elastic.co>
2023-07-11 14:49:40 -04:00
Max Hniebergall 3a4113801c
[NLP] Support the different mask tokens used by NLP models for Fill Mask (#97453)
Add mask_token field to fill_mask of _ml/trained_models.

This change will enable users and Kibana to get the particular mask tokens needed for deployed models by adding a mask_token field to the GET _ml/trained_models API, as an enhancement to support kibana#159577.
2023-07-11 14:42:44 -04:00
Jack Conradson f2b0434ee2
Mark rank and sub_searches as tech preview (#97573)
rank and sub_searches are in tech preview. This adds the tech preview text that is required in the docs for these features.
2023-07-11 09:28:46 -07:00
Marc-Antoine Leclercq b1d150babf
Fix typo on semantic-search-elser.asciidoc (#97551)
MACRO => MARCO
2023-07-11 11:52:26 +02:00
Luca Cavanna 7df388df64
Make terminate_after early termination friendly (#97540)
There are situations in which the terminate_after functionality causes
the collection to keep on going although there is nothing to collect,
with the only goal of incrementing the counter of collected docs and
eventually early terminating which sets the `terminated_early` flag
in the search response to true.

When docs collection early terminates, we should rather honor the
corresponding `CollectionTerminatedException` that is thrown, and
adjust expectations around the fact that `terminate_after` affects
actual collection of documents, meaning that it can't be honored if
the threshold has not been reached by the team the collection early
terminates for other reasons.

This commit adjust the QueryPhaseCollector behavior to do that, which
allows for some additional simplifications.

Closes #97269
2023-07-11 10:14:12 +02:00
Ievgen Degtiarenko bd7c0f55e5
Add `shard_stats.total_count` column description to /_cat/nodes docs. (#97549)
This column was implemented previously but not added to the docs. It might be
helpful to list it when troubleshooting cluster balance.
2023-07-11 10:09:04 +02:00
David Turner ddd4ba5e30
Fix docs for explaining unassigned shards (#97538)
Today the `current_node` parameter is given in several sample requests
illustrating how to explain an unassigned shard using the cluster
allocation explain API. This doesn't make sense, an unassigned shard has
no `current_node`. This commit removes the misleading parameter in these
cases.
2023-07-11 08:01:12 +01:00
Michael Peterson 6dd1841dbc
Allow users to run the painless execute API on a remote cluster shard (#97335)
Added a clusterAlias to the Painless execute Request object, so that index
expressions in the request of the form "myremote:myindex" will be parsed to
set clusterAlias to "myremote" and the index to "myindex".

If clusterAlias is null, then it is executed against a shard on the local cluster, as before.
If clusterAlias is non-null, then the SingleShardTransportAction is sent to the remote cluster,
where it will run the full request (doing remote coordination). Note that the new clusterAlias 
field is not Writeable so that when it is sent to the remote cluster it will only see the index
name, not the clusterAlias (which it wouldn't know how to handle correctly).

Added PainlessExecuteIT test that tests cross-cluster calls

Updated painless-execute-script end user docs to indicate support for cross-cluster executions
2023-07-10 12:27:00 -04:00
David Turner ac0c316764
Add note on troubleshooting laggy cancellations (#97485)
Today we document that tasks may not react to cancellations immediately,
but in practice it's surprising to users and kind of a bug if they run
for too long after being cancelled. This commit adds a little extra
detail about the information to collect to troubleshoot such a
situation.
2023-07-10 11:52:36 +01:00
Christoph Büscher 192597d795
Limit _terms_enum prefix size (#97488)
Currently the prefix size of the _terms_enum endpoint are not limited in size.
Since they run against a keyword field and build automata, this can lead to high memory
consumption and the danger of running OOM. This change check the size of the prefix
early in the rest request and throw a validation error in case it exceeds
IndexWriter.MAX_TERM_LENGTH, which is the same limit we apply to the length of
keyword field values anyway, so this comes at no loss in functionality.

Closes #96572
2023-07-10 12:21:07 +02:00
David Turner 09e53f9ad9
Enhance docs around network troubleshooting (#97305)
Discovery, like cluster membership, can also be affected by network-like
issues (e.g. GC/VM pauses, dropped packets and blocked threads) so this
commit duplicates the troubleshooting info across both places.
2023-07-10 10:57:44 +01:00
Ryan Ernst 6d45c57b8f
Clarify keystore add-file command behavior (#97474)
This commit adds a little more explanation to what add-file does on the
keystore.

relates #97459
2023-07-07 15:44:16 -07:00
Mayya Sharipova 6164fa53d2
Add vector search file extensions to preload info (#97375)
Add vector search file extensions to preload info

Related to #96955
2023-07-07 15:45:08 -04:00
James Rodewig fe6a42b35f
[DOCS] Update Elastic GeoIP service link (#97455)
Adds TOS-related query parameters to the Elastic GeoIP link in the [GeoIP ingest processor docs](https://www.elastic.co/guide/en/elasticsearch/reference/current/geoip-processor.html). The current link returns a 400 HTTP status.
2023-07-07 10:53:02 -04:00
James Rodewig 52a915bf31
[DOCS] Fix TOC sidebar order 2023-07-07 08:48:03 -04:00
James Rodewig 0781e60b19
[DOCS] Add TOC to landing page (#97437)
- Adds the TOC to the Elasticsearch docs landing page. Removes the right sidebar from the landing page.
- Removes the "View all Elastic docs" link from the bottom of the landing page

Co-authored-by: Abdon Pijpelink <abdon.pijpelink@elastic.co>
2023-07-07 07:44:12 -04:00
Simon Cooper 572716f7cc
Add IndexVersion to MainResponse (#97386) 2023-07-07 11:27:54 +01:00
Craig Taverner 8151092b45
Documentation for time-series geo_line (#97373)
* Documentation for time-series geo_line

* Fix incorrect ids in geoline docs

* Some updates from review

Added image of kibana map, improved first example, linked to TSDS and added section on line simplification with link to wikipedia.

* Diagrams of truncation versus simplification
2023-07-05 17:53:27 +02:00
Matt Culbreth da81686125
Add Health API Periodic Logging (#96772)
Logs the health status of the cluster and of each health indicator as observed by the Health API
2023-07-05 11:17:18 -04:00
Mary Gouseti a432313ff3
Data stream lifecycle class names (#97381) 2023-07-05 12:28:32 +03:00
Abdon Pijpelink d94461a6ea
[DOCS] Add note about Kibana warning in ILM tutorial (#97346) 2023-07-05 10:13:40 +02:00
Luca Cavanna f5a2af6c71
Query phase: fold collector wrappers into a single top level collector (#97030)
The query phase uses a number of different collectors and combines them together, pretty much one per feature that the search API exposes: there is a collector for post_filter, one for min_score, one for terminate_after, one for aggs. While this is very flexible, we always combine such collectors together in the same way (e.g. terminate_after must be the first one, post_filter is only applied to top docs collection, min score is applied to both aggs and top docs). This means that despite we could flexibly compose collectors, we need to apply each feature predictably which makes the composability not needed. Furthermore, composability causes complexity.

The terminate_after functionality is a clear example of complexity introduced as a consequence of having a complex collector tree: it relies on a multi collector, and throws an exception to force terminating the collection for all other collectors in the tree. If there was a single collector aware of post_filter, min_score and terminate_after at the same time, we could simply reuse Lucene mechanisms to early terminate the collection (CollectionTerminatedException) instead of forcing the termination throwing an exception that Lucene does not handle.

Furthermore, MultiCollector is a complex and generic collector to combine multiple collectors together, while we always every combine maximum two collectors with it, which are more or less fixed (e.g. top docs and aggs).

This PR introduces a new top-level collector that is inspired by MultiCollector in that it holds the top docs and the optional aggs collector and applies post_filter, min_score as well as terminate_after as part of its execution. This allows us to have a specialized collector for our needs, less flexibility and more control. This surfaced some strange behaviour that we may want to change as a follow-up in how terminate_after makes us collecting docs even when all possible collections have been early terminated. The goal of this PR though is to have feature parity with query phase before the refactoring, without any change of behaviour.

A nice benefit of this work is that it allows us to rely on CollectionTerminatedException for the terminate_after functionality. This simplifies the introduction of multi-threaded collector managers when it comes to handling exceptions.
2023-06-30 12:48:13 +02:00
James Rodewig ff84ad1469
[DOCS] Note license requirements for CCS (#97252)
Notes that CCS requires both clusters to use the same license level for full capabilities.
2023-06-29 16:55:10 -04:00
Simon Cooper 5d1f616c50
Fix format of DiscoveryNode xcontent index version fields (#97223)
Also changed the value to an int rather than a string to differentiate it from node version
2023-06-29 16:53:38 +01:00
Kathleen DeRusso e35d9e7982
Update Search Application API docs to discuss warnings (#97188)
* Update Search Application API docs to discuss warnings

* Update docs/reference/search-application/apis/get-search-application.asciidoc

Co-authored-by: Liam Thompson <32779855+leemthompo@users.noreply.github.com>

* Update docs/reference/search-application/apis/search-application-search.asciidoc

Co-authored-by: Liam Thompson <32779855+leemthompo@users.noreply.github.com>

---------

Co-authored-by: Liam Thompson <32779855+leemthompo@users.noreply.github.com>
2023-06-29 09:16:07 -04:00