elasticsearch

Commit Graph

Author	SHA1	Message	Date
Adrien Grand	feb6620d14	`indices.query.bool.max_clause_count` now limits all query clauses (#75297 ) In the upcoming Lucene 9 release, `indices.query.bool.max_clause_count` is going to apply to the entire query tree rather than per `bool` query. In order to avoid breaks, the limit has been bumped from 1024 to 4096. The semantics will effectively change when we upgrade to Lucene 9, this PR is only about agreeing on a migration strategy and documenting this change. To avoid further breaks, I am leaning towards keeping the current setting name even though it contains `bool`. I believe that it still makes sense given that `bool` queries are typically the main contributors to high numbers of clauses. Co-authored-by: James Rodewig <40268737+jrodewig@users.noreply.github.com>	2021-07-21 12:16:30 +02:00
James Rodewig	b207aac9ed	[DOCS] Increase `search.max_bucket` default value by one Relates to #70645.	2021-06-29 08:38:24 -04:00
David Turner	3660d863db	Fork the sending of file chunks during recovery (#74164 ) Today if sending file chunks is CPU-bound (e.g. when using compression) then we tend to concentrate all that work onto relatively few threads, even if `indices.recovery.max_concurrent_file_chunks` is increased. With this commit we fork the transmission of each chunk onto its own thread so that the CPU-bound work can happen in parallel.	2021-06-16 11:58:13 +01:00
David Turner	43ddd4a580	Fix docs rendering around recovery rate table (#73879 ) - Replaces ⇐ with ≤ - Removes table caption - Adjust table headers - Fixes leading + on subsequent paragraphs	2021-06-08 15:00:00 +01:00
Luca Belluccini	3e41d753e3	[DOCS] Note circuit breakers reject requests with 429 HTTP status code (#69864 ) We mention Elasticsearch returns 429 if the circuit breaker trips in https://www.elastic.co/blog/improving-node-resiliency-with-the-real-memory-circuit-breaker, but there is no mention in the docs. This adds an xref to circuit breaker errors section. Co-authored-by: James Rodewig <40268737+jrodewig@users.noreply.github.com>	2021-06-02 10:31:24 -04:00
James Rodewig	d3c56e6fca	[DOCS] Remove unneeded articles for Elasticsearch Service and Elastic Agent	2021-04-02 16:01:59 -04:00
James Rodewig	693807a6d3	[DOCS] Fix double spaces (#71082 )	2021-03-31 09:57:47 -04:00
David Turner	dd69ae95d7	Note recovery settings affect searchable snapshots (#70771 ) Adds a short note that `max_restore_bytes_per_sec` and `indices.recovery.max_bytes_per_sec` also affect the recovery of a searchable snapshot index.	2021-03-24 09:22:44 +00:00
Lee Hinman	3f9f007545	Add the frozen tier node role and ILM phase (#68605 ) This commit adds the `data_frozen` node role as part of the formalization of data tiers. It also adds the `"frozen"` phase to ILM, currently allowing the same actions as the existing cold phase. The frozen phase is intended to be used for data even less frequently searched than the cold phase, and will eventually be loosely tied to data using partial searchable snapshots (as oppposed to full searchable snapshots in the cold phase). Relates to #60848	2021-02-05 14:38:13 -07:00
Jason Tedor	6e94e67ae9	Set recovery rate for dedicated cold nodes (#68480 ) This commit sets the recovery rate for dedicated cold nodes. The goal is here is enhance performance of recovery in a dedicated cold tier, where we expect such nodes to be predominantly using searchable snapshots to back the indices located on them. This commit follows a simple approach where we increase the recovery rate as a function of the node size, for nodes that appear to be dedicated cold nodes.	2021-02-04 10:36:07 -05:00
James Rodewig	4a2a97a058	[DOCS] Document the `stack.templates.enabled` setting (#68328 )	2021-02-02 08:35:21 -05:00
Yang Cheng	168d98b7dd	limit the depth of nested bool queries (#66204 ) limit the depth of nested bool queries Introduce a new node level setting `indices.query.bool.max_nested_depth` that controls the depth of nested bool queries. Throw an error if a nested depth of a bool query exceeds the maximum allowed nested depth. Closes #55303	2021-01-12 09:36:09 -05:00
Nik Everett	3e3152406a	Bust the request cache when the mapping changes (#66295 ) This makes sure that we only serve a hit from the request cache if it was build using the same mapping and that the same mapping is used for the entire "query phase" of the search. Closes #62033	2020-12-23 13:19:02 -05:00
James Rodewig	77dc63b2de	[DOCS] Fix `search.max_buckets` default (#66311 )	2020-12-14 21:55:27 -05:00
Wylie Conlon	10ee0f2878	Clarify field data cache behavior in docs (#64375 ) * Clarify that field data cache includes global ordinals * Describe that the cache should be cleared once the limit is reached * Clarify that the `_id` field does not supported aggregations anymore * Fold the `fielddata` mapping parameter page into the `text field docs * Improve cross-linking	2020-11-20 13:53:23 -08:00
James Rodewig	bbcd8078ce	[DOCS] Document dynamic index mgmt and buffer settings (#61753 )	2020-09-04 10:19:42 -04:00
James Rodewig	a70c00a62c	[DOCS] Document dynamic cluster settings (#61760 ) Co-authored-by: Adam Locke <adam.locke@elastic.co>	2020-09-01 15:48:45 -04:00
James Rodewig	d077a4f5a1	[DOCS] Document static field cache settings (#61424 )	2020-08-26 17:10:08 -04:00
James Rodewig	8359232c45	[DOCS] Document dynamic circuit breaker settings (#61334 )	2020-08-19 10:58:04 -04:00
James Rodewig	ae01606785	[DOCS] Replace `twitter` dataset in docs (#60604 )	2020-08-03 12:49:56 -04:00
James Rodewig	441c3a21b1	[DOCS] Update my-index examples (#60132 ) Changes the following example index names to `my-index-000001` for consistency: * `my-index` * `my_index` * `myindex`	2020-07-27 14:46:39 -04:00
James Rodewig	2774cd6938	[DOCS] Swap `[float]` for `[discrete]` (#60124 ) Changes instances of `[float]` in our docs for `[discrete]`. Asciidoctor prefers the `[discrete]` tag for floating headings: https://asciidoctor.org/docs/asciidoc-asciidoctor-diffs/#blocks	2020-07-23 11:48:22 -04:00
Nhat Nguyen	961db311f0	Sending operations concurrently in peer recovery (#58018 ) Today, we send operations in phase2 of peer recoveries batch by batch sequentially. Normally that's okay as we should have a fairly small of operations in phase 2 due to the file-based threshold. However, if phase1 takes a lot of time and we are actively indexing, then phase2 can have a lot of operations to replay. With this change, we will send multiple batches concurrently (defaults to 1) to reduce the recovery time.	2020-07-07 18:00:03 -04:00
Adam Locke	3a1258fe97	[DOCS] Add supported ESS settings to ES docs (#57953 ) * Adding ESS icons to supported ES settings. * Adding new file for supported ESS settings. * Adding supported ESS settings for HTTP and disk-based shard allocation. * Adding more supported settings for ESS. * Adding descriptions for each Cloud section, plus additional settings. * Adding new warehouse file for Cloud, plus additional settings. * Adding node settings for Cloud. * Adding audit settings for Cloud. * Resolving merge conflict. * Adding SAML settings (part 1). * Adding SAML realm encryption and signing settings. * Adding SAML SSL settings. * Adding Kerberos realm settings. * Adding OpenID Connect Realm settings. * Adding OpenID Connect SSL settings. * Resolving leftover Git merge markers. * Removing Cloud settings page and link to it. * Add link to mapping source * Update docs/reference/docs/reindex.asciidoc * Incorporate edit of HTTP settings * Remove "cloud" from tag and ID * Remove "cloud" from tag and update description * Remove "cloud" from tag and ID * Change "whitelists" to "specifies" * Remove "cloud" from end tag * Removing cloud from IDs and tags. * Changing link reference to fix build issue. * Adding index management page for missing settings. * Removing warehouse file for Cloud and moving settings elsewhere. * Clarifying true/false usage of http.detailed_errors.enabled. * Changing underscore to dash in link to fix ci build.	2020-07-02 14:13:06 -04:00
Yannick Welsch	118521d022	Account for recovery throttling when restoring snapshot (#58658 ) Restoring from a snapshot (which is a particular form of recovery) does not currently take recovery throttling into account (i.e. the `indices.recovery.max_bytes_per_sec` setting). While restores are subject to their own throttling (repository setting `max_restore_bytes_per_sec`), this repository setting does not allow for values to be configured differently on a per-node basis. As restores are very similar in nature to peer recoveries (streaming bytes to the node), it makes sense to configure throttling in a single place. The `max_restore_bytes_per_sec` setting is also changed to default to unlimited now, whereas previously it was set to `40mb`, which is the current default of `indices.recovery.max_bytes_per_sec`). This means that no behavioral change will be observed by clusters where the recovery and restore settings were not adapted. Relates https://github.com/elastic/elasticsearch/issues/57023 Co-authored-by: James Rodewig <james.rodewig@elastic.co>	2020-06-30 13:08:21 +02:00
James Rodewig	6ef9506beb	[DOCS] Correct setting type for `indices.query.bool.max_clause_count` (#56640 ) #56449 incorrectly labelled this as a dynamic setting. This corrects that error.	2020-05-12 16:25:41 -04:00
James Rodewig	af2d13144f	[DOCS] Add reference docs for `search.max_buckets` setting (#56449 ) Adds reference-style setting documentation for the `search.max_buckets` setting. This setting was previously only documented on the [bucket aggregations][0] page. [0]: https://www.elastic.co/guide/en/elasticsearch/reference/master/search-aggregations-bucket.html	2020-05-11 08:35:24 -04:00
Stuart Tettemer	bd64da0960	Scripting: Deprecate general cache settings (#55038 ) * Scripting: Deprecate general cache settings * Add script.disable_max_compilations_rate setting * Move construction to ScriptCache * Use ScriptService to do updates of CacheHolder * Remove fallbacks * Add SCRIPT_DISABLE_MAX_COMPILATIONS_RATE_SETTING to ClusterSettings * Node scope * Use back compat * 8.0 for bwc * script.max_compilations_rate=2048/1m -> script.disable_max_compilations_rate=true in docker compose * do not guard in esnode * Doc update * isSnapshotBuild() -> systemProperty 'es.script.disable_max_compilations_rate', 'true' * Do not use snapshot in gradle to set max_compilations_rate * Expose cacheHolder as package private * monospace 75/5m in cbreaker docs, single space in using * More detail in general compilation rate error * Test: don't modify defaultConfig on upgrade	2020-04-22 12:33:33 -06:00
James Rodewig	5cbd05bb10	[DOCS] Relocate `indices` module content (#54903 ) Moves `indices` content from the [Modules][0] section to the [Configuring Elasticsearch][1] section. Also removes the [Indices][2] landing page and adds a related redirect. [0]: https://www.elastic.co/guide/en/elasticsearch/reference/master/modules.html [1]: https://www.elastic.co/guide/en/elasticsearch/reference/master/settings.html [2]: https://www.elastic.co/guide/en/elasticsearch/reference/master/modules-indices.html	2020-04-10 12:00:02 -04:00
David Turner	ac1b6eb5e9	indices.recovery.max_bytes_per_sec may be per-node (#54633 ) The `indices.recovery.max_bytes_per_sec` recovery bandwidth limit can differ between nodes if it is not set dynamically, but today this is not obvious. This commit adds a paragraph to its documentation clarifying how to set different bandwidth limits on each node. Co-Authored-By: James Rodewig <james.rodewig@elastic.co>	2020-04-02 18:14:34 +01:00
István Zoltán Szabó	451eb1fa1f	[DOCS] Expands the documentation of Node Query Cache (#51105 ) Co-authored-by: debadair <debadair@elastic.co>	2020-01-20 11:11:57 +01:00
Stuart Tettemer	fb6ef69c6b	[DOCS] Deterministic scripted queries are cached (#50408 ) Refs: #49321	2019-12-19 16:16:57 -07:00
Patryk Krawaczyński	de4f701a19	[DOCS] Document `index.queries.cache.enabled` as a static setting (#49886 )	2019-12-10 14:23:14 -05:00
James Rodewig	7583c07fa8	[DOCS] Reorder index APIs alphabetically (#46981 )	2019-10-01 15:13:27 -04:00
James Rodewig	5c78f606c2	[DOCS] Change // CONSOLE comments to [source,console] (#46440 )	2019-09-09 10:45:37 -04:00
Daniel Mitterdorfer	1f23fc704a	Clarify which circuit breaker settings are static (#44992 ) Most of the circuit breaker settings are dynamically configurable. However, `indices.breaker.total.use_real_memory` is not. With this commit we add a clarifying note that this specific setting is static. Closes #44974	2019-07-31 13:13:39 +02:00
Sam Mingo	0ce3a28ebb	Update search-settings.asciidoc (#43016 ) Grammar and spelling fixes	2019-06-10 10:14:27 +01:00
James Rodewig	45e1e59371	[DOCS] Rewrite 'rewrite' parameter docs (#42018 )	2019-05-13 08:42:26 -04:00
James Rodewig	0225af44a0	[DOCS] Clarify Recovery Settings for Shard Relocation (#40329 ) * Clarify that peer recovery settings apply to shard relocation * Fix awkward wording of 1st sentence * [DOCS] Remove snapshot recovery reference. Call out link to [[cat-recovery]]. Separate expert settings.	2019-04-26 10:23:30 -04:00
Christoph Büscher	25aac4f77f	Remove `include_type_name` in asciidoc where possible (#37568 ) The "include_type_name" parameter was temporarily introduced in #37285 to facilitate moving the default parameter setting to "false" in many places in the documentation code snippets. Most of the places can simply be reverted without causing errors. In this change I looked for asciidoc files that contained the "include_type_name=true" addition when creating new indices but didn't look likey they made use of the "_doc" type for mappings. This is mostly the case e.g. in the analysis docs where index creating often only contains settings. I manually corrected the use of types in some places where the docs still used an explicit type name and not the dummy "_doc" type.	2019-01-18 09:34:11 +01:00
Julie Tibshirani	36a3b84fc9	Update the default for include_type_name to false. (#37285 ) * Default include_type_name to false for get and put mappings. * Default include_type_name to false for get field mappings. * Add a constant for the default include_type_name value. * Default include_type_name to false for get and put index templates. * Default include_type_name to false for create index. * Update create index calls in REST documentation to use include_type_name=true. * Some minor clean-ups around the get index API. * In REST tests, use include_type_name=true by default for index creation. * Make sure to use 'expression == false'. * Clarify the different IndexTemplateMetaData toXContent methods. * Fix FullClusterRestartIT#testSnapshotRestore. * Fix the ml_anomalies_default_mappings test. * Fix GetFieldMappingsResponseTests and GetIndexTemplateResponseTests. We make sure to specify include_type_name=true during xContent parsing, so we continue to test the legacy typed responses. XContent generation for the typeless responses is currently only covered by REST tests, but we will be adding unit test coverage for these as we implement each typeless API in the Java HLRC. This commit also refactors GetMappingsResponse to follow the same appraoch as the other mappings-related responses, where we read include_type_name out of the xContent params, instead of creating a second toXContent method. This gives better consistency in the response parsing code. * Fix more REST tests. * Improve some wording in the create index documentation. * Add a note about types removal in the create index docs. * Fix SmokeTestMonitoringWithSecurityIT#testHTTPExporterWithSSL. * Make sure to mention include_type_name in the REST docs for affected APIs. * Make sure to use 'expression == false' in FullClusterRestartIT. * Mention include_type_name in the REST templates docs.	2019-01-14 13:08:01 -08:00
Nhat Nguyen	15aa3764a4	Reduce recovery time with compress or secure transport (#36981 ) Today file-chunks are sent sequentially one by one in peer-recovery. This is a correct choice since the implementation is straightforward and recovery is network bound in most of the time. However, if the connection is encrypted, we might not be able to saturate the network pipe because encrypting/decrypting are cpu bound rather than network-bound. With this commit, a source node can send multiple (default to 2) file-chunks without waiting for the acknowledgments from the target. Below are the benchmark results for PMC and NYC_taxis. - PMC (20.2 GB) \| Transport \| Baseline \| chunks=1 \| chunks=2 \| chunks=3 \| chunks=4 \| \| ----------\| ---------\| -------- \| -------- \| -------- \| -------- \| \| Plain \| 184s \| 137s \| 106s \| 105s \| 106s \| \| TLS \| 346s \| 294s \| 176s \| 153s \| 117s \| \| Compress \| 1556s \| 1407s \| 1193s \| 1183s \| 1211s \| - NYC_Taxis (38.6GB) \| Transport \| Baseline \| chunks=1 \| chunks=2 \| chunks=3 \| chunks=4 \| \| ----------\| ---------\| ---------\| ---------\| ---------\| -------- \| \| Plain \| 321s \| 249s \| 191s \| * \| * \| \| TLS \| 618s \| 539s \| 323s \| 290s \| 213s \| \| Compress \| 2622s \| 2421s \| 2018s \| 2029s \| n/a \| Relates #33844	2019-01-14 15:14:46 -05:00
David Turner	d9e2ebca67	Add more detail to recovery bandwidth limit docs (#37156 )	2019-01-09 08:18:25 +00:00
Yu	d01b30acba	lower fielddata circuit breaker's default limit (#27162 ) * Lower fielddata circuit breaker default limit Lower fielddata circuit breaker default limit from 60% to 40% as we have moved to doc_values for most of the cases. * merge master in * update tests * update docs	2018-12-11 11:30:58 +01:00
Alexandru Rusanescu	f3e150b0ea	[Docs] Update query_cache.asciidoc (#33340 ) Add note about non-visibility of cache content.	2018-11-01 10:22:36 +01:00
Christoph Büscher	c0c6a28e86	[Docs] Add `indices.query.bool.max_clause_count` setting (#34779 ) This change adds a section about the global search setting `indices.query.bool.max_clause_count` that limits the number of boolean clauses allowed in a Lucene BooleanQuery. Closes #19858	2018-10-25 17:59:59 +02:00
Daniel Mitterdorfer	f174f72fee	Circuit-break based on real memory usage With this commit we introduce a new circuit-breaking strategy to the parent circuit breaker. Contrary to the current implementation which only accounts for memory reserved via child circuit breakers, the new strategy measures real heap memory usage at the time of reservation. This allows us to be much more aggressive with the circuit breaker limit so we bump it to 95% by default. The new strategy is turned on by default and can be controlled with the new cluster setting `indices.breaker.total.userealmemory`. Note that we turn it off for all integration tests with an internal test cluster because it leads to spurious test failures which are of no value (we cannot fully control heap memory usage in tests). All REST tests, however, will make use of the real memory circuit breaker. Relates #31767	2018-07-13 10:08:28 +02:00
Daniel Mitterdorfer	3d53daeb2f	Account for XContent overhead in in-flight breaker So far the in-flight request circuit breaker has only accounted for the on-the-wire representation of a request. However, we convert the raw request into XContent internally which increases the overhead. Therefore, we increase the value of the corresponding setting `network.breaker.inflight_requests.overhead` from one to two. While this value is still rather conservative (we assume that the representation as structured objects has no overhead compared to the byte[]), it is closer to reality than the current value. Relates #31613	2018-07-03 09:17:16 +02:00
Colin Goodheart-Smithe	360b09f148	[DOCS] Fixes accounting setting names (#30863 ) The documentation for the account circuit breaker listed the settings for it's limit and overhead to be `network.breaker.accounting.limit` and `network.breaker.accounting.overhead` when in `HieratchyCircuitBreakerService` it seems the settings are actually `indices.breaker.accounting.limit` and `indices.breaker.accounting.overhead`.	2018-06-04 09:20:54 +01:00
Lee Jones	37f67d9e21	[Docs] Fix typo in circuit breaker docs (#29659 ) The previous description had a part that didn't fit and was probably from a copy/paste of the in flight requests description above.	2018-05-22 16:43:45 +02:00

1 2

72 Commits