elasticsearch

Commit Graph

Author	SHA1	Message	Date
David Turner	3660d863db	Fork the sending of file chunks during recovery (#74164 ) Today if sending file chunks is CPU-bound (e.g. when using compression) then we tend to concentrate all that work onto relatively few threads, even if `indices.recovery.max_concurrent_file_chunks` is increased. With this commit we fork the transmission of each chunk onto its own thread so that the CPU-bound work can happen in parallel.	2021-06-16 11:58:13 +01:00
David Turner	43ddd4a580	Fix docs rendering around recovery rate table (#73879 ) - Replaces ⇐ with ≤ - Removes table caption - Adjust table headers - Fixes leading + on subsequent paragraphs	2021-06-08 15:00:00 +01:00
David Turner	dd69ae95d7	Note recovery settings affect searchable snapshots (#70771 ) Adds a short note that `max_restore_bytes_per_sec` and `indices.recovery.max_bytes_per_sec` also affect the recovery of a searchable snapshot index.	2021-03-24 09:22:44 +00:00
Lee Hinman	3f9f007545	Add the frozen tier node role and ILM phase (#68605 ) This commit adds the `data_frozen` node role as part of the formalization of data tiers. It also adds the `"frozen"` phase to ILM, currently allowing the same actions as the existing cold phase. The frozen phase is intended to be used for data even less frequently searched than the cold phase, and will eventually be loosely tied to data using partial searchable snapshots (as oppposed to full searchable snapshots in the cold phase). Relates to #60848	2021-02-05 14:38:13 -07:00
Jason Tedor	6e94e67ae9	Set recovery rate for dedicated cold nodes (#68480 ) This commit sets the recovery rate for dedicated cold nodes. The goal is here is enhance performance of recovery in a dedicated cold tier, where we expect such nodes to be predominantly using searchable snapshots to back the indices located on them. This commit follows a simple approach where we increase the recovery rate as a function of the node size, for nodes that appear to be dedicated cold nodes.	2021-02-04 10:36:07 -05:00
James Rodewig	2774cd6938	[DOCS] Swap `[float]` for `[discrete]` (#60124 ) Changes instances of `[float]` in our docs for `[discrete]`. Asciidoctor prefers the `[discrete]` tag for floating headings: https://asciidoctor.org/docs/asciidoc-asciidoctor-diffs/#blocks	2020-07-23 11:48:22 -04:00
Nhat Nguyen	961db311f0	Sending operations concurrently in peer recovery (#58018 ) Today, we send operations in phase2 of peer recoveries batch by batch sequentially. Normally that's okay as we should have a fairly small of operations in phase 2 due to the file-based threshold. However, if phase1 takes a lot of time and we are actively indexing, then phase2 can have a lot of operations to replay. With this change, we will send multiple batches concurrently (defaults to 1) to reduce the recovery time.	2020-07-07 18:00:03 -04:00
Yannick Welsch	118521d022	Account for recovery throttling when restoring snapshot (#58658 ) Restoring from a snapshot (which is a particular form of recovery) does not currently take recovery throttling into account (i.e. the `indices.recovery.max_bytes_per_sec` setting). While restores are subject to their own throttling (repository setting `max_restore_bytes_per_sec`), this repository setting does not allow for values to be configured differently on a per-node basis. As restores are very similar in nature to peer recoveries (streaming bytes to the node), it makes sense to configure throttling in a single place. The `max_restore_bytes_per_sec` setting is also changed to default to unlimited now, whereas previously it was set to `40mb`, which is the current default of `indices.recovery.max_bytes_per_sec`). This means that no behavioral change will be observed by clusters where the recovery and restore settings were not adapted. Relates https://github.com/elastic/elasticsearch/issues/57023 Co-authored-by: James Rodewig <james.rodewig@elastic.co>	2020-06-30 13:08:21 +02:00
James Rodewig	5cbd05bb10	[DOCS] Relocate `indices` module content (#54903 ) Moves `indices` content from the [Modules][0] section to the [Configuring Elasticsearch][1] section. Also removes the [Indices][2] landing page and adds a related redirect. [0]: https://www.elastic.co/guide/en/elasticsearch/reference/master/modules.html [1]: https://www.elastic.co/guide/en/elasticsearch/reference/master/settings.html [2]: https://www.elastic.co/guide/en/elasticsearch/reference/master/modules-indices.html	2020-04-10 12:00:02 -04:00
David Turner	ac1b6eb5e9	indices.recovery.max_bytes_per_sec may be per-node (#54633 ) The `indices.recovery.max_bytes_per_sec` recovery bandwidth limit can differ between nodes if it is not set dynamically, but today this is not obvious. This commit adds a paragraph to its documentation clarifying how to set different bandwidth limits on each node. Co-Authored-By: James Rodewig <james.rodewig@elastic.co>	2020-04-02 18:14:34 +01:00
James Rodewig	7583c07fa8	[DOCS] Reorder index APIs alphabetically (#46981 )	2019-10-01 15:13:27 -04:00
James Rodewig	0225af44a0	[DOCS] Clarify Recovery Settings for Shard Relocation (#40329 ) * Clarify that peer recovery settings apply to shard relocation * Fix awkward wording of 1st sentence * [DOCS] Remove snapshot recovery reference. Call out link to [[cat-recovery]]. Separate expert settings.	2019-04-26 10:23:30 -04:00
Nhat Nguyen	15aa3764a4	Reduce recovery time with compress or secure transport (#36981 ) Today file-chunks are sent sequentially one by one in peer-recovery. This is a correct choice since the implementation is straightforward and recovery is network bound in most of the time. However, if the connection is encrypted, we might not be able to saturate the network pipe because encrypting/decrypting are cpu bound rather than network-bound. With this commit, a source node can send multiple (default to 2) file-chunks without waiting for the acknowledgments from the target. Below are the benchmark results for PMC and NYC_taxis. - PMC (20.2 GB) \| Transport \| Baseline \| chunks=1 \| chunks=2 \| chunks=3 \| chunks=4 \| \| ----------\| ---------\| -------- \| -------- \| -------- \| -------- \| \| Plain \| 184s \| 137s \| 106s \| 105s \| 106s \| \| TLS \| 346s \| 294s \| 176s \| 153s \| 117s \| \| Compress \| 1556s \| 1407s \| 1193s \| 1183s \| 1211s \| - NYC_Taxis (38.6GB) \| Transport \| Baseline \| chunks=1 \| chunks=2 \| chunks=3 \| chunks=4 \| \| ----------\| ---------\| ---------\| ---------\| ---------\| -------- \| \| Plain \| 321s \| 249s \| 191s \| * \| * \| \| TLS \| 618s \| 539s \| 323s \| 290s \| 213s \| \| Compress \| 2622s \| 2421s \| 2018s \| 2029s \| n/a \| Relates #33844	2019-01-14 15:14:46 -05:00
David Turner	d9e2ebca67	Add more detail to recovery bandwidth limit docs (#37156 )	2019-01-09 08:18:25 +00:00
alamzeeshan	a1cc683cff	Updated document as per code change. (#22878 ) Updated document as per this change : https://github.com/elastic/elasticsearch/pull/15235	2017-01-31 13:36:09 +01:00
Simon Willnauer	f5e4cd4616	Remove recovery threadpools and throttle outgoing recoveries on the master Today we throttle recoveries only for incoming recoveries. Nodes that have a lot of primaries can get overloaded due to too many recoveries. To still keep that at bay we limit the number of threads that are sending files to the target to overcome this problem. The right solution here is to also throttle the outgoing recoveries that are today unbounded on the master and don't start the recovery until we have enough resources on both source and target nodes. The concurrency aspects of the recovery source also added a lot of complexity and additional threadpools that are hard to configure. This commit removes the concurrent streamns notion completely and sends files in the thread that drives the recovery simplifying the recovery code considerably. Outgoing recoveries are not throttled on the master via a allocation decider.	2015-12-22 14:59:43 +01:00
Clinton Gormley	f123a53d72	Docs: Refactored modules and index modules sections	2015-06-22 23:49:45 +02:00

17 Commits