Commit Graph

43 Commits

Author SHA1 Message Date
David Turner 5dff56a00e
Mention network handler logging in docs (#100118)
Mentions the `InboundHandler` (and `OutboundHandler`) as potential
sources of useful log messages when tracking down a network threading
bug.
2023-10-02 08:52:16 +01:00
David Turner bf34036c8c
Discovery troubleshooting next steps (#99743)
Adds a little more detail on how to react if you see evidence that the
Elasticsearch process is pausing for a long time due to long GCs or VM
pauses.
2023-09-21 13:00:13 +01:00
Felix Barnsteiner ebd5ead943
Remove ineffective options of preventing mapping explosions (#99665)
Removes the recommendations to use the object field type and to set index: false.
Both of these options are not effective with avoiding mapping explosions.
2023-09-20 13:59:03 +02:00
Abdon Pijpelink af76a3a436
[DOCS] Add 'Troubleshooting an unstable cluster' to nav (#99287)
* [DOCS] Add 'Troubleshooting an unstable cluster' to nav

* Adjust docs links in code

* Revert "Adjust docs links in code"

This reverts commit f3846b1d78.

---------

Co-authored-by: David Turner <david.turner@elastic.co>
2023-09-08 13:42:50 +02:00
Stef Nestor 0781bafac1
[DOC+][Hot Spotting] Pull detailed Node Tasks (#98879)
Co-authored-by: David Turner <david.turner@elastic.co>
2023-08-29 14:25:10 -04:00
David Turner ddd4ba5e30
Fix docs for explaining unassigned shards (#97538)
Today the `current_node` parameter is given in several sample requests
illustrating how to explain an unassigned shard using the cluster
allocation explain API. This doesn't make sense, an unassigned shard has
no `current_node`. This commit removes the misleading parameter in these
cases.
2023-07-11 08:01:12 +01:00
David Turner 09e53f9ad9
Enhance docs around network troubleshooting (#97305)
Discovery, like cluster membership, can also be affected by network-like
issues (e.g. GC/VM pauses, dropped packets and blocked threads) so this
commit duplicates the troubleshooting info across both places.
2023-07-10 10:57:44 +01:00
David Turner 846d640ddf
Suggest capturing a heap dump to diagnose high heap (#96526)
The `high-jvm-memory-pressure.html` troubleshooting docs give some
suggestions, but vitally they omit the advice to capture a heap dump
which is what we really need users to do if they want to understand
their high heap usage. This commit adds a note to the docs to that
effect.
2023-06-02 09:43:52 -04:00
debadair 777598d602
[DOCS] Remove redirect pages (#88738)
* [DOCS] Remove manual redirects

* [DOCS] Removed refs to modules-discovery-hosts-providers

* [DOCS] Fixed broken internal refs

* Fixing bad cross links in ES book, and adding redirects.asciidoc[] back into docs/reference/index.asciidoc.

* Update docs/reference/search/point-in-time-api.asciidoc

Co-authored-by: James Rodewig <james.rodewig@elastic.co>

* Update docs/reference/setup/restart-cluster.asciidoc

Co-authored-by: James Rodewig <james.rodewig@elastic.co>

* Update docs/reference/sql/endpoints/translate.asciidoc

Co-authored-by: James Rodewig <james.rodewig@elastic.co>

* Update docs/reference/snapshot-restore/restore-snapshot.asciidoc

Co-authored-by: James Rodewig <james.rodewig@elastic.co>

* Update repository-azure.asciidoc

* Update node-tool.asciidoc

* Update repository-azure.asciidoc

---------

Co-authored-by: amyjtechwriter <61687663+amyjtechwriter@users.noreply.github.com>
Co-authored-by: Elastic Machine <elasticmachine@users.noreply.github.com>
Co-authored-by: Amy Jonsson <amy.jonsson@elastic.co>
Co-authored-by: James Rodewig <james.rodewig@elastic.co>
2023-05-24 12:32:46 +01:00
Stef Nestor 65b4fe28d4
[+DOC] Troubleshooting / Mapping Explosion (#95397)
* [+DOC] Troubleshooting / Mapping Explosion

---------

Co-authored-by: Steffanie Nestor <steffanie.nestor@elastic.co>
Co-authored-by: Amy Jonsson <amy.jonsson@elastic.co>
2023-04-27 11:08:56 -06:00
David Kilfoyle 626db84fac
[Docs] Small fixes for hot spotting page (#95627) 2023-04-27 10:18:21 -04:00
Stef Nestor 4c5a3fb4da
[+Doc] Troubleshooting / Hot Spotting (#95429)
* [+Doc] Troubleshooting / Hot Spotting

---------

Co-authored-by: Abdon Pijpelink <abdon.pijpelink@elastic.co>
2023-04-26 12:29:47 -06:00
Stef Nestor 1ee528dc3f
[Doc] Troubleshoot Cluster State / Linkable subsections (#95468)
👋🏼 howdy, team! Could we make these sub-sections sub-header link-able?
2023-04-25 10:35:14 +02:00
Pablo Alcantar Morales 253fe6325d
Add shards capacity troubleshooting guide (#95208) 2023-04-19 09:24:07 +02:00
David Turner b4b9292ce9
Small changes to corruption troubleshooting docs (#95265)
- Mention that third-party software may be to blame too
- Mention `strace` as a last resort
- Minor rewordings
2023-04-17 09:07:27 +01:00
Stef Nestor e12e83fa37
Search-Troubleshoot | Most Recent Record (#94409)
May we add a section to [this page](https://www.elastic.co/guide/en/elasticsearch/reference/master/troubleshooting-searches.html#troubleshooting-check-field-values) to query for the latest record on an index (pattern)? This will be helpful to decide between Kibana Discover filter and Elasticsearch ingest lag problems.

---------

Co-authored-by: David Kilfoyle <41695641+kilfoyle@users.noreply.github.com>
Co-authored-by: Abdon Pijpelink <abdon.pijpelink@elastic.co>
2023-03-23 08:04:49 -06:00
Abdon Pijpelink 2808512397
[DOCS] Improve watermark troubleshooting documentation (#94222) 2023-03-01 14:34:14 +01:00
Iraklis Psaroudakis 555a4d91ee
Update add-repository.asciidoc (#92945)
Our guide on re-registering a corrupt repository should link to the warnings about the potential side-effects of corruption.
2023-01-16 17:20:21 +02:00
Stef Nestor d9cbefc19c
[DOC] Troubleshooting Expensive Searches (#92725)
* [DOC] Troubleshooting Expensive Searches

👋 re: https://github.com/elastic/elasticsearch/issues/73222 adds in content so we can link to users on how to find source of expensive searches.

* Several edits

* Apply suggestions from code review

Co-authored-by: Abdon Pijpelink <abdon.pijpelink@elastic.co>

Co-authored-by: Abdon Pijpelink <abdon.pijpelink@elastic.co>
2023-01-13 09:55:13 +01:00
Cleydyr Bezerra de Albuquerque ee452bd143
Update circuit-breaker-errors.asciidoc (#92070)
Fix typo fieldata -> fielddata
2022-12-05 10:53:06 +01:00
Mary Gouseti cfd23d512f
Disk indicator troubleshooting guides (#90504) 2022-10-14 15:24:21 +02:00
Ievgen Degtiarenko 4d6d979e0e
Deprecate state field in `/_cluster/reroute` response (#90399) 2022-10-05 08:18:27 +02:00
Iraklis Psaroudakis 34471b1cd2
Introduce max headroom for disk watermark stages (#88639)
Introduce max headroom settings for the low, high, and flood disk watermark stages, similar to the existing max headroom setting for the flood stage of the frozen tier. Introduce new max headrooms in HealthMetadata and in ReactiveStorageDeciderService. Add multiple tests in DiskThresholdDeciderUnitTests, DiskThresholdDeciderTests and DiskThresholdMonitorTests. Moreover, addition & subtraction for ByteSizeValue, and min.
2022-09-19 14:59:18 +03:00
James Baiera db73aa0498
Add repeated snapshot failure troubleshooting guide (#89762)
This troubleshooting guide is what will be returned from the SLM health indicator 
when a SLM policy has suffered from too many repeat failures without a successful 
execution.
2022-09-15 17:01:32 -04:00
Abdon Pijpelink 346f7848e6
[DOCS] Add troubleshooting searches guide (#89583)
* [DOCS] Adds troubleshooting searches guide

* Additional troubleshooting steps

* Apply review suggestions

* Replace separate _cat aliases/indices requests with one get indices call

* Reorder steps to move field caps forward

* Add note about ignore_unavailable

Co-authored-by: Elastic Machine <elasticmachine@users.noreply.github.com>
2022-09-08 14:30:21 +02:00
Leaf-Lin 942e5fd9fc
Adding specific items into troubleshooting guide (#88105)
* Update troubleshooting.asciidoc

Adding items into the troubleshooting guide

* Resolve conflicts

* Reorganizes troubleshooting links

Co-authored-by: Abdon Pijpelink <abdon.pijpelink@elastic.co>
2022-08-03 17:00:34 +02:00
David Turner 74ce7a4603
Fix typo (#89063) 2022-08-03 10:23:57 +01:00
David Turner 7103053f03
Add troubleshooting docs about data corruption (#88760)
Adds some docs giving more detailed background about what data
corruption really means and some suggestions about how to narrow down
the root cause.

Co-authored-by: Henning Andersen <33268011+henningandersen@users.noreply.github.com>
2022-07-28 11:23:23 +01:00
Mary Gouseti 89903bbe23
Troubleshooting docs for ACTION_RESTORE_FROM_SNAPSHOT (#87692)
Troubleshooting guide to restore indices and data streams that have
missing data from a snapshot.

This will be associated with the user action
`ACTION_RESTORE_FROM_SNAPSHOT`.

Preview link:
https://elasticsearch_87692.docs-preview.app.elstc.co/guide/en/elasticsearch/reference/master/restore-from-snapshot.html
2022-07-27 23:37:08 +09:30
Abdon Pijpelink 26cc87360e
Split common cluster issues page into separate pages (#88495) 2022-07-18 17:54:02 +02:00
Andrei Dan f3431e1bff
Add troubleshooting guide for corrupt repository (#88391)
Co-authored-by: Lee Hinman <dakrone@users.noreply.github.com>
2022-07-14 13:37:02 +01:00
Leaf-Lin 945cb27782
[DOCS] Adding discovery troubleshooting link in the master get help page (#87344)
* Adding discovery troubleshooting link

* Add tags to pull in discovery troubleshooting content

* Move discovery troubleshooting to separate page and add redirects

Co-authored-by: Adam Locke <adam.locke@elastic.co>
2022-07-06 15:51:43 -04:00
Andrei Dan 6e98072db5
Add start slm user action (#87854)
This creates a user action for the slm health indicator that will help
the user to start SLM.
2022-06-23 11:04:45 +01:00
Andrei Dan a4e7064b0e
Create ILM not running user action (#87852)
This creates a user action for the ilm health indicator that will help
the user to start ILM.
2022-06-23 09:54:31 +01:00
Andrei Dan 2ec4a9e006
Add troubleshooting doc for missing tier (#87526)
Co-authored-by: Lee Hinman <dakrone@users.noreply.github.com>
2022-06-17 12:24:33 +01:00
Leaf-Lin ed0fb8b819
Move fix common cluster issues to troubleshooting (#87440)
* Move fix common cluster issues to troubleshooting

* Include fix common cluster issues in the troubleshooting doc

* [DOCS] Remove extra include from How-To

Co-authored-by: Deb Adair <debadair@elastic.co>
2022-06-13 17:16:17 -07:00
Keith Massey 6caf39c109
How to increase node capacity docs (#87188)
This adds troubleshooting documentation for the case when the ShardsAvailabilityHealthIndicatorService
reports that there are not enough nodes in the data tier (user action "increase_node_capacity_for_allocations" or
"increase_tier_capacity_for_allocations_". This covers both the cloud and self-managed environments. For
cloud we first recommend increasing the number of availability zones (because you cannot directly add nodes), and
decreasing index.number_of_replicas if that is not possible. For self-managed, we first recommend adding nodes,
and decreasing index.number_of_replicas if that is not possible.
2022-06-08 14:06:47 -05:00
Leaf-Lin 7bd4708886 Revert "Move fix common issues into troubleshooting"
This reverts commit 4a563e9bfb.
2022-06-07 17:14:38 +10:00
Leaf-Lin 4a563e9bfb Move fix common issues into troubleshooting 2022-06-07 17:07:03 +10:00
Andrei Dan 08b323131f
Troubleshooting guides for disabled allocations (#86789)
This adds the troubleshooing guides when index and cluster allocations are
disabled.

Co-authored-by: Lee Hinman <dakrone@users.noreply.github.com>
2022-05-24 10:27:15 +01:00
Andrei Dan 20802a9f66
Add migrate to tiers troubleshooting doc (#86738)
This adds a troubleshooting doc for indices that mix index filtering allocation
with data tiers routing.

Co-authored-by: Lee Hinman <dakrone@users.noreply.github.com>
2022-05-24 10:12:28 +01:00
Andrei Dan 490f417efd
Troubleshooting guide for diagnosing unassigned shards (#86996)
Co-authored-by: Leaf-Lin <39002973+Leaf-Lin@users.noreply.github.com>
Co-authored-by: Lee Hinman <dakrone@users.noreply.github.com>
2022-05-24 09:56:23 +01:00
Andrei Dan 21785c9a77
How-to docs for increasing the total number of shards per node (#86214)
Co-authored-by: Lee Hinman <dakrone@users.noreply.github.com>
Co-authored-by: Leaf-Lin <39002973+Leaf-Lin@users.noreply.github.com>
2022-05-10 09:13:27 +01:00