Commit Graph

9008 Commits

Author SHA1 Message Date
Armin Braun 5f89f8be3f
Introduce Next Field in Paginated GetSnapshots Response (#74236)
Follow up to #73952 adding documentation for the `after` query parameter
and the related `next` response field.
2021-06-28 22:28:30 +02:00
James Rodewig 139eabad2d
[DOCS] Query strings are normalized for fuzzy (`~`) operator (#73921)
Notes that `fuzzy` queries made using the query string query's `~`
operator are normalized.

Closes #73299
2021-06-28 13:13:41 -04:00
James Rodewig 0c205b0d68
[DOCS] Document how to switch ILM policies (#73967)
To switch an index's lifecycle policy, you must first remove the existing
policy. Otherwise, phase execution for the index may silently fail.

Closes #70151
2021-06-28 10:32:01 -04:00
Gao Ruifeng 4c67195d44
update monitoring cluster node name (#74500)
`es-mon2` -> `es-mon-2`
2021-06-28 09:30:55 -04:00
James Rodewig ed1999e935
[DOCS] Update wildcard support for removing aliases (#74561)
You can now use a wildcard pattern to remove data stream and index
aliases in the same action/request.

Co-authored-by: Elastic Machine <elasticmachine@users.noreply.github.com>
2021-06-28 08:53:45 -04:00
Andrei Dan 636aa7c0da
Add migrate to data tiers API (#74264)
This adds the _ilm/migrate_to_data_tiers API to expose the service for
migrating the elasticsearch abstractions (indices, ILM policies and an 
optional legacy template to delete) to data tiers routing allocation 
(away from custom node attributes)
2021-06-28 12:07:39 +01:00
Przemko Robakowski 9706d676a1
Enable GeoIP downloader by default (#74323)
This change will enable geoip downloader by default again. It also brings back relevant documentation.
2021-06-28 09:04:49 +02:00
Ignacio Vera d7ef5b6d21
Remove bounding box query type parameter (#74536)
The parameter has been deprecates in 7.14 as it is a no-op.
2021-06-28 07:37:04 +02:00
Armin Braun 939c4c6c3f
Remove Repository Size Limit Default (#74592)
Now that we actively improve the scalability there is no point
of having a `500` limit in `master`.
2021-06-27 21:14:56 +02:00
James Rodewig 2fe07014d9
[DOCS] Move ES glossary to Stack docs (#74579)
The ES glossary is now incorporated into the [Elastic glossary](https://www.elastic.co/guide/en/elastic-stack-glossary/current/terms.html).

Depends on https://github.com/elastic/stack-docs/pull/1722 and https://github.com/elastic/docs/pull/2141
2021-06-24 19:04:31 -04:00
Christos Soulios df941367df
Add dimension mapping parameter (#74450)
Added the dimension parameter to the following field types:

    keyword
    ip
    Numeric field types (integer, long, byte, short)

The dimension parameter is of type boolean (default: false) and is used 
to mark that a field is a time series dimension field.

Relates to #74014
2021-06-24 20:16:27 +03:00
Adam Locke 88330250a7
[DOCS] Remove beta label for most service accounts docs (#74555)
* [DOCS] Remove beta label for most service accounts docs

* Remove beta label from additional service account files

Co-authored-by: Elastic Machine <elasticmachine@users.noreply.github.com>
2021-06-24 11:57:05 -04:00
Lisa Cawley 64af39b759
[DOCS] Add memory limit details in update job API (#74517)
Co-authored-by: David Roberts <dave.roberts@elastic.co>
2021-06-24 08:50:19 -07:00
Armin Braun cbf48e0633
Flatten Get Snapshots Response (#74451)
This PR returns the get snapshots API to the 7.x format (and transport client behavior) and enhances it for requests that ask for multiple repositories.
The changes for requests that target multiple repositories are:
* Add `repository` field to `SnapshotInfo` and REST response
* Add `failures` map alongside `snapshots` list instead of returning just an exception response as done for single repo requests
* Pagination now works across repositories instead of being per repository for multi-repository requests

closes #69108
closes #43462
2021-06-24 16:58:33 +02:00
Ignacio Vera 28b4982df4
Deprecate Bounding box query type parameter (#74493)
This parameter has no effect on the query execution.
2021-06-24 07:34:57 +02:00
Lee Hinman 997db17852
Add usage to get ILM policy response (#74518)
This commit adds the "in_use_by" object to the response for ILM policies. This map shows the
indices, data streams, and composable templates that use the ILM policy.

An example output may look like:

```json
{
  "logs" : {
    "version" : 1,
    "modified_date" : "2021-06-23T18:42:08.381Z",
    "policy" : {
      ...
    },
    "in_use_by" : {
      "indices" : [".ds-logs-foo-barbaz-2021.06.23-000001", ".ds-logs-foo-other-2021.06.23-000001"],
      "data_streams" : ["logs-foo-barbaz", "logs-foo-other"],
      "composable_templates" : ["logs"]
    }
  }
}
```

Resolves #73869
2021-06-23 16:01:19 -06:00
Robin Clarke 7aaf209e47
[DOCS] Fix capitalization (#74470) 2021-06-23 12:02:57 -04:00
James Rodewig 47adf545c4
[DOCS] SQL: Document `null` handling for functions (#74444)
Closes #74193.
2021-06-23 11:11:26 -04:00
Benjamin Trent 0303e6d733
[ML] add datafeed field to the job config (#74265)
This is a quality of life improvement for typical users. Almost all anomaly jobs will receive their data through a datafeed.

The datafeed config can now be supplied and is available in the datafeed field in the job config for creation and getting jobs.
2021-06-23 08:06:58 -04:00
bellengao 42c0be4c67
Add template snippets support for KV ingest processor (#73758) 2021-06-22 16:32:11 -05:00
James Rodewig 3c88fbea75
[DOCS] Note remote reindex does not support slicing (#73959)
A reindex from a remote cluster doesn't support automatic or manual slicing.
This reuses a related note from the reindex docs in the upgrade docs.

Closes #54243.
2021-06-22 09:00:37 -04:00
James Rodewig 4ccff315ed
[DOCS] Note bulk API supports `application/json` content type (#73949)
The bulk API supports a `Content-Type` header of `application/json` or
`application/x-ndjson`.

Closes #26280
2021-06-22 08:57:50 -04:00
David Roberts 6e9b959450
[ML] Closing an anomaly detection job now automatically stops its datafeed if necessary (#74257)
Previously it was a requirement of the close job API that if the
job had an associated datafeed that that datafeed was stopped
before the job could be closed. Experience has shown that this
is just a pedantic nuisance. If a user closes the job without
first stopping the datafeed then it's just a mistake, and they
then have to make two further calls, to stop the datafeed and
then attempt to close the job again.

This PR changes the behaviour so that if you ask to close a job
whose datafeed is running then the datafeed gets stopped first
as part of the same call. Datafeeds are stopped with the same
level of force as the job close request specified.
2021-06-22 12:56:11 +01:00
Henning Andersen 77938381ea
Autoscale frozen tier into existence (#73435)
This commit adds two related changes:
* ILM WaitForDataTierStep
* Autoscaling frozen_existence decider

The first part ensures that we wait mounting an index until a node that
can hold the index is available, avoiding a failed restore and red
cluster state. This is in particular important for the frozen phase, but
is done generically in the searchable snapshot action.

The second part triggers on indices in the ILM frozen phase to scale the
tier into existence by requiring a minimal amount of memory and storage.

Closes #72771
2021-06-22 13:21:04 +02:00
Nik Everett 8904ffe2be
Add extra profiling information to terms agg (#73636)
I was helping some folks debug an issue with the terms agg and noticed
that we didn't always have the `total_buckets` debug information. I also
noticed that we can't tell how many buckets we build, so I added that
too as `built_buckets`.

Finally, I noticed that when we're using segment ords we count segments
without any values as "multi-valued". We can do better there and count
them as no-valued. That will, mostly, just improve the profiling. When
we collect from global ords we have no way to tell how many values are
on the segment so segments without any values will, sadly, in this case
still be miscounted as multi-valued.
2021-06-21 10:10:41 -04:00
Yang Cheng 98a6ef9ee3
[DOCS] Add missing`xpack` role (#74331) 2021-06-21 10:01:35 -04:00
Luca Cavanna 5bfdcd2ec7
[DOCS] add missing dynamic runtime option (#74294) 2021-06-21 09:13:21 -04:00
Hendrik Muhs 86df1b3aec
[DOCS] enhance transform example with range filter (#74284)
enhance transform example using range instead of terms for 5xx error codes
2021-06-21 10:37:11 +02:00
Luca Cavanna 1d88fe639b
Dynamic runtime to not dynamically create objects (#74234)
When we introduced dynamic:runtime (#65489) we decided to have it create objects dynamically under properties, as the runtime section did not (and still does not) support object fields. That proved to be a poor choice, because the runtime section is flat, supports dots in field names, and does not really need objects. Also, these end up causing unnecessary mapping conflicts.

With this commit we adapt dynamic:runtime to not dynamically create objects.

Closes #70268
2021-06-18 14:12:43 +02:00
István Zoltán Szabó 2e820fcab6
[DOCS] Clarifies terminology in Performing population analysis page. (#74237) 2021-06-18 09:03:38 +02:00
David Turner 31e1b1ae22
Improve check_on_startup docs and logging (#74233)
Today we don't really describe why using `index.shard.check_on_startup`
is such a bad idea, or what to do instead. This commit expands the docs
to clarify what it does, why it's not really necessary and what to do
instead. It also now logs a warning every time the startup checks run to
encourage users to stop using this setting.
2021-06-18 07:46:51 +01:00
James Rodewig 853f2436ee
[DOCS] SQL: Move `null` handling to arg descriptions (#74255)
https://github.com/elastic/elasticsearch/pull/74201 documents `null` handling  to the arg descriptions of several string functions.

This PR moves pre-existing docs for `null` handling and similar edge case handling for string functions to arg descriptions for consistency.

Relates to #74193
2021-06-17 15:50:02 -04:00
Nik Everett 1338a11d1c
Document types `terms` agg can consume (#73272)
Co-authored-by: James Rodewig <40268737+jrodewig@users.noreply.github.com>
2021-06-17 14:58:20 -04:00
James Rodewig 94d6978abe
[DOCS] Add missing request section for delete stored script API (#74240) 2021-06-17 09:07:32 -04:00
bellengao 6705f9db74
[DOCS] Fix URL of get stored script API (#74221) 2021-06-17 09:03:39 -04:00
James Rodewig 23cc658f00 [DOCS] Minor edits for better readability 2021-06-17 08:49:16 -04:00
James Rodewig e111c39da0 [DOCS] Remove unneeded on-page TOC 2021-06-17 08:40:56 -04:00
Armin Braun c1e9590a69
Pagination and Sorting for Get Snapshots API (#73952)
Pagination and snapshots for get snapshots API, build on top of the current implementation to enable work that needs this API for testing. A follow-up will leverage the changes to make things more efficient via pagination.

Relates https://github.com/elastic/elasticsearch/pull/73570 which does part of the under-the-hood changes required to efficiently implement this API on the repository layer.
2021-06-17 09:00:11 +02:00
Adam Locke b1803e4514
[DOCS] Update central reporting image (#74195) 2021-06-16 17:29:30 -04:00
James Rodewig 5fc6d73aa5
[DOCS] SQL: Document `null` handing for string functions (#74201) 2021-06-16 14:21:08 -04:00
Armin Braun e8662e3cc0
Fix Snapshot Docs Listing Query Params in Body Incorrectly (#74196)
Both of these APIs don't parse request bodies, the parameters are all taken
from the query string. Also, included the master timeout param include
as it was missing here also.
2021-06-16 20:07:25 +02:00
James Rodewig c7d59f0a4d
[DOCS] EQL: Note EQL uses `fields` parameter (#74194) 2021-06-16 13:01:02 -04:00
François-Clément Brossard 0ea7cbd429
[DOC] Add watcher to the threadpool doc (#73935)
Co-authored-by: James Rodewig <40268737+jrodewig@users.noreply.github.com>
Co-authored-by: Joe Gallo <joegallo@gmail.com>
2021-06-16 11:07:36 -04:00
Nicole Albee e124824f22
Replace deprecated `script.cache.*` settings with `script.context.$constext.cache_*` in documentation. (#74144) 2021-06-16 09:49:58 -04:00
David Turner 8055f680d5
Fix clone API settings docs bug (#74175)
In #74138 we noted that index settings aren't copied in a clone. In fact
that's not true, we copy everything except explicitly-excluded ones,
`number_of_replicas` and `auto_expand_replicas`. This fixes the mistake.

Co-authored-by: James Rodewig <40268737+jrodewig@users.noreply.github.com>
2021-06-16 13:54:29 +01:00
David Turner 3660d863db
Fork the sending of file chunks during recovery (#74164)
Today if sending file chunks is CPU-bound (e.g. when using compression)
then we tend to concentrate all that work onto relatively few threads,
even if `indices.recovery.max_concurrent_file_chunks` is increased. With
this commit we fork the transmission of each chunk onto its own thread
so that the CPU-bound work can happen in parallel.
2021-06-16 11:58:13 +01:00
James Rodewig 27a90d68b7
[DOCS] Add `multi-field` def to glossary (#74147) 2021-06-15 16:39:46 -04:00
James Rodewig 48a8405213
[DOCS] Clone index API doesn't apply index templates or settings (#74138)
Changes:
* Notes that Elasticsearch doesn't apply index templates to cloned indices.
* Notes the clone API doesn't copy index settings.
2021-06-15 16:38:36 -04:00
James Rodewig 8a899419bc [DOCS] Change `multi field` to `multi-field` 2021-06-15 11:40:03 -04:00
James Rodewig c4741abf77
[DOCS] Note ESS must use custom bundles for custom GeoIP database files (#73978) (#74128)
Co-authored-by: James Rodewig <40268737+jrodewig@users.noreply.github.com>

Co-authored-by: Jennie Soria <predogma@users.noreply.github.com>
2021-06-15 09:46:49 -04:00
James Rodewig 51ec860b48
[DOCS] Clarify criteria for restore completion (#74094)
A restore operation is complete when all attempts to recover primary shards have finished, even if unsuccessful.

Closes #70854
2021-06-15 08:30:51 -04:00
James Rodewig 3540b7ccc7
[DOCS] Fix typo in CCR connect example (#74100) 2021-06-14 14:55:34 -04:00
ymao1 c727b40d0b
[Docs] Update cross-document links to Kibana Alerting docs (#74034)
* Updating cross-document links

* PR fixes
2021-06-14 12:23:47 -04:00
Dimitris Athanasiou dc61a72c9e
[ML] Reset anomaly detection job API (#73908)
Adds a new API that allows a user to reset
an anomaly detection job.

To use the API do:

```
POST _ml/anomaly_detectors/<job_id>_reset
```

The API removes all data associated to the job.
In particular, it deletes model state, results and stats.

However, job notifications and user annotations are not removed.

Also, the API can be called asynchronously by setting the parameter
`wait_for_completion` to `false` (defaults to `true`). When run
that way the API returns the task id for further monitoring.

In order to prevent the job from opening while it is resetting,
a new job field has been added called `blocked`. It is an object
that contains a `reason` and the `task_id`. `reason` can take
a value from ["delete", "reset", "revert"] as all these
operations should block the job from opening. The `task_id` is also
included in order to allow tracking the task if necessary.

Finally, this commit also sets the `blocked` field when
the revert snapshot API is called as a job should not be opened
while it is reverted to a different model snapshot.
2021-06-14 18:56:28 +03:00
Stef Nestor 8b8466b42c
[DOCS] Fix ILM action order (#74021)
Following [this code](https://github.com/elastic/elasticsearch/blob/master/x-pack/plugin/core/src/main/java/org/elasticsearch/xpack/core/ilm/TimeseriesLifecycleType.java#L54) I believe these are the necessary changes to bring the doc into alignment.

Co-authored-by: James Rodewig <40268737+jrodewig@users.noreply.github.com>
2021-06-10 14:28:44 -04:00
Stef Nestor fc1ea9c317
[DOCS] Fix `operation_mode` response property def (#73976) 2021-06-10 13:31:15 -04:00
James Rodewig c054f963e2
[DOCS] Note get license API can return a `404` (#73951)
If the master node is generating a new cluster state, the get license API may
return a `404` response.

Closes #70848
2021-06-10 09:19:20 -04:00
Benjamin Trent 8d882863d7
[ML] adding running_state to datafeed stats object (#73926)
It is useful to know the following information when reading datafeed stats:

 - Is the datafeed a "real-time" datafeed, i.e. a datafeed without a configured `end` time
 - Has the datafeed processed all past data available at the time of starting.

This object is only available if the datafeed task has been created.

It has the form:

```
"running_state": {
  "is_real_time": <boolean>,
  "look_back_finished": <boolean>
}
```
2021-06-10 08:08:49 -04:00
James Rodewig 95e7f3faf9
[DOCS] Remove `PrintCompressedOopsMode` JVM option (#73741)
Changes:
* Removes a reference to the
  `-XX:+UnlockDiagnosticVMOptions -XX:+PrintCompressedOopsMode` JVM option. This
  option is no longer supported.
* Combines `Xms/Xmx` recommendations for compressed oops.

Closes #71644.

Co-authored-by: Rick Boyd <boyd.richardj@gmail.com>
2021-06-09 09:43:33 -04:00
Igor Motov db36b6c89a
Add keep_values gap policy (#73297)
Adds a new keep_values gap policy that works like skip, except if the metric
calculated on an empty bucket provides a non-null non-NaN value, this value is
used for the bucket.

Fixes #27377

Co-authored-by: Mark Tozzi <mark.tozzi@gmail.com>
2021-06-08 09:47:29 -10:00
James Rodewig 0360ce48b4
[DOCS] Clarify supported fields for `top_metrics` agg (#73907)
Changes:
* Notes `metrics.field` supports `boolean` fields and runtime fields.
* Notes `metrics.field` doesn't support array values.

Closes #72889
2021-06-08 13:19:43 -04:00
David Turner 43ddd4a580
Fix docs rendering around recovery rate table (#73879)
- Replaces ⇐ with ≤
- Removes table caption
- Adjust table headers
- Fixes leading + on subsequent paragraphs
2021-06-08 15:00:00 +01:00
James Rodewig ff0cb8ed97
[DOCS] Make doc_count error docs more searchable (#73870)
Changes:
* Combines the `Document counts are approximate` and `Calculating document count
  error` sections.
* Rewrites the section to include `sum_other_doc_count` and
  `doc_count_error_upper_bound` for easier on-page (ctrl+f) searching.

Closes #73200
2021-06-08 09:33:10 -04:00
István Zoltán Szabó 20d0dc300f
[DOCS] Updates datafeed related runtime field examples (#73725) 2021-06-08 11:27:55 +02:00
Mark Tozzi 2d4d3d40a0
Docvalueformat errors (#73121)
Improve the error message when inconsistent mappings cause doc value formatting errors.  For example, trying to format a binary encoded IP address as a UTF8 string often fails with something unexpected, like `ArrayIndexOutOfBounds`.  This change catches that and wraps it with a message suggesting the user check their mappings.  Also gets rid of anonymous instances for doc value formatters, which made it hard to see what format was failing to be applied.
2021-06-07 15:24:27 -04:00
Daisuke Harada fa61bf814e
Update runtime.asciidoc (#73802)
it looks typo in a few numbers there.
2021-06-07 09:36:05 -04:00
James Rodewig 3fbc182076
[DOCS] Clone index API doesn't copy index metadata (#73787)
Notes that the clone index API doesn't copy index metadata and highlights some affected features.

Closes #69490
2021-06-07 08:36:58 -04:00
Henning Andersen a11e6f5c6e
Breaking change for single data node setting (#73737)
In #55805, we added a setting to allow single data node clusters to
respect the high watermark. In #73733 we added the related deprecations.
This commit ensures the only valid value for the setting is true and
adds deprecations if the setting is set. The setting will be removed
in a future release.

Co-authored-by: David Turner <david.turner@elastic.co>
2021-06-07 13:12:04 +02:00
David Turner 9146606e49
Note repo must not be registered during restore (#73740)
This commit adds a short note to the docs on repository backups
indicating that the repository must not be modified while registered, so
that a restore from a repository backup must complete before
registration.

Relates #73730
2021-06-07 10:14:13 +01:00
Glen Smith 1b6d2f522c
[doc] Fix typo in slowlog.asciidoc (#73803)
Clarify `index.indexing.slowlog.source` setting.
2021-06-07 09:50:50 +02:00
Benjamin Trent 30cf4dc8be
[ML] adding new KS test pipeline aggregation (#73334)
This adds a new pipeline aggregation for calculating Kolmogorov–Smirnov test for a given sample and buckets path.

For now, the buckets path resolution needs to be `_count`. But, this may be relaxed in the future. 

It accepts a parameter `fractions` that indicates the distribution of documents from some other pre-calculated sample. 

This particular version of the K-S test is Two-sample, meaning, it calculates if the `fractions` and the distribution of `_count` values in the buckets_path are taken from the same distribution.

This in combination with the hypothesis alternatives (`less`, `greater`, `two_sided`) and sampling logic (`upper_tail`, `lower_tail`, `uniform`) allow for flexibility and usefulness when comparing two samples and determining the likelihood of them being from the same overall distribution.

Usage:

```
POST correlate_latency/_search?size=0&filter_path=aggregations
{
  "aggs": {
    "buckets": {
      "terms": { <1>
        "field": "version",
        "size": 2
      },
      "aggs": {
        "latency_ranges": {
          "range": { <2>
            "field": "latency",
            "ranges": [
              { "to": 0.0 },
              { "from": 0, "to": 105 },
              { "from": 105, "to": 225 },
              { "from": 225, "to": 445 },
              { "from": 445, "to": 665 },
              { "from": 665, "to": 885 },
              { "from": 885, "to": 1115 },
              { "from": 1115, "to": 1335 },
              { "from": 1335, "to": 1555 },
              { "from": 1555, "to": 1775 },
              { "from": 1775 }
            ]
          }
        },
        "ks_test": { <3>
          "bucket_count_ks_test": {
            "buckets_path": "latency_ranges>_count",
            "alternative": ["less", "greater", "two_sided"]
          }
        }
      }
    }
  }
}
```
2021-06-04 10:04:41 -04:00
Francisco Fernández Castaño a8101f346f
Add support for index pattern exclusion in CCR AutoFollow (#72935)
This commit adds the ability to specify exclusion patterns in Auto-Follow patterns. This allows excluding indices that match any of the inclusion patterns and also match some of the exclusion patterns giving more fine grained control in scenarios where this is important.

Related #67686
2021-06-04 13:53:30 +02:00
Tim Vernum 7ce9398fcf
[DOCS] Fix client_authentication setting default (#73743)
The value of `*.ssl.client_authentication` is `required` for
everything except `xpack.security.http.ssl.client_authentication`, for
which is it `none`.

The doc template for this setting was configured to have a default
value, and allow an override. However, the default was set to `none`
when it should have been `required`.

The override for `http` was correctly set to `none` (but that didn't
really do anything, since that was the same as the default).

This commit changes the default to `required`, which matches the code
(see `XPackSettings.CLIENT_AUTH_DEFAULT`), and leaves the override for
http as `none` (see `XPackSettings.HTTP_CLIENT_AUTH_DEFAULT`).
2021-06-04 14:22:47 +10:00
Lisa Cawley a6339918ac
[DOCS] Adds defaults to get ML results APIs (#73540)
Co-authored-by: David Roberts <dave.roberts@elastic.co>
2021-06-03 10:05:47 -07:00
István Zoltán Szabó e197577003
[DOCS] Fixes typo in transforms at scale page. (#73726) 2021-06-03 18:27:36 +02:00
William Brafford 1c295a92d8
Add threadpool for critical operations on system indices (#72625)
* Add new thread pool for critical operations
* Split critical thread pool into read and write
* Add POJO to hold thread pool names
* Add tests for critical thread pools
* Add thread pools to data streams
* Update settings for security plugin
* Retrieve ExecutorSelector from SystemIndices where possible
* Use a singleton ExecutorSelector
2021-06-03 12:07:37 -04:00
Adam Locke e2c470abed
[DOCS] Retrieve values from flattened fields w/ runtime fields (#73630)
* [DOCS] Add retriving from flattened fields

* Clarify sub-field syntax

* Moving sub-field retrieval to flattened field docs

* Remove full example and de-emphasize runtime fields

* Remove extraneous sample tag
2021-06-03 11:52:53 -04:00
James Rodewig 68bb763343
[DOCS] Note `include_aliases` supports data stream aliases (#73687)
With #73595, data stream aliases now support the restore snapshot API's
`include_aliases` option.
2021-06-03 08:43:14 -04:00
James Rodewig 975ae227fc [DOCS] Fix typo in modulo example 2021-06-03 08:21:29 -04:00
Lisa Cawley 5273011f98
[DOCS] Update transforms tutorial with index mapping details (#73395) 2021-06-02 13:45:01 -07:00
James Rodewig 712dbdc037 Remove unneeded word 2021-06-02 15:12:34 -04:00
James Rodewig c696bc949d
[DOCS] Remove outdated OSS homebrew tap (#73688) (#73694)
With
230b860d95,
the `elastic/tap/elasticsearch-oss` tap was removed from Homebrew. This
removes outdated references to the tap from our docs.

It also notes that Homebrew installs the latest version of Elasticsearch.
2021-06-02 14:19:41 -04:00
James Rodewig 7f93d07ee3
[DOCS] Revise shard size guidelines to 50GB (#73686)
Changes:
* Revises the size your shards guide to use a 50GB shard guideline. This better aligns with our default in the ILM policy UI.
* Updates the language to indicate that the 50GB shard guideline is not a hard limit. Larger shards may work depending on the network and use case.

Reverts some changes added in #71367.
2021-06-02 13:41:40 -04:00
István Zoltán Szabó 44c26c8bdc
[DOCS] Removes Kibana charts-related advise about agg interval and bucket span. (#73673) 2021-06-02 16:47:01 +02:00
Luca Belluccini 3e41d753e3
[DOCS] Note circuit breakers reject requests with 429 HTTP status code (#69864)
We mention Elasticsearch returns 429 if the circuit breaker trips in https://www.elastic.co/blog/improving-node-resiliency-with-the-real-memory-circuit-breaker, but there is no mention in the docs.

This adds an xref to circuit breaker errors section.

Co-authored-by: James Rodewig <40268737+jrodewig@users.noreply.github.com>
2021-06-02 10:31:24 -04:00
Alexander Reelsen f5a1d94cc9
[DOCS] Fix create component template API request path (#72874) 2021-06-02 09:04:58 -04:00
James Rodewig f1e1661088
[DOCS] Reuse `aliases` object properties in API docs (#73421)
Changes:
* Reuses the same `aliases` object properties in the following API docs:
  * Clone index API
  * Create index API
  * Put component template API
  * Put legacy index template API
  * Put index template API
  * Rollover index API
  * Shrink index API
  * Simulate template API
  * Split index API
* Updates the `aliases` object properties for the simulate index API docs.

Closes #73044
2021-06-02 08:31:42 -04:00
Tim Vernum d58fe5159c
[DOCS] Add notes on SSL version by JDK version (#73401)
Recent JDK releases have disabled TLS v1.0 and TLS v1.1 by default

See
 - https://java.com/en/jre-jdk-cryptoroadmap.html
 - https://bugs.openjdk.java.net/browse/JDK-8202343

This change adds documentation clarifying which TLS versions are
supported on which JDKs (in general terms, rather than specific builds)
and how to change the configuration if necessary.

Co-authored-by: Adam Locke <adam.locke@elastic.co>
2021-06-02 17:33:33 +10:00
bellengao b6fd1bbb06
Add _meta field to ilm policy (#73515)
Relates to #70755.

The main changes of this PR are:

    Add an optional _meta field to ILM policy.
    Add some test code about the change.
    Update the doc of Create or update lifecycle policy API.
2021-06-01 11:17:53 -06:00
James Rodewig 5180f75c70
[DOCS] Use `bool` query in alias filter example (#73619)
Co-authored-by: Elastic Machine <elasticmachine@users.noreply.github.com>
2021-06-01 12:05:40 -04:00
Rene Groeschke 94f07c4942
Fix docs path for docs PR to pass 2021-06-01 16:55:13 +02:00
James Rodewig 3e019a4d56
[DOCS] Note only ES should lock `path.data` files (#73596)
If another service, such as an antivirus or backup program, opens and
locks files in the `path.data` directory, Elasticsearch may return errors.
2021-06-01 10:36:47 -04:00
James Rodewig 8212f15424
[DOCS] Update `is_write_index` example (#73601)
Changes:
* Updates the write index snippet to use data streams
* Notes data streams aliases don't set an implicit write stream, even if the alias points to one data stream.
2021-06-01 10:30:17 -04:00
David Roberts 0059c59e25
[ML] Make ml_standard tokenizer the default for new categorization jobs (#72805)
Categorization jobs created once the entire cluster is upgraded to
version 7.14 or higher will default to using the new ml_standard
tokenizer rather than the previous default of the ml_classic
tokenizer, and will incorporate the new first_non_blank_line char
filter so that categorization is based purely on the first non-blank
line of each message.

The difference between the ml_classic and ml_standard tokenizers
is that ml_classic splits on slashes and colons, so creates multiple
tokens from URLs and filesystem paths, whereas ml_standard attempts
to keep URLs, email addresses and filesystem paths as single tokens.

It is still possible to config the ml_classic tokenizer if you
prefer: just provide a categorization_analyzer within your
analysis_config and whichever tokenizer you choose (which could be
ml_classic or any other Elasticsearch tokenizer) will be used.

To opt out of using first_non_blank_line as a default char filter,
you must explicitly specify a categorization_analyzer that does not
include it.

If no categorization_analyzer is specified but categorization_filters
are specified then the categorization filters are converted to char
filters applied that are applied after first_non_blank_line.

Closes elastic/ml-cpp#1724
2021-06-01 15:11:32 +01:00
Adam Locke 88dfe1aebf
[DOCS] Remove errant note (#73597) (#73600) 2021-06-01 09:41:09 -04:00
James Rodewig 74bc81c23e
[DOCS] Update alias API docs for data stream aliases (#73419)
Updates the docs for several alias APIs for data stream aliases. Also
removes some redundant examples now covered in the [aliases guide][0].

[0]: https://www.elastic.co/guide/en/elasticsearch/reference/master/alias.html
2021-06-01 08:45:22 -04:00
István Zoltán Szabó 7477a644fb
[DOCS] Changes relative links to absolute ones in Node changes. (#73583) 2021-06-01 10:47:32 +02:00
David Turner e027ce977c
Extend version barrier to all upgrades (#73358)
Today when upgrading to the next major version we have a so-called
_major version barrier_: once the cluster comprises nodes of the new
major version then nodes of the previous major version are prevented
from joining the cluster. This means we can be certain that
`clusterState.nodes().getMinNodeVersion().major` will never decrease, so
we can implement upgrade logic that relies on the cluster remaining in
its wholly-upgraded state.

This commit generalises this behaviour to apply to all upgrades, so that
we can be certain that `clusterState.nodes().getMinNodeVersion()` will
never decrease in a running cluster.

Closes #72911
2021-06-01 09:07:23 +01:00
David Turner 2feb48731a
Document removal of MDP as a breaking change (#73578)
Records that the removal of multiple data paths (#71205) is a breaking
change. Includes a copy of the migration process introduced in #73367.
2021-06-01 08:56:02 +01:00
Martijn van Groningen afc17bdb74
Add support for is_write_index flag to data stream aliases. (#73462)
This allows indexing documents into a data stream alias.
The ingestion is that forwarded to the write index of the data stream
that is marked as write data stream.
The `is_write_index` parameter can be used to indicate what the write data stream is,
when updating / adding a data steam alias.

Relates to #66163
2021-05-31 15:08:39 +02:00
James Rodewig a375d232b5
[DOCS] Fix `_tasks` API endpoint reference (#73379) (#73531)
Co-authored-by: James Rodewig <40268737+jrodewig@users.noreply.github.com>

Co-authored-by: Stéphane Bour <sbour@welcoming.com>
2021-05-28 11:19:12 -04:00
James Rodewig 1de9bd4d2a
[DOCS] Update `<alias>` parameter for cat aliases API (#73526) 2021-05-28 10:03:46 -04:00
Dan Hermann 7dd8fb7c66
[DOCS] Mark unfreeze API as deprecated (#73505) 2021-05-28 08:51:41 -05:00
Adam Locke e64a029be4
[DOCS] Create a new page for dissect content in scripting docs (#73437)
* [DOCS] Create a new page for dissect in scripting docs

* Expanding a bit more

* Adding a section for using dissect patterns

* Adding tests

* Fix test cases and other edits
2021-05-27 17:08:52 -04:00
Dan Hermann 40a029f5ba
Deprecate freeze index API (#72618) 2021-05-27 15:14:50 -05:00
James Rodewig c04fd6e301 [DOCS] Remove 'closed data stream' reference 2021-05-27 16:07:09 -04:00
James Rodewig 5729bb8d49
[DOCS] Update alias references (#73427)
Updates several `index aliases` references to `aliases`.
2021-05-27 16:00:57 -04:00
Adam Locke 0aa0171ce1
[DOCS] Create a new page for grok content in scripting docs (#73118)
* [DOCS] Moving grok to its own scripting page

* Adding examples

* Updating cross link for grok page

* Adds same runtime field in a search request for #73262

* Clarify titles and shift navigation

* Incorporating review feedback

* Updating cross-link to Painless
2021-05-27 15:18:34 -04:00
James Rodewig 2fca666b47
[DOCS] Update snapshot/restore for data stream aliases (#73438)
In 7.14+, you can now create data stream aliases. This updates the related
snapshot/restore documentation.
2021-05-27 10:57:06 -04:00
James Rodewig 4e6e3fb51d
[DOCS] Fix typo (#73444) (#73472)
Co-authored-by: Kevin Lowe <kevin@meetsoci.com>
2021-05-27 09:36:18 -04:00
Adam Locke 3d36df5cbd
[DOCS] Move common scripting use cases up a level (#73445) 2021-05-27 07:38:55 -04:00
James Rodewig 2aa4c3602b [DOCS] Fix typos 2021-05-26 09:43:26 -04:00
James Rodewig 325406235a [DOCS] Change 'wildcard expression' to 'wildcard pattern' 2021-05-26 09:20:46 -04:00
István Zoltán Szabó 0bb41ba4f8
[DOCS] Adds screenshots to Transform examples (#72939)
Co-authored-by: Lisa Cawley <lcawley@elastic.co>
2021-05-26 12:06:54 +02:00
James Rodewig 39a0314d30
[DOCS] Update alias xrefs (#73380)
Updates several internal 'alias' xrefs to point to the aliases guide rather than
API docs.
2021-05-25 16:19:00 -04:00
Adam Locke 89ed0c8e29
[DOCS] Expand information on using a runtime field without a script (#73219)
* [DOCS] Expand information on when to use a runtime field without a script

* Reworking information based on review feedback

* Clarify case where doc_values are disabled

* A few minor changes from review feedback
2021-05-25 15:09:31 -04:00
Adam Locke 13b8715a56
[DOCS] Adding redirect for cross-link from Cloud (#73351) (#73383) 2021-05-25 15:07:35 -04:00
Adam Locke 0336e012ee
[DOCS] Update bi-directional CCR note (#72266)
* [DOCS] Update bi-directional CCR note

* Integrating note with a new sub-section

* Add "particularly"
2021-05-25 14:23:58 -04:00
James Rodewig a4e7fcf8ab [DOCS] Fix typo 2021-05-25 12:23:42 -04:00
James Rodewig b1ad71b077
[DOCS] Add alias guide (#73062)
Creates a guide for aliases. We can use this guide to house concepts, examples,
and tips for aliases.
2021-05-25 11:52:16 -04:00
Yang Cheng 7458b472cd
[Doc]Missing xpack role in cat transforms (#73310) 2021-05-25 08:44:59 -07:00
Adam Locke aa66724d1c
[DOCS] Add redirects for Cloud pages to SAML and OIDC docs (#73343) (#73348) 2021-05-24 17:43:46 -04:00
Lee Hinman 95bccda599
Remove deprecated ._tier allocation filtering settings (#73074)
These settings were deprecated in 7.13+ in #72835 and are now removed by this commit.

This commit also ensures that the settings are removed from index metadata when the metadata is
loaded. The reason for this is that if we allow the settings to remain (because they are not
technically "invalid"), then the index will not be able to be allocated, because the
FilterAllocationDecider will be looking for nodes with the _tier attribute.
2021-05-24 14:38:34 -06:00
James Rodewig 12c2c1f12a
[DOCS] Fix formatting (#73338) 2021-05-24 14:19:52 -04:00
Andrei Dan 6a38aff777
[DOCS] Clarify that at least one met condition triggers rollover (#73224) 2021-05-24 19:15:16 +01:00
James Rodewig 0f1fb46d58
[DOCS] Expand and reuse `aliases` parameters (#73296)
Changes:

* Expands the `aliases` parameter for the create index API to better document
  supported properties.

* Reuses `aliases` parameter in the following API docs:
  * Clone index API
  * Shrink index API
  * Split index API
2021-05-24 08:39:05 -04:00
James Rodewig 6c5ab8c04c
[DOCS] Fix data stream ref in index template docs (#73292)
Updates an outdated reference to empty `data_stream` objects. The create index
template API's `data_stream` parameter now supports the `hidden` property.
2021-05-20 15:42:54 -04:00
James Rodewig 41a9eae803
[DOCS] Update `hidden` anchor (#73287) 2021-05-20 14:40:10 -04:00
James Rodewig cb943dbe9b
[DOCS] Reuse index template body params in simulate template API (#73177)
Changes:

* Reuses and reorders the index template API's body parameters in the simulate template API docs.
* Replaces several includes with a shorter xref.
* Reformats a sidebar on naming collisions with built-in index templates.
2021-05-20 12:21:20 -04:00
James Rodewig a11ea87041
[DOCS] Update exists API for data streams (#73180)
Updates the exists API docs to better reflect its support of data streams and
aliases.

Co-authored-by: Elastic Machine <elasticmachine@users.noreply.github.com>
2021-05-20 12:08:40 -04:00
Bogdan Pintea 522283c90e
SQL: Add ODBC proxy support documentation (#73263)
This documents how to configure the proxy support for ODBC.
It also removes the documentation of the connection string values, these
are now all covered by the GUI settings.

Co-authored-by: Andrei Stefan <astefan@users.noreply.github.com>
2021-05-20 16:56:02 +02:00
James Rodewig 81de369635
[DOCS] Fix query parameters for restore API (#73015) 2021-05-20 08:38:37 -04:00
Henning Andersen 0f0cdcb3b0
[DOCS] Use partially mounted in autoscaling docs (#73229)
Fixed autoscaling docs to no longer call partially mounted indices or
shards for frozen indices/shards, now uses partially mounted indices or
shards.

Closes #73132

Co-authored-by: James Rodewig <40268737+jrodewig@users.noreply.github.com>
2021-05-20 11:18:42 +02:00
Julie Tibshirani f85a9dddb9
Support field collapsing with search_after (#73023)
This change adds support for using `search_after` with field collapsing. When
using these in conjunction, the same field must be used for both sorting and
field collapsing. This helps keep the behavior simple and predictable.
Otherwise it would be possible for a group to appear on multiple pages of
results.

Currently search after is handled directly in `CollapsingTopDocsCollector`. As
a follow-up, we could generalize the logic and move support to the Lucene
grouping framework.

Closes #53115.
2021-05-19 14:21:18 -07:00
David Turner 8170376bf0
Restore global state does not merge (#73226)
Today the docs indicate that restoring a snapshot with
`include_global_state` set will merge the ingest pipelines, ILM
policies, settings etc in the snapshot with those already in the
cluster. This isn't the case, we simply replace all the things. This
commit corrects the docs.
2021-05-19 16:16:41 +01:00
James Rodewig 25dcfae30a
[DOCS] Fix reindex API intro (#73188)
The destination doesn't need to pre-exist.
2021-05-19 09:44:51 -04:00
James Rodewig fb212e415b [DOCS] Fix typo 2021-05-19 08:50:19 -04:00
James Rodewig c9a09e33e5
[DOCS] Remove `_all` examples from 'Fix common cluster issues' (#73217)
We plan to deprecate the `_all` wildcard pattern. This removes several
`_all` examples from the 'Fix common cluster issues' guide.
2021-05-19 08:37:16 -04:00
Martijn van Groningen 4b2c3ab0b7
The get aliases api should not return entries for data streams with no aliases (#72953)
The get alias api should take into account the aliases parameter when
returning aliases that refer to data streams and don't return entries
for data streams that don't have any aliases pointing to it.

Relates to #66163
2021-05-19 10:07:11 +02:00
Armin Braun da242856fd
Introduce SNAPSHOT_META Threadpool for Fetching Repository Metadata (#73172)
Adds new snapshot meta pool that is used to speed up the get snapshots API
by making `SnapshotInfo` load in parallel. Also use this pool to load
`RepositoryData`.
A follow-up to this would expand the use of this pool to the snapshot status
API and make it run in parallel as well.
2021-05-18 14:40:39 +02:00
Ryan Ernst 77d756b534
Deprecate shared and index data path settings (#73178)
This commit adds deprecation warnings for use of the path.shared_data
setting as well as the index setting index.data_path.

relates #73168
2021-05-18 05:38:35 -07:00
James Rodewig eeb1098874
[DOCS] Use query parameters in search API example (#73158)
The current search API documentation doesn't include any examples of query
parameter usage.

This updates the docs to include a simple syntax example using the `from` and
`size` query parameters.
2021-05-17 12:18:50 -04:00
James Rodewig e9add28e4f
[DOCS] Update anchor and xrefs for `alias` glossary entry (#73169) 2021-05-17 11:39:12 -04:00
James Rodewig 25421df5fd
[DOCS] Add `alias` to glossary (#73065)
https://github.com/elastic/elasticsearch/pull/72613 adds data stream support to aliases.
This adds an `alias` glossary entry and removes out the current `index alias` entry.
2021-05-17 11:08:07 -04:00
markharwood dbc37f9f6e
TermsEnum api - allow null search strings (#73144)
Allow null search strings (matches all)

Closes #73141
2021-05-17 15:54:53 +01:00
James Rodewig 7b64caee16
[DOCS] Clarify when SLM deletes expired snapshots (#73155)
Changes:
* Removes an error in the create SLM policy API's `schedule` parameter
  def. `schedule` is not used to delete expired snapshots.

* Updates the `expire_after` parameter def to mention the
  `slm.retention_schedule` cluster setting.
2021-05-17 10:02:08 -04:00
Seth Michael Larson ff406a88cc
Rename 'termsenum' API to 'terms_enum' for better readability 2021-05-17 08:08:06 -05:00
David Turner b2956b3ae7
Identify cancelled tasks in list tasks API (#72931)
This commit adds a `cancelled` flag to each cancellable task in the
response to the list tasks API, allowing users to see that a task has
been properly cancelled and will complete as soon as possible.

Closes #72907
2021-05-17 11:02:50 +01:00
David Turner eabe2d1b34
Increase PeerFinder verbosity on persistent failure (#73128)
If a node is partitioned away from the rest of the cluster then the
`ClusterFormationFailureHelper` periodically reports that it cannot
discover the expected collection of nodes, but does not indicate why. To
prove it's a connectivity problem, users must today restart the node
with `DEBUG` logging on `org.elasticsearch.discovery.PeerFinder` to see
further details.

With this commit we log messages at `WARN` level if the node remains
disconnected for longer than a configurable timeout, which defaults to 5
minutes.

Relates #72968
2021-05-17 10:52:18 +01:00
James Rodewig c9223a25a4
[DOCS] Update anchor for alias write index (#73108)
Updates the anchor for our alias write index content. Changes are no-op, but this helps prepare for future work on alias docs.
2021-05-14 12:35:25 -04:00
James Rodewig 34d5f6c246
[DOCS] Consolidate `routing` parameter definitions (#73107)
Combines duplicate definitions for the `routing` parameter.
2021-05-14 12:27:19 -04:00
James Rodewig fa60dfddd4
[DOCS] Update single index APIs reference (#73103)
Many single index APIs, such as the index document API, can now target
data streams. This updates updates references to single index APIs.
2021-05-14 11:53:34 -04:00
markharwood ebb113a7e3
Add "search_after" support to new termsEnum api (#72933)
Adds an optional parameter to the _terms_enum request designed to allow paging.
The last term from a previous result can be passed as the search_after parameter to a subsequent request, meaning only terms after the given term (but still matching the provided string prefix) are returned
Relates to #72910
2021-05-14 15:00:21 +01:00
James Rodewig b564650ed0
[DOCS] Rename `Bulk index alias` API to `Aliases` API (#73077)
In 7.14, the API supports both index and data stream aliases.
2021-05-14 07:41:11 -04:00
James Rodewig 8ec893a425
[DOCS] Change field alias anchor (#73043) 2021-05-13 09:32:36 -04:00
James Rodewig 81c666e785
[DOCS] Fix typo in SLM docs (#73025) (#73040)
Co-authored-by: Daisuke Harada <1519063+dharada@users.noreply.github.com>
2021-05-13 08:36:14 -04:00
Yang Wang 576fe750de
Service Accounts - add beta version documentation (#71729)
* wip

* Service Accounts - add beta documentation

* consistent names

* fix test

* Update service accounts overview and token creation files.

* Rename get service tokens to get service credentials

* fix tests

* Changes for create and get service tokens.

* Changes for get token creds, delete token, clear token cache, and token auth.

* add manage_service_account privilege to list

* List service accounts APIs

* Move xpack setting to Security API page, plus other cleanup.

* Shorten secret tokens in examples, add cross links, plus other cleanup.

* Clarifying parameter descriptions.

* Clarify language for authenticating with a token.

* Tweaks

* Typo fix

* Adding redirects to work around CI build checks

* Revert "Adding redirects to work around CI build checks"

This reverts commit 20a1b53591.

* Remove redirects that were implemented to satisfy CI checks in master branch

* Move note about not supporting basic auth

* Clarify what service accounts are specifically for

* Apply suggestions from code review

Co-authored-by: Tim Vernum <tim@adjective.org>

* Addressing review feedback

* tweak

* Improve doc tests

* fix test

Co-authored-by: Adam Locke <adam.locke@elastic.co>
Co-authored-by: Elastic Machine <elasticmachine@users.noreply.github.com>
Co-authored-by: Tim Vernum <tim@adjective.org>
2021-05-13 07:55:42 -04:00
Felix Barnsteiner a5e39ce086
Add missing word in data streams checkpoints docs (#73005) 2021-05-13 08:34:34 +02:00
James Rodewig ce41fd7e2a [DOCS] Mark `filter` param as optional 2021-05-12 16:25:39 -04:00
James Rodewig b3a197715e
[DOCS] Update allowed operations on data stream write index (#72994)
With #70908, you can now close a write index in 7.12.1 and later versions.
This removes an outdated reference from the data stream overview docs.
2021-05-12 14:26:22 -04:00
James Rodewig 7909d3bd77
[DOCS] Fix JSON spec link for terms enum API (#72996) 2021-05-12 12:57:56 -04:00
James Rodewig 8dddca77aa
[DOCS] Remove and redirect frozen index overview content (#72990)
Changes:

* Removes and adds redirects for the frozen indices [overview][0], [best
  practices][1], [search][2], and [monitoring][3] pages.
* Removes glossary terms related to frozen indices.
* Updates several xrefs to point to the freeze index API docs.

Relates to elastic/elasticsearch#72946 and elastic/elasticsearch#70192.

[0]: https://www.elastic.co/guide/en/elasticsearch/reference/7.12/frozen-indices.html
[1]: https://www.elastic.co/guide/en/elasticsearch/reference/master/best_practices.html
[2]: https://www.elastic.co/guide/en/elasticsearch/reference/master/searching_a_frozen_index.html
[3]: https://www.elastic.co/guide/en/elasticsearch/reference/master/monitoring_frozen_indices.html
2021-05-12 12:54:20 -04:00
markharwood 2dca93fbda
Links new TermsEnum api into docs. Remove superfluous “API” from page title. (#72992)
Closes #72982
2021-05-12 16:21:36 +01:00
Dominic Page dc43d05816
docs amendment match-only-text has limited support for aggs (#72985) 2021-05-12 16:48:57 +02:00
James Rodewig dc1bf6eff9
[DOCS] EQL: Note CCS is not supported (#72975) 2021-05-12 09:19:29 -04:00
James Rodewig 4386008ae2
[DOCS] Add xrefs to script APIs (#72954) 2021-05-12 08:30:49 -04:00
Dan Hermann d0fd686600
[DOCS] Mark freeze API as deprecated (#72946) 2021-05-11 14:27:40 -05:00
Henning Andersen 2ebb58800c
[DOCS] recommend dedicated frozen tier (#72882)
* [DOCS] recommend dedicated frozen tier

Using a dedicated frozen tier is recommended.
2021-05-11 20:41:10 +02:00
Lee Hinman 539a614d9b
Deprecate single-tier allocation filtering settings (#72835)
* Deprecate single-tier allocation filtering settings

`(index|cluster).routing.allocation.(include|exclude|require)._tier` settings are now deprecated in
favor of using `index.routing.allocation.include._tier_preference`.

* Update deprecation message
2021-05-11 09:45:16 -06:00
Christoph Büscher f34c9a8a40
Enhance error message for copy-to (#72820)
We currently don't support `copy_to` for fields that take the form of objects
(e.g. `date_range` or certain kinds of `geo_point` variants). The current
problem with objects is that when DocumentParser parses anything other than
single values, it potentially advances the underlying parser past the value that
we would need to stay on for parsing the value again. While we might want to
support this in the future, for now this PR enhances the otherwise confusing
MapperParsingException with something more helpful and adds a short note in the
documentation about this restriction.

Closes #49344
2021-05-11 13:27:45 +02:00
Martijn van Groningen 6689b8bf1c
Add basic alias support for data streams (#72613)
Aliases to data streams can be defined via the existing update aliases api.
Aliases can either only refer to data streams or to indices (not both).
Also the existing get aliases api has been modified to support returning
aliases that refer to data streams.

Aliases for data streams are stored separately from data streams and
and refer to data streams by name and not to the backing indices of
a data stream. This means that when backing indices are added or removed
from a data stream that then the data stream alias doesn't need to be
updated.

The authorization model for aliases that refer to data streams is the
same as for aliases the refer to indices. In security privileges can
be defined on aliases, indices and data streams. When a privilege is
granted on an alias then access is also granted on the indices that
an alias refers to (irregardless whether privileges are granted or denied
on the actual indices). The same will apply for aliases that refer
to data streams. See for more details:
https://github.com/elastic/elasticsearch/issues/66163#issuecomment-824709767

Relates to #66163
2021-05-11 09:51:05 +02:00
Nik Everett a43b166d11
More debugging info for significant_text (#72727)
Adds some extra debugging information to make it clear that you are
running `significant_text`. Also adds some using timing information
around the `_source` fetch and the `terms` accumulation. This lets you
calculate a third useful timing number: the analysis time. It is
`collect_ns - fetch_ns - accumulation_ns`.

This also adds a half dozen extra REST tests to get a *fairly*
comprehensive set of the operations this supports. It doesn't cover all
of the significance heuristic parsing, but its certainly much better
than what we had.
2021-05-10 12:50:46 -04:00
Benjamin Trent 8069e9b233
[ML] add new bucket_correlation aggregation with initial count_correlation function (#72133)
This commit adds a new pipeline aggregation that allows correlation within the aggregation frame work in bucketed values. 

The initial function is a `count_correlation` function. The purpose of which is to correlate the count in a consistent number of buckets with a pre calculated indicator. The indicator and the aggregated buckets should related to the same metrics with in documents. 

Example for correlating terms within a `service.version.keyword` with latency percentiles. The percentiles and provided correlation indicator both refer to the same source data where the indicator was previously calculated.:
```
GET apm-7.12.0-transaction-generated/_search
{
  "size": 0,
  "aggs": {
    "field_terms": {
      "terms": {
        "field": "service.version.keyword",
        "size": 20
      },
      "aggs": {
        "latency_range": {
          "range": {
            "field": "transaction.duration.us",
            "ranges": [<snip>],
            "keyed": true
          }
        },
        "correlation": {
          "bucket_correlation": {
            "buckets_path": "latency_range>_count",
            "count_correlation": {
              "indicator": {
                 "expectations": [<snip>],
                 "doc_count": 20000
               }
            }
          }
        }
      }
    }
  }
}
```
2021-05-10 12:46:11 -04:00
André Letterer aa544950b6
[DOCS] Include link to ES_TMPDIR with docs on jna.tmpdir (#72845) 2021-05-10 10:52:52 -05:00
David Turner f3792169b8
Recommend cluster scope for Metricbeat (#70865)
Today we mention Metricbeat's `scope` parameter but offer no guidance
about how it should be used. This commit adds guidance to use `scope:
cluster`, especially on clusters with dedicated master-eligible nodes.
2021-05-10 14:16:46 +01:00
István Zoltán Szabó bbfe962cae
[DOCS] Adds top_metrics aggs examples to Transform docs (#72738)
Co-authored-by: Lisa Cawley <lcawley@elastic.co>
2021-05-10 14:41:51 +02:00
Przemko Robakowski a7ff5e7141
Disable GeoIP downloader (#72716)
Due to problems discovered in #72572 we have to disable geoip downloader for now. We use ingest.geoip.downloader.enabled.default as feature flag.
This change also reverts changes to docs.
2021-05-10 13:03:30 +02:00
Jason Tedor 8b4b2f9534
Remove bootstrap.system_call_filter setting (#72848)
This commit removes the bootstrap.system_call_filter setting, as
starting in Elasticsearch 8.0.0 we are going to require that system call
filters be installed and that this is not user configurable. Note that
while we force bootstrap to attempt to install system call filters, we
only enforce that they are installed via a bootstrap check in production
environments. We can consider changing this behavior, but leave that for
future consideration and thus a potential follow-up change.
2021-05-07 18:46:27 -04:00
Jason Tedor 694229f0cd
Deprecate bootstrap.system_call_filter (#72834)
We are going to require system call filters. This commit is the first
step in that journey, which is to deprecate the setting that allows
disabling system call filters.
2021-05-07 08:31:55 -04:00
Nik Everett 5808f2febb
Update docs for `filter` agg (#72508)
The docs for the `filter` agg seemed to suggest that it was the
preferred way to filter results for aggs but its really mostly for when
you need to filter things under another bucketing agg.

Co-authored-by: James Rodewig <40268737+jrodewig@users.noreply.github.com>
2021-05-06 14:51:16 -04:00
James Rodewig 965baad5c4
[DOCS] EQL: Update tiebreaker docs for implicit tiebreaker (#72808) 2021-05-06 14:48:46 -04:00
István Zoltán Szabó 090690d17a
[DOCS] Revises required privileges info in Transforms API docs (#72803) 2021-05-06 15:19:28 +02:00
Dan Hermann 3c08d821e1
[DOCS] Data stream snapshot behavior with excluded backing indices (#72761) 2021-05-06 07:35:08 -05:00
markharwood 73e0662f09
New TermsEnum API for discovering terms in the index. (#66452)
New api designed for use by apps like Kibana for auto-complete use cases.
A search string is supplied which is used as prefix for matching terms found in a given field in the index.
Supported field types are keyword, constant_keyword and flattened.
A timeout can limit the amount of time spent looking for matches (default 1s) and an `index_filter` query can limit indices e.g. those in the hot or warm tier by querying the `_tier` field

Closes #59137
2021-05-06 10:45:37 +01:00
David Turner 3e0959f308
Fix Watcher HTTP connection config for longevity (#72736)
Watcher uses a connection pool for outgoing HTTP traffic, which means
that some HTTP connections may live for a long time, possibly in an idle
state. Such connections may be silently torn down by a remote device, so
that when we re-use them we encounter a `Connection reset` or similar
error.

This commit introduces a setting allowing users to set a finite expiry
time on these connections, and also enables TCP keepalives on them by
default so that a remote teardown will be actively detected sooner.

Closes #52997
2021-05-06 08:28:39 +01:00
István Zoltán Szabó 1ce2308e2a
[DOCS] Adds max_trees hyperparameter to GET TM API docs (#72298) 2021-05-06 08:18:19 +02:00
James Rodewig dbad9d0a0d
[DOCS] Update 'shared_cache' references for searchable snapshots (#72775) 2021-05-05 17:49:15 -04:00
James Rodewig ba66669eb3
[DOCS] Rename mount types for searchable snapshots (#72699)
Changes:

* Renames 'full copy searchable snapshot' to 'fully mounted index.'
* Renames 'shared cache searchable snapshot' to 'partially mounted index.'
* Removes some unneeded cache setup instructions for the frozen tier. We added a default cache size with #71844.
2021-05-05 16:35:33 -04:00
James Rodewig 15e42fd748 [DOCS] Fix `<repository>` param 2021-05-05 15:56:28 -04:00
Luca Belluccini 647ba8f124
[DOCS] Clarify remote_cluster_client is required to run ML (#72569)
Co-authored-by: Lisa Cawley <lcawley@elastic.co>
2021-05-04 18:25:42 -07:00
James Rodewig bd84e8a394
[DOCS] Refactor script processor docs (#72691) 2021-05-04 15:55:38 -04:00
James Rodewig 0224621423 [DOCS] Fix formatting 2021-05-04 12:29:14 -04:00
James Rodewig bbfa090a19
[DOCS] Fix bulk API xref (#72685) 2021-05-04 11:07:19 -04:00
James Rodewig 0edb8f4708
[DOCS] Document missing script APIs (#72175)
Documents the following APIs:

* [Delete stored script][0]
* [Get stored script][1]
* [Get script contexts][2]
* [Get script languages][3]

[0]: https://github.com/elastic/elasticsearch/blob/master/rest-api-spec/src/main/resources/rest-api-spec/api/delete_script.json
[1]: https://github.com/elastic/elasticsearch/blob/master/rest-api-spec/src/main/resources/rest-api-spec/api/get_script.json
[2]: https://github.com/elastic/elasticsearch/blob/master/rest-api-spec/src/main/resources/rest-api-spec/api/get_script_context.json
[3]: https://github.com/elastic/elasticsearch/blob/master/rest-api-spec/src/main/resources/rest-api-spec/api/get_script_languages.json
2021-05-04 11:03:30 -04:00
James Rodewig 7283107e41
[DOCS] Clarify location of custom JVM options files (#72656) (#72679)
Clarify the files to be modified are in the `jvm.options.d/` folder

Co-authored-by: Leaf-Lin <39002973+Leaf-Lin@users.noreply.github.com>
2021-05-04 08:32:45 -04:00
David Kilfoyle c1c5e67f37
[DOCS] Remove frozen tier restriction for ESS (#72624)
* Remove frozen tier restriction for ESS

* Remove section from 'Use ES for time series data'

Co-authored-by: James Rodewig <40268737+jrodewig@users.noreply.github.com>
2021-05-03 15:39:14 -07:00
Adam Locke 44a19732ab
[DOCS] Add field extraction use cases to scripting docs (#71596)
* [DOCS] Add field extraction use cases to scripting docs

* Adding file

* Remove extra space

* Add dissect pattern to split and retrieve data

* Fix list spacing

* Incorporating review feedback
2021-05-03 16:24:28 -04:00
James Rodewig 1ef098c0c0
[DOCS] Shorten quick start response (#72622) 2021-05-03 11:17:47 -04:00
István Zoltán Szabó d07c174aaf
[DOCS] Revises required privileges info in Anomaly Detection API docs (#72483) 2021-05-03 10:20:14 +02:00
Benjamin Trent 2ce4d175f0
[ML] increase the default value of xpack.ml.max_open_jobs from 20 to 512 for autoscaling improvements (#72487)
This commit increases the xpack.ml.max_open_jobs from 20 to 512. Additionally, it ignores nodes that cannot provide an accurate view into their native memory.

If a node does not have a view into its native memory, we ignore it for assignment.

This effectively fixes a bug with autoscaling. Autoscaling relies on jobs with adequate memory to assign jobs to nodes. If that is hampered by the xpack.ml.max_open_jobs scaling decisions are hampered.
2021-04-30 07:55:57 -04:00
Ignacio Vera 4fff3788f3
Disallow creating geo_shape mappings with deprecated parameters (#70850)
With the introduction of BKD-based geo shape indexing in #32039, the prefix tree indexing method has 
been deprecated. From 8.0.0, it will not be allowed to create new mappings using deprecated parameters.
2021-04-30 11:08:58 +02:00
Ignacio Vera 793166fd1f
[GeoPoint] Grid aggregations with bounds should exclude touching tiles (#72493) 2021-04-30 08:43:18 +02:00
Lisa Cawley 0074ba6d5a
[DOCS] Clarify update transform API (#72427) 2021-04-29 07:57:58 -07:00
Dan Hermann 30cd47e7f8
[DOCS] Clarify CCR auto-follow requirements for data streams (#72476) 2021-04-29 09:41:58 -05:00
Yang Wang 4bd5647fd2
Expose API Key metadata to SetSecurityUser ingest processor (#72137)
This PR ensures SetSecurityUserProcessor adds the API key metadata
inside the existing api_key object if the metadata is not null or empty.
2021-04-29 14:47:36 +10:00
debadair bef9b8a9b3
[DOCS] Edit JVM settings info. Closes #72259 (#72350)
* [DOCS] Edit JVM settings info. Closes #72259

* Apply suggestions from code review

Co-authored-by: James Rodewig <40268737+jrodewig@users.noreply.github.com>

* Incorporated review feedback.
2021-04-28 18:30:32 -07:00
James Rodewig 44f3551786
[DOCS] EQL: Use ECS example in EQL syntax docs (#72414) 2021-04-28 14:02:12 -04:00
James Rodewig f1075e755c [DOCS] Fix several breaking changes 2021-04-28 13:38:17 -04:00
James Rodewig c5a38b4bf5 [DOCS] Fix breaking change formatting 2021-04-28 12:49:13 -04:00
James Rodewig 889197f8d0 [DOCS] Fix formatting 2021-04-28 11:40:33 -04:00
James Rodewig 84eebbbdd0 [DOCS] Decrement 'Search your data' headings 2021-04-28 11:37:57 -04:00
James Rodewig 0f02783139 [DOCS] Remove unneeded words 2021-04-28 08:51:56 -04:00
James Rodewig f8666b930f
[DOCS] Add 'Fix common cluster issues' docs (#72097) 2021-04-28 08:28:51 -04:00
István Zoltán Szabó ce9dd74cf5
[DOCS] Expands DFA and TM API docs with required privileges info (#71335) 2021-04-28 08:33:42 +02:00
debadair 633e1322f1
[DOCS] How to migrate to node roles from node attrs. Closes #65855 (#71160)
* [DOCS] Document how to migrate to node roles from node attrs. Closes #65855

* [DOCS] Incorporated review comments

* Update docs/reference/data-management/migrate-index-allocation-filters.asciidoc

Co-authored-by: Andrei Dan <andrei.dan@elastic.co>
2021-04-27 14:39:54 -07:00
James Rodewig 13179c71b6
[DOCS] EQL: Shorten response snippets (#72330)
Shortens several lengthy response snippets to better highlight the
relevant parts.
2021-04-27 16:02:45 -04:00
James Rodewig 97054d6570
[DOCS] Replace 'log into' with 'log in to' (#72324) 2021-04-27 15:27:22 -04:00
William Brafford fc7c06d8a1
Make feature reset API response more informative (#71240)
Previously, the ResetFeatureStateStatus object captured its status in a
String, which meant that if we wanted to know if something succeeded or
failed, we'd have to parse information out of the string. This isn't a
good way of doing things.

I've introduced a SUCCESS/FAILURE enum for status constants, and added a
check for failures in the transport action. We return a 207 if some but not all
reset actions fail, and for every failure, we also return information about the
exception or error that caused it.

Co-authored-by: Jay Modi <jaymode@users.noreply.github.com>
2021-04-27 13:47:10 -04:00
David Turner 1c4791e398
Abort writes in repo analyzer (#72077)
We rely on the repository implementation correctly handling the case where a
write is aborted before it completes. This is not guaranteed for third-party
repositories.

This commit adds a rare action during analysis which aborts the write
just before it completes and verifies that the target blob is not found
by any node.
2021-04-27 14:13:22 +01:00
Hendrik Muhs 7fff5df7a3
[Transform] add support for top metrics (#71850)
add support for the stats and top metrics aggregation in transform. With this change it became
easier to add more multi value aggregations to transform

Limitations:
 - only the 1st element of top_metrics gets consumed by transform[*].
 - all values of stats will be mapped to double if mapping deduction is used, including count,
   sum, min, max

fixes #52236
relates #51925
2021-04-27 13:45:53 +02:00
David Turner 12b60f64ba Trivial typo: bindiing -> binding 2021-04-27 12:20:46 +01:00
Henning Andersen 6fbd2601df
[DOCS] Disable shard allocation - data nodes only (#72265)
When doing a rolling restart we recommend disabling shard allocation to
avoid unnecessary recoveries. However, this advise is unnecessary or
even harmful when restarting nodes that do not carry any data like a
pure ML node.
2021-04-27 11:19:03 +02:00
Ryan Ernst 99c15cd05a
Remove documentation for multiple data paths (#72267)
relates #71205
2021-04-26 15:03:49 -07:00
James Rodewig 1ff080538a [DOCS] Update quick start heading 2021-04-26 17:48:01 -04:00
Adam Locke 881356ce18
[DOCS] Update id for service account redirect (#72268)
* [DOCS] Update id for service account redirect

* Remove duplicate redirect
2021-04-26 14:58:42 -04:00
Adam Locke 2ab277cc17
[DOCS] Adds temporary redirects for service account docs (#72262) 2021-04-26 14:06:00 -04:00
David Turner dd7f555ca5
Open with better cluster allocation explain ex. (#72245)
Today the only example of calling the cluster allocation explain API above the
fold is the bare `GET /_cluster/allocation/explain` which kind of works but is
not usually what the user wants. This commit changes the docs so that we open
with an example showing how we usually expect it to be called. This will make
it clearer that you should normally specify exactly for which shard you want an
explanation. It also tidies up a few other wrinkles in these docs.

Co-authored-by: James Rodewig <40268737+jrodewig@users.noreply.github.com>
2021-04-26 17:41:22 +01:00
Pierre Grimaud 3c44dfec60
[DOCS] Fix typos (#72227) 2021-04-26 12:40:38 -04:00
James Rodewig fd72a2690a [DOCS] Remove unneeded escapes 2021-04-26 12:14:45 -04:00
James Rodewig 32b5e610bf
[DOCS] Fix cat test snippets (#72233)
* Updates a cat test snippet to always return by index name in asc order
* Removes several leading slashes
* Reduces length of several snippet delimiters

Closes https://github.com/elastic/elasticsearch/issues/71683
2021-04-26 09:22:46 -04:00
István Zoltán Szabó 2f122f03b2
[DOCS] Adds anomaly detection rule advanced settings to docs (#72072)
Co-authored-by: Lisa Cawley <lcawley@elastic.co>
2021-04-26 09:55:02 +02:00
James Rodewig 53fb357da5 [DOCS] Clarify `<script-id>` parameter def 2021-04-23 15:05:35 -04:00
James Rodewig 93784444f7 [DOCS] Fix created stored script API request paths 2021-04-23 11:38:06 -04:00
James Rodewig 230b0c35bd [DOCS] Fix data type of `include_defaults` parameter 2021-04-23 10:07:26 -04:00
James Rodewig d94af832c5 [DOCS] Remove uneeded 'How to' from title 2021-04-23 10:07:13 -04:00
Wylie Conlon 15806b648b
[DOCS] Clarify `timeout` and `terminate_after` parameters (#71713)
Co-authored-by: James Rodewig <40268737+jrodewig@users.noreply.github.com>
2021-04-23 09:26:21 -04:00
James Rodewig 56068a777e
[DOCS] Update Docker instructions for quick start (#72117) 2021-04-23 09:22:47 -04:00
James Rodewig f8d2578ede
[DOCS] EQL: Remove wildcard function (#72121) 2021-04-22 15:49:07 -04:00
Yang Cheng fc2d6f89c9
[DOCS] Remove internal versioning for concurrency control (#71570) 2021-04-22 10:54:58 -04:00
Adrien Grand 83113ec8d3
Add `match_only_text`, a space-efficient variant of `text`. (#66172)
This adds a new `match_only_text` field, which indexes the same data as a `text`
field that has `index_options: docs` and `norms: false` and uses the `_source`
for positional queries like `match_phrase`. Unlike `text`, this field doesn't
support scoring.
2021-04-22 08:41:47 +02:00
William Brafford d1bcd2a88b
Add docs for feature reset API (#71759)
* Add docs for feature reset API
* Prose and style much improved by Deb Adair.

Co-authored-by: debadair <debadair@elastic.co>
2021-04-21 17:40:10 -04:00
James Rodewig 3f2eb32afc
[DOCS] Sync EQL docs with `fields` param updates (#72008) 2021-04-21 09:13:27 -04:00
Jay Modi a7dbb31765
Add Fleet action results system data stream (#71667)
This commit adds support for system data streams and also the first use
of a system data stream with the fleet action results data stream. A
system data stream is one that is used to store system data that users
should not interact with directly. Elasticsearch will manage these data
streams. REST API access is available for external system data streams
so that other stack components can store system data within a system
data stream. System data streams will not use the system index read and
write threadpools.
2021-04-20 13:33:12 -06:00
Adam Locke 6dfd92c46f
[DOCS] Focus retrieving selected fields on fields parameter (#71506)
* [DOCS] Focus retrieving selected fields on fields parameter

* Incorporating changes from reviews

* Adding clarifications from review feedback

* Slight wording revisions.

* Clarify language around format parameter and move text out of callout.
2021-04-20 15:11:35 -04:00
Tim Brooks bfb85bcecb
Wait on index when polling global checkpoints (#71890)
Currently when the fleet global checkpoints API returns immediately if
the index is not ready or shards are not ready. This commit modifies the
API to wait on the index and primary shards active up until the timeout
period.

Related to #71449.
2021-04-20 13:04:42 -06:00
Tanguy Leroux f50b35719e
Revive documentation for experimental Searchable Snapshots APIs (#71927)
This commit revives the documentation of the "Clear Cache" and 
"Shard Stats" APIs of Searchable Snapshots that was removed 
in #62217. This is a partial revert of the commit b545c55 with 
some light wording changes.

Co-authored-by: James Rodewig <40268737+jrodewig@users.noreply.github.com>
2021-04-20 18:34:48 +02:00
James Rodewig 7a8db42e23
[DOC] Fix typos (#71869) (#71948)
Co-authored-by: James Rodewig <40268737+jrodewig@users.noreply.github.com>

Co-authored-by: Mimi Tsuruga <mimi.tsuruga@elastic.co>
2021-04-20 11:47:29 -04:00
Henning Andersen c57fbe8eaf
Frozen default cache size (#71844)
This commit adds a default cache size to frozen tier of the greater of
90% and total disk size minus 100 GB.
2021-04-20 17:37:39 +02:00
Henning Andersen 794869cfbb
Add separate flood stage limit for frozen (#71855)
Dedicated frozen nodes can survive less headroom than other data nodes.
This commits introduces a separate flood stage threshold for frozen as
well as an accompanying max_headroom setting that caps the amount of
free space necessary on frozen.

Relates #71844
2021-04-20 15:51:52 +02:00
Yannick Welsch 8c1938a524
Remove experimental status from frozen tier docs (#71923)
Removes the experimental status for the frozen tier / shared_cache searchable snapshots for the 7.13 release. 
Also adapts docs that URL repositories are now supported in 7.13 for searchable snapshots.
2021-04-20 15:37:56 +02:00