Commit Graph

368 Commits

Author SHA1 Message Date
Keith Massey d5b4584612
Adding more detail about ingest.geoip.downloader.endpoint (#91182) 2022-11-09 09:17:33 -06:00
Roberto Seldner 8e35a6a846
Update documentation with supported IANA numbers (#90531)
Based on this:
https://github.com/elastic/elasticsearch/blob/main/modules/ingest-common/src/main/java/org/elasticsearch/ingest/common/CommunityIdProcessor.java#L440-L451
2022-10-19 08:23:11 -05:00
Lee Hinman 4fe9fc488c
Deprecate 'remove_binary' default of false for ingest attachment processor (#90460)
This commit adds deprecation warning for when the `remove_binary`
setting is unset. In the future we want to change the default to `true`
(it is currently `false`), so this will let a user know they should be
explicit about setting this to ensure the behavior does not change in a
future (breaking) release.

Relates to #86014
2022-10-04 01:04:40 +10:30
Abdon Pijpelink 00d4953df5
[DOCS] Fixes broken example in pipeline tutorial (#89315) 2022-08-16 08:40:10 +02:00
István Zoltán Szabó 5372c51dfd
[DOCS] Fixes a link that breaks the docs build. (#88111) 2022-06-28 10:22:23 +02:00
Ryan Ernst eed8da3919
Move the ingest attachment processor to the default distribution (#87989)
The ingest attachment processor is currently available as a plugin. This
commit moves the processor to the default distribution so it is always
available.
2022-06-28 02:10:36 -04:00
Stuart Tettemer d42211c431
Ingest: IngestDocument requires non-null version (#87665)
Changes the type of the version parameter in `IngestDocument` from
`Long` to `long` and moves it to the third argument, so all required
values occur before nullable arguments.

The `IngestService` expects a non-null version for a document and will
throw an `NullPointerException` if one is not provided.

Related: #87309
2022-06-15 07:50:45 -05:00
Martijn van Groningen 7154608abf
Allow pipeline processor to ignore missing pipelines (#87354)
Add `ignore_missing_pipeline` option to `pipeline` processor. This
controls whether the `pipeline` processor should fail with an error if
no pipeline with a name specified in the `name` option exists.

This enhancement is useful to setup a pipeline infrastructure that
lazily adds extension points for overwrites. So that for specific
cluster setups custom pre-processing can be added at a later point in
time.

Relates to #87323
2022-06-07 07:02:18 -04:00
wallrik 10f53f8766
Clarify environments with strict firewalls and GEOIP (RE: #85637) (#86648) 2022-05-23 06:43:26 -06:00
Luca Belluccini 1c52081b1f
[DOC] Air gapped environments and GEOIP (#85637)
* [DOC] Air gapped environments and GEOIP

Closing https://github.com/elastic/elasticsearch/issues/85542

* Use variable name for Elasticsearch

Co-authored-by: Adam Locke <adam.locke@elastic.co>
2022-05-10 16:34:28 -04:00
Benjamin Trent 258d2b71e2
[ML] add roberta/bart docs (#85001)
adds roberta section to NLP tokenization documentation.
2022-03-17 12:14:57 -04:00
Benjamin Trent 45deac4c96
[ML] add windowing support for text_classification (#83989)
This commit adds initial windowing support for text_classification tasks.

Specifically, a user can now indicate a span (non-negative) indicating the tokenization windowing span when creating
sub-sequences.

Default value is span: -1 indicates that no windowing should take place.
2022-03-01 08:29:12 -05:00
James Rodewig d3d468e5f1
[DOCS] Update screenshots for ingest pipeline docs (#83845)
https://github.com/elastic/kibana/pull/101216 adds a new ECS mapper feature to the Ingest Pipelines UI. This updates the ES docs to cover the new feature.
2022-02-23 10:50:02 -05:00
Chris 3e72ffcac9
[DOCS] Change license abbreviation (#82266)
As far as I can see the correct abbreviation for the CC `Attribution-ShareAlike 4.0 International` License is `CC BY-SA 4.0` https://creativecommons.org/licenses/by-sa/4.0/
2022-01-13 09:38:42 -05:00
David Kyle 1473b09415
[ML] Add NLP inference configs to the inference processor docs (#82320) 2022-01-11 08:50:45 +00:00
James Rodewig f1004ee698
[DOCS] Fix xref for conditionally running ingest processor (#82001)
Closes #81966
2021-12-21 11:37:20 -05:00
Lisa Cawley 076343933f
[DOCS] Update link in inference processor (#81897) 2021-12-17 15:49:59 -08:00
Dan Hermann b1f5373e02
Correct docs on output_format option for date processor (#81557) 2021-12-17 06:07:03 -06:00
Lisa Cawley b18f5fd2c6
[DOCS] Fixes link to language identification example (#81347) 2021-12-03 17:21:04 -08:00
Jan Doberstein 73b3d8f639
Update execute-enrich-policy.asciidoc (#80750)
Changed the wording, as the execution of the policy does not trigger the delete. That delete is done periodical and can be configured with the `enrich.cleanup_period` 

https://www.elastic.co/guide/en/elasticsearch/reference/7.16/enrich-setup.html#ingest-enrich-settings
2021-11-16 11:57:04 +01:00
James Rodewig f56a0f4b66
[DOCS] Remove `testenv` annotations from doc snippet tests (#80023)
Removes `testenv` annotations and related code. These annotations originally let you skip x-pack snippet tests in the docs. However, that's no longer possible.

Relates to #79309, #31619
2021-11-05 18:38:50 -04:00
edh-oss 3c23a9e9cd
[DOCS] Remove `[testenv="gold+"]` attributes (#79309)
Changes:

* Removes several `[testenv="gold+"]` attributes from the docs. `gold+` is not a valid [subscription level](https://www.elastic.co/subscriptions) or testenv value.
* Moves two `[testenv="basic"]` attributes to the file header. This makes the `testenv` placement consistent and fixes the yml file generated from `docs/reference/snapshot-restore/register-repository.asciidoc`.

Co-authored-by: James Rodewig <40268737+jrodewig@users.noreply.github.com>
2021-10-27 16:32:30 -04:00
Michael Bischoff c30ab868ee
[DOCS] Document range enrich policy (#79607)
Adding docs for the range enrich policy

Co-authored-by: James Rodewig <40268737+jrodewig@users.noreply.github.com>
2021-10-26 15:15:53 +02:00
Dan Hermann a23f58f809
[DOCS] `if_version` parameter for OCC on pipeline updates (#79640) 2021-10-25 08:25:26 -05:00
James Rodewig 58abbe941f
[DOCS] Fix cluster update settings refs (#79580)
The API is named 'cluster update settings,' not 'update cluster settings.'
2021-10-20 13:16:35 -04:00
Nikola Grcevski 055c770083
Deprecation of transient cluster settings (#78794)
This PR changes uses of transient cluster settings to
persistent cluster settings. 

The PR also deprecates the transient settings usage.

Relates to #49540
2021-10-15 13:00:52 -04:00
Martijn van Groningen 230e866842
Document a number of enrich node settings. (#78930)
Add a section in the docs that describe a number of node level settings
for the enrich processor.

Co-authored-by: James Rodewig <40268737+jrodewig@users.noreply.github.com>
2021-10-14 15:00:45 +02:00
Martijn van Groningen 04e5823a69
Remove default maxmind geoip databases from distribution (#78362)
* Adjusted integration tests to use geoip test fixture or to use test databases provided via config dirs (for qa module / docs).
* Kept the geolite2-databases dependency for most of the unit tests only.
* Made fallback_to_default_databases parameter on geoip processor a noop and emit deprecation warning upon using it.
* If no geoip databases are available yet to a node then the geoip processor factory returns a processor implementation that flags documents that databases are unavailable. This allows these documents to be reindex later with a pipeline. These documents will have a tag string array field, which contains a string _geoip_database_unavailable_{database_name} for each missing database in a pipeline.
* Added reload pipeline capabilities is IngestService, so that when databases are available again on a node then pipelines with geoip processor definition can be reloaded.

Relates to #68920
2021-10-13 14:52:18 +02:00
James Rodewig a763a86a0d
[DOCS] Update ingest node pipeline refs (#78770)
In https://github.com/elastic/kibana/pull/113783, we renamed Kibana's **Ingest Pipelines** feature to **Ingest Pipelines**. This updates screenshots and references for the feature. It also replaces a few remaining `ingest node pipeline` references.
2021-10-12 08:18:24 -04:00
edh-oss 62a471aefe
Update JSON parser and snippets (#77983)
Related to issue  #77823

This does the following:

- Updates several asciidoc files that contained code snippets with
  invalid JSON, most involving unnecessary trailing commas.

- Makes the switch from the Groovy JSON parser to the Jackson parser,
  pursuant to the general goal of eliminating Groovy dependence.

- Makes testing of JSON validity at build time more strict.

Note that this update still allows backslash escaping for any
character. Currently that matters because of the file
"docs/reference/ml/anomaly-detection/apis/get-datafeed-stats.asciidoc",
specifically this part:

    "attributes" : {
      "ml.machine_memory" :
        "$body.datafeeds.0.node.attributes.ml\.machine_memory",
      "ml.max_open_jobs" : "512"
    }

It's not clear to me what change, if any, is appropriate there. So,
I've left in the escaped period and configured the parser to ignore
it for the time being.
2021-09-20 11:08:26 +01:00
Dan Hermann 09004d30dc
[DOCS] ECS support for the grok processor (#77059) 2021-09-10 13:10:28 -05:00
Martijn van Groningen 1ae4f3c937
Add enrich node cache (#76800)
Introduce a LRU cache to avoid searches that occur frequently
from the enrich processor.

Relates to #48988
2021-09-03 09:33:44 +02:00
Francois-Clement Brossard ec11f9f931
Execute enrich policy wait_for_completion docfix (#77046) 2021-08-31 14:02:24 -04:00
Dan Hermann c4aad2965f
[DOCS] Map iteration support in ForEach processor (#76972) 2021-08-27 07:35:11 -05:00
James Rodewig c238296b7a
[DOCS] Add missing `timeout` param to create pipeline API docs (#76432) 2021-08-12 14:32:34 -04:00
Keith Massey 498684a696
Add support for _meta field to ingest pipelines (#75905)
We are adding a _meta field to many of our REST APIs so that users can attach whatever metadata they
want. The data in this field will not be used by Elasticsearch. This commit add the _meta field to ingest
pipelines.
2021-08-11 08:30:36 -05:00
Dan Hermann c81cf2f7fe
Configurable media_type for mustache template encoding on append processor (#76210) 2021-08-10 15:13:36 -05:00
James Rodewig c277110a14
[DOCS] Merge ingest APIs index to one page (#76264)
Adds some `discrete` tags to merge the ingest APIs index into a single
page.
2021-08-10 09:30:47 -04:00
James Rodewig fc0ac1923d
[DOCS] Correct spelling for geo terms (#76028)
Changes:
* Use "geopoint" when not referring to the literal field type
* Use "geoshape" when not referring to the literal field type or query type
* Use "GeoJSON" consistently
2021-08-03 09:55:48 -04:00
David Pilato ec588cb74a
[DOCS] Fix extracted date (#75918)
In the example, we have a date with year `2018` but the extracted date is `1998`.
2021-08-02 12:50:01 -04:00
James Rodewig eaeb7723f3
[DOCS] Update ingest pipeline screenshots for new EUI theme (#75744) 2021-07-27 13:36:12 -04:00
István Zoltán Szabó 6a4de77e11
[DOCS] Adds classification and regression links back to DFA docs. (#74930) 2021-07-08 16:37:16 +02:00
Felix Barnsteiner 0a8f725647
Support for wildcards and override option for dot_expander processor (#74601) 2021-07-08 14:39:27 +02:00
Felix Barnsteiner 404e9865b6
Json processor: add_to_root_conflict_strategy option (#74967) 2021-07-08 14:36:22 +02:00
Felix Barnsteiner 67fbc337ea
Json processor: allow duplicate keys (#74956) 2021-07-06 15:02:32 +02:00
István Zoltán Szabó 6c6e6874ff
[DOCS] Removes link to classification and regression. (#74926) 2021-07-05 16:28:14 +02:00
James Rodewig d4ed43c5a4
[DOCS] Remove deprecated `geo_shape` parameters (#74519)
* Removes docs and references for the following `geo_shape` mapping parameters:
  * `tree`
  * `tree_levels`
  * `strategy`
  * `distance_error_pct`
* Updates a related breaking change.

Relates to #70850
2021-06-29 08:52:05 -04:00
Przemko Robakowski 9706d676a1
Enable GeoIP downloader by default (#74323)
This change will enable geoip downloader by default again. It also brings back relevant documentation.
2021-06-28 09:04:49 +02:00
bellengao 42c0be4c67
Add template snippets support for KV ingest processor (#73758) 2021-06-22 16:32:11 -05:00
Yang Cheng 98a6ef9ee3
[DOCS] Add missing`xpack` role (#74331) 2021-06-21 10:01:35 -04:00