elasticsearch

Commit Graph

Author	SHA1	Message	Date
malpani	08de504b44	Support ignore_keywords flag for word delimiter graph token filter (#59563 ) This commit allows customizing the word delimiter token filters to skip processing tokens tagged as keyword through the `ignore_keywords` flag Lucene's WordDelimiterGraphFilter already exposes. Fix for #59491	2020-07-21 16:11:11 +01:00
James Rodewig	79c45a9099	[DOCS] Mark data stream stats API as stable (#59978 ) Removes experimental admon from data stream stats API. Relates to #59860.	2020-07-21 11:05:34 -04:00
Howard	b8e3ba783a	[DOCS] Fix missing punctuation in agg docs (#59822 )	2020-07-21 10:17:59 -04:00
Przemysław Witek	2a12dcf2e0	Rename binary_soft_classification evaluation to outlier_detection (#59951 )	2020-07-21 14:27:57 +02:00
Tim Brooks	08506de861	Add indexing pressure documentation (#59456 ) This commit adds documentation about the new indexing pressure memory limit setting and exposure of this metrics in node stats.	2020-07-20 19:35:26 -06:00
Lisa Cawley	fb0157460f	[DOCS] Changes level offset of anomaly detection pages (#59911 )	2020-07-20 16:33:54 -07:00
Julie Tibshirani	6b21a4a87a	Add 'point' to the top-level field type docs. (#59731 ) Before it was missing from the list. This PR also renames the 'geo data types' section to 'spatial data types' and consolidates the geo and cartesian types into that section.	2020-07-20 16:29:32 -07:00
Lisa Cawley	823c337e76	[DOCS] Changes level offset for anomaly detection APIs (#59920 )	2020-07-20 12:38:09 -07:00
Lisa Cawley	42be287b57	[DOCS] Changes level offset in data frame analytics APIs (#59919 )	2020-07-20 12:11:47 -07:00
James Rodewig	2c5d6e9c95	[DOCS] Reformat agg snippets to use two-space indents (#59912 )	2020-07-20 15:08:04 -04:00
James Rodewig	8a57800f1b	[DOCS] Add performance warning for scripts (#59890 )	2020-07-20 14:04:35 -04:00
Armin Braun	9b155d9201	Fix Snapshot Status API Docs Test (#59902 ) The clock resolution for this API is our default 200ms. It is unlikely but possible that a shard snapshot starts and ends on separate clock ticks and that breaks the test. Just allowing any value here seems fine to me (seems we can't match for integer specifically).	2020-07-20 18:26:05 +02:00
Nik Everett	f87b31f973	Document supported scenarios for CCS (#58120 ) Documents the supported scenarios for CCS. Co-authored-by: Adam Locke <adam.locke@elastic.co>	2020-07-20 10:03:08 -04:00
Igor Motov	6bfde550f9	Add hard_bounds documentation (#59809 ) Fixes #59774	2020-07-20 09:54:02 -04:00
David Turner	7bb748da8c	Remove sporadic min/max usage estimates from stats (#59755 ) Today `GET _nodes/stats/fs` includes `{least,most}_usage_estimate` fields for some nodes. These fields have rather strange semantics. They are only reported on the elected master and on nodes that have been the elected master since they were last restarted; when a node stops being the elected master these stats remain in place but we stop updating them so they may become arbitrarily stale. This means that these statistics are pretty meaningless and impossible to use correctly. Even if they were kept up to date they're never reported for data-only nodes anyway, despite the fact that data nodes are the ones where we care most about disk usage. The information needed to compute the path with the least/most available space is already provided in the rest the stats output, so we can treat the inclusion of these stats as a bug and fix it by simply removing them in this commit. Since these stats were always optional and mostly omitted (for opaque reasons) this is not considered a breaking change.	2020-07-20 14:48:53 +01:00
James Rodewig	1d8143deae	[DOCS] Fix `requests_per_second` reindex param (#59871 ) Corrects the `requests_per_second` query parameter used in the reindex, delete by query, and update by query API docs. The parameter defaults to `-1` (no throttle). `0` is not an allowed value.	2020-07-20 09:42:01 -04:00
James Rodewig	cb6d1c00f0	[DOCS] Document data stream stats API (#59435 )	2020-07-20 09:33:01 -04:00
Rui Almeida	2c450214ac	[DOCS] Fix keyword marker docs (#59834 )	2020-07-20 08:54:55 -04:00
James Rodewig	861892add4	[DOCS] EQL: Remove collapsible sections from EQL search docs (#59819 )	2020-07-20 08:50:19 -04:00
James Rodewig	8170cb9cf0	[DOCS] Remove collapsible examples (#59820 ) Snippets are now visible without additional clicks.	2020-07-20 08:42:56 -04:00
James Rodewig	6a02528e91	[DOCS] Fix erroneous data stream ref (#59805 ) Removes an erroneous data stream reference added in #58513. While technically possible, we don't encourage using date math to name data streams.	2020-07-17 13:43:43 -04:00
Nik Everett	27efb5f3b8	Clean up a few of vwh's rough edges (#59341 ) This cleans up a few rough edged in the `variable_width_histogram`, mostly found by @wwang500: 1. Setting its tuning parameters in an unexpected order could cause the request to fail. 2. We checked that the maximum number of buckets was both less than 50000 and MAX_BUCKETS. This drops the 50000. 3. Fixes a divide by 0 that can occur of the `shard_size` is 1. 4. Fixes a divide by 0 that can occur if the `shard_size * 3` overflows a signed int. 5. Requires `shard_size * 3 / 4` to be at least `buckets`. If it is less than `buckets` we will very consistently return fewer buckets than requested. For the most part we expect folks to leave it at the default. If they change it, we expect it to be much bigger than `buckets`. 6. Allocate a smaller `mergeMap` in when initially bucketing requests that don't use the entire `shard_size * 3 / 4`. Its just a waste. 7. Default `shard_size` to `10 * buckets` rather than `100`. It looks like that was our intention the whole time. And it feels like it'd keep the algorithm humming along more smoothly. 8. Default the `initial_buffer` to `min(10 * shard_size, 50000)` like we've documented it rather than `5000`. Like the point above, this feels like the right thing to do to keep the algorithm happy. Co-authored-by: Elastic Machine <elasticmachine@users.noreply.github.com>	2020-07-17 13:39:28 -04:00
Adam Locke	c143bb56cb	[DOCS] Updating snapshot/restore pages to align with API changes (#59730 ) * Updating snapshot/restore pages to align with API changes. * Fixing texts in delete snapshot page. * Removing duplicate code sample and making editorial changes. * Change "deleted" to "delete" * Incorporating review feedback and making minor editorial changes. * Remove titleabbrev * Add paragraph break * Remove titleabbrev from restore page * Remove titleabbrev from create page * Change "Create" to lowercase * Change API names to lowercase * Remove extraneous delimiters * Change "Delete" to lowercase * Single-sourcing warning and clarifying warning text.	2020-07-17 13:08:13 -04:00
Shahzad	24e5da7851	Update regex file for es user agent node processor (#59697 )	2020-07-17 16:54:34 +02:00
Armin Braun	3cef12368c	Fix Snapshot Status API Docs Test (#59775 ) We can't just assume a fixed number for the overall file count. Depending on how the merging/flushing works out we won't always have 4 files for the index across all versions, systems etc. Also, we could have x-pack concurrently create some system indices which could mess up the total numbers here. Fixed by only snapshotting a single index+shard in the snapshot that we get the status for and verifying consistency instead of equality for total file counts. Closes #59767	2020-07-17 16:34:29 +02:00
Rory Hunter	4db094c008	Remove dangling index auto import functionality (#59698 ) Closes #48366. Remove all traces of automatically importing dangling indices. This functionality is deprecated from 7.9.0.	2020-07-17 15:17:58 +01:00
James Rodewig	aa3ddfeefb	[DOCS] Move highlighting docs to separate page (#59768 ) Moves the highlighting docs from the deprecated 'Request Body Search' chapter to the new subpage of the 'Run a search chapter' section. No substantive changes were made to the content.	2020-07-17 10:15:20 -04:00
Benjamin Trent	f72b893fd3	Adding new `require_alias` option to indexing requests (#58917 ) This commit adds the `require_alias` flag to requests that create new documents. This flag, when `true` prevents the request from automatically creating an index. Instead, the destination of the request MUST be an alias. When the flag is not set, or `false`, the behavior defaults to the `action.auto_create_index` settings. This is useful when an alias is required instead of a concrete index. closes https://github.com/elastic/elasticsearch/issues/55267	2020-07-17 08:45:46 -04:00
James Rodewig	8b6e310070	[DOCS] Reformat `predicate_token_filter` tokenfilter (#57705 )	2020-07-16 13:07:19 -04:00
István Zoltán Szabó	edccf14478	[DOCS] Adds security privilege info to inference bucket aggregation (#59604 )	2020-07-16 18:02:17 +02:00
Tim Brooks	e05858132d	Update thread pool docs about WRITE queue size (#59643 ) This commit updates the thread pool documentation to reflect the change in the WRITE thread pool default queue size.	2020-07-16 09:32:51 -06:00
Benjamin Trent	b551f75ec3	[ML] add new `custom` field to trained model processors (#59542 ) This commit adds the new configurable field `custom`. `custom` indicates if the preprocessor was submitted by a user or automatically created by the analytics job. Eventually, this field will be used in calculating feature importance. When `custom` is true, the feature importance for the processed fields is calculated. When `false` the current behavior is the same (we calculate the importance for the originating field/feature). This also adds new required methods to the preprocessor interface. If users are to supply their own preprocessors in the analytics job configuration, we need to know the input and output field names.	2020-07-16 09:35:56 -04:00
Patrick Jiang(白泽)	647a413d9b	SQL: Implement DATE_PARSE function for parsing strings into DATE values (#57391 ) Implement DATE_PARSE(<date_str>, <pattern_str>) function which allows to parse a date string according to the specified pattern into a date object. The patterns allowed are those of java.time.format.DateTimeFormatter. Closes #54962 Co-authored-by: Marios Trivyzas <matriv@users.noreply.github.com>	2020-07-16 15:33:46 +02:00
Dan Hermann	09407045fd	[DOCS] write_index_only option for put mapping (#59610 )	2020-07-16 07:46:44 -05:00
István Zoltán Szabó	907a7d6b29	[DOCS] Sorts agg and grouping names alphabetically in PUT Transforms API docs. (#59688 )	2020-07-16 12:43:45 +02:00
James Rodewig	5be36b41d4	[DOCS] EQL: Update EQL search response format (#59554 )	2020-07-15 16:52:32 -04:00
Adam Locke	556f19d55d	[DOCS] Adding get snapshot status API docs (#59355 ) * Adding get snapshot status API docs. * Adding more fields and a link to the new page. * Adding missing spaces in TESTRESPONSES * Adding more parameters and making some edits. * Marking snapshot as optional * Marking repository as optional * Add data type for stats * Add data type for shard_stats * Incorporating review feedback. * Lots of review feedback incorporated. * Fixing tests to unbreak CI builds. * Changing indices to index.	2020-07-15 16:28:43 -04:00
Luca Cavanna	7abf8c7ea5	Add missing update by query breaking change entry (#59586 ) This should have been added with #59507	2020-07-15 21:06:47 +02:00
James Rodewig	d250f94374	[DOCS] Fix syntax and wording in EQL docs (#59623 )	2020-07-15 14:27:02 -04:00
Adam Locke	2f47a03b78	[DOCS] Update similarity.asciidoc (#59400 ) (#59646 ) Community contribution to fix linking issues in the Similarity module docs. Co-authored-by: Xin Yan <SHU_Yanx@hotmail.com>	2020-07-15 14:19:35 -04:00
James Rodewig	d27c286e9b	[DOCS] Add `write_index_only` param to ds mapping tutorials (#59618 )	2020-07-15 12:20:57 -04:00
Przemysław Witek	dfbb47dcaa	Add a "verbose" option to the data frame analytics stats endpoint (#59589 )	2020-07-15 15:59:56 +02:00
James Rodewig	adc520b7c2	[DOCS] Note that EQL timestamp field can also be date_nanos	2020-07-15 09:53:43 -04:00
James Rodewig	e22088d504	[DOCS] Update ds overview for optional `@timestamp` mapping (#59558 )	2020-07-15 09:12:34 -04:00
James Rodewig	5f01ffddec	[DOCS] Add example of ds index template with date_nanos mapping (#59535 )	2020-07-14 16:39:29 -04:00
Costin Leau	bccfbcd81f	EQL: Improve retrieval of results (#59552 ) Instead of retrieving an entire SearchHit, get just a reference and postpone the document retrieval when assembling the final results. Remove sort information from results to make them consistent. Move TumblingWindow under the sequence package. Co-authored-by: James Rodewig <james.rodewig@elastic.co>	2020-07-14 23:26:25 +03:00
Julie Tibshirani	0e15cc588d	Expand docs for component template merging. (#59466 ) This change clarifies the order in which components are merged. It also adds information on mapping merging, now that this has been implemented.	2020-07-14 11:07:26 -07:00
James Rodewig	0f145ace6f	[DOCS] Simplify index template snippets for data streams (#59533 ) Removes the `@timestamp` field mapping from several data stream index template snippets. With #59317, the `@timestamp` field defaults to a `date` field data type for data streams.	2020-07-14 12:08:54 -04:00
James Rodewig	1e8970985d	[DOCS] Add data streams to index template API docs (#59462 )	2020-07-14 11:49:24 -04:00
Andrei Dan	04b46bff8b	Fix sentence in data stream docs (#59518 )	2020-07-14 14:00:00 +01:00

1 2 3 4 5 ...

7451 Commits