elasticsearch

Commit Graph

Author	SHA1	Message	Date
Lee Hinman	29c05544ec	Fix template name in mapping composition yml test (#58788 ) The warning was copied from elsewhere and just needed to use the correct template and index name.	2020-06-30 17:03:47 -06:00
Julie Tibshirani	416cb6b31e	Adjust the skip version for template mapping merging REST test.	2020-06-30 13:22:16 -07:00
Martijn van Groningen	906aed4a88	Add data stream support to put mapping and update index settings APIs. (#58231 ) Change update index setting and put mapping api to execute on all backing indices if data stream is targeted. Relates #53100	2020-06-30 17:23:27 +02:00
Lee Hinman	3b68df2355	Add default composable templates for new indexing strategy (#57629 ) This commit adds the component and composable templates, as well as ILM policies, for the new default indexing strategy. It installs: - logs-default-mappings (component) - logs-default-settings (component) - logs-default-policy (ilm policy) - logs-default-template (composable template) - metrics-default-mappings (component) - metrics-default-settings (component) - metrics-default-policy (ilm policy) - metrics-default-template (composable template) These templates and policies are managed by a new x-pack module, `stack`, and can be disabled by setting `stack.templates.enabled` to `false`. These ensure that patterns for the `logs--` and `metrics--` indices are set up to create data streams with the proper mappings and settings. This also makes changes to the `IndexTemplateRegistry` to support installing component and composable templates (previously it supported only legacy templates). Resolves #56709	2020-06-30 09:19:37 -06:00
Yannick Welsch	e4df92815e	Adapt BWC after backport of (#58094 )	2020-06-30 14:09:03 +02:00
Yannick Welsch	5e345e115b	Add index block api (#58094 ) Adds an API for putting an index block in place, which also ensures for write blocks that, once successfully returning to the user, all shards of the index are properly accounting for the block, for example that all in-flight writes to an index have been completed after adding the write block. This API allows coordinating more complex workflows, where it is crucial that an index is no longer receiving writes after the API completes, useful for example when marking an index as read-only during an upgrade in order to reindex its documents.	2020-06-30 09:33:15 +02:00
Julie Tibshirani	676893a263	Merge mappings for composable index templates (#58521 ) This PR implements recursive mapping merging for composable index templates. When creating an index, we perform the following: * Add each component template mapping in order, merging each one in after the last. * Merge in the index template mappings (if present). * Merge in the mappings on the index request itself (if present). Some principles: * All 'structural' changes are disallowed (but everything else is fine). An object mapper can never be changed between `type: object` and `type: nested`. A field mapper can never be changed to an object mapper, and vice versa. * Generally, each section is merged recursively. This includes `object` mappings, as well as root options like `dynamic_templates` and `meta`. Once we reach 'leaf components' like field definitions, they always overwrite an existing one instead of being merged. Relates to #53101.	2020-06-29 15:00:40 -07:00
Enrico Zimuel	9a7a28958a	Added PERL reserved words in REST keywords (#58535 )	2020-06-26 12:11:22 +02:00
Dan Hermann	edc15d7c90	Add data stream support to open index API (#58487 )	2020-06-25 09:13:53 -05:00
Dan Hermann	603c5f7a48	Data stream support for get field mappings API (#58488 ) Co-authored-by: Elastic Machine <elasticmachine@users.noreply.github.com>	2020-06-25 08:08:51 -05:00
Dan Hermann	991f635c7f	Data stream support for search shards API (#58486 )	2020-06-25 07:57:56 -05:00
Rory Hunter	48c9a0776b	Update rest-api-spec keyword list Follow-up to `35aecf4c9a`. Somehow I missed the fact that there's an ILM API named `retry`, which is a keyword in Ruby. I've removed it from the keywords list.	2020-06-25 09:53:16 +01:00
Rory Hunter	35aecf4c9a	Validate that REST API names do not contain keywords (#58452 ) If an API name (or components of a name) overlaps with a reserved word in the programming language for an ES client, then it's possible that the code that is generated from the API will not compile. This PR adds validation to check for such overlaps.	2020-06-25 09:47:05 +01:00
Martijn van Groningen	0166e0e5a3	Re-enable data streams yaml tests in bwc mode (#58403 )	2020-06-24 14:55:57 +02:00
James Dorfman	e99d287fbb	Add Variable Width Histogram Aggregation (#42035 ) Implements a new histogram aggregation called `variable_width_histogram` which dynamically determines bucket intervals based on document groupings. These groups are determined by running a one-pass clustering algorithm on each shard and then reducing each shard's clusters using an agglomerative clustering algorithm. This PR addresses #9572. The shard-level clustering is done in one pass to minimize memory overhead. The algorithm was lightly inspired by [this paper](https://ieeexplore.ieee.org/abstract/document/1198387). It fetches a small number of documents to sample the data and determine initial clusters. Subsequent documents are then placed into one of these clusters, or a new one if they are an outlier. This algorithm is described in more details in the aggregation's docs. At reduce time, a [hierarchical agglomerative clustering](https://en.wikipedia.org/wiki/Hierarchical_clustering) algorithm inspired by [this paper](https://arxiv.org/abs/1802.00304) continually merges the closest buckets from all shards (based on their centroids) until the target number of buckets is reached. The final values produced by this aggregation are approximate. Each bucket's min value is used as its key in the histogram. Furthermore, buckets are merged based on their centroids and not their bounds. So it is possible that adjacent buckets will overlap after reduction. Because each bucket's key is its min, this overlap is not shown in the final histogram. However, when such overlap occurs, we set the key of the bucket with the larger centroid to the midpoint between its minimum and the smaller bucket’s maximum: `min[large] = (min[large] + max[small]) / 2`. This heuristic is expected to increases the accuracy of the clustering. Nodes are unable to share centroids during the shard-level clustering phase. In the future, resolving https://github.com/elastic/elasticsearch/issues/50863 would let us solve this issue. It doesn’t make sense for this aggregation to support the `min_doc_count` parameter, since clusters are determined dynamically. The `order` parameter is not supported here to keep this large PR from becoming too complex.	2020-06-23 09:26:54 -04:00
Martijn van Groningen	085ba99fba	Keep track of timestamp_field mapping as part of a data stream (#58096 ) Relates to #53100 * use mapping source direcly instead of using mapper service to extract the relevant mapping details * moved assertion to TimestampField class and added helper method for tests * Improved logic that inserts timestamp field mapping into an mapping. If the timestamp field path consisted out of object fields and if the final mapping did not contain the parent field then an error occurred, because the prior logic assumed that the object field existed.	2020-06-22 12:01:01 +02:00
Jim Ferenczi	b0f4024879	Adapt bwc version after backport of #58299 (#58300 ) This commit adapts the bwc version in preparation of the backport to 7.x. The bwc tests are disabled in order to allow the merge of #58299. Relates #58299	2020-06-18 10:22:54 +02:00
Rory Hunter	1f6c953194	Rename dangling index APIs (#58266 ) The dangling_indices.import API name could cause issues in the client libs because import is a reserved word in many languages. Rename the API to avoid this, and rename the other APIs for consistency. Related to #48366.	2020-06-18 08:57:39 +01:00
Jim Ferenczi	90c9b95ca0	Allow index filtering in field capabilities API (#57276 ) * Add index filtering in field capabilities API This change allows to use an `index_filter` in the field capabilities API. Indices are filtered from the response if the provided query rewrites to `match_none` on every shard: ```` GET metrics-* { "index_filter": { "bool": { "must": [ "range": { "@timestamp": { "gt": "2019" } } } } } ```` The filtering is done on a best-effort basis, it uses the can match phase to rewrite queries to `match_none` instead of fully executing the request. The first shard that can match the filter is used to create the field capabilities response for the entire index. Closes #56195	2020-06-17 22:53:53 +02:00
Rory Hunter	ebe8951879	Implement dangling indices API (#50920 ) Part of #48366. Implement an API for listing, importing and deleting dangling indices. Co-authored-by: David Turner <david.turner@elastic.co>	2020-06-16 15:19:17 +01:00
Dan Hermann	cce279bbb3	Prohibit clone, shrink, and split on a data stream's write index (#58104 )	2020-06-16 08:37:48 -05:00
Nik Everett	7c7fe0152d	Save memory when auto_date_histogram is not on top (#57304 ) This builds an `auto_date_histogram` aggregator that natively aggregates from many buckets and uses it when the `auto_date_histogram` used to use `asMultiBucketAggregator` which should save a significant amount of memory in those cases. In particular, this happens when `auto_date_histogram` is a sub-aggregator of a multi-bucketing aggregator like `terms` or `histogram` or `filters`. For the most part we preserve the original implementation when `auto_date_histogram` only collects from a single bucket. It isn't possible to "just port the aggregator" without taking a pretty significant performance hit because we used to rewrite all of the buckets every time we switched to a coarser and coarser rounding configuration. Without some major surgery to how to delay sub-aggs we'd end up rewriting the delay list zillions of time if there are many buckets. The multi-bucket version of the aggregator has a "budget" of "wasted" buckets and only rewrites all of the buckets when we exceed that budget. Now that we don't rebucket every time we increase the rounding we can no longer get an accurate count of the number of buckets! So instead the aggregator uses an estimate of the number of buckets to trigger switching to a coarser rounding. This estimate is likely to be terrible when buckets are far apart compared to the rounding. So it also uses the difference between the first and last bucket to trigger switching to a coarser rounding. Which covers for the shortcomings of the bucket estimation technique pretty well. It also causes the aggregator to emit fewer buckets in cases where they'd be reduced together on the coordinating node. This is wonderful! But probably fairly rare. All of that does buy us some speed improvements when the aggregator is a child of multi-bucket aggregator: Without metrics or time zone: 25% faster With metrics: 15% faster With time zone: 22% faster Relates to #56487	2020-06-15 14:33:31 -04:00
Dan Hermann	e515adb07e	Fix REST test for resolve index API (#58043 )	2020-06-12 13:14:58 -05:00
Dan Hermann	f9f39d75fa	Mute failing REST tests with correct syntax (#58048 )	2020-06-12 09:28:00 -05:00
Dan Hermann	780603d9f7	Mute failing REST tests	2020-06-12 08:54:12 -05:00
Dan Hermann	9724fa9dc8	Resolve index API (#57626 )	2020-06-12 06:25:16 -05:00
Martijn van Groningen	eb6f46a342	Enforce valid field mapping exists for timestamp_field in templates. (#57741 ) Relates to #53100	2020-06-12 13:22:20 +02:00
Martijn van Groningen	01b70b4068	Prohibit append-only writes targeting backing indices directly. (#57788 ) Append-only writes can only target the corresponding data stream. Relates to #53100	2020-06-11 11:29:27 +02:00
Russ Cam	a85f2bede8	Mark Component and Index template APIs as experimental (#57910 ) This commit marks the Component Template and Index Template APIs as experimental.	2020-06-10 14:06:32 +10:00
Lee Hinman	c688eb69c7	Disallow merging existing mapping field definitions in templates (#57701 ) * Disallow merging existing mapping field definitions in templates This commit changes the merge strategy introduced in #55607 and #55982. Instead of overwriting these fields, we now prevent them from being merged with an exception when a user attempts to overwrite a field. As part of this, a more robust validation has been added. The existing validation checked whether templates (composable and component) were valid on their own, this new validation now checks that the composite template (mappings/settings/aliases) is valid. This means that when a composable template is added or updated, we confirm that it is valid with its component pieces. When a component template is updated we ensure that all composable templates that make use of the component template continue to be valid before allowing the component template to be updated. This change also necessitated changes in the tests, however, I have left tests that exercise mapping merging with nested object fields as `@AwaitsFix`, as we intend to change the behavior soon to allow merging in a recursive-with-replacement fashion (see: #57393). I have added tests that check the new disallowing behavior in the meantime. * Use functional instead of imperative prefix collection * Use IndexService.withTempIndexService * Rename tests * Fix tests Co-authored-by: Elastic Machine <elasticmachine@users.noreply.github.com>	2020-06-08 08:57:13 -06:00
Dan Hermann	904bdae9ff	Change default backing index naming scheme (#57721 )	2020-06-08 08:39:55 -05:00
Dan Hermann	c4334ee074	Prohibit closing the write index for a data stream (#57692 )	2020-06-05 10:00:13 -05:00
Nik Everett	1c7bd29f4c	update skip after backport of #57397 (#57694 )	2020-06-04 15:37:53 -04:00
Nik Everett	4b9e378d4a	Bump skip before backport	2020-06-04 12:16:20 -04:00
Nik Everett	69cd4435b2	Merge remaining sig_terms into terms (#57397 ) Merges the remaining implementation of `significant_terms` into `terms` so that we can more easilly make them work properly without `asMultiBucketAggregator` which should save memory and speed them up. Relates #56487	2020-06-04 11:22:03 -04:00
Russ Cam	f77005a0e2	Update snapshot.delete.json to make snapshot a list (#57326 ) Relates: elastic/elasticsearch#55474 This commit updates the snapshot.delete.json REST API spec to make snapshot a list type, now that it can accept a list of comma-separated snapshot names	2020-06-03 09:48:51 +10:00
Nik Everett	474a3fc49f	Update skip after backport of #57438 (#57550 )	2020-06-02 16:22:03 -04:00
Nik Everett	b072f5f002	Fix an optimization in terms agg (#57438 ) When the `terms` agg runs against strings and uses global ordinals it has an optimization when it collects segments that only ever have a single value for the particular string. This is very common. But I broke it in #57241. This fixes that optimization and adds `debug` information that you can use to see how often we collect segments of each type. And adds a test to make sure that I don't break the optimization again. We also had a specialiation for when there isn't a filter on the terms to aggregate. I had removed that specialization in #57241 which resulted in some slow down as well. This adds it back but in a more clear way. And, hopefully, a way that is marginally faster when there is a filter. Closes #57407	2020-06-02 13:57:27 -04:00
Nik Everett	27bff25cf8	Update skip after backport of #57277 (#57379 )	2020-05-29 16:20:07 -04:00
Nik Everett	460b204f8e	Save memory when histogram agg is not on top (#57277 ) This saves some memory when the `histogram` aggregation is not a top level aggregation by dropping `asMultiBucketAggregator` in favor of natively implementing multi-bucket storage in the aggregator. For the most part this just uses the `LongKeyedBucketOrds` that we built the first time we did this.	2020-05-29 09:54:47 -04:00
Nik Everett	d0a253db5b	Update skip after backport of #57241 (#57316 )	2020-05-29 08:03:34 -04:00
Martijn van Groningen	9d07229879	Change cluster info actions to be able to resolve data streams. (#56878 ) With this change the following APIs will be able to resolve data streams: get index, get mappings and ilm explain APIs. Relates to #53100	2020-05-29 11:04:55 +02:00
Russ Cam	0b041cccd8	Deprecate local param in get_mapping.json (#57265 ) Relates: elastic/elasticsearch#55014 This commit deprecates the local param in get_mapping.json. This parameter is a no-op and field mappings are always retrieved locally.	2020-05-29 12:24:44 +10:00
Nik Everett	29e9e79656	Update skip before backport I accidentally didn't put the customary "skip the last version" on #57241 and the PR tests didn't catch it. This adds it.	2020-05-28 16:01:37 -04:00
Martijn van Groningen	9f6bc6856b	Re-able data stream bwc tests (#57293 ) after merging #57275	2020-05-28 21:36:03 +02:00
Nik Everett	974d236fbc	Make global ords terms simpler to understand (#57241 ) When the `terms` enum operates on non-numeric data it can collect it via global ordinals. It actually has two separate collection strategies for, one "dense" and one "remapping". Each of those strategies has two "iteration" strategies that it uses to build buckets, depending on whether or not we need buckets with `0` docs in them. Previously this was done with several `null` checks and never really explained. This change replaces those checks with two `CollectionStrategy` classes which have good stuff like documentation.	2020-05-28 15:29:31 -04:00
Christoph Büscher	3d4f9fedaf	Check for negative "from" values in search request body (#54953 ) Today we already disallow negative values for the "from" parameter in the search API when it is set as a request parameter and setting it on the SearchSourceBuilder, but it is still parsed without complaint from a search body, leading to differing exceptions later. This PR changes this behavior to be the same regardless of setting the value directly, as url parameter or in the search body. While we silently accepted "-1" as meaning "unset" and used the default value of 0 so far, any negative from-value is now disallowed. Closes #54897	2020-05-28 16:25:19 +02:00
Martijn van Groningen	f8b090b641	Ensure template exists when creating data stream (#56888 ) Limit the creation of data streams only for namespaces that have a composable template with a data stream definition. This way we ensure that mappings/settings have been specified and will be used at data stream creation and data stream rollover. Also remove `timestamp_field` parameter from create data stream request and let the create data stream api resolve the timestamp field from the data stream definition snippet inside a composable template. Relates to #53100	2020-05-28 13:11:15 +02:00
Dan Hermann	7a67395807	Limit _cat/indices test to versions with fix (#57244 )	2020-05-27 16:06:33 -05:00
Nik Everett	69661252fc	Update skip after backport of #56789 (#57238 )	2020-05-27 16:30:31 -04:00
Lee Hinman	4dc32611fc	Rename template V2 classes to ComposableTemplate (#57183 ) This PR changes the name of the Index Template V2 classes to "Composable Templates", it also ensures there are no mentions of "V2" in the documentation or error/warning messages. V1 templates are referred to as "legacy" templates. Resolves #56609	2020-05-27 09:32:10 -06:00
Nik Everett	9aaab6efdd	Save memory on numeric sig terms when not top (#56789 ) This saves memory when running numeric significant terms which are not at the top level by merging its collection into numeric terms and relying on the optimization that we made in #55873.	2020-05-27 10:53:09 -04:00
Russ Cam	38a17f299f	Update track_total_hits to union type (#51846 ) * Update track_total_hits to union type This commit updates track_total_hits parameter type to a union of boolean and number, to reflect the possible values that can be passed. * Update rest-api-spec/src/main/resources/rest-api-spec/api/search.json Co-Authored-By: Karel Minarik <karel.minarik@gmail.com> * Update rest-api-spec/src/main/resources/rest-api-spec/api/search.json Co-authored-by: Karel Minarik <karel.minarik@gmail.com> Co-authored-by: Elastic Machine <elasticmachine@users.noreply.github.com>	2020-05-27 12:25:45 +10:00
James Rodewig	eae4a1c953	[DOCS] Add delete snapshot repo API docs (#57043 ) Changes: * Adds API reference docs for the delete snapshot repo API. * Corrects an error in the delete snapshot repo API spec. Comma-separated repository names are not supported. * Relocates the existing delete snapshot repo API example docs.	2020-05-21 13:59:48 -04:00
Nik Everett	7790e815fe	Update skip after backport of #56921 (#56974 )	2020-05-20 11:20:20 -04:00
Dan Hermann	4eb9d18b72	Handle exceptions when building _cat/indices response (#56993 )	2020-05-20 09:44:40 -05:00
Nik Everett	bea2341c9e	Save memory when date_histogram is not on top (#56921 ) When `date_histogram` is a sub-aggregator it used to allocate a bunch of objects for every one of it's parent's buckets. This uses the data structures that we built in #55873 rework the `date_histogram` aggregator instead of all of the allocation. Part of #56487	2020-05-19 13:48:25 -04:00
Ioannis Kakavas	127646c496	Adjust version mute for reload secure settings (#56938 ) We can safely run the reload_secure_settings tests after 7.7.0 , the relevant changes have long been backported there	2020-05-19 18:13:44 +03:00
Tomas Della Vedova	f5360fc984	[DOCS] Fix component template API link in JSON specs (#56884 )	2020-05-19 09:59:06 -04:00
Ioannis Kakavas	9ae9bdc9a3	Adjust reload keystore test to pass in FIPS (#56889 ) In KeystoreWrapper class we determine if the error to decrypt a given keystore is caused by a wrong password based on the exception that the SunJCE implementation of AES is throwing(AEADBadTagException). Other implementations from other Security Providers fail with a different exception and as such we cannot differentiate between a corrupted file and a wrong password in a foolproof way. As in other tests such as in KeyStoreWrapperTests#testDecryptKeyStoreWithWrongPassword we handle this by matching both possible exception messages.	2020-05-19 15:25:45 +03:00
Lee Hinman	d3ccada06f	Add template simulation API for simulating template composition (#56842 ) This adds an API for simulating template composition with or without an index template. It looks like: ``` POST /_index_template/_simulate/my-template ``` To simulate a template named `my-template` that already exists, or, to simulate a template that does not already exist: ``` POST /_index_template/_simulate { "index_patterns": ["my-index"] "composed_of": ["ct1", "ct2"], } ``` This is related to #55686, which adds an API to simulate composition based on an index name (hence the `_simulate_index` vs `_simulate`). This commit also adds reference documentation for both simulation APIs. Relates to #53101 Resolves #56390 Resolves #56255	2020-05-18 15:11:42 -06:00
Dan Hermann	a62483f7b5	Rename endpoint from plural "_data_streams" to singular "_data_stream" (#56762 )	2020-05-15 08:23:43 -05:00
Ryan Ernst	c0ee68b0a0	Move publishing configuration to a separate plugin (#56727 ) This is another part of the breakup of the massive BuildPlugin. This PR moves the code for configuring publications to a separate plugin. Most of the time these publications are jar files, but this also supports the zip publication we have for integ tests.	2020-05-14 18:56:59 -07:00
Lee Hinman	cad030d8d7	Don't allow invalid template combinations (#56397 ) This commit removes the ability to put V2 index templates that reference missing component templates. It also prevents removing component templates that are being referenced by an existing V2 index template. Relates to #53101 Resolves #56314	2020-05-14 15:33:35 -06:00
Nik Everett	f433fd472c	Update skip after backport of #56208 (#56719 )	2020-05-13 17:36:31 -04:00
Nik Everett	4a8d93f55b	Add list of defered aggregations to the profiler (#56208 ) This adds a few things to the `breakdown` of the profiler: * `histogram` aggregations now contain `total_buckets` which is the count of buckets that they collected. This could be useful when debugging a histogram inside of another bucketing agg that is fairly selective. * All bucketing aggs that can delay their sub-aggregations will now add a list of delayed sub-aggregations. This is useful because we sometimes have fairly involved logic around which sub-aggregations get delayed and this will save you from having to guess. * Aggregtations wrapped in the `MultiBucketAggregatorWrapper` can't accurately add anything to the breakdown. Instead they the wrapper adds a marker entry `"multi_bucket_aggregator_wrapper": true` so we can be quickly pick out such aggregations when debugging. It also fixes a bug where `_count` breakdown entries were contributing to the overall `time_in_nanos`. They didn't add a large amount of time so it is unlikely that this caused a big problem, but I was there. To support the arbitrary breakdown data this reworks the profiler so that the `breakdown` can contain any data that is supported by `StreamOutput#writeGenericValue(Object)` and `XContentBuilder#value(Object)`.	2020-05-13 08:30:38 -04:00
Martijn van Groningen	e9cc3de173	Fix allowed warning in data stream rest test. (#56630 )	2020-05-12 21:02:37 +02:00
Jake Landis	525522e187	json spec: allow null for documentation url (#55749 ) This commit allows the JSON schema's documentation.url property to have a null value. This can useful for cases where a feature is under development, and does not have documentation published yet. This commit also adds a documentation.url for two ml resources.	2020-05-12 12:51:24 -05:00
Martijn van Groningen	c4082384db	Enable bwc tests after backporting index templates v2 data stream integration (#56615 ) Relates to #55377	2020-05-12 18:10:20 +02:00
James Rodewig	3ebbf895f2	[DOCS] Add clean up snapshot repository API docs (#56519 )	2020-05-12 08:56:29 -04:00
Martijn van Groningen	74e2c01138	Auto create data streams using index templates v2 (#55377 ) This commit adds the ability to auto create data streams using index templates v2. Index templates (v2) now have a data_steam field that includes a timestamp field, if provided and index name matches with that template then a data stream (plus first backing index) is auto created. Relates to #53100	2020-05-12 13:42:59 +02:00
Lee Hinman	fc708ccca4	Remove prefer_v2_templates query string parameter (#56546 ) This commit removes the `prefer_v2_templates` flag and setting. This was a brief setting that allowed specifying whether V1 or V2 template should be used when an index is created. It has been removed in favor of V2 templates always having priority. Relates to #53101 Resolves #56528 This is not a breaking change because this flag was never in a released version.	2020-05-11 14:56:48 -06:00
Nik Everett	7c367de13b	Update skip after backport of #56252 (#56379 )	2020-05-07 15:41:45 -04:00
Nik Everett	923fc988ad	Fix auto_date_histogram interval (#56252 ) `auto_date_histogram` was returning the incorrect `interval` because of a combination of two things: 1. When pipeline aggregations rewrote `auto_date_histogram` we reset the interval to 1. Oops. Fixed that. 2. Every bucket aggregation was rewriting its buckets as though there was a pipeline aggregation even if there aren't any. This is a bit silly so we skip that too. Closes #56116	2020-05-07 08:19:53 -04:00
Przemko Robakowski	7ca47f52e8	Add prefer_v2_templates parameter to Reindex (#56253 ) * prefer_v2_templates for reindex	2020-05-06 22:01:13 +02:00
Jake Landis	32269f1a6d	_cat/threadpool remove "size" and add "time" params (#55736 ) The rest spec and documentation for _cat/threadpool supports a "size" parameter. However, the "size" parameter will have no impact since there are no values of type "SizeValue" of the return value of this _cat api. This commit removes the "size" param from the spec and documentation. This commit also adds support for the "time" param since and support to format the time param for the "keep_alive" column. By default, the output should not change since the "TimeValue" rendered default (via RestTable) is toString(), and the code prior to this also called toString(). closes #54478	2020-05-06 14:26:08 -05:00
Dan Hermann	117055d49e	Get index includes parent data stream for backing indices (#56022 )	2020-05-05 13:40:15 -05:00
Jake Landis	e392ce939a	deprecrate size from cat.thread_pool in json spec (#55984 )	2020-04-30 11:36:20 -05:00
Andrei Dan	e256becad7	Conditionally run tests asserting overlapping templates (#56028 ) Only run the tests verifyin the overlapping index templates when there is no `global` index template (ie. when the default shards are not changed)	2020-04-30 16:07:34 +01:00
Andrei Dan	475790c34e	Add HLRC support for simulate index template api (#55936 )	2020-04-30 14:24:46 +01:00
Andrei Dan	e3e9782b20	Update template v2 api rest spec (#55948 ) This removed the specification of `order` as it is not a parameter of the v2 put template api (the priority is the equivalent of `order` and is defined in the body) and add a bit of description for the `cause` parameter (which is currently used as a cluster update task tracking)	2020-04-30 10:52:52 +01:00
Andrei Dan	1a5845edce	Add simulate template composition API _index_template/_simulate_index/{name} (#55686 ) This adds a new api to simulate matching the given index name against the index templates in the system. The syntax for the new API takes the following form: POST _index_template/_simulate_index/{index_name} { "index_patterns": ["logs-*"], "priority": 15, "template": { "settings": { "number_of_shards": 3 } ... } } Where the body is optional, but we support the entire body used by the PUT _index_template/{name} api. When the body is specified we'll simulate matching the given index against a system that'd have the given index template together with the index templates that exist in the system. The response, in both cases, will return the matching template's resolved settings, mappings and aliases, together with a special field that'll print any overlapping templates and their corresponding index patterns.	2020-04-29 11:27:15 +01:00
David Roberts	e0f38896fd	[ML] Adjust BWC after daily_model_snapshot_retention_after_days backport (#55911 ) Simplifying BWC code after merging #55891	2020-04-29 10:49:40 +01:00
zacharymorn	498fc66cc8	Add API specs for voting config exclusions (#55760 ) Closes #48131	2020-04-29 08:34:10 +01:00
Lee Hinman	3fc17b1b55	Adjust skip version for _cat/templates yml tests (#55871 ) Now that #55829 has been backported (#55866) we can adjust these skip versions to allow testing with 7.8+. Relates to #53101	2020-04-28 11:05:00 -06:00
Dan Hermann	0077714bfe	REST test for rolling data streams (#55802 )	2020-04-28 11:52:36 -05:00
Lee Hinman	61acf602fc	Add support for V2 index templates to /_cat/templates (#55829 ) This adds support for V2 index templates to the cat templates API. It uses the `order` field as priority in order not to break compatibility, while adding the `composed_of` field to show component templates that are used from an index template. Relates to #53101	2020-04-28 09:25:54 -06:00
Dan Hermann	bcf86000e5	Delete index API properly handles backing indices for data streams (#55690 )	2020-04-24 16:34:57 -05:00
Zachary Tong	76170eded1	Update version skip after backport	2020-04-24 10:19:03 -04:00
Zachary Tong	9f165bd44e	Aggs must specify a `field` or `script` (or both) (#52226 ) * Aggs must specify a `field` or `script` (or both) This adds a validation to VSParserHelper to ensure that a field or script or both are specified by the user. This is technically required today already, but throws an exception much deeper in the agg framework and has a very unintuitive error for the user (as well as eating more resources instead of failing early) * Fix StringStats test * Add yaml test * Skip test on older versions Co-authored-by: Elastic Machine <elasticmachine@users.noreply.github.com>	2020-04-23 14:26:38 -04:00
Jake Landis	2b0900d33b	Validate REST specs against schema (#55117 ) A JSON schema was recently introduced for the REST API specification. #54252 This PR introduces a 3rd party validation tool to ensure that the REST specification conforms to the schema. The task is applied to the 3 projects that contain REST API specifications. The plugin wires this task into the precommit commit task, and should be considered as part of the public API for the build tools for any plugin developer to contribute their plugin's specification. An ignore parameter has been introduced for the task to allow specific file to be ignored from the validation. The ignored files in this PR will soon get issues logged and a link so they can be fixed. Closes #54314	2020-04-21 18:18:18 -05:00
Fernando Briano	4e9dd2b292	Add skip arbitrary_key to nodes.reload_secure_settings YAML test (#55402 )	2020-04-21 16:19:38 +01:00
Lee Hinman	93021f72aa	Adjust serialization versions for prefer_v2_templates flag (#55478 ) This adjusts the minimum version for serialization for #55411. It should only be merged after #55476 has been merged	2020-04-20 14:07:40 -06:00
Lee Hinman	0202e1ae96	Add prefer_v2_templates flag and index setting (#55411 ) This commit adds a new querystring parameter on the following APIs: - Index - Update - Bulk - Create Index - Rollover These APIs now support a `?prefer_v2_templates=true\|false` flag. This flag changes the preference creation to use either V2 index templates or V1 templates. This flag defaults to `false` and will be changed to `true` for 8.0+ in subsequent work. Additionally, setting this flag internally sets the `index.prefer_v2_templates` index-level setting. This setting is used so that actions that automatically create a new index (things like rollover initiated by ILM) will inherit the preference from the original index. This setting is dynamic so that a transition from v1 to v2 templates can occur for long-running indices grouped by an alias performing periodic rollover. This also adds support for sending this parameter to the High Level Rest Client. Relates to #53101	2020-04-20 10:04:42 -06:00
zhichen	05066aecf0	Add Bulk stats track the bulk per shard (#52208 ) * Add Bulk stats track the bulk sizes per shard and the time spent on the bulk shard request (#50536)(#47345)	2020-04-20 11:09:29 +02:00
Dan Hermann	4a8b84349d	Mute data stream YML tests until backport (#55406 )	2020-04-17 10:48:43 -05:00
Dan Hermann	e1730452e3	Add explicit generation attribute to data streams (#55342 )	2020-04-17 09:02:09 -05:00
Martijn van Groningen	d649826a94	Re-enable data stream yaml bwc tests. (#55367 ) After backporting #55337	2020-04-17 09:44:31 +02:00
Martijn van Groningen	fada09a132	Make data streams in APIs resolvable. (#54726 ) The INCLUDE_DATA_STREAMS indices option controls whether data streams can be resolved in an api for both concrete names and wildcard expressions. If data streams cannot be resolved then a 400 error is returned indicating that data streams cannot be used. In this pr, the INCLUDE_DATA_STREAMS indices option is enabled in the following APIs: search, msearch, refresh, index (op_type create only) and bulk (index requests with op type create only). In a subsequent later change, we will determine which other APIs need to be able to resolve data streams and enable the INCLUDE_DATA_STREAMS indices option for these APIs. Whether an api resolve all backing indices of a data stream or the latest index of a data stream (write index) depends on the IndexNameExpressionResolver.Context.isResolveToWriteIndex(). If isResolveToWriteIndex() returns true then data streams resolve to the latest index (for example: index api) and otherwise a data stream resolves to all backing indices of a data stream (for example: search api). Relates to #53100	2020-04-16 19:46:13 +02:00
bellengao	338798a9db	Fix creating filtered alias using now in a date_nanos range query failed (#54785 ) Modify the value of nowInMillis in queryShardContext to current timestamp, because the value will be used lately when validating the filtered alias which uses now in a date_nanos range query. Closes #54315	2020-04-16 18:41:44 +02:00
Tomas Della Vedova	d7ee30c276	Yaml test: Fixed bad indentation (#55170 )	2020-04-15 08:49:39 +02:00
Julie Tibshirani	13053c6ad9	Remove the object format for indices_boost. (#55078 ) This format has been deprecated since version 5.2.	2020-04-14 21:01:07 -07:00
Yang Wang	92427d3758	Remove local parameter for get field mapping API (#55100 ) The local parameter of get field mapping API is marked as deprecated in 7.x. This PR removes it for v8.0	2020-04-15 12:02:10 +10:00
muachilin	db1236ce3d	Remove deprecated endpoints of hot threads API (#55109 ) This removes deprecated endpoints in hot threads action. Closes #52640	2020-04-14 15:36:57 -04:00
Mark Vieira	0e55fdeae9	Re-add origin url information to publish POM files (#55171 )	2020-04-14 11:48:36 -07:00
Igor Motov	3583bc7188	Tests: unmute test after t-test backport (#55122 ) Unmutes weighted avg rest test	2020-04-13 14:49:42 -04:00
Igor Motov	6eab90b3db	Prepare for backport of TTest filters (#55072 ) Preparation for backport of #55066	2020-04-13 11:32:22 -04:00
Yang Wang	6ce88038f2	Deprecate local parameter for get field mapping request (#55014 ) The usage of local parameter for GetFieldMappingRequest has been removed from the underlying transport action since v2.0. This PR deprecates the parameter from rest layer. It will be removed in next major version.	2020-04-12 12:34:44 +10:00
Ioannis Kakavas	5ee27927e8	Mute test in versions that do not support pwd protected keystores (#55069 ) Mute test in versions that do not support password protected keystores. This didn't fail in the PR check since we run MixedClusterClientYamlTestSuiteIT against 4 nodes (2 old and 2 new ) and this happened to hit a node that was on master rather than one that was on 7.8.0-SNAPSHOT	2020-04-10 18:56:45 +03:00
Ioannis Kakavas	16e9433ead	Fix ReloadSecureSettings API to consume password (#54771 ) The secure_settings_password was never taken into consideration in the ReloadSecureSettings API. This commit fixes that and adds necessary REST layer testing. Doing so, it also - Allows TestClusters to have a password protected keystore so that it can be set for tests. - Adds a parameter to the run task so that elastisearch can be run with a password protected keystore from source.	2020-04-10 16:48:36 +03:00
Lee Hinman	2e73fe3c6d	Bump minimum version for component template CRUD test (#54992 ) These tests do CRUD for component templates, however, for 7.7 some changes weren't backported in the `_doc` wrapping/unwrapping done for the APIs, this can cause test failures. This bumps the minimum version for these tests to 7.8, which is okay because component templates are hidden behind a flag and have no compatibility guarantees for 7.7. Relates to #53101	2020-04-08 16:38:50 -06:00
Lee Hinman	c7dc03345a	Add allowed warnings to index template composition tests (#54916 ) We occasionally add a global template for our YAML tests, and this can cause warnings for these template tests. This commit adds these warnings so they don't cause test failures. Resolves #54822 Co-authored-by: Elastic Machine <elasticmachine@users.noreply.github.com>	2020-04-08 09:22:10 -06:00
Przemko Robakowski	b8ca70b8aa	HLRC support for Index Templates V2 (#54838 ) * HLRC support for Index Templates V2 This change adds High Level Rest Client support for Index Templates V2. Relates to #53101	2020-04-08 01:11:02 +02:00
Tal Levy	cf9603c6fd	Create new `geo` module and migrate geo_shape registration (#53562 ) This commit introduces a new `geo` module that is intended to be contain all the geo-spatial-specific features in server. As a first step, the responsibility of registering the geo_shape field mapper is moved to this module. Co-authored-by: Nicholas Knize <nknize@gmail.com>	2020-04-07 12:27:29 -07:00
Nik Everett	c546cc37aa	Update skip after backport of #54298 (#54900 ) We can run these tests again 7.8.0 now that it has the fix.	2020-04-07 13:52:41 -04:00
Nik Everett	f942655f22	More pipeline aggregation cleanup (#54298 ) This replaces the last bit of validation that pipeline aggregations performed on the data nodes with explicit checks in a few `PipelineAggregationBuilders`. We were already catching these validation errors for pipeline aggregations that require that their parent be squentially ordered. This just adds validation for pipelines that require any parent like `bucket_selector` and `bucket_sort`.	2020-04-07 09:20:43 -04:00
Lee Hinman	7747cfa5f3	Adjust skip version for Index Template V2 tests (#54754 ) These were skipped for all 7.x until the backport was finished, it's now backported so these can be adjusted to be run on 7.8+ Relates to #53101	2020-04-06 22:03:57 -06:00
Przemko Robakowski	ad8590e190	HLRC support for Component Templates APIs (#54635 ) * HLRC support for Component Templates * hlrc * hlrc * merge fix * removed unused import * checkstyle fixes * metaData -> metadata * move to ClusterClient * checkstyle fixes * checkstyle fixes * checkstyle fixes * method in spec fixed * PR comments * PR comments * PR comments * unused imports fixed * review comment Co-authored-by: Elastic Machine <elasticmachine@users.noreply.github.com>	2020-04-06 18:52:25 +02:00
Nhat Nguyen	61e5350e77	Support hierarchical task cancellation (#54757 ) With this change, when a task is canceled, the task manager will cancel not only its direct child tasks but all also its descendant tasks. Closes #50990	2020-04-06 12:00:02 -04:00
Dan Hermann	19da22ebf6	Delete backing indices with data stream (#54693 )	2020-04-06 09:55:37 -05:00
Lee Hinman	9f9ade7dcb	Use V2 index templates during index creation (#54669 ) * Use V2 index templates during index creation This commit changes our index creation code to use (and favor!) V2 index templates during index creation. The creation precedence goes like so, in order of precedence: - Existing source `IndexMetadata` - for example, when recovering from a peer or a shrink/split/clone where index templates should not be applied - A matching V2 index template, if one is found - When a V2 template is found, all component templates (in the `composed_of` field) are applied in the order that they appear, with the index template having the 2nd highest precedence (the create index request always has the top priority when it comes to index settings) - All matching V1 templates (the old style) This also adds index template validation when `PUT`-ing a new v2 index template (because this was required) and ensures that all index and component templates specify no top-level mapping type (it is automatically added when the template is added to the cluster state). This does not yet implement fine-grained component template merging of mappings, where we favor merging only a single field's configuration, that will be done in subsequent work. This also keeps the existing hidden index behavior present for v1 templates, where a hidden index will match v2 index templates unless they are global (`*`) templates. Relates to #53101	2020-04-03 09:34:50 -06:00
Dan Hermann	959f41e3d1	Get data stream accepts single search parameter (#54530 )	2020-04-03 09:32:42 -05:00
Dan Hermann	42f513c810	Create first backing index when creating data stream (#54467 )	2020-04-02 10:58:06 -05:00
Russ Cam	da37e01d32	Update rest API specs (#54252 ) This commit updates the rest API specs to validate against a JSON schema for the specifications. Most updates are to add a description, whilst others fix typos and unify conventions e.g. deprecations, descriptions, urls starting with /. The schema conforms to draft-07 JSON schema.	2020-04-02 10:40:50 +10:00
Nhat Nguyen	ee3d40320a	Broadcast cancellation to only nodes have outstanding child tasks (#54312 ) Today when canceling a task we broadcast ban/unban requests to all nodes in the cluster. This strategy does not scale well for hierarchical cancellation. With this change, we will track outstanding child requests and broadcast the cancellation to only nodes that have outstanding child tasks. This change also prevents a parent task from sending child requests once it got canceled. Relates #50990 Supersedes #51157 Co-authored-by: Igor Motov <igor@motovs.org> Co-authored-by: Yannick Welsch <yannick@welsch.lu>	2020-04-01 11:22:13 -04:00
Lee Hinman	2a77551b12	Add allowed warnings to index template v2 YAML tests (#54535 ) (#54541 ) There is a setting in `ESClientYamlSuiteTestCase` under `usually()` that can install a `global` template changing the number of shards for all indices. This can cause warnings when installing v2 templates (see #54367). This adds these as optional warnings so they don't cause failures regardless of whether the global template is installed or not. These warnings can be removed when our internal template usage has been moved to index templates v2 Relates to #53101	2020-03-31 16:27:48 -06:00
Nik Everett	2437c5ebeb	Update skip after backport (#54450 ) Now that #54161 is backported we can run its tests against nodes that have the fix. Co-authored-by: Elastic Machine <elasticmachine@users.noreply.github.com>	2020-03-31 16:27:41 -04:00
Nik Everett	472dd1b370	Fix a master->7.x serialization bug (#54429 ) `master` wasn't sending `auto_date_histogram`'s `minimumIntervalExpression` over the wire to `7.x` nodes even though everything above `7.3` expected it. We never noticed this because we didn't have any yml tests for `auto_date_histogram` until I added some in #54161. Closes #54396	2020-03-30 11:37:38 -04:00
Benjamin Trent	49c30ec4de	muting test (#54404 )	2020-03-30 08:45:47 -04:00
Jason Tedor	922e09ba7b	Adjust BWC version on node roles being sorted Node roles are sorted now as of 7.8.0. This commit adjusts the BWC version for tests.	2020-03-28 15:30:02 -04:00
Jason Tedor	489c7091c4	Ensure that the output of node roles are sorted (#54376 ) This commit ensures that node roles are sorted by node role name, which makes the output easier to consume, and also makes it easier to rely on the behavior of the output in assertions.	2020-03-28 12:47:49 -04:00
Nik Everett	a0f7c4a6a4	Clean up how pipeline aggs check for multi-bucket (#54161 ) Pipeline aggregations like `stats_bucket`, `sum_bucket`, and `percentiles_bucket` only operate on buckets that have multiple buckets. This adds support for those aggregations to `geo_distance`, `ip_range`, `auto_date_histogram`, and `rare_terms`. This all happened because we used a marker interface to mark compatible aggs, `MultiBucketAggregationBuilder` and it was fairly easy to forget to implement the interface. This replaces the marker interface with an abstract method in `AggregationBuilder`, `bucketCardinality` which makes you return `NONE`, `ONE`, or `MANY`. The `bucket` aggregations can check for `MANY`. At this point `ONE` and `NONE` amount to about the same thing, but I suspect that'll be a useful distinction when validating bucket sorts. Closes #53215	2020-03-28 11:47:01 -04:00
Dan Hermann	133b743f4c	Test to enforce response to invalid data stream names	2020-03-27 12:20:42 -05:00
Lee Hinman	e89e916738	Add REST APIs for IndexTemplateV2Metadata CRUD (#54039 ) * Add REST APIs for IndexTemplateV2Metadata CRUD This commit adds the get/put/delete APIs for interacting with the now v2 versions of index templates. These APIs are behind the existing `es.itv2_feature_flag_registered` system property feature flag. Relates to #53101 * Add exceptions for HLRC tests * Add skips for 7.x versions * Use index_template instead of template_v2 in action names * Add test for MetaDataIndexTemplateService.addIndexTemplateV2 * Move removal to static method and add test * Add unit tests for request classes (implement hashCode & equals) Co-authored-by: Elastic Machine <elasticmachine@users.noreply.github.com>	2020-03-27 07:56:29 -06:00
Mark Tozzi	a90c1de874	Add ValuesSource Registry and associated logic (#54281 ) * Remove ValuesSourceType argument to ValuesSourceAggregationBuilder (#48638) * ValuesSourceRegistry Prototype (#48758) * Remove generics from ValuesSource related classes (#49606) * fix percentile aggregation tests (#50712) * Basic thread safety for ValuesSourceRegistry (#50340) * Remove target value type from ValuesSourceAggregationBuilder (#49943) * Cleanup default values source type (#50992) * CoreValuesSourceType no longer implements Writable (#51276) * Remove genereics & hard coded ValuesSource references from Matrix Stats (#51131) * Put values source types on fields (#51503) * Remove VST Any (#51539) * Rewire terms agg to use new VS registry (#51182) Also adds some basic AggTestCases for untested code paths (and boilerplate for future tests once the IT are converted over) * Wire Cardinality aggregation to work with the ValuesSourceRegistry (#51337) * Wire Percentiles aggregator into new VS framework (#51639) This required a bit of a refactor to percentiles itself. Before, the Builder would switch on the chosen algo to generate an algo-specific factory. This doesn't work (or at least, would be difficult) in the new VS framework. This refactor consolidates both factories together and introduces a PercentilesConfig object to act as a standardized way to pass algo-specific parameters through the factory. This object is then used when deciding which kind of aggregator to create Note: CoreValuesSourceType.HISTOGRAM still lives in core, and will be moved in a subsequent PR. * Remove generics and target value type from MultiVSAB (#51647) * fix checkstyle after merge (#52008) * Plumb ValuesSourceRegistry through to QuerySearchContext (#51710) * Convert RareTerms to new VS registry (#52166) * Wire up Value Count (#52225) * Wire up Max & Min aggregations (#52219) * ValuesSource refactoring: Wire up Sum aggregation (#52571) * ValuesSource refactoring: Wire up SigTerms aggregation (#52590) * Soft immutability for VSConfig (#52729) * Unmute testSupportedFieldTypes, fix Percentiles/Ranks/Terms tests (#52734) Also fixes Percentiles which was incorrectly specified to only accept numeric, but in fact also accepts Boolean and Date (because those are numeric on master - thanks `testSupportedFieldTypes` for catching it!) * VS refactoring: Wire up stats aggregation (#52891) * ValuesSource refactoring: Wire up string_stats aggregation (#52875) * VS refactoring: Wire up median (MAD) aggregation (#52945) * fix valuesourcetype issue with constant_keyword field (#53041) this commit implements `getValuesSourceType` for the ConstantKeyword field type. master was merged into feature/extensible-values-source introducing a new field type that was not implementing `getValuesSourceType`. * ValuesSource refactoring: Wire up Avg aggregation (#52752) * Wire PercentileRanks aggregator into new VS framework (#51693) * Add a VSConfig resolver for aggregations not using the registry (#53038) * Vs refactor wire up ranges and date ranges (#52918) * Wire up geo_bounds aggregation to ValuesSourceRegistry (#53034) This commit updates the geo_bounds aggregation to depend on registering itself in the ValuesSourceRegistry relates #42949. * VS refactoring: convert Boxplot to new registry (#53132) * Wire-up geotile_grid and geohash_grid to ValuesSourceRegistry (#53037) This commit updates the geo_grid aggregations to depend on registering itself in the ValuesSourceRegistry relates to the values-source refactoring meta issue #42949. Wire-up geo_centroid agg to ValuesSourceRegistry (#53040) This commit updates the geo_centroid aggregation to depend on registering itself in the ValuesSourceRegistry. relates to the values-source refactoring meta issue #42949. * Fix type tests for Missing aggregation (#53501) * ValuesSource Refactor: move histo VSType into XPack module (#53298) - Introduces a new API (`getBareAggregatorRegistrar()`) which allows plugins to register aggregations against existing agg definitions defined in Core. - This moves the histogram VSType over to XPack where it belongs. `getHistogramValues()` still remains as a Core concept - Moves the histo-specific bits over to xpack (e.g. the actual aggregator logic). This requires extra boilerplate since we need to create a new "Analytics" Percentile/Rank aggregators to deal with the histo field. Doubly-so since percentiles/ranks are extra boiler-plate'y... should be much lighter for other aggs * Wire up DateHistogram to the ValuesSourceRegistry (#53484) * Vs refactor parser cleanup (#53198) Co-authored-by: Zachary Tong <polyfractal@elastic.co> Co-authored-by: Zachary Tong <zach@elastic.co> Co-authored-by: Christos Soulios <1561376+csoulios@users.noreply.github.com> Co-authored-by: Tal Levy <JubBoy333@gmail.com>	2020-03-26 15:01:07 -04:00
Martijn Laarman	fbe173723d	Document known features on rest-api-spec tests (#52916 ) * Document known features on rest-api-spec tests Features dictate wheter a `rest-api-spec` test runner can execute a test. This PR documents all the know features in the java implementation of the runner. * Apply suggestions from code review Co-Authored-By: Luca Cavanna <javanna@users.noreply.github.com> Co-authored-by: Luca Cavanna <javanna@users.noreply.github.com>	2020-03-25 16:27:45 +01:00
Jason Tedor	1fc0432b24	Introduce formal role for remote cluster client (#53924 ) This commit introduce a formal role for identifying nodes that are capable of making connections to remote clusters.	2020-03-24 19:21:56 -04:00
Dan Hermann	81d8510887	Unmute data stream YML tests	2020-03-24 10:15:42 -05:00
Dan Hermann	aed8ce7cda	Cluster state and CRUD operations for data streams (#53877 )	2020-03-24 06:08:48 -05:00
Jim Ferenczi	04bd154037	Add heuristics to compute pre_filter_shard_size when unspecified (#53873 ) This commit changes the pre_filter_shard_size default from 128 to unspecified. This allows to apply heuristics based on the request and the target indices when deciding whether the can match phase should run or not. When unspecified, this pr runs the can match phase automatically if one of these conditions is met: * The request targets more than 128 shards. * The request contains read-only indices. * The primary sort of the query targets an indexed field. Users can opt-out from this behavior by setting the `pre_filter_shard_size` to a static value. Closes #39835	2020-03-23 19:06:32 +01:00
weizijun	face37514b	/_cat/shards support path stats (#53461 ) * _cat/shards support path stats * fix some style case * fix some style case * fix rest-api-spec cat.shards error * fix rest-api-spec cat.shards bwc error Co-authored-by: Elastic Machine <elasticmachine@users.noreply.github.com>	2020-03-23 14:27:13 +01:00
Martijn van Groningen	abcee01a96	adjusted skip version (#53969 )	2020-03-23 14:04:17 +01:00
James Rodewig	1df73828ed	[DOCS] Note doc links should be live in REST API JSON specs (#53871 ) Downstream Elasticsearch clients, such as the Elaticsearch-JS client, use the documentation links in our REST API JSON specifications to create their docs. Using a broken link or linking to yet-to-be-created doc pages can break the docs build for these clients. This PR adds a related note to the README for the REST API JSON Specs.	2020-03-23 07:36:37 -04:00
Martijn van Groningen	12046084f9	Initial data stream commit (#53666 ) * Initial data stream commit This commits adds a data stream feature flag, initial definition of a data stream and the stubs for the data stream create, delete and get APIs. Also simple serialization tests are added and a rest test to thest the data stream API stubs. This is a large amount of code and mainly mechanical, but this commit should be straightforward to review, because there isn't any real logic. The data stream transport and rest action are behind the data stream feature flag and are only intialized if the feature flag is enabled. The feature flag is enabled if elasticsearch is build as snapshot or a release build and the 'es.datastreams_feature_flag_registered' is enabled. The integ-test-zip sets the feature flag if building a release build, otherwise rest tests would fail. Relates to #53100 * fixed hlrc test * ignore bwc until this change has been backported to 7.x branch * changed data stream apis to be a cluster based action. before this commit the data steams api were indices based actions, but data streams aren't indices, data streams encapsulates indices, but are indices themselves. It is a cluster level attribute, and therefor cluster based action fits best for now. Perhaps in the future we will have data stream based actions and then this would be a right fit for the data stream crud apis. * this should have been part of the previous commit * fixed yaml test * Also add feature flag in other modules that run the yaml test if a release build is executed * Reverted the commits that make data stream a cluster based api This reverts commit `e362eeb669`. * Make data stream crud apis work like a indices based api. * renamed timestamp field * fixed compile error after merging in master * fixed merge mistake * moved setting system property * applied review comments	2020-03-20 11:22:18 +01:00
Lee Hinman	d47d74a558	Fix feature flag setting for ComponentTemplate APIs (#53758 ) The feature flag was set for most of the builds, but there are a couple where it was missing. Resolves #53708	2020-03-19 08:00:06 -06:00
James Rodewig	03caeaad79	[DOCS] Remove incorrect parms from put index template API docs (#53750 ) Removes the `flat_settings` and `timeout` query parameters from the JSON spec and asciidoc docs for the put index template API. These parameters are not supported by the API.	2020-03-18 14:33:40 -04:00
Jake Landis	3ef3cc571f	Add Watcher to available rest resources (#53620 ) Prior to this commit Watcher explicitly copied test between two projects with a copy task. This commit removes the explicit copy in favor of adding the Watcher tests to the available restResources that may be copied between projects. This is how inter-project dependencies should be modeled. However, only Watcher is included here since it is (currently) the only project with inter-project test dependencies. Note - this re-introduces: commit: `4f48e053f9` with some additional fixes.	2020-03-18 09:08:13 -05:00
Ioannis Kakavas	baccbec5f5	Mute failing test (#53709 ) see https://github.com/elastic/elasticsearch/issues/53708	2020-03-18 10:36:16 +02:00
Lee Hinman	263e525e49	Add REST API for ComponentTemplate CRUD (#53558 ) * Add REST API for ComponentTemplate CRUD This adds the Put/Get/DeleteComponentTemplate APIs that allow inserting, retrieving, and removing ComponentTemplateMetadata into the cluster state metadata. These APIs are currently only available behind a feature flag system property - `es.itv2_feature_flag_registered`. Relates to #53101 Co-authored-by: Elastic Machine <elasticmachine@users.noreply.github.com>	2020-03-17 10:55:07 -06:00
Gordon Brown	3c25005648	Update the Skip version in hidden index YAML tests (#53641 ) This commit adjusts the version ranges used for the _cat API hidden index/alias tests.	2020-03-17 10:20:48 -06:00
Hendrik Muhs	68a698f9ae	[Transform] add transform discovery node role (#53616 ) Enhancement of #52712: Add a discovery node role using the letter t for transform. Fixes #53156	2020-03-17 11:34:22 +01:00
Jim Ferenczi	ff94792e41	Shortcut query phase using the results of other shards (#51852 ) This commit, built on top of #51708, allows to modify shard search requests based on informations collected on other shards. It is intended to speed up sorted queries on time-based indices. For queries that are only interested in the top documents. This change will rewrite the shard queries to match none if the bottom sort value computed in prior shards is better than all values in the shard. For queries that mix top documents and aggregations this change will reset the size of the top documents to 0 instead of rewriting to match none. This means that we don't need to keep a search context open for this shard since we know in advance that it doesn't contain any competitive hit.	2020-03-17 10:54:44 +01:00
Nik Everett	f2f0fc03de	Update skip before backport PR #53617 has yet to be backported so we should skip its integration test on nodes earlier than 8.0.0	2020-03-16 15:51:29 -04:00
Nik Everett	3b7843d774	Fix sorting agg buckets by doc_count (#53617 ) I broke sorting aggregations by `doc_count` in #51271 by mixing up true and false. This flips that comparison and adds a few tests to double check that we don't so this again.	2020-03-16 14:41:56 -04:00
Mayya Sharipova	01eee1a97f	Highlighters skip ignored keyword values (#53408 ) Keyword field values with length more than ignore_above are not indexed. But highlighters still were retrieving these values from _source and were trying to highlight them. This sometimes lead to errors if a field length exceeded max_analyzed_offset. But also this is a wrong behaviour to attempt to highlight something that was not ignored during indexing. This PR checks if a keyword value was ignored because of its length, and if yes, skips highlighting it. Closes #43800	2020-03-16 06:49:37 -04:00
Gordon Brown	d7bbc9df1d	Allow _cat indices & aliases to use indices options (#53248 ) This commit adjusts the _cat/indices and _cat/aliases APIs to allow specifying indices options, so that these APIs can handle hidden indices/aliases in the same way as other APIs. Also adds the hidden option to the expand_wildcards parameter in the YAML spec for every API that accepts it.	2020-03-13 11:57:00 -06:00
Marios Trivyzas	314145294e	Fix YAML test for search.allow_expensive_queries (#53541 ) Remove excessive testing and keep only the checks for when the queries are disallowed. Fix also the check for the initial value of the setting to be conmbatible with Go client tests.	2020-03-13 14:42:09 +01:00
Nik Everett	d0addbd142	Update skip after backport (#53421 ) Now that we've backported #53315 we can run the backwards compatibility tests against it.	2020-03-11 16:43:43 -04:00
Andy Bristol	4095df443b	aggregator and yaml tests for missing agg (#53214 ) Tests for unmapped fields, the missing parameter, scripting, and correct ValuesSource types in MissingAggregatorTests. Basic yaml tests for the missing agg For #42949	2020-03-11 13:23:38 -07:00
Nik Everett	5b15aa6b0b	Update skip after backport (#53346 ) Now that #53296 is backported to 7.x we can run its tests against 7.7.0.	2020-03-10 13:08:36 -04:00
Nik Everett	a63232d2bc	Fix date_nanos in composite aggs (#53315 ) It looks like `date_nanos` fields weren't likely to work properly in composite aggs because composites iterate field values using points and we weren't converting the points into milliseconds. Because the doc values were coming back in milliseconds we ended up geting very confused and just never collecting sub-aggregations. This fixes that by adding a method to `DateFieldMapper.Resolution` to `parsePointAsMillis` which is similarly in name and function to `NumberFieldMapper.NumberType`'s `parsePoint` except that it normalizes to milliseconds which is what aggs need at the moment. Closes #53168	2020-03-10 11:40:14 -04:00
Nik Everett	57853de283	Fix composite agg sort bug (#53296 ) When an composite aggregation is run against an index with a sort that starts with the "source" fields from the composite but has additional fields it'd blow up in while trying to decide if it could use the sort. This changes it to decide that it can use the sort. Closes #52480	2020-03-10 09:23:44 -04:00
Nik Everett	fbdb4105fe	Add `allowed_warnings` to yaml tests (#53139 ) When we test backwards compatibility we often end up in a situation where we sometimes get a warning, and sometimes don't. Like, we won't get the warning if we're testing against an older version, but we will in a newer one. Or we won't get the warning if the request randomly lands on a node with an old version of the code. But we wouldn't if it randomed into a node with newer code. This adds `allowed_warnings` to our yaml test runner for those cases: warnings declared this way are "allowed" but not "required". Blocks #52959	2020-03-05 08:12:33 -05:00
Filip M. Nowak	ad8c6ccb45	tiny typo fix (#52929 )	2020-02-27 23:50:14 +01:00
David Turner	a3a98c7003	Cache completion stats between refreshes (#51991 ) Computing the stats for completion fields may involve a significant amount of work since it walks every field of every segment looking for completion fields. Innocuous-looking APIs like `GET _stats` or `GET _cluster/stats` do this for every shard in the cluster. This repeated work is unnecessary since these stats do not change between refreshes; in many indices they remain constant for a long time. This commit introduces a cache for these stats which is invalidated on a refresh, allowing most stats calls to bypass the work needed to compute them on most shards. Closes #51915	2020-02-27 07:33:16 +00:00
Nhat Nguyen	827f62c990	Fix translog stats on closed indices yaml test (#52800 ) We need to wait for no initializing shards before closing; otherwise, we might fail to close some recovering replicas. Closes #52701	2020-02-26 08:13:31 -05:00
Jake Landis	810dc9fce3	Smarter copying of the rest specs and tests (#52114 ) This PR addresses the unnecessary copying of the rest specs and allows for better semantics for which specs and tests are copied. By default the rest specs will get copied if the project applies `elasticsearch.standalone-rest-test` or `esplugin` and the project has rest tests or you configure the custom extension `restResources`. This PR also removes the need for dozens of places where the x-pack specs were copied by supporting copying of the x-pack rest specs too. The plugin/task introduced here can also copy the rest tests to the local project through a similar configuration. The new plugin/task allows a user to minimize the surface area of which rest specs are copied. Per project can be configured to include only a subset of the specs (or tests). Configuring a project to only copy the specs when actually needed should help with build cache hit rates since we can better define what is actually in use. However, project level optimizations for build cache hit rates are not included with this PR. Also, with this PR you can no longer use the includePackaged flag on integTest task. The following items are included in this PR: * new plugin: `elasticsearch.rest-resources` * new tasks: CopyRestApiTask and CopyRestTestsTask - performs the copy * new extension 'restResources' ``` restResources { restApi { includeCore 'foo' , 'bar' //will include the core specs that start with foo and bar includeXpack 'baz' //will include x-pack specs that start with baz } restTests { includeCore 'foo', 'bar' //will include the core tests that start with foo and bar includeXpack 'baz' //will include the x-pack tests that start with baz } } ```	2020-02-25 18:46:32 -06:00
Marios Trivyzas	e42c4d1b0b	[Tests] Update skip version for YAML tests (#52324 ) Update skip versions upper boundary to match the release or intended release version of the feature/fix. Relates to #52310	2020-02-13 20:03:48 +01:00
Nik Everett	24e36ba326	Enable BWC test after backport (#52299 ) Now that we've backported #52016 we can run its tests when we're performance backwards compatibility testing.	2020-02-13 07:56:35 -05:00
Nik Everett	7346cd05ed	Update skip after backport (#52287 ) Now that #51868 is fully backported we can run its tests in the backwards compatibility tests.	2020-02-12 17:01:24 -05:00
Nik Everett	b38c04f0c6	Update skip for backported fix (#52240 ) Now that #51172 is fully backported we can fix the `skip` clause in the bwc tests for it.	2020-02-12 13:55:44 -05:00
Marios Trivyzas	a8b39ed842	Add a cluster setting to disallow expensive queries (#51385 ) Add a new cluster setting `search.allow_expensive_queries` which by default is `true`. If set to `false`, certain queries that have usually slow performance cannot be executed and an error message is returned. - Queries that need to do linear scans to identify matches: - Script queries - Queries that have a high up-front cost: - Fuzzy queries - Regexp queries - Prefix queries (without index_prefixes enabled - Wildcard queries - Range queries on text and keyword fields - Joining queries - HasParent queries - HasChild queries - ParentId queries - Nested queries - Queries on deprecated 6.x geo shapes (using PrefixTree implementation) - Queries that may have a high per-document cost: - Script score queries - Percolate queries Closes: #29050	2020-02-12 18:06:04 +01:00
Nik Everett	da2b67d6e5	Fix a DST error in date_histogram (#52016 ) When `date_histogram` attempts to optimize itself it for a particular time zone it checks to see if the entire shard is within the same "transition". Most time zone transition once every size months or thereabouts so the optimization can usually kicks in. But it crashes when you attempt feed it a time zone who's last DST transition was before epoch. The reason for this is a little twisted: before this patch it'd find the next and previous transitions in milliseconds since epoch. Then it'd cast them to `Long`s and pass them into the `DateFieldType` to check if the shard's contents were within the range. The trouble is they are then converted to `String`s which are then parsed back to `Instant`s which are then convertd to `long`s. And the parser doesn't like most negative numbers. And everything before epoch is negative. This change removes the `long` -> `Long` -> `String` -> `Instant` -> `long` chain in favor of passing the `long` -> `Instant` -> `long` which avoids the fairly complex parsing code and handles a bunch of interesting edge cases around epoch. And other edge cases around `date_nanos`. Closes #50265	2020-02-11 16:04:36 -05:00
Zachary Tong	ba9c4fb987	Refactor Percentiles/Ranks aggregation builders and factories (#51887 ) - Consolidates HDR/TDigest factories into a single factory - Consolidates most HDR/TDigest builder into an abstract builder - Deprecates method(), compression(), numSigFig() in favor of a new unified PercentileConfig object - Disallows setting algo options that don't apply to current algo The unified config method carries both the method and algo-specific setting. This provides a mechanism to reject settings that apply to the wrong algorithm. For BWC the old methods are retained but marked as deprecated, and can be removed in future versions. Co-authored-by: Mark Tozzi <mark.tozzi@gmail.com>	2020-02-11 15:32:03 -05:00
Nhat Nguyen	ebc4681473	Use local checkpoint to calculate min translog gen for recovery (#51905 ) Today we use the translog_generation of the safe commit as the minimum required translog generation for recovery. This approach has a limitation, where we won't be able to clean up translog unless we flush. Reopening an already recovered engine will create a new empty translog, and we leave it there until we force flush. This commit removes the translog_generation commit tag and uses the local checkpoint of the safe commit to calculate the minimum required translog generation for recovery instead. Closes #49970	2020-02-10 08:26:01 -05:00
Martijn Laarman	1c3b341960	Time parameter includes description (#49368 ) * Time parameter includes description In option enumeration causing codegenerators to pick up the description as a value to send. * cat.shards missing ending quotes	2020-02-06 17:18:22 +01:00
Jim Ferenczi	eb69c6fe7c	Always rewrite search shard request outside of the search thread pool (#51708 ) This change ensures that the rewrite of the shard request is executed in the network thread or in the refresh listener when waiting for an active shard. This allows queries that rewrite to match_no_docs to bypass the search thread pool entirely even if the can_match phase was skipped (pre_filter_shard_size > number of shards). Coordinating nodes don't have the ability to create empty responses so this change also ensures that at least one shard creates a full empty response while the other can return null ones. This is needed since creating true empty responses on shards require to create concrete aggregators which would be too costly to build on a network thread. We should move this functionality to aggregation builders in a follow up but that would be a much bigger change. This change is also important for #49601 since we want to add the ability to use the result of other shards to rewrite the request of subsequent ones. For instance if the first M shards have their top N computed, the top worst document in the global queue can be pass to subsequent shards that can then rewrite to match_no_docs if they can guarantee that they don't have any document better than the provided one.	2020-02-06 08:55:20 +01:00
Nik Everett	b6d06c91a0	Fix a sneaky bug in rare_terms (#51868 ) When the `rare_terms` aggregation contained another aggregation it'd break them. Most of the time. This happened because the process that it uses to remove buckets that turn out not to be rare was incorrectly merging results from multiple leaves. This'd cause array index out of bounds issues. We didn't catch it in the test because the issue doesn't happen on the very first bucket. And the tests generated data in such a way that the first bucket always contained the rare terms. Randomizing the order of the generated data fixed the test so it caught the issue. Closes #51020	2020-02-05 13:38:46 -05:00
Adrien Grand	28e2f16734	Prepare backport of #51260 . (#51876 ) Backport: #51875	2020-02-05 11:02:46 +01:00
Karel Minarik	2ed9e95100	Fix the type for "slices" in the Reindex and Update By Query REST API specification (#51908 ) This patch supplements #51792 and #51535 where the type of the "slices" parameter has been fixed.	2020-02-05 09:50:49 +01:00
Julie Tibshirani	e0d57c5181	Correct the serialization version for index field caps. It needs to be lowered to 7.7 now that the PR has been backported.	2020-02-04 17:00:29 -08:00
Gordon Brown	375e39e579	Mute failing field_caps tests (#51897 )	2020-02-04 17:11:32 -07:00
Adrien Grand	d5bc6d6de0	Move analysis/mappings stats to cluster-stats. (#51260 ) Closes #51138	2020-02-04 16:56:49 +01:00
Jonathan Budzenski	23b31d6abe	[DOCS] Change http://elastic.co -> https (#48479 )	2020-02-03 08:52:34 -05:00
Karel Minarik	68db7fc611	Fix the type for "slices" in the Delete By Query REST API specification (#51792 ) The previous patch in `c1d9966d35` incorrectly set the `type` to `number\|auto`, which is incorrect — the "polymorphic" type, denoted with the `\|` sign, should contain only other types, ie. number, string, bool, etc. Fixes #51535	2020-02-02 18:09:07 +01:00
Karel Minarik	c1d9966d35	Fix the "slices" parameter for the Delete By Query API in the REST specification (#51535 ) This patch updates the `type` parameter in the Delete By Query API: according to [the documentation](https://www.elastic.co/guide/en/elasticsearch/reference/current/docs-delete-by-query.html#docs-delete-by-query-slice), it can be set to "auto", but the type in the documentation allows only numerical values. This prevents people from setting the parameter to "auto" eg. in the Go client, which generates source from the specification, and sets the corresponding Go type as number. The patch uses the `\|` notation, which we have discussed previously for encoding a "polymorphic" parameter like this. Related: https://github.com/elastic/go-elasticsearch/issues/77	2020-02-02 09:59:48 +01:00
Christoph Büscher	54e8d1ce1c	Lower range/10_basic.yml skip version after backport After backport of #50237 in #51741 the skip version in this test can be lowered.	2020-01-31 16:10:49 +01:00
Nhat Nguyen	6e0fbbd4db	Remove translog retention settings (#51697 ) The translog retention settings index.translog.retention.size and index.translog.retention.age were effectively ignored in 7.4, deprecated in 7.7, and now removed in 8.0 in favor of soft-deletes. Closes #50775	2020-01-31 08:18:07 -05:00
Christoph Büscher	7cec5f93be	Make `date_range` query rounding consistent with `date` (#50237 ) Currently the rounding used in range queries can behave differently for `date` and `date_range` as explained in #50009. The behaviour on `date` fields is the one we document in https://www.elastic.co/guide/en/elasticsearch/reference/current/query-dsl-range-query.html#range-query-date-math-rounding. This change adapts the rounding behaviour for RangeType.DATE so it uses the same logic as the `date` for the `date_range` type. Closes #50009	2020-01-31 14:15:13 +01:00
Nhat Nguyen	49a6729af1	Adjust bwc for #51588	2020-01-30 10:38:32 -05:00
Nhat Nguyen	2aa650c75e	Deprecate translog retention settings (#51588 ) This change deprecates the translog retention settings as they are effectively ignored since 7.4. Relates #50775 Relates #45473	2020-01-29 10:19:22 -05:00
Nik Everett	0483f7c1b7	Begin moving date_histogram to offset rounding (take two) (#51271 ) We added a new rounding in #50609 that handles offsets to the start and end of the rounding so that we could support `offset` in the `composite` aggregation. This starts moving `date_histogram` to that new offset. This is a redo of #50873 with more integration tests. This reverts commit `d114c9db3e`.	2020-01-27 12:24:52 -05:00
Nik Everett	d060d73a7d	Support time_zone on composite's date_histogram (#51172 ) We've been parsing the `time_zone` parameter on `date_hitogram` for a while but it hasn't done anything. This wires it up. Closes #45199 Inspired by #45200	2020-01-27 11:10:53 -05:00
Dan Pickett	c3b5f713e2	[DOCS] Fix typos in several REST API specs (#51197 ) Changes `effected` to `affected` in several REST API spec files.	2020-01-22 12:17:33 -05:00
Przemyslaw Gomulka	a2f23f7ea0	Remove TODO in a test for verifying start of the week (#50894 ) The test should be run against 7.7 version at least, as this was only backported and released in that version relates SPI based implementation #48209 relates backport #50916	2020-01-21 16:58:19 +01:00
Nik Everett	73991daa91	Add "did you mean" to unknown queries (#51177 ) This replaces the message we return for unknown queries with the standard one that we use for unknown fields from `ObjectParser`. This is nice because it includes "did you mean". One day we might convert parsing queries to using object parser, but that looks complex. This change is much smaller and seems useful.	2020-01-20 15:24:59 -05:00
Nhat Nguyen	c893a3e495	Make soft-deletes mandatory in 8.0 (#51122 ) Creating indices with soft deletes disabled is no longer supported in 8.0.	2020-01-17 17:34:22 -05:00
Maxim	09ebc11b06	Deprecates _upgrade API (#47678 ) (#50484 ) * Deprecates _upgrade API Ref #47678 * Move deprecation flags to path section. Add deprecation warning tests for _upgrade API. Ref #47678	2020-01-17 14:52:46 -07:00
Nik Everett	1a727ba277	Update skip after backport (#51175 ) Now that we've backported #50869 we can update the skip config for its test.	2020-01-17 13:32:07 -05:00
Nik Everett	224640a3ca	"did you mean" for ObjectParser with top named (#51018 ) When you declare an ObjectParser with top level named objects like we do with `significant_terms` we didn't support "did you mean". This fixes that. Relates #50938	2020-01-17 10:41:14 -05:00

... 2 3 4 5 6 ...

2277 Commits