elasticsearch

Commit Graph

Author	SHA1	Message	Date
Adrien Grand	25750a3696	Make intervals queries fully pluggable through field mappers. (#71429 ) `MappedFieldType` only allows configuring `match` and `prefix` queries today. This change makes it possible to configure how to create `wildcard` and `fuzzy` queries as well. This will allow making the upcoming `match_only_text` field fully support intervals queries.	2021-04-20 18:10:12 +02:00
Francisco Fernández Castaño	68693523b5	Add read barrier to AzureBlobStore#convertStreamToByteBuffer (#71832 ) To write the contents from an InputStream we convert it into a Flux<ByteBuffer>, since we only have 1 IO thread for the Azure SDK, the reads from the underlying InputStream are dispatched to different threads in order to avoid blocking the IO thread. This could cause visibility problems when the underlying InputStream holds state that is not synchronized. In order to avoid this, this commit introduces explicit synchronization barriers in order to ensure visibility. The overhead should be fairly minimal, since the contention should be small.	2021-04-20 17:43:03 +02:00
Nhat Nguyen	a461597c75	Upgrade to Lucene 8.8.2 on 8.0 (#71587 )	2021-04-14 08:52:23 -04:00
Lyudmila Fokina	3b0b7941ae	Warn users if security is implicitly disabled (#70114 ) * Warn users if security is implicitly disabled Elasticsearch has security features implicitly disabled by default for Basic and Trial licenses, unless explicitly set in the configuration file. This may be good for onboarding, but it also lead to unintended insecure clusters. This change introduces clear warnings when security features are implicitly disabled. - a warning header in each REST response if security is implicitly disabled; - a log message during cluster boot.	2021-04-13 18:33:41 +02:00
Rene Groeschke	0f40889879	Update build to Gradle 7.0 (#68506 ) - Update gradle wrapper to gradle 7.0 - Remove deprecated usages to make build 7.0 compatible - Fix excludes in docs snippet tasks (See https://github.com/gradle/gradle/issues/16160 for details) - Fix deprecation warnings in 7.0 - Add explicit dependencies that have been missed - Make extract native licenses tasks output dir more explicit - Use a snapshot of the ospackage plugin that includes a fix for 7.0 already - fix test runtime classpath setup in repository-hdfs - Make task dependency explicit to fix further deprecation warnings - Remove manual check for http repo usages that has been deprecated in gradle 7.0 - Update spock to latest 2.0 milestone required for groovy 3	2021-04-13 09:15:08 +02:00
Jake Landis	b1ef1fd800	Introduce yamlRestCompatTests for :plugins projects (#71440 )	2021-04-08 16:11:50 -05:00
Armin Braun	f8a7576007	S3 Metrics Collection Should not Count Requests without Response (#71406 ) It's in the title. Currently we count requests as having been executed even if the request was broken due to a network issue and not response was ever received. In order to not overcount requests relative to what the S3 endpoint would measure for e.g. billing purposes we should ignore these requests. closes #70060	2021-04-07 18:07:26 +02:00
Yannick Welsch	801c50985c	Use default application credentials for GCS repositories (#71239 ) Adds support for "Default Application Credentials" for GCS repositories, making it easier to set up a repository on GCP, as all relevant information to connect to the repository is retrieved from the environment, not necessitating complicated keystore setups.	2021-04-06 15:16:00 +02:00
Armin Braun	821383378e	Reduce Memory Use of Parallel Azure Blob Deletes (#71330 ) 1. Limit the number of blob deletes to execute in parallel to `100`. 2. Use flat listing when deleting a directory to require fewer listings in between deletes and keep the code simpler. closes #71267	2021-04-06 14:45:03 +02:00
Jason Tedor	32314493a2	Pass override settings when creating test cluster (#71203 ) Today when creating an internal test cluster, we allow the test to supply the node settings that are applied. The extension point to provide these settings has a single integer parameter, indicating the index (zero-based) of the node being constructed. This allows the test to make some decisions about the settings to return, but it is too simplistic. For example, imagine a test that wants to provide a setting, but some values for that setting are not valid on non-data nodes. Since the only information the test has about the node being constructed is its index, it does not have sufficient information to determine if the node being constructed is a non-data node or not, since this is done by the test framework externally by overriding the final settings with specific settings that dicate the roles of the node. This commit changes the test framework so that the test has information about what settings are going to be overriden by the test framework after the test provide its test-specific settings. This allows the test to make informed decisions about what values it can return to the test framework.	2021-04-02 10:20:36 -04:00
Christoph Büscher	ba0ecac934	Add _size and _doc_count to fields output (#70575 ) Currently metadata fields like `_size` or `_doc_count` cannot be retrieved using the fields API. With this change, we allow this if the field is explicitely queried for using its name, but won't include metadata fields when e.g. requesting all fields via "*". With this change, not all metadata fields will be retrievable by using its name, but support for "_size" and "_doc_count" (which is fetched from source) is added. Support for other metadata field types will need to be decided case by case and an appropriate ValueFetcher needs to be supplied. Relates to #63569	2021-03-31 19:24:21 +02:00
Mark Vieira	6339691fe3	Consolidate REST API specifications and publish under Apache 2.0 license (#70036 )	2021-03-26 16:20:14 -07:00
Nik Everett	91c700bd99	Super randomized tests for fetch fields API (#70278 ) We've had a few bugs in the fields API where is doesn't behave like we'd expect. Typically this happens because it isn't obvious what we expct. So we'll try and use randomized testing to ferret out what we want. This adds a test for most field types that asserts that `fields` works similarly to `docvalues_fields`. We expect this to be true for most fields. It does so by forcing all subclasses of `MapperTestCase` to define a method that makes random values. It declares a few other hooks that subclasses can override to further randomize the test. We skip the test for a few field types that don't have doc values: * `annotated_text` * `completion` * `search_as_you_type` * `text` We should come up with some way to test these without doc values, even if it isn't as nice. But that is a problem for another time, I think. We skip the test for a few more types just because I wanted to cut this PR in half so we could get to reviewing it earlier. We'll get to those in a follow up change. I've filed a few bugs for things that are inconsistent with `docvalues_fields`. Typically that means that we have to limit the random values that we generate to those that do round trip properly.	2021-03-24 14:16:27 -04:00
Hendrik Muhs	f1b89fad5b	add test framework for json schema validation of rest spec body's (#69902 ) Rest API specs define the API's used at the rest level, however these specs only define the endpoint and the parameters. We miss definitions for the body, especially when it comes to rich bodies like they are used in ML. This change introduces an abstract testcase for json schema validation. This allows developers to validate any object that is serializable to JSON - using the `ToXContent` - to be tested against a json schema. You can use it for REST input and outputs, but also for internal objects(documents) and `ToXContentFragments`. As the overall goal is to ensure it validates properly, the testcase enforces strictness. A schema file must spec all properties. This will ensure that once a schema test has been added, it won't go out of sync. Every change to the pojo enforces a schema update as otherwise the test would fail. Schemas can load sub-schemas from extra files. That way you can re-use schemas e.g. in hierarchies or re-use a schema for similar but not same interfaces.	2021-03-17 08:30:40 +01:00
Rory Hunter	d181b947c2	Remove depth limit from checkstyle negation rule (#70274 ) The Checkstyle rule that bans unary negation in favour of an explicit `== false` has a `maximumDepth` of 2 configured, which meant that it didn't catch all violations. The `maximumDepth` isn't required (actually it has a really high default), so this change removes the limit and fixes the resulting violations.	2021-03-10 22:06:50 +00:00
Alan Woodward	49897be1bc	Fix position increment gap on phrase/prefix analyzers (#70096 ) Custom position increments are handled by wrapping analyzers with a NamedAnalyzer and passing the custom increment through to its constructor. However, phrase and prefix analyzers use delegating analyzer wrappers to add extra filtering to their parent analyzers, and we can't wrap analyzers multiple times because this wrecks reuse strategies, so we unwrap the parent before passing it to phrase and prefix builders. This unwrapping means that we lose the custom position increments; in particular, it means that we can end up with a position increment gap of -1, which is the sentinel value for the unset parameter - and that means exceptions at index time for backwards-moving positions on fields with multiple values. This commit removes the sentinel value and uses standard parameter defaults and the isConfigured() method instead, plus it adds some more comprehensive testing for position increments when combined with phrase/prefix index options on text fields. Fixes #70049	2021-03-09 12:10:13 +00:00
David Turner	60d53c0206	Stop double-starting transport service in tests (#70056 ) Today in tests we often use a utility method that creates and starts a transport service, and then we start it again in the tests anyway. This commit removes this unnecessary code and asserts that we only ever call `TransportService#acceptIncomingRequests` once.	2021-03-08 11:04:43 +00:00
Francisco Fernández Castaño	22ef725a2f	Add integration test for Azure multi block uploads (#69267 ) Relates #68957	2021-03-01 15:15:43 +01:00
Armin Braun	bb77ab46e0	Stop Ignoring Exceptions on Close in Network Code (#69665 ) We should not be ignoring and suppressing exceptions on releasing network resources quietly in these spots. Co-authored-by: David Turner <david.turner@elastic.co>	2021-03-01 14:38:18 +01:00
Armin Braun	c86d9e3cf6	Simplify BwC for UUIDs in RepositoryData (#69335 ) We don't need to separately handle BwC for these two fields since they were both introduced in `7.12`.	2021-02-22 15:42:09 +01:00
Mark Vieira	dabf857548	Remove integration testing using OSS distribution (#69153 )	2021-02-17 13:57:04 -08:00
Rene Groeschke	6c957475f0	Split test artifact plugin into base and conventional plugin (#69051 ) This is a follow up on #68766 which allows reusing test artifact setup for other sourceSet than `test`	2021-02-17 12:08:13 +01:00
Marios Trivyzas	1e12c93a31	Fix issue with AnnotatedTextHighlighter and max_analyzed_offset (#69028 ) With the newly introduced `max_analyzed_offset` the analyzer of `AnnotatedTextHighlighter` was wrapped twice with the `LimitTokenOffsetAnalyzer` by mistake. Follows: #67325	2021-02-16 17:08:07 +01:00
Marios Trivyzas	f9af60bf69	Add query param to limit highlighting to specified length (#67325 ) Add a `max_analyzed_offset` query parameter to allow users to limit the highlighting of text fields to a value less than or equal to the `index.highlight.max_analyzed_offset`, thus avoiding an exception when the length of the text field exceeds the limit. The highlighting still takes place, but stops at the length defined by the new parameter. Closes: #52155	2021-02-16 09:25:45 +01:00
David Turner	b3d5d32209	Adjust encoding of Azure block IDs (#68957 ) Today we represent block IDs sent to Azure using the URL-safe base-64 encoding. This makes sense: these IDs appear in URLs. It turns out that Azure rejects this encoding for block IDs and instead demands that they are represented using the regular, URL-unsafe, base-64 encoding instead, then further wrapped in %-encoding to deal with the URL-unsafe characters that inevitably result. Relates #66489	2021-02-15 11:34:27 +00:00
Rory Hunter	2d44cce31e	Replace NOT operator with explicit `false` check - part 9 (#68645 ) Part 9. We have an in-house rule to compare explicitly against `false` instead of using the logical not operator (`!`). However, this hasn't historically been enforced, meaning that there are many violations in the source at present. We now have a Checkstyle rule that can detect these cases, but before we can turn it on, we need to fix the existing violations. This is being done over a series of PRs, since there are a lot to fix.	2021-02-08 15:28:57 +00:00
Luca Cavanna	0ca6819882	DocumentMapper to not implement ToXContent (#68653 ) DocumentMapper does not need to implement ToXContent, in fact it is its inner Mapping that needs to and already does. Consumers can switch to calling mapping() and toXContent against it.	2021-02-08 14:17:31 +01:00
Mark Vieira	a92a647b9f	Update sources with new SSPL+Elastic-2.0 license headers As per the new licensing change for Elasticsearch and Kibana this commit moves existing Apache 2.0 licensed source code to the new dual license SSPL+Elastic license 2.0. In addition, existing x-pack code now uses the new version 2.0 of the Elastic license. Full changes include: - Updating LICENSE and NOTICE files throughout the code base, as well as those packaged in our published artifacts - Update IDE integration to now use the new license header on newly created source files - Remove references to the "OSS" distribution from our documentation - Update build time verification checks to no longer allow Apache 2.0 license header in Elasticsearch source code - Replace all existing Apache 2.0 license headers for non-xpack code with updated header (vendored code with Apache 2.0 headers obviously remains the same). - Replace all Elastic license 1.0 headers with new 2.0 header in xpack.	2021-02-02 16:10:53 -08:00
David Turner	dd519a9eba	Introduce string constant for readonly setting (#68291 ) A blob store repository can be put in readonly mode by setting `readonly: true` in its settings. In the codebase the setting key is just the literal string `"readonly"` wherever it's used and it takes some effort to determine what the right setting name is, in particular to check each time that it's not spelled `"read_only"`. This commit replaces those literal `"readonly"` strings with the `BlobStoreRepository#READONLY_SETTING_KEY` constant to reduce this trappiness.	2021-02-01 15:43:37 +00:00
Ignacio Vera	747773d5af	Upgrade to Lucene 8.8.0 (#68272 )	2021-02-01 13:36:03 +01:00
Mark Vieira	413e6bac07	Disable secureHdfs fixture when testing on JDK 16 (#68182 )	2021-01-28 18:25:59 -08:00
Armin Braun	77162071f5	Add ClusterUUID to RepositoryData (#68002 ) Record the clusterUUID of the last cluster to write to a repository in the `RepositoryData` and use it for more meaningful logging when running into a concurrent modification issue.	2021-01-28 12:38:15 +01:00
Rory Hunter	ad1f876daa	Replace NOT operator with explicit `false` check (#67817 ) We have an in-house rule to compare explicitly against `false` instead of using the logical not operator (`!`). However, this hasn't historically been enforced, meaning that there are many violations in the source at present. We now have a Checkstyle rule that can detect these cases, but before we can turn it on, we need to fix the existing violations. This is being done over a series of PRs, since there are a lot to fix.	2021-01-26 14:47:09 +00:00
David Turner	e5a15d4fcb	Introduce repository UUIDs (#67829 ) Today a snapshot repository does not have a well-defined identity. It can be reregistered with a different cluster under a different name, and can even be registered with multiple clusters in readonly mode. This presents problems for cases where we need to refer to a specific snapshot in a globally-unique fashion. Today we rely on the repository being registered under the same name on every cluster, but this is not a safe assumption. This commit adds a UUID that can be used to uniquely identify a repository. The UUID is stored in the top-level index blob, represented by `RepositoryData`, and is also usually copied into the `RepositoryMetadata` that represents the repository in the cluster state. The repository UUID is exposed in the get-repositories API; other more meaningful consumers will be added in due course.	2021-01-25 12:17:52 +00:00
Jim Ferenczi	e77c523bd9	Upgrade to a new lucene 8.8.0 snapshot (#67691 ) This change upgrades to the latest Lucene 8.8.0 snapshot. It also restores the compression on binary doc values that was lost in the last snapshot upgrade. The compression is now configurable on binary doc values but we don't expose this functionality yet so this commit ensures that we pick the same compression mode as previous releases (BEST_COMPRESSION).	2021-01-19 13:33:19 +01:00
Armin Braun	6d025d3a27	Log Slowness on Sending Transport Messages (#67664 ) Similar to #62444 but for the outbound path. This does not detect slowness in individual transport handler logic, this is done via the inbound handler logging already, but instead warns if it takes a long time to hand off the message to the relevant transport thread and then transfer the message over the wire. This gives some visibility into the stability of the network connection itself and into the reasons for slow network responses (if they are the result of slow networking on the sender).	2021-01-19 12:19:32 +01:00
Rory Hunter	1a05a5ac24	Introduce deprecation categories (#67443 ) Closes #64824. Introduce the concept of categories to deprecation logging. Every location where we log a deprecation message must now include a deprecation category.	2021-01-18 16:16:54 +00:00
Julie Tibshirani	5852fbedf5	Rename QueryShardContext -> SearchExecutionContext. (#67490 ) We decided to rename `QueryShardContext` to clarify that it supports all parts of search request execution. Before there was confusion over whether it should only be used for building queries, or maybe only used in the query phase. This PR also updates the javadocs. Closes #64740.	2021-01-14 09:11:59 -08:00
David Turner	bc1f50c523	Permit wait_for_active_shards warnings in master (#67498 ) Part of the fixes for #66419, this commit permits nodes to emit the deprecation warning regarding not specifying `?wait_for_active_shards` when closing an index in 7.x versions for x ≥ 12. This change is required on `master` too since the BWC tests encounter these warnings. Relates #67246, which is the 7.x part of this change.	2021-01-14 15:55:43 +00:00
Nik Everett	7b0b09dfd7	Help eclipse compilation (#67403 ) Eclipse wasn't seeing the special shadow jars we were making for repository-azure and repository-hdfs so it wasn't able to compile those plugins. This points Eclipse at the project that we use to build the shadow jar which gets it compiling. The tests don't pass because we aren't pointing at the shadow jars but at least we compile.	2021-01-13 13:37:46 -05:00
Francisco Fernández Castaño	4b9f2e94bd	Increase Azure client timeout on tests (#67210 ) Additionally, this commit improves the error messages provided as previously we weren't including the blob name on deletion failures. Closes #67119	2021-01-13 13:57:13 +01:00
Francisco Fernández Castaño	78ad79a87f	Remove assertion that checks the exception message on AzureBlobContainerRetriesTests#testRetryUntilFail (#67258 ) The error message might change depending on the timing when we try to read from the stream. Since we already check that we're not able to read any data this assertion doesn't add much value.	2021-01-12 14:05:28 +01:00
Ignacio Vera	604ee06a3b	Upgrade to lucene-8.8-snapshot-f73f6b1 (#67228 )	2021-01-12 08:03:00 +01:00
Dan Hermann	8b05edaeb5	Fix attachment processor test that fails on Windows (#67156 )	2021-01-08 07:10:04 -06:00
Mark Vieira	22a6811802	Mute AzureBlobStoreRepositoryTests.testLargeBlobCountDeletion	2021-01-07 11:54:05 -08:00
Francisco Fernández Castaño	ac63c6dcf5	Fix AzureBlobContainerRetriesTests#testRetryUntilFail (#67077 ) We were too agressive with retries and in certain scenarios (CI) it was possible that when the SDK had retried n times the http handler had some pending backlog that didn't account for all the performed requests. Closes #66865	2021-01-06 13:47:21 +01:00
Francisco Fernández Castaño	9950cd24be	Add Ability to Write a BytesReference to Azure BlobContainer (#66683 )	2021-01-06 12:01:11 +01:00
Francisco Fernández Castaño	f1ebe1195c	Avoid early task cancellation during azure parallel blob deletions (#66929 ) Closes #66633	2021-01-05 11:08:16 +01:00
Albert Zaharovits	a184486362	Fix azure repo stream exhaust check for multipart uploads (#66769 ) This PR fixes the validation of the conversion from an input stream to a flux in the AzureBlobStore's multipart update logic, which erroneously checked that the upload input stream is exhausted after each part's flux is completed.	2021-01-04 17:49:40 +02:00
markharwood	aa01af882e	Annotated text plugin highlighter causes "array_index_out_of_bounds_exception" (#66593 ) Recent changes to the way Analyzers and field mappings are managed revealed a bug in the AnnotatedHighlighterAnalyzer class. Old sequences of calls avoided the issue but under the new scheme a counter reset was required between documents being highlighted. Closes #66535	2021-01-04 15:41:49 +00:00

1 2 3 4 5 ...

2826 Commits