Commit Graph

228 Commits

Author SHA1 Message Date
Mark Vieira c2eda511de
Add JUnit rule based integration test cluster orchestration framework (#92379)
This commit adds a new test framework for configuring and orchestrating
test clusters for both Java and YAML REST testing. This will eventually
replace the existing "test-clusters" Gradle plugin and the build-time
cluster orchestration.
2022-12-21 15:33:46 -08:00
Ignacio Vera ed564f6e1d
Update lo lucene-9.3.0 (#88927) 2022-08-01 07:21:13 +02:00
Ignacio Vera 3b7f393a82
Upgrade to lucene snapshot lucene-9.3.0-snapshot-b8d1fcfd0ec (#88706) 2022-07-22 11:22:39 +02:00
Nhat Nguyen bd69f90fff
Upgrade to Lucene-9.3.0-snapshot-2d05f5c623e (#88284)
To include LUCENE-10620 - which passes Weight to Collector
2022-07-06 16:16:03 -04:00
Armin Braun 02568210ba
Don't extend AbstractIndexComponent in AbstractTokenFilter (#88113)
No need for this extension, we don't make use of the settings or deprecation logger
in production any more. Also, this slows down CS operations that require a
temporary index service which builds quite a bit slower when the loggers
need to be set up via reflective calls.
2022-06-28 12:13:36 +02:00
Nhat Nguyen c2dc6e6ef4
Upgrade to new Lucene snapshot (#87932)
This PR uses Lucene-9.3 snapshot in Elasticsearch 8.4. Noticeable changes in this Lucene snapshot:

- Merge-on-refresh (disabled)
- No more pathological merging
- SortedSetDocValues#count for value_count aggs
2022-06-23 12:18:27 -04:00
Alan Woodward 048fa422c2
Update to public lucene 9.2.0 release (#87162) 2022-06-06 10:06:41 +01:00
Alan Woodward 205cfec52f
Upgrade to lucene 9.2.0-RC2 snapshot (#86931)
Only difference from last snapshot is a revert of a change in the behaviour
of PersianAnalyzer
2022-05-20 08:54:35 +01:00
Alan Woodward 0418e8a9d8
Upgrade to lucene snapshot 978eef5459c (#86852)
Final (hopefully!) snapshot before the 9.2.0 release

* Update test to expect persian tokenfilter - will be exposed later
* Fix KnnVectorQueryBuilderTests::doAssertLuceneQuery

Co-authored-by: Mayya Sharipova <mayya.sharipova@elastic.co>
2022-05-17 15:27:52 -07:00
Armin Braun 7b916f2678
AbstractAnalyzerProvider does not need to extend AbstractIndexComponent (#86537)
Remove the inheritance here to make instances smaller and speed up many-shards benchmarks a little.
Did not remove the dead arguments from the constructors in this PR as that would have been a
very noisy change.
2022-05-08 22:34:52 +02:00
Alan Woodward 4d076eee20
Upgrade to Lucene 9.2 snapshot efa5d6f4d43 (#86227)
Notable changes include:

count implementations for MultiRangeQuery and IndexSortedNumericDocValuesRangeQuery, which may speed up certain aggregations
more efficient decoding of docids in BKD reader
2022-05-05 15:48:13 +01:00
Ignacio Vera af2fe8ee33
Upgrade Lucene to 9.1.0 release (#85211) 2022-03-22 14:11:53 +01:00
Alan Woodward 0863fb83d5
Upgrade to lucene 9.1.0-snapshot-5b522487ba8 (#85025)
Specifically includes LUCENE-10469 which should address a performance 
regression in EQL.
2022-03-16 14:58:20 +00:00
Julie Tibshirani bba2dfac56
Upgrade Lucene to 9.1.0-snapshot-949752 (#84540)
This PR upgrades Lucene to a newer snapshot `9.1.0-snapshot-9497524cc2d`.

Changes:
* Adapt to `LeafReader#searchNearestVectors` signature change
* Adapt checks in `GeometryIndexerTests`, `SearchServiceTests`, `FiltersAggregatorTests`, `AggregationProfilerIT`
* Address highlighting failures in `MultiPhrasePrefixQuery` and `HasChildQueryBuilder.LateParsingQuery`
2022-03-04 08:36:26 -08:00
Mayya Sharipova 26c3dd6857
Upgrade to lucene-9.1.0-snapshot-1336263051c (#83667)
Lucene issues that resulted in elasticsearch changes:

LUCENE-9820 Separate logic for reading the BKD index from logic to intersecting it.
LUCENE-10377: Replace 'sortPos' with 'enableSkipping' in SortField.getComparator()
LUCENE-10301: make the test-framework a proper module by moving all test
classes to org.apache.lucene.tests
LUCENE-10300: rewrite how resources are read in ukrainian morfologik analyzer:
LUCENE-10054 Make HnswGraph hierarchical
2022-02-22 09:53:20 +01:00
Przemyslaw Gomulka 037261356e
Convert 'id' and '_id' values in REST API tests to strings (#82681)
Follow-up from #77144 (comment) with converting id/_id to always be strings instead of integers. This makes the type value in the Elasticsearch specification be only string instead of string | number.

this change was generated using following command on ubuntu
find . -type f -name "*.yml" -print0 | xargs -0 sed -i -r 's/([^a-zA-Z0-9_\.]id|[^a-zA-Z0-9_]_id):(\s*)([0-9]+)/\1:\2"\3"/g'
2022-02-10 09:14:17 +01:00
Alan Woodward 33ef38e478
Upgrade to released lucene 9.0.0 (#81426)
This commit makes elasticsearch depend on the released artifacts
of lucene 9.0.0, rather than an internal snapshot.
2021-12-07 14:19:56 +00:00
Mayya Sharipova db0b4ba08a
Upgrade Lucene 9 snapshot cc2a31f2be8 (#80213) 2021-11-02 15:50:33 -04:00
Mark Vieira 12ad399c48 Reformat Elasticsearch source 2021-10-27 08:19:51 -07:00
Alan Woodward dd3227192a
Update lucene 9 snapshot (#79701)
This includes the following changes:

* LUCENE-10180: Avoid using lambdas in SegmentMerger
* LUCENE-10187: Reduce DirectWriter's padding
* LUCENE-10193: Cut over more array access to VarHandles
* LUCENE-10189: Optimize flush of doc-value fields that are effectively single-valued
* LUCENE-10165: Implement Lucene90DocValuesProducer#getMergeInstance
2021-10-25 12:59:43 +01:00
Mayya Sharipova 074b023be5
Upgrade lucene version 8b68bf60c98 (#79461)
This introduces a new 8.10.1 Lucene version for ES 7.16

Also Enable bwc that were disabled in PR#79385
2021-10-19 11:30:41 -04:00
Alan Woodward 13f05358a0
Update to new lucene snapshot (#79138)
Includes the following new commits of interest:

* LUCENE-10150: override readLongs() in ByteBuffersDataInput
* LUCENE-10146: Add VectorSimilarityFunction.COSINE
* LUCENE-10140: Correct minimizing iterator sub-matches
* LUCENE-10103 Make QueryCache respect Accountable queries
* LUCENE-10170: Restore compression speed for LZ4.
2021-10-14 12:25:09 +01:00
Alan Woodward b56d72a468
Upgrade to lucene snapshot ba75dc5e6bf (#78817)
Includes the following lucene changes:

* LUCENE-10145: Speed up byte[] comparisons using VarHandles.
* LUCENE-10143: Delegate primitive writes in RateLimitedIndexOutput
* LUCENE-10182: No longer check dvGen.
* LUCENE-10153: Speed up BKDWriter using VarHandles.
* LUCENE-10150: override ByteBuffersDataInput readLong/readInt/readShort
2021-10-07 12:11:25 +01:00
Alan Woodward 92db6a62c0
Update to new lucene snapshot (#78548)
Includes the following changes:

- Updates to prevent package splits in ES: LUCENE-10118, LUCENE-10132
- Speedups in writing doc values: LUCENE-10127, LUCENE-10123, LUCENE-10125
- Speedups in writing primitives: LUCENE-10125
- Sort-after bugfixes: LUCENE-10126
2021-10-01 12:07:42 +01:00
Nhat Nguyen 7d82dbf9e3
Upgrade to Lucene9-snapshot-94c6e261274 (#78286)
Upgrade to a new Lucene 9 snapshot that includes LUCENE-10119 so we can 
re-enable the sort optimization with points for scroll and search_after
requests.

Relates #78230
2021-09-24 10:46:11 -04:00
Ryan Ernst b0c6ff8c3c
Fix split package in analysis-smartcn plugin (#78045)
The analysis-smartcn plugin reuses server the server package name for
analysis. This commit moves the plugin implementation to use a single
package name, o.e.p.analysis.smartcn
2021-09-21 08:10:50 -07:00
Alan Woodward 524d1ea757
Upgrade to lucene-9.0.0-snapshot-32a0a16aff0 (#73324)
This commit upgrades elasticsearch 8.0 to use a snapshot of
Lucene 9.0.0.
2021-09-21 10:48:26 +01:00
Nhat Nguyen 05c25e4b0a
Upgrade to Lucene-8.10.0-snapshot-bf2fcb53079 (#77801)
To include LUCENE-10106.
2021-09-16 12:23:18 -04:00
Rene Groeschke 35ec6f348c
Introduce simple public yaml-rest-test plugin (#76554)
This introduces a basic public yaml rest test plugin that is supposed to be used by external 
elasticsearch plugin authors. This is driven by #76215

- Rename yaml-rest-test to intern-yaml-rest-test
- Use public yaml plugin in example plugins

Co-authored-by: Mark Vieira <portugee@gmail.com>
2021-08-31 08:45:52 +02:00
Nhat Nguyen 174f65ef82
Upgrade to Lucene 8.9.0 (#74729) 2021-06-30 08:49:53 -04:00
Ryan Ernst ab1a2e4a84
Add precommit task for detecting split packages (#73784)
Modularization of the JDK has been ongoing for several years. Recently
in Java 16 the JDK began enforcing module boundaries by default. While
Elasticsearch does not yet use the module system directly, there are
some side effects even for those projects not modularized (eg #73517).
Before we can even begin to think about how to modularize, we must
Prepare The Way by enforcing packages only exist in a single jar file,
since the module system does not allow packages to coexist in multiple
modules.

This commit adds a precommit check to the build which detects split
packages. The expectation is that we will add the existing split
packages to the ignore list so that any new classes will not exacerbate
the problem, and the work to cleanup these split packages can be
parallelized.

relates #73525
2021-06-08 15:04:23 -07:00
Nhat Nguyen 5efb6eaba6
Update Lucene to 8.9.0-snapshot-ddc238e5df8 (#73568)
Just include LUCENE-9980, which fixes #39591.

Closes #39591
2021-05-31 13:49:29 -04:00
Nhat Nguyen 1764e8ba15
Upgrade to Lucene-8.9.0-SNAPSHOT-efdc43fee18 (#73130)
Upgrades to Lucene-8.9 snapshot which includes:

- LUCENE-9507: Custom order for leaves (/cc @mayya-sharipova)
- LUCENE-9935: Enable bulk merge for stored fields with index sort
2021-05-17 09:37:20 -04:00
Nhat Nguyen a461597c75
Upgrade to Lucene 8.8.2 on 8.0 (#71587) 2021-04-14 08:52:23 -04:00
Jake Landis b1ef1fd800
Introduce yamlRestCompatTests for :plugins projects (#71440) 2021-04-08 16:11:50 -05:00
Mark Vieira 6339691fe3
Consolidate REST API specifications and publish under Apache 2.0 license (#70036) 2021-03-26 16:20:14 -07:00
Mark Vieira a92a647b9f Update sources with new SSPL+Elastic-2.0 license headers
As per the new licensing change for Elasticsearch and Kibana this commit
moves existing Apache 2.0 licensed source code to the new dual license
SSPL+Elastic license 2.0. In addition, existing x-pack code now uses
the new version 2.0 of the Elastic license. Full changes include:

 - Updating LICENSE and NOTICE files throughout the code base, as well
   as those packaged in our published artifacts
 - Update IDE integration to now use the new license header on newly
   created source files
 - Remove references to the "OSS" distribution from our documentation
 - Update build time verification checks to no longer allow Apache 2.0
   license header in Elasticsearch source code
 - Replace all existing Apache 2.0 license headers for non-xpack code
   with updated header (vendored code with Apache 2.0 headers obviously
   remains the same).
 - Replace all Elastic license 1.0 headers with new 2.0 header in xpack.
2021-02-02 16:10:53 -08:00
Ignacio Vera 747773d5af
Upgrade to Lucene 8.8.0 (#68272) 2021-02-01 13:36:03 +01:00
Jim Ferenczi e77c523bd9
Upgrade to a new lucene 8.8.0 snapshot (#67691)
This change upgrades to the latest Lucene 8.8.0 snapshot.
It also restores the compression on binary doc values that was lost in the last snapshot upgrade.
The compression is now configurable on binary doc values but we don't expose this functionality yet so this commit ensures that we pick the same compression mode as previous releases (BEST_COMPRESSION).
2021-01-19 13:33:19 +01:00
Ignacio Vera 604ee06a3b
Upgrade to lucene-8.8-snapshot-f73f6b1 (#67228) 2021-01-12 08:03:00 +01:00
Ignacio Vera 4add5cb65d
Upgrade to Lucene-8.7.0 (#64532) 2020-11-03 16:04:26 +01:00
Ignacio Vera 2a1d8c66af
Upgrade to lucene-8.7.0-snapshot-72d8528c3a6 (#63912) 2020-10-20 13:33:32 +02:00
Mayya Sharipova e6aa10ad89
Upgrade 8.7.0-snapshot-5c4168d (#63444)
This disables sort optim on _doc, which may still be unstable.
2020-10-08 06:47:37 -04:00
Mayya Sharipova fc6e3dc41e
Upgrade to lucene-8.7.0-snapshot-e914862 (#63395)
This fixes a problem with some special cases of sorting on .doc
2020-10-07 08:48:14 -04:00
Mayya Sharipova 5f9ba13750
Upgrade to lucene- 8.7.0-snapshot-66c49a35402 (#63349)
This includes fixing doc iteration during sort optimization
2020-10-06 20:31:51 -04:00
Nhat Nguyen ff43c08443
Upgrade to Lucene-8.7.0-snapshot-77396dbf339 (#63222)
Includes LUCENE-9554, which exposes the pendingNumDocs from IndexWriter.
2020-10-05 09:36:02 -04:00
Mayya Sharipova 1648cf2974
Upgrade lucene to lucene-8.7.0-snapshot-3b59906 (#62970) 2020-09-28 13:57:49 -04:00
Adrien Grand b33dc24a79
Upgrade to lucene-8.7.0-snapshot-830bd186a8d. (#62576) 2020-09-18 07:53:38 +02:00
Adrien Grand 0418e9c09b
Upgrade to lucene-8.7.0-snapshot-9cd3af50f80. (#62450)
This new snapshot contains the following JIRAs that we're interested in:
 - [LUCENE-9525](https://issues.apache.org/jira/browse/LUCENE-9525)
Better handling of small documents. This should improve retrieval times
when documents are less than ~1kB.
 - [LUCENE-9510](https://issues.apache.org/jira/browse/LUCENE-9510)
Faster flushes when index sorting is enabled by not compressing the
temporary files that store stored fields and term vectors.
2020-09-16 17:13:39 +02:00
Adrien Grand 39bde05040
Upgrade to lucene-8.7.0-snapshot-cdfdc1e0851. (#62334)
Upgrade to a new Lucene snapshot that (at least partially) addresses the
indexing rate regression when index sorting is enabled.
2020-09-15 14:19:42 +02:00