Commit Graph

87301 Commits

Author SHA1 Message Date
Luigi Dell'Aquila a79bbffb0b
ES|QL: enable EXPLAIN (snapshot only) (#129526) 2025-06-23 09:55:45 +02:00
Rene Groeschke f1b2c8dd8e
Split up bc and pr upgrade tests (#129756)
Splits up bc and pr upgrade tests as they tend to be a bottle neck in intake and pr builds nowadays.
2025-06-23 09:17:54 +02:00
Sam Xiao e3838a4b9c
Make GeoIp downloader multi-project aware (#128282)
This change makes the GeoIp persistent task executor/downloader multi-project aware. 
- the database downloader persistent task will be at the project level, meaning there will be a downloader instance per project
- persistent task id is prefixed with project id, namely `<project-id>/geoip-downloader` for cluster in MP mode
2025-06-23 15:07:40 +08:00
Martijn van Groningen 41f69810df
Force niofs for fdt tmp file read access when flushing stored fields (#129538)
Due to the way how stored fields get flushed when index sorting is active, it is possible that we encounter significant page cache faults when memory is scarce. In order to mitigate some of the slowness around this, we're planning to no longer mmap the fdt temp file. Initially behind a feature flag, to check for unforeseen side effects.

Typically using always mmap directory is better compared to noifs directory given there is a sufficient memory available to the OS for filesystem caching. However when that isn't the case, then indexing performance can vary a lot (often very slow). This is more true for files tmp files that stored fields create during flushing. These files exist for only a brief moment to sort stored fields in the order of the configured index sorting and are then removed. If these tmp files are mmapped there is risk to trash file system cache.

This change only avoids using mmap for the fdt tmp file. This the file that actually contains the data and can large compared to other files that get flushed. The fdm (metadata) and fdi (stored field index) remain being mmapped.
2025-06-23 07:46:00 +02:00
elasticsearchmachine 13365dd14f Mute org.elasticsearch.index.engine.ThreadPoolMergeExecutorServiceDiskSpaceTests testAbortingOrRunningMergeTaskHoldsUpBudget #129823 2025-06-23 15:20:20 +10:00
Ignacio Vera 5bec44ad58
Reduce data amplification in IVFVectorsWriter (#129698)
With this change we will create first the tmp file and the posting list and once the file is deleted we will 
merge the vectors on the vec file. Therefore we only have two copies of the vector at the same time.
2025-06-23 07:13:22 +02:00
elasticsearchmachine 3c1137688e Mute org.elasticsearch.xpack.esql.qa.single_node.GenerativeIT test #129819 2025-06-23 09:04:21 +10:00
elasticsearchmachine 73146ac304 Mute org.elasticsearch.qa.verify_version_constants.VerifyVersionConstantsIT testLuceneVersionConstant #125638 2025-06-23 08:28:39 +10:00
elasticsearchmachine c2ebe1b535 Mute org.elasticsearch.search.query.RescoreKnnVectorQueryIT testKnnRetriever #129818 2025-06-23 07:34:58 +10:00
Chris Hegarty 1255a64832
Upgrade to Lucene 10.2.2 (#129546)
This commit upgrades to Upgrade to Lucene 10.2.2.

With the release of 10.2.2, we no longer need to workaround the Lucene bug mentioned in 128671.
2025-06-22 13:37:22 +01:00
elasticsearchmachine 6a1a862aea Mute org.elasticsearch.search.query.RescoreKnnVectorQueryIT testKnnQueryRescore #129809 2025-06-21 23:30:20 +10:00
Simon Chase cb451dac0b
datafeed: check remote_cluster_client before cluster aliases in start (#129601)
TransportStartDatafeedAction previously tried to validate remote index cluster
names in datafeed jobs, before checking if the local cluster had
remote_cluster_client role. Because this role enables retrieval of the remote
cluster names, the validation step would always fail with a no-such-cluster
exception. This was confusing. This change moves the remote_cluster_client check
ahead of cluster name validation, and adds a test.

Closes ES-11841
Closes #121149
2025-06-20 16:25:35 -07:00
Nik Everett 0b35acf861
ESQL: Fix misspelling in generated docs (#129789)
Pulled from #128576 so it's easier to review.
2025-06-20 20:42:37 +01:00
Parker Timmins 245dc0775a
Make flattened synthetic source concatenate object keys on scalar/object mismatch (#129600)
There is an issue where for Flattened fields with synthetic source, if there is a key with a scalar value, and a duplicate key with an object value, one of the values will be left out of the produced synthetic source. This fixes the issue by replacing the object with paths to each of its keys. These paths consist of the concatenation of all keys going down to a given scalar, joined by a period. For example, they are of the form foo.bar.baz. This applies recursively, so that every value within the object, no matter how nested, will be accessible through a full specified path.
2025-06-20 14:20:49 -05:00
Lorenzo Dematté 1edf77c1df
Mute testSnapshotRestore in bcUpgradeTest (#129767) 2025-06-20 19:04:09 +01:00
Sam Xiao 706e7f3b6f
Make Watch transport actions project-aware (#129612) 2025-06-21 01:09:40 +08:00
Jonathan Buttner d9b34d43a5
[ML] Custom service add support for input_type, top_n, and return_documents (#129441)
* Making progress on different request parameters

* Working tests

* Adding custom service validator for rerank

* Fixing embedding bug

* Adding transport version check

* Fixing tests

* Fixing license header

* Fixing writeTo

* Moving file and removing commented code

* Fixing test

* Fixing tests

* Refactoring and tests

* Fixing test
2025-06-20 12:23:48 -04:00
Niels Bauman a230165df3
Unmute #112189 (#129773)
This test has been muted for a long time and only seemed to fail on v7
compatability tests. We're unmuting this to see if it's still relevant.

Closes #112189
2025-06-21 01:25:52 +10:00
Ignacio Vera 4ca96c199f
Introduce a vectorize soarDistance function (#129744)
This commit replaces the method #soarResidual with a method call #soarDistance which perfoms better for computing soar distances.
2025-06-20 16:23:50 +02:00
elasticsearchmachine 1d913f3da4 Mute org.elasticsearch.xpack.security.SecurityRolesMultiProjectIT testUpdatingFileBasedRoleAffectsAllProjects #129775 2025-06-20 23:56:55 +10:00
Carlos Delgado 6952cf2b63
Add IVF feature flag to IT tests (#129766) 2025-06-20 23:47:01 +10:00
Luigi Dell'Aquila b0bd718b96
ES|QL: add generative tests for CHANGE_POINT command (#129758) 2025-06-20 15:28:42 +02:00
Luigi Dell'Aquila 539d8d4a8e
ES|QL: Add multi-node generative tests (#129765) 2025-06-20 15:28:30 +02:00
Ievgen Degtiarenko c1046b0704
Remove resolved mutes (#129750) 2025-06-20 15:13:52 +02:00
Iraklis Psaroudakis 2940f9a011
Accommodate hollow engine changes (#129535)
* Field infos calculation method inside Engine
* buildSeqNoStats as static public method

So it can be overriden in stateless if/as needed.

Relates ES-11457
2025-06-20 16:01:55 +03:00
Luke Whiting a952245178
Fix missing feature flag on Streams YAML tests (#129747)
* Fix missing feature flag on YAML tests

* Unmute tests
2025-06-20 21:12:51 +10:00
Ioana Tagirta 0c0bf6bd93
Resolve Keep plan added to FORK branches (#129754) 2025-06-20 13:07:18 +02:00
Luke Whiting 18c1e55eb3
Reserve transport version for streams endpoint backport (#129753) 2025-06-20 20:57:35 +10:00
Moritz Mack 9b2ca99ca4
Tolerate incompatible versions with different build hash (#128589)
Tolerate incompatible versions with different build hash.
I'm keeping the serverless feature flag to not create warnings there.

Relates to ES-11869
2025-06-20 12:16:32 +02:00
Jan Kuipers 586c3b28ca
fix esql categorize csv test for old JDKs (#129746) 2025-06-20 19:57:43 +10:00
elasticsearchmachine b2acda9a9f Mute org.elasticsearch.snapshots.SnapshotShutdownIT testSnapshotShutdownProgressTracker #129752 2025-06-20 19:12:39 +10:00
Ievgen Degtiarenko f2703ce461
Fix toIpInAgg & toIpInSort (#129743) 2025-06-20 11:01:27 +02:00
Antonio 39a3e80b51
[ResponseOps] Granting `kibana_system` role access to the cases analytics indices (#129414)
* Update kibana system user permissions for cases analytics index.

* [CI] Auto commit changes from spotless

* Add permissions for aliases.
Create constants for indexes and aliases.
Update tests.

* [CI] Auto commit changes from spotless

---------

Co-authored-by: elasticsearchmachine <infra-root+elasticsearchmachine@elastic.co>
Co-authored-by: Elastic Machine <elasticmachine@users.noreply.github.com>
2025-06-20 11:00:13 +02:00
Julia Bardi 4aa83e262a
[Fleet] add index privileges to support reroute processor (#129692)
* add index privileges to support reroute processor

* [CI] Auto commit changes from spotless

* add more indices

* update

* fix test

* remove unnecessary indices

---------

Co-authored-by: elasticsearchmachine <infra-root+elasticsearchmachine@elastic.co>
2025-06-20 10:41:51 +02:00
Dimitris Rempapis bd4f5fee8d
ShardSearchStatsTests - add missing metrics and methods (#129311)
ShardSearchStatsTests - add missing metrics and methods - complete coverage
2025-06-20 11:32:52 +03:00
Nick Tindall f715f63137
Fix and unmute GetSnapshotsIT (#129741)
Closes: 129740
2025-06-20 18:29:47 +10:00
Ankita Kumar 9e19b85783
Metrics to account for time spent waiting for next chunk (#129469)
This PR addresses ES-12071.

We want to collect metrics for the time that is spent waiting for the next chunk of a bulk request. This can help with diagnosing high bulk latency in case the latency is attributable to external factors such as network connection.

Co-authored-by: Francisco Fernández Castaño <francisco.fernandez.castano@gmail.com>
2025-06-20 08:21:18 +02:00
elasticsearchmachine 73a5ce2071 Mute org.elasticsearch.snapshots.GetSnapshotsIT testFilterByState #129740 2025-06-20 15:27:03 +10:00
Yang Wang 0e83b19de7
Remove obsolete Metadata BWC for repositories (#129685)
When migrating RepositoriesMetadata from cluster custom to project
custom (#125398), we needed temporary BWC handling for clusters running
on a version that is before this change but after the initial MP change.
Such a cluster can only exist in the serverless environment which has
progressed way past any applicable versions. Therefore we no longer need
the BWC handling and this PR removes it.

Relates: #125398
2025-06-20 11:34:48 +10:00
Brendan Cully 4ce06d1aa2
Add deleteByQuery to InternalEngine (#129679) 2025-06-19 15:40:38 -07:00
elasticsearchmachine 0f2806a9c2 Mute org.elasticsearch.streams.StreamsYamlTestSuiteIT test {yaml=streams/logs/10_basic/Check for repeated toggle to same state} #129735 2025-06-20 07:33:59 +10:00
elasticsearchmachine 2e13fca5ee Mute org.elasticsearch.streams.StreamsYamlTestSuiteIT test {yaml=streams/logs/10_basic/Basic toggle of logs state enable to disable and back} #129733 2025-06-20 07:33:50 +10:00
Rene Groeschke 29db3f3464
[Build] Extract logsdb rolling-upgrade tests (#129673)
- introduce separate subproject for testing logsdb rolling-upgrade tests
- should reduce :qa:rolling-upgrade test task durations
2025-06-19 22:04:36 +02:00
Iván Cea Fontenla 2b8d9df0ef
[ESQL] Fix match phrase tests not being deterministic (#129724)
Closes https://github.com/elastic/elasticsearch/issues/129676
2025-06-20 03:13:17 +10:00
Luigi Dell'Aquila 0dba16384e
ES|QL: Fix generative tests (#129717) 2025-06-20 02:13:26 +10:00
Mikhail Berezovskiy eeca493860
Move HTTP content aggregation from Netty into RestController (#129302) 2025-06-19 09:05:17 -07:00
Albert Zaharovits 083326e658
Threadpool merge executor does not block aborted merges (#129613)
This PR addresses a bug where aborted merges are blocked if there's
insufficient disk space.

Previously, the merge disk space estimation did not consider if the
operation has been aborted when/while it was enqueued for execution.
Consequently, aborted merges, for e.g. when closing a shard, were
blocked if their disk space estimation was exceeding the available disk
space threshold. In this case, the shard close operation would itself
block.

This fix estimates a disk space budget of `0` for aborted merges, and it
periodically checks if any enqueued merge tasks have been aborted (more
generally, it checks if the budget estimate for any merge tasks has
changed, and reorders the queue if so). This way aborted merges are
prioritized and are never blocked.

Closes https://github.com/elastic/elasticsearch/issues/129335
2025-06-20 00:51:13 +10:00
Mary Gouseti ee5d652411
Increase node up timeout in AbstractLocalClusterFactory (#129639)
In the last two months a lot of tests were converted to use the newer rest test framework. Some tests start 1 node, other start 3 nodes, others even more, the framework runs tests in parallel but it doesn't know how many nodes its tests needs meaning that running 3 tests in parallel, for example, can be very different when they are single node clusters or 3 node clusters etc. During this execution we saw the 3x more CPU load than what we would want to have ideally.

Currently there is no good solution for this because if dial down the concurrency we will use the nodes inefficiently, but if we keep the concurrency to where it is we risk longer start up times. Considering that the starting time of elasticsearch is not related to this test, we choose to increase the timeout to reduce the noise.
2025-06-19 17:37:37 +03:00
Aurélien FOUCRET 34ccaba56d
[ES|QL] RERANK command - Updating the syntax and behavior (#129488) 2025-06-19 15:46:33 +02:00
elasticsearchmachine a0109bb0fe Mute org.elasticsearch.search.query.RescoreKnnVectorQueryIT testKnnSearchRescore #129713 2025-06-19 23:31:05 +10:00