elasticsearch

Commit Graph

Author	SHA1	Message	Date
Tim Vernum	4ff691f066	Merge revision `7fb6ca447a` into multi-project	2024-12-31 15:41:02 +11:00
Nikolaj Volgushev	257ad517d2	Bring back automaton minimization (#119309 ) The security codebase relies heavily on automata and caching these. The Lucene 10 upgrade removed automaton minimization which can result in a memory usage increase of >5x, esp. for roles with many application privileges. This PR brings back Automaton minimization to avoid the explosion in roles cache size. Relates: ES-10451	2024-12-31 01:52:58 +11:00
Yang Wang	fda1fa19d4	Merge main into multi-project	2024-12-13 12:15:25 +11:00
Kathleen DeRusso	c9a6a2c841	Add match support for semantic_text fields (#117839 ) * Added query name to inference field metadata * Fix build error * Added query builder service * Add query builder service to query rewrite context * Updated match query to support querying semantic text fields * Fix build error * Fix NPE * Update the POC to rewrite to a bool query when combined inference and non-inference fields * Separate clause for each inference index (to avoid inference ID clashes) * Simplify query builder service concept to a single default inference query * Rename QueryBuilderService, remove query name from inference metadata * Fix too many rewrite rounds error by injecting booleans in constructors for match query builder and semantic text * Fix test compilation errors * Fix tests * Add yaml test for semantic match * Add NodeFeature * Fix license headers * Spotless * Updated getClass comparison in MatchQueryBuilder * Cleanup * Add Mock Inference Query Builder Service * Spotless * Cleanup * Update docs/changelog/117839.yaml * Update changelog * Replace the default inference query builder with a query rewrite interceptor * Cleanup * Some more cleanup/renames * Some more cleanup/renames * Spotless * Checkstyle * Convert List<QueryRewriteInterceptor> to Map keyed on query name, error on query name collisions * PR feedback - remove check on QueryRewriteContext class only * PR feedback * Remove intercept flag from MatchQueryBuilder and replace with wrapper * Move feature to test feature * Ensure interception happens only once * Rename InterceptedQueryBuilderWrapper to AbstractQueryBuilderWrapper * Add lenient field to SemanticQueryBuilder * Clean up yaml test * Add TODO comment * Add comment * Spotless * Rename AbstractQueryBuilderWrapper back to InterceptedQueryBuilderWrapper * Spotless * Didn't mean to commit that * Remove static class wrapping the InterceptedQueryBuilderWrapper * Make InterceptedQueryBuilderWrapper part of QueryRewriteInterceptor * Refactor the interceptor to be an internal plugin that cannot be used outside inference plugin * Fix tests * Spotless * Minor cleanup * C'mon spotless * Test spotless * Cleanup InternalQueryRewriter * Change if statement to assert * Simplify template of InterceptedQueryBuilderWrapper * Change constructor of InterceptedQueryBuilderWrapper * Refactor InterceptedQueryBuilderWrapper to extend QueryBuilder * Cleanup * Add test * Spotless * Rename rewrite to interceptAndRewrite in QueryRewriteInterceptor * DOESN'T WORK - for testing * Add comment * Getting closer - match on single typed fields works now * Deleted line by mistake * Checkstyle * Fix over-aggressive IntelliJ Refactor/Rename * And another one * Move SemanticMatchQueryRewriteInterceptor.SEMANTIC_MATCH_QUERY_REWRITE_INTERCEPTION_SUPPORTED to Test feature * PR feedback * Require query name with no default * PR feedback & update test * Add rewrite test * Update server/src/main/java/org/elasticsearch/index/query/InnerHitContextBuilder.java Co-authored-by: Mike Pellegrini <mike.pellegrini@elastic.co> --------- Co-authored-by: Mike Pellegrini <mike.pellegrini@elastic.co>	2024-12-12 16:55:00 +01:00
Niels Bauman	33f48b728f	Merge main into multi-project	2024-12-10 05:23:29 +00:00
Benjamin Trent	5e859d9301	Even better(er) binary quantization (#117994 ) This measurably improves BBQ by adjusting the underlying algorithm to an optimized per vector scalar quantization. This is a brand new way to quantize vectors. Instead of there being a global set of upper and lower quantile bands, these are optimized and calculated per individual vector. Additionally, vectors are centered on a common centroid. This allows for an almost 32x reduction in memory, and even better recall than before at the cost of slightly increasing indexing time. Additionally, this new approach is easily generalizable to various other bit sizes (e.g. 2 bits, etc.). While not taken advantage of yet, we may update our scalar quantized indices in the future to use this new algorithm, giving significant boosts in recall. The recall gains spread from 2% to almost 10% for certain datasets with an additional 5-10% indexing cost when indexing with HNSW when compared with current BBQ.	2024-12-10 03:06:27 +11:00
Niels Bauman	04da446e42	Merge main into multi-project	2024-12-04 23:18:13 +01:00
Niels Bauman	032b42fcf7	Make TransportLocalClusterStateAction wait for cluster to unblock (#117230 ) This will make `TransportLocalClusterStateAction` wait for a new state that is not blocked. This means we need a timeout (again). For consistency's sake, we're reusing the REST param `master_timeout` for this timeout as well. The only class that was using `TransportLocalClusterStateAction` was `TransportGetAliasesAction`, so its request needed to accept a timeout again as well.	2024-12-04 12:17:13 +01:00
Simon Cooper	73645b2daf	Merge remote-tracking branch 'upstream-main/main' into merge-main-031224	2024-12-03 15:48:16 +00:00
Benjamin Trent	6c2f6071b2	Refactor/bbq format (#117847 ) * Refactor bbq format to be contained in a package * fixing license headers * fixing module * fix style	2024-12-02 16:04:31 -05:00
Yang Wang	92867cdf50	Merge main into multi-project	2024-11-29 08:50:54 +11:00
John Verwolf	8350ff29ba	Extensible Completion Postings Formats (#111494 ) Allows the Completion Postings Format to be extensible by providing an implementation of the CompletionsPostingsFormatExtension SPIs.	2024-11-28 13:25:02 -08:00
Martijn van Groningen	6a4b68d263	Add source mode stats to MappingStats (#117463 )	2024-11-28 10:53:39 +01:00
Tim Vernum	4cfb619448	Merge main into multi-project	2024-11-19 18:22:02 +11:00
Simon Cooper	cc35f1dc6a	Remove transport versions fixup listener and associated code (#116941 )	2024-11-18 16:19:14 +00:00
Simon Cooper	c832572709	Remove some historical features (#116926 ) Historical features are now trivially true on v9 - so we can remove the features, and the check. Historical features do not affect cluster state, so this has no compatibility restrictions.	2024-11-18 14:33:05 +00:00
Niels Bauman	0edb9fa778	Merge remote-tracking branch 'public/main' into merge-main # Conflicts: # server/src/main/java/org/elasticsearch/action/search/TransportSearchShardsAction.java # server/src/main/java/org/elasticsearch/cluster/routing/allocation/AllocationStatsService.java # server/src/main/java/org/elasticsearch/gateway/GatewayMetaState.java # server/src/main/java/org/elasticsearch/plugins/Plugin.java # server/src/test/java/org/elasticsearch/gateway/GatewayMetaStateTests.java # server/src/test/java/org/elasticsearch/ingest/IngestMetadataTests.java	2024-11-18 10:53:12 +01:00
Alexis Charveriat	e0af1238fc	Index stats enhancement: creation date and tier_preference (#116339 ) * Expose tier preference as part of the index stats * Also expose index creation date in index stats * Added test	2024-11-15 09:08:42 +01:00
Tim Vernum	da5da54f3f	Merge main into multi-project	2024-11-06 16:05:33 +11:00
Patrick Doyle	338c0538b7	Dynamic entitlement agent (#116125 ) * Refactor: treat "maybe" JVM options uniformly * WIP * Get entitlement running with bridge all the way through, with qualified exports * Cosmetic changes to SystemJvmOptions * Disable entitlements by default * Bridge module comments * Fixup forbidden APIs * spotless * Rename EntitlementChecker * Fixup InstrumenterTests * exclude recursive dep * Fix some compliance stuff * Rename asm-provider * Stop using bridge in InstrumenterTests * Generalize readme for asm-provider * InstrumenterTests doesn't need EntitlementCheckerHandle * Better javadoc * Call parseBoolean * Add entitlement to internal module list * Docs as requested by Lorenzo * Changes from Jack * Rename ElasticsearchEntitlementChecker * Remove logging javadoc * exportInitializationToAgent should reference EntitlementInitialization, not EntitlementBootstrap. They're currently in the same module, but if that ever changes, this code would have become wrong. * Some suggestions from Mark --------- Co-authored-by: Ryan Ernst <ryan@iernst.net>	2024-11-06 00:07:52 +01:00
Nhat Nguyen	fa6c5296d4	Add num docs and size to logsdb telemetry (#116128 ) Follow-up on #115994 to add telemetry for the total number of documents and size in bytes of logsdb indices. Relates #115994	2024-11-05 08:46:46 -08:00
Tim Vernum	2ba2d2a995	Merge main into multi-project	2024-10-31 11:55:04 +11:00
Ying Mao	4ecdfbb214	[Inference API] Add API to get configuration of inference services (#114862 ) * Adding API to get list of service configurations * Update docs/changelog/114862.yaml * Fixing some configurations * PR feedback -> Stream.of * PR feedback -> singleton * Renaming ServiceConfiguration to SettingsConfiguration. Adding TaskSettingsConfiguration * Adding task type settings configuration to response * PR feedback	2024-10-30 13:29:58 -04:00
Tim Vernum	d4e4b5abb0	Merge main into multi-project	2024-10-22 13:03:12 +11:00
Luca Cavanna	8efd08b019	Upgrade to Lucene 10 (#114741 ) The most relevant ES changes that upgrading to Lucene 10 requires are: - use the appropriate IOContext - Scorer / ScorerSupplier breaking changes - Regex automaton are no longer determinized by default - minimize moved to test classes - introduce Elasticsearch900Codec - adjust slicing code according to the added support for intra-segment concurrency - disable intra-segment concurrency in tests - adjust accessor methods for many Lucene classes that became a record - adapt to breaking changes in the analysis area Co-authored-by: Christoph Büscher <christophbuescher@posteo.de> Co-authored-by: Mayya Sharipova <mayya.sharipova@elastic.co> Co-authored-by: ChrisHegarty <chegar999@gmail.com> Co-authored-by: Brian Seeders <brian.seeders@elastic.co> Co-authored-by: Armin Braun <me@obrown.io> Co-authored-by: Panagiotis Bailis <pmpailis@gmail.com> Co-authored-by: Benjamin Trent <4357155+benwtrent@users.noreply.github.com>	2024-10-21 13:38:23 +02:00
Tim Vernum	586d543918	Merge main into multi-project	2024-10-15 15:08:06 +11:00
Benjamin Trent	6c752abc23	Adding new bbq index types behind a feature flag (#114439 ) new index types of bbq_hnsw and bbq_flat which utilize the better binary quantization formats. A 32x reduction in memory, with nice recall properties.	2024-10-14 20:13:27 -04:00
Simon Cooper	f981d1f9e2	Merge remote-tracking branch 'upstream-main/main' into update-main-10-10-24	2024-10-10 13:27:33 +01:00
Mary Gouseti	0d03207477	Clean up factory retention settings from elasticsearch (#114396 ) This removes the possibility for a plugin to provide factory retention settings. Factory retention settings have been deprecated and completely replaced by #111972. Note: this feature is not in use. If someone wants to set global retention they can use the cluster settings as defined in #111972.	2024-10-10 11:45:46 +03:00
Simon Cooper	09f91cdaec	Merge branch 'main' into main-update-9-10-24	2024-10-09 17:08:05 +01:00
David Turner	07c3acf1c0	Remove cluster state from `/_cluster/reroute` response (#114231 ) Including the cluster state in responses to the `POST _cluster/state` API was deprecated in #90399 (v8.6.0) requiring callers to pass `?metric=none` to avoid the deprecation warning. This commit adjusts the behaviour as promised in v9 so that this API never returns the cluster state, and deprecates the `?metric` parameter itself. Closes #88978	2024-10-08 07:59:57 +01:00
Albert Zaharovits	90e9f03171	Merge main into multi-project	2024-10-01 14:03:54 +03:00
Chris Hegarty	32dde26e49	Upgrade to Lucene 9.12.0 (#113333 ) This commit upgrades to Lucene 9.12.0. Co-authored-by: Adrien Grand <jpountz@gmail.com> Co-authored-by: Armin Braun <me@obrown.io> Co-authored-by: Benjamin Trent <ben.w.trent@gmail.com> Co-authored-by: Chris Hegarty <chegar999@gmail.com> Co-authored-by: John Wagster <john.wagster@elastic.co> Co-authored-by: Luca Cavanna <javanna@apache.org> Co-authored-by: Mayya Sharipova <mayya.sharipova@elastic.co>	2024-10-01 08:39:27 +01:00
Albert Zaharovits	71ccf2089f	Merge main into multi-project	2024-09-26 09:57:32 +03:00
David Turner	97db0c2182	Remove `{Indices,}ClusterStateUpdateRequest` (#113483 ) These abstract classes are now unused so this commit removes them.	2024-09-25 08:17:51 +01:00
Tim Vernum	ad6435dede	Merge main into multi-project	2024-09-25 12:49:01 +10:00
Simon Cooper	f9aa6f40cd	Always use CLDR locale on ES v9 (#113184 ) Regardless of JDK version, ES should always use CLDR locale database from 9.0.0. This also removes IsoCalendarDataProvider used to override week-date calculations for the root locale only.	2024-09-23 11:05:08 +01:00
Tim Vernum	f6458344ce	Merge main into multi-project	2024-09-23 11:36:52 +10:00
Ryan Ernst	2ecfb397ad	Remove plugin classloader indirection (#113154 ) Extensible plugins use a custom classloader for other plugin jars. When extensible plugins were first added, the transport client still existed, and elasticsearch plugins did not exist in the transport client (at least not the ones that create classloaders). Yet the transport client still created a PluginsService. An indirection was used to avoid creating separate classloaders when the transport client had created the PluginsService. The transport client was removed in 8.0, but the indirection still exists. This commit removes that indirection layer.	2024-09-20 07:45:40 -07:00
Niels Bauman	c41ed527b3	Merge main into multi-project	2024-09-14 10:52:45 +02:00
Mark Vieira	a59c182f9f	Add AGPLv3 as a supported license	2024-09-13 15:29:46 -07:00
Albert Zaharovits	62e93779d8	Merge main into multi-project	2024-08-29 17:46:47 +03:00
Patrick Doyle	50871a3d28	New injector (#111722 ) * Initial new injector * Allow createComponents to return classes * Downsample injection * Remove more vestiges of subtype handling * Lowercase logger * Respond to code review comments * Only one object per class * Some additional cleanup incl spotless * PR feedback * Missed one * Rename workQueue * Remove Injector.addRecordContents * TelemetryProvider requires us to inject an object using a supertype * Address Simon's comments * Clarify the reason for SuppressForbidden * Make log indentation code less intrusive	2024-08-28 11:13:47 -04:00
Tim Vernum	a100bc3131	Merge main into multi-project	2024-08-28 20:22:59 +10:00
David Turner	f150e2c11d	Add telemetry for repository usage (#112133 ) Adds to the `GET _cluster/stats` endpoint information about the snapshot repositories in use, including their types, whether they are read-only or read-write, and for Azure repositories the kind of credentials in use.	2024-08-27 23:34:02 +10:00
Panagiotis Bailis	b685a436ce	Adding RankDocsRetrieverBuilder and RankDocsQuery (#111709 )	2024-08-26 15:18:47 +03:00
Patrick Doyle	35a375329a	Move Guice to org.elasticsearch.injection.guice (#111723 ) * Move files and fix imports & module exports * Other consequences of moving Guice	2024-08-12 10:47:46 -04:00
Albert Zaharovits	7e73a1ad2c	Initial changes to make ILM go through the ProjectResolver for project metadata (MP-1589) The initial goal of this PR was to make the "put ILM" action go through the project resolver in order to resolve the project-scoped metadata, and hence avoid referring to the whole cluster state. This implies changing some methods to work on the project metadata rather than the whole cluster metadata. It turns out, due to good code reuse, it is hard to only change one specific action to only refer to project-scoped metadata.	2024-08-01 17:32:29 +03:00
Keith Massey	a2814e816b	Adding mapping validation to the simulate ingest API (#110606 )	2024-07-19 08:08:21 -05:00
Joe Gallo	27e7601698	Directly download commercial ip geolocation databases from providers (#110844 ) Co-authored-by: Keith Massey <keith.massey@elastic.co>	2024-07-17 20:55:14 -04:00

1 2 3

150 Commits