kafka

Commit Graph

Author	SHA1	Message	Date
Jeff Kim	07fee62afe	KAFKA-14519; [2/N] New coordinator metrics (#14387 ) This patch copy over existing metrics and add new consumer group metrics to the new GroupCoordinatorService. Now that each coordinator is responsible for a topic partition, this patch introduces a GroupCoordinatorMetrics that records gauges for global metrics such as the number of generic groups in PreparingRebalance state, etc. For GroupCoordinatorShard specific metrics, GroupCoordinatorMetrics will activate new GroupCoordinatorMetricsShards that will be responsible for incrementing/decrementing TimelineLong objects and then aggregate the total amount across all shards. As the CoordinatorRuntime/CoordinatorShard does not care about group metadata, we have introduced a CoordinatorMetrics.java/CoordinatorMetricsShard.java so that in the future transaction coordinator metrics can also be onboarded in a similar fashion. Main files to look at: GroupCoordinatorMetrics.java GroupCoordinatorMetricsShard.java CoordinatorMetrics.java CoordinatorMetricsShard.java CoordinatorRuntime.java Metrics to add after #14408 is merged: offset deletions sensor (OffsetDeletions); Meter(offset-deletion-rate, offset-deletion-count) Metrics to add after https://issues.apache.org/jira/browse/KAFKA-14987 is merged: offset expired sensor (OffsetExpired); Meter(offset-expiration-rate, offset-expiration-count) Reviewers: Justine Olshan <jolshan@confluent.io>	2023-11-20 21:38:50 -08:00
Igor Soarez	c7c82baf87	MINOR: Always send cumulative failed dirs in HB request (#14770 ) Instead of only sending failed log directory UUIDs in the heartbeat request until a successful response is received, the broker sends the full cumulative set of failed directories since startup time. This aims to simplify the handling of log directory failure in the controller side, considering overload mode handling of heartbeat requests, which returns an undifferentiated reply. Reviewers: Colin P. McCabe <cmccabe@apache.org>, Proven Provenzano <pprovenzano@confluent.io>	2023-11-20 16:18:38 -08:00
Jason Gustafson	e905ef1edf	MINOR: Small LogValidator clean ups (#14697 ) This patch contains a few small clean-ups in LogValidator and associated classes: 1. Set shallowOffsetOfMaxTimestamp consistently as the last offset in the batch for v2 compressed and non-compressed data. 2. Rename `RecordConversionStats` to `RecordValidationStats` since one of its fields `temporaryMemoryBytes` does not depend on conversion. 3. Rename `batchIndex` in `recordIndex` in loops over the records in each batch inside `LogValidator`. Reviewers: Qichao Chu <5326144+ex172000@users.noreply.github.com>, Jun Rao <junrao@gmail.com>	2023-11-20 10:40:45 -08:00
Andrew Schofield	4eb8ae68ca	KAFKA-15836: KafkaConsumer subscribes to multiple topics does not respect max.poll.records (#14789 ) When returning fetched records, the code was not properly honouring max.poll.records. When it had fetched records for multiple topic-partitions, it was intended to accumulate records up to max.poll.records. Actually, whenever it accumulated records from a partition, it was prepared to add max.poll.records for that partition regardless of how many records it had already accumulated, rather than reducing the number of records to add based on records already added from other partitions. I added a test case which failed before the code change in the PR and passes afterwards. Reviewers: Kirk True <ktrue@confluent.io>, Justine Olshan <jolshan@confluent.io>	2023-11-20 08:18:54 -08:00
Igor Soarez	9e884bcff5	MINOR: Fix unstable sorting in AssignmentsManagerTest (#14794 ) Building AssignReplicasToDirsRequestData relies on iteration over Map entries, which can result in different sorting order. The order does not matter to the semantics of the request, but it can cause issues with test assertions. This issue was introduced in #14369. Reviewers: Divij Vaidya <diviv@amazon.com>, Ismael Juma <ismael@juma.me.uk>, David Jacot <djacot@confluent.io>	2023-11-20 07:31:45 -08:00
Lianet Magrans	6499cb0412	KAFKA-15574; [1/N]: Client state machine updates (#14690 ) Updates for the Client state machine, Heartbeat and Commit managers, to support required states and transitions as the member interacts with a consumer group based on heartbeat requests. Includes support for the subscribe and unsubscribe API calls, that drive the full lifecycle of a member as it joins a group, reconciles assignments, commits offsets if needed, handles fencing and fatal errors, and leaves the group. This also includes support for topic ID only when needed for the reconciliation process. Reviewers: Kirk True <ktrue@confluent.io>, Philip Nee <pnee@confluent.io>, Andrew Schofield <aschofield@confluent.io>, David Jacot <djacot@confluent.io>	2023-11-20 07:20:14 -08:00
Kamal Chandraprakash	feee616f73	MINOR: Query before creating the internal remote log metadata topic (#14755 ) When a node starts (or) restarts, then we send a CREATE_TOPICS request to the controller to create the internal __remote_log_metadata topic. Topic creation event is costly and handled by the controller. During re-balance, the controller can have pending requests in its queue and can lead to CREATE_TOPICS timeout. Instead of firing the CREATE_TOPICS request when a node restarts, send a METADATA request (topic describe) which is handled by the least loaded node before sending a request to create the topic. Reviewers: Satish Duggana <satishd@apache.org>, Christo Lolov <lolovc@amazon.com>	2023-11-20 14:50:11 +05:30
Philip Nee	e63f23718f	KAFKA-15174: Ensure CommitAsync propagate the exception to the user (#14680 ) The commit covers a few important points: - Exception handling: We should be thrown RetriableCommitException when the commit exception is retriable. We should throw FencedIdException on commit and poll similar to the current implementation. Other errors should be thrown as it is. - Callback invocation: The callbacks need to be invoked on the main/application thread; however, the future is completed in the background thread. To achieve this, I created an Invoker class with a queue, so that this callback can be invoked during the consumer.poll() Note: One change I made is to remove the DefaultOffsetCommit callback. Since the callback is purely for logging, I think it is reasonable for us to move the logging to the background thread instead of relying on the application thread to trigger the logging. Reviewers: Lucas Brutschy <lbrutschy@confluent.io>	2023-11-20 09:15:48 +01:00
Ismael Juma	df78204e05	KAFKA-15854: Move Java classes from `kafka.server` to the `server` module (#14796 ) We only move Java classes that have minimal or no dependencies on Scala classes in this PR. Details: * Configured `server` module in build files. * Changed `ControllerRequestCompletionHandler` to be an interface since it has no implementations. * Cleaned up various import control files. * Minor build clean-ups for `server-common`. * Disabled `testAssignmentAggregation` when executed with Java 8, this is an existing issue (see #14794). For broader context on this change, please check: * KAFKA-15852: Move server code from `core` to `server` module Reviewers: Divij Vaidya <diviv@amazon.com>	2023-11-19 22:09:19 -08:00
runom	066635819a	MINOR: Improve printing topic name when created topic in TopicCommand (#14661 ) The topic name was displayed as `Optional<String>` when the topic was created. ``` % bin/kafka-topics.sh --create --topic test --bootstrap-server localhost:9092 Created topic Optional[test]. ``` This PR fixed to print the topic name as `String` instead of `Optional<String>`. ``` % bin/kafka-topics.sh --create --topic test --bootstrap-server localhost:9092 Created topic test. ``` Reviewers: Ismael Juma <ismael@juma.me.uk>	2023-11-19 16:03:07 -08:00
Lucas Brutschy	b9556611cd	KAFKA-15833: Restrict Consumer API to be used from one thread (#14779 ) The legacy consumer detects any concurrent operations from different threads. This was not enforced in the new consumer. To avoid inconsistencies in the behavior, we enforce the same restriction in the new consumer. Reviewers: Bruno Cadonna <cadonna@apache.org>	2023-11-19 21:24:28 +01:00
Bruno Cadonna	922d0d0e5c	MINOR: Do not check whether updating tasks exist in the waiting loop (#14791 ) The state updater waits on a condition variable if no tasks exist that need to be updated. The condition variable is wrapped by a loop to account for spurious wake-ups. The check whether updating tasks exist is done in the condition of the loop. Actually, the state updater thread can change whether updating tasks exists, but since the state updater thread is waiting for the condition variable the check for the existence of updating tasks will always return the same value as long as the state updater thread is waiting. Thus, the check only need to be done once before entering the loop. This commit moves check before the loop making also the usage of mocks more robust since the processing becomes more deterministic. Reviewers: Lucas Brutschy <lbrutschy@confluent.io>	2023-11-18 21:10:21 +01:00
Dongnuo Lyu	b1796ce6d2	KAFKA-15849: Fix ListGroups API when runtime partition size is zero (#14785 ) When the group coordinator does not host any __consumer_offsets partitions, the existing ListGroup implementation won't schedule any operation, thus a `new CompletableFuture<>()` is returned directly and never gets completed. This patch fixes the issue. Reviewers: David Jacot <djacot@confluent.io>	2023-11-17 04:48:02 -08:00
David Jacot	fe7a373baa	HOTFIX: Fix compilation error in ReplicaManagerConcurrencyTest for Scala 2.12 (#14786 ) https://github.com/apache/kafka/pull/14369 introduced a compilation error in ReplicaManagerConcurrencyTest for Scala 2.12. Reviewers: Lucas Brutschy <lbrutschy@confluent.io>	2023-11-17 00:29:26 -08:00
Igor Soarez	a03a71d7b5	KAFKA-15357: Aggregate and propagate assignments A new AssignmentsManager accumulates, batches, and sends KIP-858 assignment events to the Controller. Assignments are sent via AssignReplicasToDirs requests. Move QuorumTestHarness.formatDirectories into TestUtils so it can be used in other test contexts. Fix a bug in ControllerRegistration.java where the wrong version of the record was being generated in ControllerRegistration.toRecord. Reviewers: Colin P. McCabe <cmccabe@apache.org>, Proven Provenzano <pprovenzano@confluent.io>, Omnia G H Ibrahim <o.g.h.ibrahim@gmail.com>	2023-11-16 16:19:49 -08:00
Crispin Bernier	b1d83e2b04	Revert "Revert "KAFKA-15661: KIP-951: Server side changes (#14444 )" (#14738 )" (#14747 ) This KIP-951 commit was reverted to investigate the org.apache.kafka.tiered.storage.integration.ReassignReplicaShrinkTest test failure (#14738). A fix for that was merged in #14757, hence unreverting this change. This reverts commit `a98bd7d`. Reviewers: Justine Olshan <jolshan@confluent.io>, Mayank Shekhar Narula <mayanks.narula@gmail.com>	2023-11-16 15:42:34 -08:00
David Arthur	a8622faf47	KAFKA-15799 Handle full metadata updates on ZK brokers (#14719 ) This patch adds the concept of a "Full" UpdateMetadataRequest, similar to what is used in LeaderAndIsr. A new tagged field is added to UpdateMetadataRequest at version 8 which allows the KRaft controller to indicate if a UMR contains all the metadata or not. Since UMR is implicitly treated as incremental by the ZK broker, we needed a way to detect topic deletions when the KRaft broker sends a metadata snapshot to the ZK broker. By sending a "Full" flag, the broker can now compare existing topic IDs to incoming topic IDs and calculate which topics should be removed from the MetadataCache. This patch only removes deleted topics from the MetadataCache. Partition/log management was implemented in KAFKA-15605. Reviewers: Colin P. McCabe <cmccabe@apache.org>	2023-11-16 14:38:44 -08:00
vamossagar12	e7f4f5dfe7	[MINOR] Removing unused variables from StreamThreadTest (#14777 ) A few variables which aren't being used anymore but still exist. This commit removes those unused variables. Co-authored-by: Sagar Rao <sagarrao@Sagars-MacBook-Pro.local> Reviewers: Lucas Brutschy <lbrutschy@confluent.io>	2023-11-16 11:00:11 +01:00
Robert Wagner	3fd6293449	KAFKA-15755: LeaveGroupResponse v0 - v2 loses its member under certain error conditions (#14635 ) This patch fixes a bug in the LeaveGroupResponse construction. Basically, when a top level error is set, no members are expected but the current check always requires one for versions prior to version 3. Reviewers: David Jacot <djacot@confluent.io>	2023-11-16 01:17:33 -08:00
Jorge Esteban Quilcate Otoya	875e610a2b	KAFKA-15802: Validate remote segment state before fetching index (#14727 ) (#14759 ) Reviewers: Satish Duggana <satishd@apache.org>, Divij Vaidya <diviv@amazon.com>, Christo Lolov <lolovc@amazon.com>, Luke Chen <showuon@gmail.com>, Kamal Chandraprakash<kamal.chandraprakash@gmail.com>	2023-11-16 08:52:01 +05:30
Alieh	0489b7cd33	KAFKA-15346: add support for 'single key single timestamp' IQs with versioned state stores (KIP-960) (#14596 ) This PR implements KIP-960 which add support for `VersionedKeyQuery`. Reviewers: Matthias J. Sax <matthias@confluent.io>	2023-11-15 17:34:54 -08:00
Kirk True	22f7ffe5e1	KAFKA-15277: Design & implement support for internal Consumer delegates (#14670 ) The consumer refactoring project introduced another `Consumer` implementation, creating two different, coexisting implementations of the `Consumer` interface: * `KafkaConsumer` (AKA "existing", "legacy" consumer) * `PrototypeAsyncConsumer` (AKA "new", "refactored" consumer) The goal of this task is to refactor the code via the delegation pattern so that we can keep a top-level `KafkaConsumer` but then delegate to another implementation under the covers. There will be two delegates at first: * `LegacyKafkaConsumer` * `AsyncKafkaConsumer` `LegacyKafkaConsumer` is essentially a renamed `KafkaConsumer`. That implementation handles the existing group protocol. `AsyncKafkaConsumer` is renamed from `PrototypeAsyncConsumer` and will implement the new consumer group protocol from KIP-848. Both of those implementations will live in the `internals` sub-package to discourage their use. This task is part of the work to implement support for the new KIP-848 consumer group protocol. Reviewers: Philip Nee <pnee@confluent.io>, Andrew Schofield <aschofield@confluent.io>, David Jacot <djacot@confluent.io>	2023-11-15 05:00:40 -08:00
Jay Wang	a64037cdef	MINOR: Fix GroupCoordinatorShardTest stubbing (#14637 ) This patch fixes incorrect stubs in GroupCoordinatorShardTest. Reviewers: David Jacot <djacot@confluent.io>	2023-11-14 23:45:11 -08:00
Yash Mayya	1bc4de75a2	KAFKA-15470: Allow creating connectors in a stopped state (#14704 ) Reviewers: Chris Egerton <chrise@aiven.io>	2023-11-15 11:37:50 +05:30
Justine Olshan	83b7c9a053	MINOR Re-add action queue parameter removed from appendRecords (#14753 ) In `91fa196`, I accidentally removed the action queue paramater that was added in `7d147cf`. I also renamed the actionQueue as to not confuse this in the future. I don't think this broke anything since we don't use verification for group coordinator commits, but I should fix it to be as it was before. Reviewers: Artem Livshits <alivshits@confluent.io>, Jason Gustafson <jason@confluent.io>	2023-11-14 15:56:42 -08:00
Mayank Shekhar Narula	bd030f3499	KAFKA-15824 SubscriptionState's maybeValidatePositionForCurrentLeader should handle partition which isn't subscribed yet (#14757 ) See the motivation in jira description https://issues.apache.org/jira/browse/KAFKA-15824 This was discovered as ReassignReplicaShrinkTest started to fail with KIP-951 changes. KIP-951 changes since then have been reverted(PR), would be put back once this is in. Reviewers: Walker Carlson <wcarlson@apache.org>, Andrew Schofield <aschofield@confluent.io>	2023-11-14 17:46:58 -06:00
Mickael Maison	832627fc78	MINOR: Various cleanups in metadata (#14734 ) - Remove unused code, suppression - Simplify/fix test assertions - Javadoc cleanups Reviewers: Josep Prat <josep.prat@aiven.io>	2023-11-14 09:25:09 +01:00
David Jacot	49d3122d42	MINOR: Fix ClusterConnectionStatesTest.testSingleIP (#14741 ) This test fails because `localhost` resolved to more than one IP on the Ci. This patch updates the ClusterConnectionStatesTest.testSingleIP test to use a static resolver. Reviewers: Luke Chen <showuon@gmail.com>	2023-11-13 01:15:35 -08:00
Zihao Lin	7c562a776d	HOTFIX: Fix compilation error for JDK 21 caused by this-escape warning (#14740 ) This patch fixes the compilation error for JDK 21 introduced in https://github.com/apache/kafka/pull/14708. Reviewers: Ismael Juma <ismael@juma.me.uk>, David Jacot <djacot@confluent.io>	2023-11-12 08:59:40 +01:00
David Jacot	a98bd7d65f	Revert "KAFKA-15661: KIP-951: Server side changes (#14444 )" (#14738 ) This reverts commit `f38b0d8`. Trying to find the root cause of org.apache.kafka.tiered.storage.integration.ReassignReplicaShrinkTest failing in CI. Reviewers: Justine Olshan <jolshan@confluent.io>	2023-11-11 18:12:17 -08:00
Almog Gavra	39cacca89b	KAFKA-15774: refactor windowed stores to use StoreFactory (#14708 ) This is a follow up from #14659 that ports the windowed classes to use the StoreFactory abstraction as well. There's a side benefit of not duplicating the materialization code twice for each StreamImpl/CogroupedStreamImpl class as well. Reviewers: Anna Sophie Blee-Goldman <ableegoldman@apache.org>, Matthias Sax <mjsax@apache.org>	2023-11-10 18:19:11 -08:00
Greg Harris	989618a56e	KAFKA-15800: Prevent DataExceptions from corrupting KafkaOffsetBackingStore (#14718 ) Signed-off-by: Greg Harris <greg.harris@aiven.io> Reviewers: Yash Mayya <yash.mayya@gmail.com>	2023-11-10 08:40:16 -08:00
David Jacot	fcfd378129	HOTFIX: Fix compilation error in BrokerLifecycleManager (#14732 ) This patch fixes a compilation error introduced by https://github.com/apache/kafka/pull/14392 for Scala 2.12. ``` > Task :core:compileScala [Error] /home/jenkins/workspace/Kafka_kafka-pr_PR-14392/core/src/main/scala/kafka/server/BrokerLifecycleManager.scala:305:49: value incl is not a member of scala.collection.immutable.Set[org.apache.kafka.common.Uuid] ``` Reviewers: Luke Chen <showuon@gmail.com>	2023-11-10 01:27:35 -08:00
Crispin Bernier	f38b0d886c	KAFKA-15661: KIP-951: Server side changes (#14444 ) This is the server side changes to populate the fields in KIP-951. On NOT_LEADER_OR_FOLLOWER errors in both FETCH and PRODUCE the new leader ID and epoch are retrieved from the local cache through ReplicaManager and included in the response, falling back to the metadata cache if they are unavailable there. The endpoint for the new leader is retrieved from the metadata cache. The new fields are all optional (tagged) and an IBP bump was required. https://cwiki.apache.org/confluence/display/KAFKA/KIP-951%3A+Leader+discovery+optimisations+for+the+client https://issues.apache.org/jira/browse/KAFKA-15661 Protocol changes: #14627 Testing Benchmarking described here https://cwiki.apache.org/confluence/display/KAFKA/KIP-951%3A+Leader+discovery+optimisations+for+the+client#KIP951:Leaderdiscoveryoptimisationsfortheclient-BenchmarkResults ./gradlew core:test --tests kafka.server.KafkaApisTest Reviewers: Justine Olshan <jolshan@confluent.io>, David Jacot <djacot@confluent.io>, Jason Gustafson <jason@confluent.io>, Fred Zheng <zhengyd2014@gmail.com>, Mayank Shekhar Narula <mayanks.narula@gmail.com>, Yang Yang <yayang@uber.com>, David Mao <dmao@confluent.io>, Kirk True <ktrue@confluent.io>	2023-11-09 21:07:21 -08:00
José Armando García Sancio	809694a9f6	MINOR; Fix cluster size for migration tests (#14726 ) Use smaller cluster sizes instead of the default cluster size Reviewers: David Arthur <mumrah@gmail.com>, Ismael Juma <ismael@juma.me.uk>	2023-11-09 15:47:15 -08:00
José Armando García Sancio	35317d8f7b	MINOR; Fix KRaft metadata version system tests (#14722 ) The latest metadata version is now 3.7. Fix the KRaft upgrade test to upgrade to that version instead of 3.6. Change the vagrant setup and gradle dependencies to use 3.3.2 instead of 3.3.1. Reviewers: David Arthur <mumrah@gmail.com>	2023-11-09 12:03:12 -08:00
Igor Soarez	eaa6b8abdd	KAFKA-15360: Include dirs in BrokerRegistration #14392 BrokerLifecycleManager should send the offline log directories in the BrokerHeartbeatRequests it sends. Also, when handling BrokerHeartbeatResponses, do so by enqueing a BrokerLifecycleManager event, rather than trying to do the handling directly in the callback. Reviewers: Colin P. McCabe <cmccabe@apache.org>, Proven Provenzano <pprovenzano@confluent.io>	2023-11-09 11:01:01 -08:00
Proven Provenzano	fa472d26a5	MINOR: Update BrokerRegistration to use a Builder Update BrokerRegistration to use a Builder. This fixes the proliferation of different constructors, and makes it clear what arguments are being used where. Reviewers: Colin P. McCabe <cmccabe@confluent.io>	2023-11-09 10:08:31 -08:00
Colin Patrick McCabe	7060c08d6f	MINOR: Rewrite the meta.properties handling code in Java and fix some issues #14628 (#14628 ) meta.properties files are used by Kafka to identify log directories within the filesystem. Previously, the code for handling them was in BrokerMetadataCheckpoint.scala. This PR rewrites the code for handling them as Java and moves it to the apache.kafka.metadata.properties namespace. It also gets rid of the separate types for v0 and v1 meta.properties objects. Having separate types wasn't so bad back when we had a strict rule that zk clusters used v0 and kraft clusters used v1. But ZK migration has blurred the lines. Now, a zk cluster may have either v0 or v1, if it is migrating, and a kraft cluster may have either v0 or v1, at any time. The new code distinguishes between an individual meta.properties file, which is represented by MetaProperties, and a collection of meta.properties files, which is represented by MetaPropertiesEnsemble. It is useful to have this distinction, because in JBOD mode, even if some log directories are inaccessible, we can still use the ensemble to extract needed information like the cluster ID. (Of course, even when not in JBOD mode, KRaft servers have always been able to configure a metadata log directory separate from the main log directory.) Since we recently added a unique directory.id to each meta.properties file, the previous convention of passing a "canonical" MetaProperties object for the cluster around to various places in the code needs to be revisited. After all, we can no longer assume all of the meta.properties files are the same. This PR fixes these parts of the code. For example, it fixes the constructors of ControllerApis and RaftManager to just take a cluster ID, rather than a MetaProperties object. It fixes some other parts of the code, like the constructor of SharedServer, to take a MetaPropertiesEnsemble object. Another goal of this PR was to centralize meta.properties validation a bit more and make it unit-testable. For this purpose, the PR adds MetaPropertiesEnsemble.verify, and a few other verification methods. These enforce invariants like "the metadata directory must be readable," and so on. Reviewers: Igor Soarez <soarez@apple.com>, David Arthur <mumrah@gmail.com>, Divij Vaidya <diviv@amazon.com>, Proven Provenzano <pprovenzano@confluent.io>	2023-11-09 09:32:35 -08:00
Chris Egerton	39c6170aa9	KAFKA-15693: Immediately reassign lost connectors and tasks when scheduled rebalance delay is disabled (#14647 ) Reviewers: Sagar Rao <sagarmeansocean@gmail.com>, Yash Mayya <yash.mayya@gmail.com>	2023-11-09 10:48:43 -05:00
Bruno Cadonna	81cceedf7e	MINOR: Delete task-level commit sensor (#14677 ) The task-level commit metrics were removed without deprecation in KIP-447 and the corresponding PR #8218. However, we forgot to update the docs and to remove the code that creates the task-level commit sensor. This PR removes the task-level commit metrics from the docs and removes the code that creates the task-level commit sensor. The code was effectively dead since it was only used in tests. Reviewers: Guozhang Wang <wangguoz@gmail.com>, Matthias J. Sax <matthias@confluent.io>	2023-11-09 15:37:13 +01:00
Bruno Cadonna	f1e58a35d7	MINOR: Do not checkpoint standbys when handling corrupted tasks (#14709 ) When a task is corrupted, uncorrupted tasks are committed. That is also true for standby tasks. Committing standby tasks actually means that they are checkpointed. When the state updater is enabled, standbys are owned by the state updater. The stream thread should not checkpoint them when handling corrupted tasks. That is not a big limitation since the state updater periodically checkpoints standbys anyway. Additionally, when handling corrupted tasks the important thing is to commit active running tasks to abort the transaction. Committing standby tasks do not abort any transaction. Reviewers: Matthias J. Sax <matthias@confluent.io>, Lucas Brutschy <lbrutschy@confluent.io>	2023-11-08 16:09:24 +00:00
Justine Olshan	91fa196930	KAFKA-15653: Pass requestLocal as argument to callback so we use the correct one for the thread (#14629 ) With the new callback mechanism we were accidentally passing context with the wrong request local. Now include a RequestLocal as an explicit argument to the callback. Also make the arguments passed through the callback clearer by separating the method out. Added a test to ensure we use the request handler's request local and not the one passed in when the callback is executed via the request handler. Reviewers: Ismael Juma <ismael@juma.me.uk>, Divij Vaidya <diviv@amazon.com>, David Jacot <djacot@confluent.io>, Jason Gustafson <jason@confluent.io>, Artem Livshits <alivshits@confluent.io>, Jun Rao <junrao@gmail.com>,	2023-11-07 15:14:17 -08:00
Calvin Liu	edc7e10a74	KAFKA-15583: Enforce HWM advance only if partition is not under-min-ISR (#14594 ) Only advance the HWM for a partition if the ISR set is equal to or above the min ISR config. This patch also sets an upper bound on the min ISR config so it cannot exceed the number of replicas. Reviewers: David Arthur <mumrah@gmail.com>	2023-11-07 10:24:40 -05:00
Yanming Zhou	317f61dfd9	MINOR: Avoid unnecessary Map lookup in RoundRobinPartitioner (#14613 ) Avoid calling cluster.partitionsForTopic(topic) if cluster.availablePartitionsForTopic(topic) is not empty Reviewers: Divij Vaidya <diviv@amazon.com>, hudeqi <1217150961@qq.com>	2023-11-07 10:01:54 +01:00
Xiaobing Fang	2d07e57287	KAFKA-15771: fix concurrency bug in ProduceRequest#partitionSizes() (#14674 ) A commit fixes a bug in ProduceRequest#partitionSizes() that may cause this method to incorrectly returning an empty or incomplete response for a thread when another thread is in the process of initialising it. Reviewers: Divij Vaidya <diviv@amazon.com>, hudeqi <1217150961@qq.com>, vamossagar12 <sagarmeansocean@gmail.com> -------------------------------- Co-authored-by: fangxiaobing <fangxiaobing@kuaishou.com>	2023-11-07 09:44:07 +01:00
Almog Gavra	febf0fb573	KAFKA-15774: introduce internal StoreFactory (#14659 ) This PR sets up the necessary prerequisites to respect configurations such as dsl.default.store.type and the dsl.store.suppliers.class (introduced in #14648) without requiring them to be passed in to StreamBuilder#new(TopologyConfig) (passing them only into new KafkaStreams(...). It essentially makes StoreBuilder an external-only API and internally it uses the StoreFactory equivalent. It replaces KeyValueStoreMaterializer with an implementation of StoreFactory that creates the store builder only after configure() is called (in a Future PR we will create the missing equivalents for all of the places where the same thing happens for window stores, such as in all the *WindowKStreamImpl classes) Reviewers: Anna Sophie Blee-Goldman <ableegoldman@apache.org>	2023-11-06 17:30:58 -08:00
David Mao	c6ea0a84ab	KAFKA-15780: Wait for consistent KRaft metadata when creating or deleting topics (#14695 ) TestUtils.createTopicWithAdmin calls waitForAllPartitionsMetadata which waits for partition(s) to be present in each brokers' metadata cache. This is a sufficient check in ZK mode because the controller sends an LISR request before sending an UpdateMetadataRequest which means that the partition in the ReplicaManager will be updated before the metadata cache. In KRaft mode, the metadata cache is updated first, so the check may return before partitions and other metadata listeners are fully initialized. Testing: Insert a Thread.sleep(100) in BrokerMetadataPublisher.onMetadataUpdate after // Publish the new metadata image to the metadata cache. metadataCache.setImage(newImage) and run EdgeCaseRequestTest.testProduceRequestWithNullClientId and the test will fail locally nearly deterministically. After the change(s), the test no longer fails. Reviewers: Justine Olshan <jolshan@confluent.io>	2023-11-06 17:07:56 -08:00
Calvin Liu	505e5b3eaa	KAFKA-15584: Leader election with ELR (#14593 ) The patch includes the following changes as part of KIP-966 * Allow ISR shrink to empty * Allow leader election with ELR members * Allow electing the last known leader Reviewers: Artem Livshits <alivshits@confluent.io>, David Arthur <mumrah@gmail.com>	2023-11-06 17:21:51 -05:00
kumarpritam863	ca05da95c3	KAFKA-15680: Fix sink task partition-count metric when cooperative consumer protocol is used (#14630 ) Reviewers: Chris Egerton <chrise@aiven.io>	2023-11-06 11:32:05 -05:00

1 2 3 4 5 ...

11924 Commits All Branches Search

11924 Commits

All Branches