kafka

Commit Graph

Author	SHA1	Message	Date
Mickael Maison	d183cf9ac1	KAFKA-18172 Move RemoteIndexCacheTest to the storage module (#19469 ) Reviewers: Chia-Ping Tsai <chia7712@gmail.com>	2025-04-15 15:53:41 +08:00
Mickael Maison	5f2a68b150	KAFKA-19119 Move ApiVersionManager/SimpleApiVersionManager to server (#19426 ) Reviewers: Ken Huang <s7133700@gmail.com>, PoAn Yang <payang@apache.org>, Chia-Ping Tsai <chia7712@gmail.com>	2025-04-15 14:32:44 +08:00
Milly	80b209d2a0	MINOR: remove unused parameter from KafkaMetadataLog (#19458 ) 1. Remove unused parameter from KafkaMetadataLog. 2. Give Utils.closeQuietly a meaningful name when closing reader. Reviewers: TengYao Chi <kitingiao@gmail.com>, Ken Huang <s7133700@gmail.com>, Jhen-Yung Hsu <jhenyunghsu@gmail.com>, Chia-Ping Tsai <chia7712@gmail.com> --------- Co-authored-by: TengYao Chi <kitingiao@gmail.com>	2025-04-15 10:26:55 +08:00
PoAn Yang	b18f00b449	KAFKA-19121 Move AddPartitionsToTxnConfig and TransactionStateManagerConfig out of KafkaConfig (#19439 ) Both AddPartitionsToTxnConfig and TransactionStateManagerConfig are static configs and they don't have specific config check. We can move them out of KafkaConfig to simplify KafkaConfig. Reviewers: Chia-Ping Tsai <chia7712@gmail.com>	2025-04-15 01:16:30 +08:00
PoAn Yang	8827ce4701	KAFKA-19113: Migrate DelegationTokenManager to server module (#19424 ) 1. Migrate DelegationTokenManager to server module. 2. Rewrite DelegationTokenManager in Java. 3. Move DelegationTokenManagerConfigs out of KafkaConfig. Reviewers: Mickael Maison <mickael.maison@gmail.com>	2025-04-14 16:49:45 +02:00
Azhar Ahmed	4cdd4b617c	KAFKA-19071: Fix doc for remote.storage.enable (#19345 ) As of 3.9, Kafka allows disabling remote storage on a topic after it was enabled. It allows subsequent enabling and disabling too. However the documentation says otherwise and needs to be corrected. Doc: https://kafka.apache.org/39/documentation/#topicconfigs_remote.storage.enable Reviewers: Luke Chen <showuon@gmail.com>, PoAn Yang <payang@apache.org>, Ken Huang <s7133700@gmail.com>	2025-04-14 11:08:49 +08:00
Parker Chang	c8fe551139	KAFKA-19030 Remove metricNamePrefix from RequestChannel (#19374 ) As described in the JIRA ticket, `controlPlaneRequestChannelOpt` was removed from KRaft mode, so there's no need to use the metrics prefix anymore. This change removes `metricNamePrefix` from RequestChannel and the related files. It also removes `DataPlaneAcceptor#MetricPrefix`, since `DataPlaneAcceptor` is the only implementation of `Acceptor`. Since the implementation of KIP-291 is essentially removed, we can also remove `logAndThreadNamePrefix` and `DataPlaneAcceptor#ThreadPrefix`. Reviewers: PoAn Yang <payang@apache.org>, Ken Huang <s7133700@gmail.com>, Chia-Ping Tsai <chia7712@gmail.com>	2025-04-12 23:22:40 +08:00
Dmitry Werner	7863b35064	KAFKA-14485: Move LogCleaner to storage module (#19387 ) Move LogCleaner and related classes to storage module and rewrite in Java. Reviewers: Mickael Maison <mickael.maison@gmail.com>, Jun Rao <junrao@gmail.com>	2025-04-11 09:21:05 -07:00
Andrew Schofield	21a080f08c	KAFKA-16894: Define feature to enable share groups (#19293 ) This PR proposes a switch to enable share groups for 4.1 (preview) and 4.2 (GA). * `share.version=1` to indicate that share groups are enabled. This is used as the switch for turning share groups on and off. In 4.1, the default will be `share.version=0`. Then a user wanting to evaluate the preview of KIP-932 would use `bin/kafka-features.sh --bootstrap.server xxxx upgrade --feature share.version=1`. In 4.2, the default will be `share.version=1`. Reviewers: Jun Rao <junrao@gmail.com>	2025-04-11 12:14:38 +01:00
PoAn Yang	34a87d3477	KAFKA-19042 Move TransactionsWithMaxInFlightOneTest to client-integration-tests module (#19289 ) Use Java to rewrite `TransactionsWithMaxInFlightOneTest` by new test infra and move it to client-integration-tests module. Reviewers: Chia-Ping Tsai <chia7712@gmail.com>	2025-04-11 12:04:19 +08:00
Ken Huang	588d107ec2	KAFKA-19101 Remove ControllerMutationQuotaManager#throttleTimeMs unused parameter (#19410 ) It seems `timeMs` this parameter never used in Kafka project, the method init commit is `b5f90daf13` Reviewers: Jhen-Yung Hsu <jhenyunghsu@gmail.com>, PoAn Yang <payang@apache.org>, TengYao Chi <kitingiao@gmail.com>, Chia-Ping Tsai <chia7712@gmail.com>	2025-04-11 11:31:08 +08:00
Sushant Mahajan	c3b7aa6e64	KAFKA-18170: Add create and write timestamp fields in share snapshot [1/N] (#19432 ) * We wish to track the time of creation of the `ShareSnapshot` records so that automated jobs could force their creation if a share partition has gone cold (no updates for a specified time interval). * To accomplish this, we have added 2 new fields `CreateTimestamp` and `WriteTimestamp` in the `ShareSnapshot` record. * The former tracks snapshot creation due to regular RPC calls while the latter will track snapshots created by periodic jobs. * In this PR we have made the requisite changes. * This is a first of a series of PRs to create the automated jobs and associated scaffolding. Reviewers: Andrew Schofield <aschofield@confluent.io>	2025-04-10 15:56:58 +01:00
TengYao Chi	b649b1ed5d	KAFKA-18935: Ensure brokers do not return null records in FetchResponse (#19167 ) JIRA: KAFKA-18935 This patch ensures the broker will not return null records in FetchResponse. For more details, please refer to the ticket. Reviewers: Ismael Juma <ismael@juma.me.uk>, Chia-Ping Tsai <chia7712@gmail.com>, Jun Rao <junrao@gmail.com>	2025-04-10 22:21:00 +08:00
Lucas Brutschy	6430fb5d45	MINOR: Add note that streams groups are in early access (#19434 ) Add a note to the group protocol configuration that streams groups are in early access and should not be used in production. Also update an outdated comment related to disabling the protocol. Reviewers: Bruno Cadonna <cadonna@apache.org>	2025-04-10 13:46:31 +02:00
Abhinav Dixit	699ae1b75b	KAFKA-16729: Support isolation level for share consumer (#19261 ) This PR adds the share group dynamic config `share.isolation.level`. Until now, share groups only supported `READ_UNCOMMITTED` isolation level type. With this PR, we aim to support `READ_COMMITTED` isolation type to share groups. Reviewers: Andrew Schofield <aschofield@confluent.io>, Jun Rao <junrao@gmail.com>, Apoorv Mittal <apoorvmittal10@gmail.com>	2025-04-10 09:00:03 +01:00
PoAn Yang	56591d2d07	KAFKA-19090: Move DelayedFuture and DelayedFuturePurgatory to server module (#19390 ) Rewrite these classes in Java and move them to the server module Reviewers: Mickael Maison <mickael.maison@gmail.com>	2025-04-09 11:52:56 +02:00
Chirag Wadhwa	5148174196	KAFKA-16718-2/n: KafkaAdminClient and GroupCoordinator implementation for DeleteShareGroupOffsets RPC (#18976 ) This PR contains the implementation of KafkaAdminClient and GroupCoordinator for DeleteShareGroupOffsets RPC. - Added `deleteShareGroupOffsets` to `KafkaAdminClient` - Added implementation for `handleDeleteShareGroupOffsetsRequest` in `KafkaApis.scala` - Added `deleteShareGroupOffsets` to `GroupCoordinator` as well. internally this makes use of `persister.deleteState` to persist the changes in share coordinator Reviewers: Andrew Schofield <aschofield@confluent.io>, Sushant Mahajan <smahajan@confluent.io>	2025-04-09 07:31:06 +01:00
Nick Guo	43e22ef5d6	KAFKA-19093 Change the "Handler on Broker" to "Handler on Controller" for controller server (#19384 ) > INFO [data-plane Kafka Request Handler on Broker 3000], Resizing request handler thread pool size from 8 to 10 (kafka.server.KafkaRequestHandlerPool) it should be "Controller" rather than "Broker" Reviewers: Ken Huang <s7133700@gmail.com>, Chia-Ping Tsai <chia7712@gmail.com>	2025-04-09 14:09:36 +08:00
Ken Huang	2f086d188f	KAFKA-18892: Add KIP-877 support for ClientQuotaCallback (#19068 ) Allow ClientQuotaCallback to implement Monitorable and register metrics. Reviewers: Mickael Maison <mickael.maison@gmail.com>, TaiJuWu <tjwu1217@gmail.com>, Jhen-Yung Hsu <jhenyunghsu@gmail.com>	2025-04-08 16:58:29 +02:00
Xuan-Zhang Gong	375ed19fba	KAFKA-19100: Use ProcessRole instead of String in AclApis (#19406 ) Use the ProcessRole enum instead of hardcoding the role Reviewers: Mickael Maison <mickael.maison@gmail.com>, PoAn Yang <poan.yang@suse.com>, Jhen-Yung Hsu <jhenyunghsu@gmail.com>, Ken Huang <s7133700@gmail.com>	2025-04-08 11:09:55 +02:00
Nick Guo	fcf6da0a0d	KAFKA-19098 Remove `lastOffset` from PartitionResponse (#19398 ) The `lastOffset` is not used actually, so it can be removed. Reviewers: Jhen-Yung Hsu <jhenyunghsu@gmail.com>, Ken Huang <s7133700@gmail.com>, Chia-Ping Tsai <chia7712@gmail.com>	2025-04-08 00:06:02 +08:00
Stanislav Kozlovski	b8c095074d	MINOR: Rename RemoteLogStorageManager variable to RemoteStorageManager (#19401 ) This patch renames the KIP-405 Plugin variable from `remoteLogStorageManager` to `remoteStorageManager`. After [writing about it](https://aiven.io/blog/apache-kafka-tiered-storage-in-depth-how-writes-and-metadata-flow), I realized I got swayed by the code and called the component incorrectly - the official name doesn't have `Log` in it. I thought i'd go ahead and change the code so it's consistent with the naming too Reviewers: Chia-Ping Tsai <chia7712@gmail.com>	2025-04-08 00:02:38 +08:00
Sanskar Jhajharia	2ae4ffb5e0	MINOR: Cleanup Core Module (#19372 ) Now that Kafka Brokers support Java 17, this PR makes some changes in core module. The changes in this PR are limited to only the Java files in the Core module. Scala related changes may follow next. The changes mostly include: - Collections.emptyList(), Collections.singletonList() and Arrays.asList() are replaced with List.of() - Collections.emptyMap() and Collections.singletonMap() are replaced with Map.of() - Collections.singleton() is replaced with Set.of() - Some changes to use enhanced switch statement. Reviewers: Andrew Schofield <aschofield@confluent.io>, PoAn Yang <payang@apache.org>, Ken Huang <s7133700@gmail.com>	2025-04-07 16:57:52 +01:00
Hong-Yi Chen	6dd2cc70c3	MINOR: Clean up comments and remove unused code in RecordVersion and CreateTopicsRequestTest (#19342 ) ## Summary This PR updates the `RecordVersion` javadoc for clarity. It removes outdated references to `message.format.version` mentioned in the [Kafka 4.0 upgrade documentation](`48f06981ee/40/upgrade.html (L135)`) and aligns with feedback from a previous discussion in [#19325 ](https://github.com/apache/kafka/pull/19325). ## Changes - Cleaned up javadoc in `RecordVersion` - Removed outdated or deprecated references Reviewers: PoAn Yang <payang@apache.org>, Ken Huang <s7133700@gmail.com>, Chia-Ping Tsai <chia7712@gmail.com>	2025-04-07 07:47:06 +08:00
TengYao Chi	6d68f8a82c	MINOR: Move BrokerReconfigurable to the sever-common module (#19383 ) This patch moves `BrokerReconfigurable` to the `server-common module` and decouples the `TransactionLogConfig` and `KafkaConfig` to unblock KAFKA-14485. Reviewers: PoAn Yang <payang@apache.org>, TaiJuWu <tjwu1217@gmail.com>, Chia-Ping Tsai <chia7712@gmail.com>	2025-04-07 07:39:01 +08:00
Gaurav Narula	3f0e14a3e8	MINOR: rename metric variable name in Processor#accept (#19361 ) `Processor#accept` accepts a metric which tracks the amount of time for which the Acceptor thread was blocked. It's misleading to name it `acceptorIdlePercentMeter` and this change updates its naming to align with the call site. Reviewers: PoAn Yang <payang@apache.org>, Chia-Ping Tsai <chia7712@gmail.com>	2025-04-05 22:54:21 +08:00
PoAn Yang	3d96b20630	KAFKA-19042 Move TransactionsExpirationTest to client-integration-tests module (#19288 ) Use Java to rewrite `TransactionsExpirationTest` by new test infra and move it to client-integration-tests module. Reviewers: Chia-Ping Tsai <chia7712@gmail.com>	2025-04-05 20:01:31 +08:00
Mickael Maison	08a93fe12a	KAFKA-14523: Move DelayedRemoteListOffsets to the storage module (#19285 ) Decouple RemoteLogManager and ReplicaManager. Reviewers: Chia-Ping Tsai <chia7712@gmail.com>	2025-04-05 19:51:13 +08:00
Abhinav Dixit	30c511d640	KAFKA-19085: SharePartitionManagerTest testMultipleConcurrentShareFetches throws silent exception and works incorrectly (#19370 ) The test `testMultipleConcurrentShareFetches` is throwing a silent exception. `ERROR Error processing delayed share fetch request (kafka.server.share.DelayedShareFetch:225)` This is due to incomplete mocks setup for the test and also requires changes in timeout. Reviewers: Andrew Schofield <aschofield@confluent.io>	2025-04-04 20:20:47 +01:00
Andrew Schofield	d4d9f11816	KAFKA-18761: [2/N] List share group offsets with state and auth (#19328 ) This PR approaches completion of Admin.listShareGroupOffsets() and kafka-share-groups.sh --describe --offsets. Prior to this patch, kafka-share-groups.sh was only able to describe the offsets for partitions which were assigned to active members. Now, the Admin.listShareGroupOffsets() uses the persister's knowledge of the share-partitions which have initialised state. Then, it uses this list to obtain a complete set of offset information. The PR also implements the topic-based authorisation checking. If Admin.listShareGroupOffsets() is called with a list of topic-partitions specified, the authz checking is performed on the supplied list, returning errors for any topics to which the client is not authorised. If Admin.listShareGroupOffsets() is called without a list of topic-partitions specified, the list of topics is discovered from the persister as described above, and then the response is filtered down to only show the topics to which the client is authorised. This is consistent with other similar RPCs in the Kafka protocol, such as OffsetFetch. Reviewers: David Arthur <mumrah@gmail.com>, Sushant Mahajan <smahajan@confluent.io>, Apoorv Mittal <apoorvmittal10@gmail.com>	2025-04-04 13:25:19 +01:00
Abhinav Dixit	98c0f3024d	MINOR: Added trace logs to help debug SharePartition (#19358 ) Added `trace` logs to help debug `nextFetchOffset` functionality within SharePartition. We did not have a way to figure out the fetch offsets of a share partition through logs. Forward moving `fetchOffset` confirms that the consumption from a given share partition is happening correctly on the broker. Reviewers: Andrew Schofield <aschofield@confluent.io>	2025-04-04 09:52:17 +01:00
TaiJuWu	f1bb29b93a	MINOR: migrate BrokerCompressionTest to storage module (#19277 ) There are two change for this PR. 1. Move `BrokerCompressionTest ` from core to storage 2. Rewrite `BrokerCompressionTest ` from scala to java Reviewers: TengYao Chi <kitingiao@gmail.com>, PoAn Yang <payang@apache.org>, Ken Huang <s7133700@gmail.com>, Chia-Ping Tsai <chia7712@gmail.com>	2025-04-03 22:43:42 +08:00
Bruno Cadonna	6ef42d1524	MINOR: Deduplicate topics of a topology for authorization check (#19352 ) With the new Streams rebalance protocol, the Streams client sends the topology with the used topics to the broker for initialization. For the initialization the broker needs to describe the topics in the topology and consequently the Streams application needs to be authorized to describe the topics. The broker checks the authorization by filtering the topics in the topology by authorization. This filtering implicitly deduplicates the topics of the topology if they appear multiple times in the topology send to the brokers. After that the broker compares the size of the authorized topics with the topics in the topology. If the authorized topics are less than the topics in the topology a TOPIC_AUTHORIZATION_FAILED error is returned. In Streams a topology that is sent to the brokers likely has duplicate topics because a repartition topic appears as a sink for one subtopology and as a source for another subtopology. This commit deduplicates the topics of a topology before the verification of the authorization. Reviewers: Lucas Brutschy <lbrutschy@confluent.io>	2025-04-03 13:39:55 +02:00
PoAn Yang	be80e3cb8a	KAFKA-18923: resource leak in RSM fetchIndex inputStream (#19111 ) Fix resource leak in RSM inputStream. Reviewers: Luke Chen <showuon@gmail.com>	2025-04-03 15:18:05 +08:00
PoAn Yang	5c01fd0b76	KAFKA-18949 add consumer protocol to testDeleteRecordsAfterCorruptRecords (#19317 ) The `PlaintextAdminIntegrationTest#testDeleteRecordsAfterCorruptRecords` was only enabled for classic protocol. Add consumer protocol to it. Reviewers: Chia-Ping Tsai <chia7712@gmail.com>	2025-04-03 13:24:25 +08:00
Xuan-Zhang Gong	2994e5eff3	KAFKA-19004 Move DelayedDeleteRecords to server-common module (#19226 ) Reviewers: Mickael Maison <mickael.maison@gmail.com>, Ken Huang <s7133700@gmail.com>, Chia-Ping Tsai <chia7712@gmail.com>	2025-04-03 00:06:27 +08:00
Ismael Juma	ccf2510fdd	MINOR: Remove dead code `maybeWarnIfOversizedRecords` (#19316 ) The `metadataVersionSupplier` is unused after this - remove it. Also remove redundant `metadataVersion.fetchRequestVersion >= 13` check in `RemoteLeaderEndPoint` - the minimum version returned by this method is `13`. Reviewers: Chia-Ping Tsai <chia7712@gmail.com>	2025-04-01 06:36:25 -07:00
Chirag Wadhwa	0c97338959	KAFKA-18796-2: Corrected the check for acquisition lock timeout in Sh… (#19338 ) Minor PR to correct the check for the presence of acquisition lock in `assertionFailedMessage` method in `SharePartitionTest` Reviewers: Andrew Schofield <aschofield@confluent.io>	2025-04-01 13:49:47 +01:00
Apoorv Mittal	4aa81204ff	KAFKA-19018,KAFKA-19063: Implement maxRecords and acquisition lock timeout in share fetch request and response resp. (#19334 ) PR add `MaxRecords` to share fetch request and also adds `AcquisitionLockTimeout` to share fetch response. PR also removes internal broker config of `max.fetch.records`. Reviewers: Andrew Schofield <aschofield@confluent.io>	2025-04-01 12:23:06 +01:00
David Jacot	d038f44848	MINOR: Small cleanups in ReplicaManager (#19322 ) This is a small follow-up of https://github.com/apache/kafka/pull/19290. The `actionQueue` argument is only used by the `CoordinatorPartitionWriter` so we can remove it from the other methods now. Reviewers: Jeff Kim <jeff.kim@confluent.io>	2025-03-31 07:39:40 -07:00
TengYao Chi	20546930ae	KAFKA-19042 Move ConsumerTopicCreationTest to client-integration-tests module (#19283 ) This patch moves `ConsumerTopicCreationTest` to the `client-integration-tests` and rewrite it as Java. The patch also streamlines the test flow. In the Scala version, there is a producer that produces messages, but this is not the main purpose of the `ConsumerTopicCreationTest`. Reviewers: Ken Huang <s7133700@gmail.com>, Chia-Ping Tsai <chia7712@gmail.com>	2025-03-31 20:15:54 +08:00
Dmitry Werner	4144290335	MINOR: Cleanup metadata module (#18937 ) Removed unused code and fixed IDEA warnings. Reviewers: Chia-Ping Tsai <chia7712@gmail.com>	2025-03-31 17:46:21 +08:00
PoAn Yang	4a5ae144ea	KAFKA-19032 Remove TestInfoUtils.TestWithParameterizedQuorumAndGroupProtocolNames (#19270 ) The zookeeper mode was removed in 4.0. The test cases don't need to specify quorum. Following variable and functions can be replaced: - TestWithParameterizedQuorumAndGroupProtocolNames - getTestQuorumAndGroupProtocolParametersClassicGroupProtocolOnly - getTestQuorumAndGroupProtocolParametersConsumerGroupProtocolOnly - getTestQuorumAndGroupProtocolParametersAll Reviewers: Chia-Ping Tsai <chia7712@gmail.com>	2025-03-30 02:11:07 +08:00
PoAn Yang	c125cc7dd1	KAFKA-19036 Rewrite LogAppendTimeTest and move it to storage module (#19282 ) Use Java to rewrite `LogAppendTimeTest` by new test infra and move it to storage module. Reviewers: Chia-Ping Tsai <chia7712@gmail.com>	2025-03-29 03:14:53 +08:00
Nick Guo	9292a22606	KAFKA-19049 Remove the `@ExtendWith(ClusterTestExtensions.class)` from code base (#19299 ) jira: https://issues.apache.org/jira/browse/KAFKA-19049 [KAFKA-18617](https://issues.apache.org/jira/browse/KAFKA-18617) introduced the mechanism to inject the cluster test at runtime, so the integration tests don't need to use `@ExtendWith(ClusterTestExtensions.class)` any more. Reviewers: PoAn Yang <payang@apache.org>, Chia-Ping Tsai <chia7712@gmail.com>	2025-03-29 02:15:16 +08:00
David Jacot	28de78bcba	MINOR: Refactor GroupCoordinator write path (#19290 ) This patch addresses a weirdness on the GroupCoordinator write path. The `CoordinatorPartitionWriter` uses the `ReplicaManager#appendRecords` method with `acks=1` and it expects it to completes immediately/synchronously. It works because this is effectively what the method does with `acks=1`. The issue is that fundamentally the method is asynchronous so the contract is really fragile. This patch changes it by introducing new method `ReplicaManager.appendRecordsToLeader`, which is synchronous. It also refactors `ReplicaManager#appendRecords` to use `ReplicaManager.appendRecordsToLeader` so we can benefits from all the existing tests. Reviewers: Fred Zheng <fzheng@confluent.io>, Jeff Kim <jeff.kim@confluent.io>	2025-03-27 08:58:47 -07:00
Lucas Brutschy	2267902b40	MINOR: Mark streams RPCs as unstable (#19292 ) Streams groups RPCs are not enabled by default, but they should also be marked as unstable. Reviewers: Bruno Cadonna <cadonna@apache.org>	2025-03-27 14:22:01 +01:00
David Jacot	9e42b76147	MINOR: Some cleanups in group coordinator's intergration tests (#19281 ) This patch applies a few cleanups to uniformize how group coordinator's integration tests are setup. Reviewers: Lianet Magrans <lmagrans@confluent.io>, Chia-Ping Tsai <chia7712@gmail.com>	2025-03-27 06:06:36 -07:00
Dmitry Werner	84b8fec089	KAFKA-14486 Move LogCleanerManager to storage module (#19216 ) Move LogCleanerManager and related classes to storage module and rewrite in Java. Reviewers: TengYao Chi <kitingiao@gmail.com>, Jun Rao <junrao@gmail.com>, Mickael Maison <mickael.maison@gmail.com>, Chia-Ping Tsai <chia7712@gmail.com>	2025-03-27 12:35:38 +08:00
Sushant Mahajan	eb88e78373	KAFKA-18827: Initialize share group state group coordinator impl. [3/N] (#19026 ) * This PR adds impl for the initialize share groups call from the Group Coordinator perspective. * The initialize call on persister instance will be invoked by the `GroupCoordinatorService`, based on the response of the `GroupCoordinatorShard.shareGroupHeartbeat`. If there is new topic subscription or member assignment change (topic paritions incremented), the delta share partitions corresponding to the share group in question are returned as an optional initialize request. * The request is then sent to the share coordinator as an encapsulated timer task because we want the heartbeat response to go asynchronously. * Tests have been added for `GroupCoordinatorService` and `GroupMetadataManager`. Existing tests have also been updated. * A new formatter `ShareGroupStatePartitionMetadataFormatter` has been added for debugging. Reviewers: Andrew Schofield <aschofield@confluent.io>	2025-03-26 19:40:23 +00:00
Vikas Singh	56d1dc1b6e	MINOR: Use readable interface to parse requests (#19163 ) The generated request data type's constructors take Readable as an input. However, the parse method in the AbstractRequest takes a ByteBuffer as input. So to create the corresponding request data objects, each individual concrete Request classes wraps the ByteBuffer into a ByteBufferAccessor. This is boilerplate code present in all the concrete request classes. This changes AbstractRequest's parse method so that subclasses can simply pass the `Readable` they get directly to request data classes. The same change is made to the serialize method to maintain symmetry. Reviewers: Ismael Juma <ismael@juma.me.uk>, José Armando García Sancio <jsancio@apache.org>, Artem Livshits <alivshits@confluent.io>, Truc Nguyen <trnguyen@confluent.io>	2025-03-26 10:13:13 -04:00
ClarkChen	1547204baa	KAFKA-18914 Migrate ConsumerRebootstrapTest to use new test infra (#19154 ) Migrate ConsumerRebootstrapTest to the new test infra and remove the old Scala test. The PR changed three things. * Migrated `ConsumerRebootstrapTest` to new test infra and removed the old Scala test. * Updated the original test case to cover rebootstrap scenarios. * Integrated `ConsumerRebootstrapTest` into `ClientRebootstrapTest` in the `client-integration-tests` module. * Removed the `RebootstrapTest.scala`. Default `ConsumerRebootstrap` config: > properties.put(CommonClientConfigs.METADATA_RECOVERY_STRATEGY_CONFIG, "rebootstrap"); properties.put(CommonClientConfigs.METADATA_RECOVERY_REBOOTSTRAP_TRIGGER_MS_CONFIG, "300000"); properties.put(CommonClientConfigs.SOCKET_CONNECTION_SETUP_TIMEOUT_MS_CONFIG, "10000"); properties.put(CommonClientConfigs.SOCKET_CONNECTION_SETUP_TIMEOUT_MAX_MS_CONFIG, "30000"); properties.put(CommonClientConfigs.RECONNECT_BACKOFF_MS_CONFIG, "50L"); properties.put(CommonClientConfigs.RECONNECT_BACKOFF_MAX_MS_CONFIG, "1000L"); The test case for the consumer with enabled rebootstrap ![Screenshot 2025-03-22 at 9 48 13 PM](https://github.com/user-attachments/assets/8470549f-a24c-43fa-ae44-789cbf422a63) The test case for the consumer with disabled rebootstrap ![Screenshot 2025-03-22 at 9 47 22 PM](https://github.com/user-attachments/assets/0a183464-6a74-449f-8e71-d641a6ea5bb1) Reviewers: Chia-Ping Tsai <chia7712@gmail.com>	2025-03-26 01:53:42 +08:00
TengYao Chi	80d99ea2ba	KAFKA-18991: FetcherThread should match leader epochs between fetch request and fetch state (#19223 ) This PR fixes a potential issue where the `FetchResponse` returns `divergingEndOffsets` with an older leader epoch. This can lead to committed records being removed from the follower's log, potentially causing data loss. In detail: `processFetchRequest` gets the requested leader epoch of partition data by `topicPartition` and compares it with the leader epoch of the current fetch state. If they don't match, the response is ignored. Reviewers: Jun Rao <junrao@gmail.com>	2025-03-25 09:14:01 -07:00
David Jacot	9db5888609	MINOR: FindCoordinator API does not lookup partition for share partition key correctly (#19273 ) This patch fixes another bug in the FindCoordinator API handling for share partition key. `shareCoordinator.foreach` returns `Unit` so `shareCoordinator.foreach(coordinator => coordinator.partitionFor(SharePartitionKey.getInstance(key)))` does not return the partition for the key. Reviewers: Jhen-Yung Hsu <jhenyunghsu@gmail.com>, Chia-Ping Tsai <chia7712@gmail.com>	2025-03-24 19:43:23 +01:00
TengYao Chi	20bad6efb3	KAFKA-18576 Convert ConfigType to Enum (#18711 ) JIRA: KAFKA-18576 After removing ZooKeeper, we no longer need to exclude `client_metrics` and `group` from `ConfigType#ALL`. Since it's a common pattern to provide a mechanism to know all values in enumeration ( Java enum provides ootb), we should convert ConfigType to enum. Reviewers: Chia-Ping Tsai <chia7712@gmail.com>	2025-03-25 01:10:59 +08:00
David Jacot	95ef344940	MINOR: FindCoordinator API should return INVALID_REQUEST when share partition key is invalid (#19272 ) At the moment, the FindCoordinator API returns an `UNKNOWN_SERVER_ERROR` error when the share partition key is invalid. It seems that the aim was to return an `INVALID_REQUEST` error but the code has a small bug preventing it from working as expected. Reviewers: Apoorv Mittal <amittal@confluent.io>, Andrew Schofield <aschofield@confluent.io>	2025-03-24 08:29:20 -07:00
Chirag Wadhwa	b5f5265864	KAFKA-18796: Added more information to error message when assertion fails for acquisition lock timeout (#19247 ) This PR adds extra information in assertion failed messages for tests in SharePartitionTest revolving around acquisition lock timeouts functionality. Reviewers: Andrew Schofield <aschofield@confluent.io>	2025-03-24 16:14:29 +05:30
ClarkChen	fef9aebb19	KAFKA-18276 Migrate ProducerRebootstrapTest to new test infra (#19046 ) The PR changed three things. * Migrated `ProducerRebootstrapTest` to new test infra and removed the old Scala test. * Updated the original test case to cover rebootstrap scenarios. * Integrated `ProducerRebootstrapTest` into `ClientRebootstrapTest` in the `client-integration-tests` module. Default `ProducerRebootstrap` config: > properties.put(CommonClientConfigs.METADATA_RECOVERY_STRATEGY_CONFIG, "rebootstrap"); properties.put(CommonClientConfigs.METADATA_RECOVERY_REBOOTSTRAP_TRIGGER_MS_CONFIG, "300000"); properties.put(CommonClientConfigs.SOCKET_CONNECTION_SETUP_TIMEOUT_MS_CONFIG, "10000"); properties.put(CommonClientConfigs.SOCKET_CONNECTION_SETUP_TIMEOUT_MAX_MS_CONFIG, "30000"); properties.put(CommonClientConfigs.RECONNECT_BACKOFF_MS_CONFIG, "50L"); properties.put(CommonClientConfigs.RECONNECT_BACKOFF_MAX_MS_CONFIG, "1000L"); The test case for the producer with enabled rebootstrap <img width="1549" alt="Screenshot 2025-03-17 at 10 46 03 PM" src="https://github.com/user-attachments/assets/547840a6-d79d-4db4-98c0-9b05ed04cf60" /> The test case for the producer with disabled rebootstrap <img width="1552" alt="Screenshot 2025-03-17 at 10 46 47 PM" src="https://github.com/user-attachments/assets/2248e809-d9d5-4f3b-a24f-ba1aa0fef728" /> Reviewers: TengYao Chi <kitingiao@gmail.com>, Ken Huang <s7133700@gmail.com>, Chia-Ping Tsai <chia7712@gmail.com>	2025-03-24 01:09:17 +08:00
PoAn Yang	d497250c22	KAFKA-18999 Remove BrokerMetadata (#19227 ) * Replace `BrokerMetadata` with `UsableBroker` in KRaftMetadataCache and ReassignPartitionsCommand. Reviewers: Chia-Ping Tsai <chia7712@gmail.com>	2025-03-22 19:30:28 +08:00
David Jacot	ca20e9cd92	KAFKA-18329; [3/3] Delete old group coordinator (KIP-848) (#19255 ) This patch is the third of a series of patches to remove the old group coordinator. With the release of Apache Kafka 4.0, the so-called new group coordinator is the default and only option available now. It removes the old group coordinator and cleans up the `GroupCoordinator` interface. Reviewers: Jeff Kim <jeff.kim@confluent.io>, Chia-Ping Tsai <chia7712@gmail.com>	2025-03-21 08:07:42 -07:00
TaiJuWu	79fe1305b6	KAFKA-18893: Add KIP-877 support to ReplicaSelector (#19064 ) ReplicaSelector implementations can implement Monitorable to register their own metrics. Reviewers: Mickael Maison <mickael.maison@gmail.com>, Ken Huang <s7133700@gmail.com>	2025-03-21 15:39:50 +01:00
David Jacot	0c5e5c5d2d	KAFKA-18329; [2/3] Delete old group coordinator (KIP-848) (#19251 ) This patch is the second of a series of patches to remove the old group coordinator. With the release of Apache Kafka 4.0, the so-called new group coordinator is the default and only option available now. The patch removes `group.coordinator.new.enable` (internal config) and all its usages (integration tests, unit tests, etc.). It also cleans up `KafkaApis` to remove logic only used by the old group coordinator. Reviewers: Jeff Kim <jeff.kim@confluent.io>, Chia-Ping Tsai <chia7712@gmail.com>	2025-03-21 05:54:41 -07:00
Mickael Maison	121ec2a662	KAFKA-15599 Move MetadataLogConfig to raft module (#19246 ) Rewrite the class in Java and move it to the raft module. Reviewers: PoAn Yang <payang@apache.org>, TengYao Chi <kitingiao@gmail.com>, Chia-Ping Tsai <chia7712@gmail.com>	2025-03-21 13:44:20 +08:00
Jorge Esteban Quilcate Otoya	f24945b519	KAFKA-15931: Cancel RemoteLogReader gracefully (#19197 ) Reverts commit `2723dbf3a0` and `269e8892ad`. Instead of reopening the transaction index, it cancels the RemoteFetchTask without interrupting it--avoiding to close the TransactionIndex channel. This will lead to complete the execution of the remote fetch but ignoring the results. Given that this is considered a rare case, we could live with this. If it becomes a performance issue, it could be optimized. Reviewers: Jun Rao <junrao@gmail.com>	2025-03-20 10:20:44 -07:00
TengYao Chi	b83a23a4f9	KAFKA-18946 Move BrokerReconfigurable and DynamicProducerStateManagerConfig to server module (#19174 ) This patch is to move `DynamicProducerStateManagerConfig` and `BrokerReconfigurable` to the server module. Reviewers: PoAn Yang <payang@apache.org>, Chia-Ping Tsai <chia7712@gmail.com>	2025-03-20 21:30:19 +08:00
Lan Ding	e73719d962	KAFKA-18819 StreamsGroupHeartbeat API and StreamsGroupDescribe API check topic describe (#19183 ) This patch filters out the topic describe unauthorized topics from the StreamsGroupHeartbeat and StreamsGroupDescribe response. Reviewers: Lucas Brutschy <lbrutschy@confluent.io>	2025-03-19 20:42:05 +01:00
PoAn Yang	fcca4056fd	KAFKA-18975 Move clients-integration-test out of core module (#19217 ) Move following tests from core to clients-integration-test module. - ClientTelemetryTest - DeleteTopicTest - DescribeAuthorizedOperationsTest - ConsumerIntegrationTest - CustomQuotaCallbackTest - RackAwareAutoTopicCreationTest Move following tests from core to server module. - BootstrapControllersIntegrationTest - LogManagerIntegrationTest Reviewers: Kirk True <kirk@kirktrue.pro>, Ken Huang <s7133700@gmail.com>, Chia-Ping Tsai <chia7712@gmail.com>	2025-03-20 02:43:19 +08:00
Ritika Reddy	3a3159b01e	KAFKA-18953: [1/N] Add broker side handling for 2 PC (KIP-939) (#19193 ) This patch adds logic to enable and handle two phase commit (2PC) transactions following KIP-939. The changes made are as follows: 1) Add a new broker config called transaction.two.phase.commit.enable which is set to false by default 2) Add new flags enableTwoPCFlag and keepPreparedTxn to handleInitProducerId 3) Return an error if keepPreparedTxn is set to true (for now) Reviewers: Artem Livshits <alivshits@confluent.io>, Justine Olshan <jolshan@confluent.io>	2025-03-19 09:22:00 -07:00
Kevin Wu	a5325e029e	KAFKA-17431: Support invalid static configs for KRaft so long as dynamic configs are valid (#18949 ) During broker startup, attempt to read dynamic configurations from latest local snapshot on disk. This will avoid most situations where the static configuration is not sufficient to start up, but the dynamic configuration would have been. The PR includes an integration test. Reviewers: Colin P. McCabe <cmccabe@apache.org>	2025-03-18 14:23:23 -07:00
Ken Huang	b805877705	KAFKA-18969 Rewrite ShareConsumerTest#setup and move to clients-integration-tests module (#19202 ) Move share consumer to clients-integration-tests module and use `@BeforeEach` to setup Reviewers: TengYao Chi <kitingiao@gmail.com>, Chia-Ping Tsai <chia7712@gmail.com>	2025-03-18 14:47:38 +08:00
Nick Guo	e9ffe0ba7c	KAFKA-18808 add test to ensure the name=<default> is not equal to default quota (#18966 ) see discussion in [KAFKA-18735](https://issues.apache.org/jira/browse/KAFKA-18735) - the test should include following check. 1. Using name=<default> does not create default quota 2. the returned entity should have name=<default> 2. the filter `ClientQuotaFilterComponent.ofDefaultEntity` should return nothing Reviewers: Chia-Ping Tsai <chia7712@gmail.com>	2025-03-18 01:57:24 +08:00
TengYao Chi	a6a0ea56d8	KAFKA-17171 Add test cases for `STATIC_BROKER_CONFIG`in kraft mode (#18463 ) Given that the `core` module will be separated into other small modules, this test will not be added to the core module. Instead, I added it to the `clients-integration-tests` module since it focuses on the admin client test. The patch should include following test cases. 1. a topic-related static config is added to quorum controller. The configs from topic creation should include it, but `describeConfigs` does not. 2. a topic-related static config is added to quorum controller. The configs from topic creation should include it, and `describeConfigs` does if admin is using controller.bootstrap 3. a topic-related static config is added to broker. The configs from topic creation should NOT include it, but `describeConfigs` does. 4. a topic-related static config is added to broker. The configs from topic creation should NOT include it, and `describeConfigs` does not also if admin is using controller.bootstrap for another, the docs of `STATIC_BROKER_CONFIG` should remind the impact of "controller.properties" BTW, those test cases should leverage new test infra, since new test infra allow us to define configs to broker/controller individually. Reviewers: Chia-Ping Tsai <chia7712@gmail.com>	2025-03-18 00:30:53 +08:00
PoAn Yang	da46cf6e79	KAFKA-17565 Move MetadataCache interface to metadata module (#18801 ) ### Changes * Move MetadataCache interface to metadata module and change Scala function to Java. * Remove functions `getTopicPartitions`, `getAliveBrokers`, `topicNamesToIds`, `topicIdInfo`, and `getClusterMetadata` from MetadataCache interface, because these functions are only used in test code. ### Performance * ReplicaFetcherThreadBenchmark ``` ./jmh-benchmarks/jmh.sh -f 1 -i 2 -wi 2 org.apache.kafka.jmh.fetcher.ReplicaFetcherThreadBenchmark ``` * trunk ``` Benchmark (partitionCount) Mode Cnt Score Error Units ReplicaFetcherThreadBenchmark.testFetcher 100 avgt 2 4775.490 ns/op ReplicaFetcherThreadBenchmark.testFetcher 500 avgt 2 25730.790 ns/op ReplicaFetcherThreadBenchmark.testFetcher 1000 avgt 2 55334.206 ns/op ReplicaFetcherThreadBenchmark.testFetcher 5000 avgt 2 488427.547 ns/op ``` * branch ``` Benchmark (partitionCount) Mode Cnt Score Error Units ReplicaFetcherThreadBenchmark.testFetcher 100 avgt 2 4825.219 ns/op ReplicaFetcherThreadBenchmark.testFetcher 500 avgt 2 25985.662 ns/op ReplicaFetcherThreadBenchmark.testFetcher 1000 avgt 2 56056.005 ns/op ReplicaFetcherThreadBenchmark.testFetcher 5000 avgt 2 497138.573 ns/op ``` * KRaftMetadataRequestBenchmark ``` ./jmh-benchmarks/jmh.sh -f 1 -i 2 -wi 2 org.apache.kafka.jmh.metadata.KRaftMetadataRequestBenchmark ``` * trunk ``` Benchmark (partitionCount) (topicCount) Mode Cnt Score Error Units KRaftMetadataRequestBenchmark.testMetadataRequestForAllTopics 10 500 avgt 2 884933.558 ns/op KRaftMetadataRequestBenchmark.testMetadataRequestForAllTopics 10 1000 avgt 2 1910054.621 ns/op KRaftMetadataRequestBenchmark.testMetadataRequestForAllTopics 10 5000 avgt 2 21778869.337 ns/op KRaftMetadataRequestBenchmark.testMetadataRequestForAllTopics 20 500 avgt 2 1537550.670 ns/op KRaftMetadataRequestBenchmark.testMetadataRequestForAllTopics 20 1000 avgt 2 3168237.805 ns/op KRaftMetadataRequestBenchmark.testMetadataRequestForAllTopics 20 5000 avgt 2 29699652.466 ns/op KRaftMetadataRequestBenchmark.testMetadataRequestForAllTopics 50 500 avgt 2 3501483.852 ns/op KRaftMetadataRequestBenchmark.testMetadataRequestForAllTopics 50 1000 avgt 2 7405481.182 ns/op KRaftMetadataRequestBenchmark.testMetadataRequestForAllTopics 50 5000 avgt 2 55839670.124 ns/op KRaftMetadataRequestBenchmark.testRequestToJson 10 500 avgt 2 333.667 ns/op KRaftMetadataRequestBenchmark.testRequestToJson 10 1000 avgt 2 339.685 ns/op KRaftMetadataRequestBenchmark.testRequestToJson 10 5000 avgt 2 334.293 ns/op KRaftMetadataRequestBenchmark.testRequestToJson 20 500 avgt 2 329.899 ns/op KRaftMetadataRequestBenchmark.testRequestToJson 20 1000 avgt 2 347.537 ns/op KRaftMetadataRequestBenchmark.testRequestToJson 20 5000 avgt 2 332.781 ns/op KRaftMetadataRequestBenchmark.testRequestToJson 50 500 avgt 2 327.085 ns/op KRaftMetadataRequestBenchmark.testRequestToJson 50 1000 avgt 2 325.206 ns/op KRaftMetadataRequestBenchmark.testRequestToJson 50 5000 avgt 2 316.758 ns/op KRaftMetadataRequestBenchmark.testTopicIdInfo 10 500 avgt 2 7.569 ns/op KRaftMetadataRequestBenchmark.testTopicIdInfo 10 1000 avgt 2 7.565 ns/op KRaftMetadataRequestBenchmark.testTopicIdInfo 10 5000 avgt 2 7.574 ns/op KRaftMetadataRequestBenchmark.testTopicIdInfo 20 500 avgt 2 7.568 ns/op KRaftMetadataRequestBenchmark.testTopicIdInfo 20 1000 avgt 2 7.557 ns/op KRaftMetadataRequestBenchmark.testTopicIdInfo 20 5000 avgt 2 7.585 ns/op KRaftMetadataRequestBenchmark.testTopicIdInfo 50 500 avgt 2 7.560 ns/op KRaftMetadataRequestBenchmark.testTopicIdInfo 50 1000 avgt 2 7.554 ns/op KRaftMetadataRequestBenchmark.testTopicIdInfo 50 5000 avgt 2 7.574 ns/op ``` * branch ``` Benchmark (partitionCount) (topicCount) Mode Cnt Score Error Units KRaftMetadataRequestBenchmark.testMetadataRequestForAllTopics 10 500 avgt 2 910337.770 ns/op KRaftMetadataRequestBenchmark.testMetadataRequestForAllTopics 10 1000 avgt 2 1902351.360 ns/op KRaftMetadataRequestBenchmark.testMetadataRequestForAllTopics 10 5000 avgt 2 22215893.338 ns/op KRaftMetadataRequestBenchmark.testMetadataRequestForAllTopics 20 500 avgt 2 1572683.875 ns/op KRaftMetadataRequestBenchmark.testMetadataRequestForAllTopics 20 1000 avgt 2 3188560.081 ns/op KRaftMetadataRequestBenchmark.testMetadataRequestForAllTopics 20 5000 avgt 2 29984751.632 ns/op KRaftMetadataRequestBenchmark.testMetadataRequestForAllTopics 50 500 avgt 2 3413567.549 ns/op KRaftMetadataRequestBenchmark.testMetadataRequestForAllTopics 50 1000 avgt 2 7303174.254 ns/op KRaftMetadataRequestBenchmark.testMetadataRequestForAllTopics 50 5000 avgt 2 54293721.640 ns/op KRaftMetadataRequestBenchmark.testRequestToJson 10 500 avgt 2 318.335 ns/op KRaftMetadataRequestBenchmark.testRequestToJson 10 1000 avgt 2 331.386 ns/op KRaftMetadataRequestBenchmark.testRequestToJson 10 5000 avgt 2 332.944 ns/op KRaftMetadataRequestBenchmark.testRequestToJson 20 500 avgt 2 340.322 ns/op KRaftMetadataRequestBenchmark.testRequestToJson 20 1000 avgt 2 330.294 ns/op KRaftMetadataRequestBenchmark.testRequestToJson 20 5000 avgt 2 342.154 ns/op KRaftMetadataRequestBenchmark.testRequestToJson 50 500 avgt 2 341.053 ns/op KRaftMetadataRequestBenchmark.testRequestToJson 50 1000 avgt 2 335.458 ns/op KRaftMetadataRequestBenchmark.testRequestToJson 50 5000 avgt 2 322.050 ns/op KRaftMetadataRequestBenchmark.testTopicIdInfo 10 500 avgt 2 7.538 ns/op KRaftMetadataRequestBenchmark.testTopicIdInfo 10 1000 avgt 2 7.548 ns/op KRaftMetadataRequestBenchmark.testTopicIdInfo 10 5000 avgt 2 7.545 ns/op KRaftMetadataRequestBenchmark.testTopicIdInfo 20 500 avgt 2 7.597 ns/op KRaftMetadataRequestBenchmark.testTopicIdInfo 20 1000 avgt 2 7.567 ns/op KRaftMetadataRequestBenchmark.testTopicIdInfo 20 5000 avgt 2 7.558 ns/op KRaftMetadataRequestBenchmark.testTopicIdInfo 50 500 avgt 2 7.559 ns/op KRaftMetadataRequestBenchmark.testTopicIdInfo 50 1000 avgt 2 7.615 ns/op KRaftMetadataRequestBenchmark.testTopicIdInfo 50 5000 avgt 2 7.562 ns/op ``` * PartitionMakeFollowerBenchmark ``` ./jmh-benchmarks/jmh.sh -f 1 -i 2 -wi 2 org.apache.kafka.jmh.partition.PartitionMakeFollowerBenchmark ``` * trunk ``` Benchmark Mode Cnt Score Error Units PartitionMakeFollowerBenchmark.testMakeFollower avgt 2 158.816 ns/op ``` * branch ``` Benchmark Mode Cnt Score Error Units PartitionMakeFollowerBenchmark.testMakeFollower avgt 2 160.533 ns/op ``` * UpdateFollowerFetchStateBenchmark ``` ./jmh-benchmarks/jmh.sh -f 1 -i 2 -wi 2 org.apache.kafka.jmh.partition.UpdateFollowerFetchStateBenchmark ``` * trunk ``` Benchmark Mode Cnt Score Error Units UpdateFollowerFetchStateBenchmark.updateFollowerFetchStateBench avgt 2 4975.261 ns/op UpdateFollowerFetchStateBenchmark.updateFollowerFetchStateBenchNoChange avgt 2 4880.880 ns/op ``` * branch ``` Benchmark Mode Cnt Score Error Units UpdateFollowerFetchStateBenchmark.updateFollowerFetchStateBench avgt 2 5020.722 ns/op UpdateFollowerFetchStateBenchmark.updateFollowerFetchStateBenchNoChange avgt 2 4878.855 ns/op ``` * CheckpointBench ``` ./jmh-benchmarks/jmh.sh -f 1 -i 2 -wi 2 org.apache.kafka.jmh.server.CheckpointBench ``` * trunk ``` Benchmark (numPartitions) (numTopics) Mode Cnt Score Error Units CheckpointBench.measureCheckpointHighWatermarks 3 100 thrpt 2 0.997 ops/ms CheckpointBench.measureCheckpointHighWatermarks 3 1000 thrpt 2 0.703 ops/ms CheckpointBench.measureCheckpointHighWatermarks 3 2000 thrpt 2 0.486 ops/ms CheckpointBench.measureCheckpointLogStartOffsets 3 100 thrpt 2 1.038 ops/ms CheckpointBench.measureCheckpointLogStartOffsets 3 1000 thrpt 2 0.734 ops/ms CheckpointBench.measureCheckpointLogStartOffsets 3 2000 thrpt 2 0.637 ops/ms ``` * branch ``` Benchmark (numPartitions) (numTopics) Mode Cnt Score Error Units CheckpointBench.measureCheckpointHighWatermarks 3 100 thrpt 2 0.990 ops/ms CheckpointBench.measureCheckpointHighWatermarks 3 1000 thrpt 2 0.659 ops/ms CheckpointBench.measureCheckpointHighWatermarks 3 2000 thrpt 2 0.508 ops/ms CheckpointBench.measureCheckpointLogStartOffsets 3 100 thrpt 2 0.923 ops/ms CheckpointBench.measureCheckpointLogStartOffsets 3 1000 thrpt 2 0.736 ops/ms CheckpointBench.measureCheckpointLogStartOffsets 3 2000 thrpt 2 0.637 ops/ms ``` * PartitionCreationBench ``` ./jmh-benchmarks/jmh.sh -f 1 -i 2 -wi 2 org.apache.kafka.jmh.server.PartitionCreationBench ``` * trunk ``` Benchmark (numPartitions) (useTopicIds) Mode Cnt Score Error Units PartitionCreationBench.makeFollower 20 false avgt 2 5.997 ms/op PartitionCreationBench.makeFollower 20 true avgt 2 6.961 ms/op ``` * branch ``` Benchmark (numPartitions) (useTopicIds) Mode Cnt Score Error Units PartitionCreationBench.makeFollower 20 false avgt 2 6.212 ms/op PartitionCreationBench.makeFollower 20 true avgt 2 7.005 ms/op ``` Reviewers: Ismael Juma <ismael@juma.me.uk>, David Arthur <mumrah@gmail.com>, Chia-Ping Tsai <chia7712@gmail.com>	2025-03-17 23:59:11 +08:00
Ming-Yen Chung	f7d07d62d9	KAFKA-18990 Avoid redundant MetricName creation in BaseQuotaTest#produceUntilThrottled (#19215 ) Avoid redundant MetricName creation in BaseQuotaTest#produceUntilThrottled via moving metrics creation out of loop. Reviewers: PoAn Yang <payang@apache.org>, Chia-Ping Tsai <chia7712@gmail.com>	2025-03-16 21:06:04 +08:00
Ken Huang	7bff678699	KAFKA-18859 honor the error message of UnregisterBrokerResponse (#19027 ) Reviewers: Ismael Juma <ismael@juma.me.uk>, TengYao Chi <kitingiao@gmail.com>, Chia-Ping Tsai <chia7712@gmail.com>	2025-03-16 03:06:01 +08:00
ClarkChen	e05b0e68e4	KAFKA-18915 Rewrite AdminClientRebootstrapTest to cover the current scenario (#19187 ) Reviewers: Jhen-Yung Hsu <jhenyunghsu@gmail.com>, TengYao Chi <kitingiao@gmail.com>, Ken Huang <s7133700@gmail.com>, Chia-Ping Tsai <chia7712@gmail.com>	2025-03-16 02:35:41 +08:00
Andrew Schofield	5e7445a6d6	KAFKA-17516 Synonyms for client metrics configs (#17264 ) This PR brings client metrics configuration resources in line with the other config resources in terms of handling synonyms and defaults. Specifically, configs which are not explicitly set take their hard-coded default values, and these are reported by `kafka-configs.sh --describe` and `Kafka-client-metrics.sh --describe`. Previously, they were omitted which means the administrator needed to know the default values. The ConfigHelper was changed so that the handling of client metrics configuration matches that of group configuration. Reviewers: poorv Mittal <apoorvmittal10@gmail.com>, Chia-Ping Tsai <chia7712@gmail.com>	2025-03-14 16:05:40 +08:00
Alieh Saeedi	ff785ac251	KAFKA-18651: Add Streams-specific broker configurations (#19176 ) This change implements the broker-side configs proposed in KIP-1071. The configurations implemented by this PR are only those that were specifically aimed to be included in `AK 4.1`. Reviewers: Lucas Brutschy <lbrutschy@confluent.io>	2025-03-13 18:05:24 +01:00
Mickael Maison	759fbbba8b	KAFKA-14484: Move UnifiedLog to storage module (#19030 ) Rewrite UnifiedLog in Java Reviewers: Jun Rao <jun@confluent.io>, Chia-Ping Tsai <chia7712@gmail.com>	2025-03-13 10:49:55 +01:00
Mickael Maison	55d65cb3ba	MINOR: Cleanups in CoreUtils (#19175 ) Delete unused methods in CoreUtils and switch to Utils.newInstance(). Reviewers: Chia-Ping Tsai <chia7712@gmail.com>	2025-03-12 19:43:30 +01:00
TengYao Chi	e1d980a3d1	MINOR: Remove unused ConfigCommandOptions#forceOpt (#19170 ) This field is unused, and we should remove it. Reviewers: Chia-Ping Tsai <chia7712@gmail.com>	2025-03-13 00:04:22 +08:00
Abhinav Dixit	c07c59ad24	KAFKA-18932: Removed usage of partition max bytes from share fetch requests (#19148 ) This PR aims to remove the usage of partition max bytes from share fetch requests. Partition Max Bytes is being defined by `PartitionMaxBytesStrategy` which was added to the broker as part of PR https://github.com/apache/kafka/pull/17870 Reviewers: Andrew Schofield <aschofield@confluent.io>, Apoorv Mittal <apoorvmittal10@gmail.com>	2025-03-12 13:19:19 +00:00
Apoorv Mittal	f3da8f500e	KAFKA-18936: Fix share fetch when records are larger than max bytes (#19145 ) The PR fixes the behaviour when records are fetched which are larger than `fetch.max.bytes` config. The usage of `hardMaxBytesLimit` is in ReplicaManager where it decides whether to fetch a single record or not. The file records get sliced based on the bytes requested. However, if `hardMaxBytesLimit` is false then at least one record is fetched and bytes are adjusted accordingly in `localLog`. Reviewers: Jun Rao <junrao@gmail.com>, Andrew Schofield <aschofield@confluent.io>, Abhinav Dixit <adixit@confluent.io>	2025-03-12 09:03:35 +00:00
David Arthur	701573366f	KAFKA-18933 Add client integration tests module (#19144 ) Adds a new ":clients:integration-test" Gradle module. Relocates one example test from ":core" Reviewers: Chia-Ping Tsai <chia7712@gmail.com>	2025-03-11 16:36:23 -04:00
Lucas Brutschy	fc2e3dfce9	MINOR: Disallow unused local variables (#18963 ) Recently, we found a regression that could have been detected by static analysis, since a local variable wasn't being passed to a method during a refactoring, and was left unused. It was fixed in [`7a749b5`](`7a749b589f`), but almost slipped into 4.0. Unused variables are typically detected by IDEs, but this is insufficient to prevent these kinds of bugs. This change enables unused local variable detection in checkstyle for Kafka. A few notes on the usage: - There are two situations in which people actually want to have a local variable but not use it. First, there are `for (Type ignored: collection)` loops which have to loop `collection.length` number of times, but that do not use `ignored` in the loop body. These are typically still easier to read than a classical `for` loop. Second, some IDEs detect it if a return value of a function such as `File.delete` is not being used. In this case, people sometimes store the result in an unused local variable to make ignoring the return value explicit and to avoid the squiggly lines. - In Java 22, unsued local variables can be omitted by using a single underscore `_`. This is supported by checkstyle. In pre-22 versions, IntelliJ allows such variables to be named `ignored` to suppress the unused local variable warning. This pattern is often (but not consistently) used in the Kafka codebase. This is, however, not supported by checkstyle. Since we cannot switch to Java 22, yet, and we want to use automated detection using checkstyle, we have to resort to prefixing the unused local variables with `@SuppressWarnings("UnusedLocalVariable")`. We have to apply this in 11 cases across the Kafka codebase. While not being pretty, I'd argue it's worth it to prevent bugs like the one fixed in [`7a749b5`](`7a749b589f`). Reviewers: Andrew Schofield <aschofield@confluent.io>, David Arthur <mumrah@gmail.com>, Matthias J. Sax <matthias@confluent.io>, Bruno Cadonna <cadonna@apache.org>, Kirk True <ktrue@confluent.io>	2025-03-10 09:37:35 +01:00
Azhar Ahmed	832dfa36da	KAFKA-18637: Fix max connections per ip and override reconfigurations (#19099 ) Reviewers: Christo Lolov <lolovc@amazon.com>, TengYao Chi <kitingiao@gmail.com>, Ken Huang <s7133700@gmail.com>	2025-03-10 07:27:48 +00:00
Ken Huang	d5413fdb48	KAFKA-17856 Move ConfigCommandTest and ConfigCommandIntegrationTest to tool module (#17767 ) Reviewers: TengYao Chi <kitingiao@gmail.com>, Chia-Ping Tsai <chia7712@gmail.com>	2025-03-09 21:05:36 +08:00
PoAn Yang	a5e5e2dcd5	KAFKA-18706 Move AclPublisher to metadata module (#18802 ) Move AclPublisher to org.apache.kafka.metadata.publisher package. Reviewers: Christo Lolov <lolovc@amazon.com>, Chia-Ping Tsai <chia7712@gmail.com>	2025-03-09 21:00:33 +08:00
ClarkChen	1584d49470	KAFKA-18944 Remove unused setters from ClusterConfig (#19166 ) Remove unused `saslServerProperties`, `saslClientProperties`, `adminClientProperties`, `producerProperties`, and `consumerProperties` in ClusterConfig. First, I quickly fixed the unused adminClientProperties, and then I will move on to https://github.com/apache/kafka/pull/19094 to fix the related issues. Pass AdminClientRebootstrapTest <img width="1398" alt="Screenshot 2025-03-09 at 12 54 57 PM" src="https://github.com/user-attachments/assets/73c50376-6602-493d-8abd-0eb2bb304114" /> Pass ClusterConfigTest <img width="1117" alt="Screenshot 2025-03-09 at 12 55 28 PM" src="https://github.com/user-attachments/assets/b4da59da-dfdf-4698-9077-5086854360ab" /> Reviewers: Chia-Ping Tsai <chia7712@gmail.com>	2025-03-09 17:49:28 +08:00
ClarkChen	2a0dbd8e0b	KAFKA-18909 Move DynamicThreadPool to server module (#19081 ) * Add `DynamicThreadPool.java` to the server module. * Remove the old DynamicThreadPool object in the `DynamicBrokerConfig.scala`. Reviewers: TengYao Chi <kitingiao@gmail.com>, Chia-Ping Tsai <chia7712@gmail.com>	2025-03-09 17:42:51 +08:00
Colin Patrick McCabe	343bc995f4	KAFKA-18920: The kcontrollers must set kraft.version in ApiVersionsResponse (#19127 ) The kafka controllers need to set kraft.version in their ApiVersionsResponse messages according to the current kraft.version reported by the Raft layer. Instead, currently they always set it to 0. Also remove FeatureControlManager.latestFinalizedFeatures. It is not needed and it does a lot of copying. Reviewers: Jun Rao <junrao@gmail.com>, Chia-Ping Tsai <chia7712@gmail.com>	2025-03-07 13:46:46 -08:00
Dániel Urbán	40db001588	KAFKA-18929: Log a warning when time based segment delete is blocked by a future timestamp (#19137 ) When producers send future timestamps, time retention based log segments may get blocked from removal for an extended period of time. Log cleaning should should warn in the logs when this scenario occurs. Reviewers: Viktor Somogyi-Vass <viktorsomogyi@gmail.com>	2025-03-07 14:31:22 +01:00
ClarkChen	870db5d811	KAFKA-18915: Migrate AdminClientRebootstrapTest to use new test infra (#19094 ) Migrate AdminClientRebootstrapTest to the new test infra and remove the old Scala test. Reviewers: TengYao Chi <kitingiao@gmail.com>, David Arthur <mumrah@gmail.com>	2025-03-06 16:05:51 -05:00
Andrew Schofield	1da30bdedf	KAFKA-18900: Experimental share consumer acknowledge mode config (#19113 ) User testing of the `KafkaShareConsumer` interface has revealed some areas which confuse people. One of these is that way that it decides whether you want to use implicit or explicit acknowledgement of records by observing which calls the application issues. We are taking the opportunity to refine the interface before it is finalised. This PR introduces an experimental configuration called `internal.share.acknowledgement.mode` which can be used to make the application declare which kind of acknowledgement it wishes to use. We plan to try out the configuration, assess whether it has helped, and then create a proper consumer configuration that makes this area better. That would require a lot of change in the tests, which explains why this initial PR only has a small number of tests. Reviewers: David Arthur <mumrah@gmail.com>	2025-03-06 17:57:11 +00:00
Ken Huang	041d8019d6	KAFKA-18910 Remove kafka.utils.json (#19112 ) Reviewers: TengYao Chi <kitingiao@gmail.com>, Chia-Ping Tsai <chia7712@gmail.com>	2025-03-06 14:11:20 +08:00
co63oc	3d7ac0c3d1	MINOR: Fix typos in multiple files (#19102 ) Fix typos in multiple files Reviewers: Andrew Schofield <aschofield@confluent.io>	2025-03-05 14:27:32 +00:00
Xuan-Zhang Gong	18eca0229d	KAFKA-18882 Remove BaseKey, TxnKey, and UnknownKey (#19054 ) Reviewers: Ken Huang <s7133700@gmail.com>, TengYao Chi <kitingiao@gmail.com>, PoAn Yang <payang@apache.org>, Chia-Ping Tsai <chia7712@gmail.com>	2025-03-05 21:16:18 +08:00
Lan Ding	69ff5d1e70	KAFKA-18817: ShareGroupHeartbeat and ShareGroupDescribe API must check topic describe (#19083 ) Reviewers: Christo Lolov <lolovc@amazon.com>	2025-03-05 11:25:08 +00:00
Kuan-Po Tseng	cbd72cc216	KAFKA-14121: AlterPartitionReassignments API should allow callers to specify the option of preserving the replication factor (#18983 ) Reviewers: Christo Lolov <lolovc@amazon.com>, Chia-Ping Tsai <chia7712@gmail.com>, TengYao Chi <kitingiao@gmail.com>	2025-03-05 11:23:12 +00:00
Logan Zhu	011f256c86	KAFKA-18886 add behavior change of CreateTopicPolicy and AlterConfigPolicy to zk2kraft (#19087 ) 1. Updated JavaDoc to reflect that CreateTopicPolicy and AlterConfigPolicy run on the controller in KRaft mode. 2. Modified Behavioral Change Reference in the HTML docs to include this change. 3. add warning message to KafkaConfig if the config of broker node has policy configs Reviewers: TengYao Chi <kitingiao@gmail.com>, Ken Huang <s7133700@gmail.com>, Chia-Ping Tsai <chia7712@gmail.com>	2025-03-05 15:15:03 +08:00
co63oc	e4ece37dbf	Fix typos in multiple files (#19086 ) Fix typos in multiple files Reviewers: Andrew Schofield <aschofield@confluent.io>	2025-03-04 16:05:51 +00:00
Apoorv Mittal	c1fc59fc23	KAFKA-18918: Correcting releasing of locks on exception (#19091 ) The PR corrects the way the locks are released on exception. As `partitionsAcquired` can be a reference to `topicPartitionData`, hence the locks should released prior clearing `partitionsAcquired`. Reviewers: Abhinav Dixit <adixit@confluent.io>, Andrew Schofield <aschofield@confluent.io>	2025-03-04 16:04:45 +00:00
David Jacot	1df4a42b40	KAFKA-18916; Resolved regular expressions must update the group by topics data structure (#19088 ) When regular expressions are resolved, they do not update the group by topics data structure. Hence, topic changes (e.g. deletion) do not trigger a rebalance of the group. Reviewers: Lucas Brutschy <lbrutschy@confluent.io>	2025-03-04 06:31:08 -08:00
Nick Guo	101e15bb1c	KAFKA-18867 add tests to describe topic configs with empty name (#19075 ) Reviewers: TengYao Chi <kitingiao@gmail.com>, Chia-Ping Tsai <chia7712@gmail.com>	2025-03-04 14:56:25 +08:00
Mahsa Seifikar	2154e55abf	MINOR: Prevent broker fencing by adjusting resendExponentialBackoff in BrokerLifecycleManager (#19061 ) This PR reduces `maxInterval` for `resendExponentialBackoff` in `BrokerLifecycleManager` class from `broker.session.timeout.ms` to half of its value. Setting `maxInterval` to `broker.session.timeout.ms` caused brokers to be fenced if a resend attempt occurred near the timeout threshold, leading to unnecessary broker fencing. Reviewers: Colin P. McCabe <cmccabe@apache.org>	2025-03-03 12:03:15 -08:00
Apoorv Mittal	a6c53d0c37	KAFKA-18878: Added share session cache and delayed share fetch metrics (KIP-1103) (#19059 ) The PR implements the ShareSessionCache and DelayedShareFetchMetrics as defined in KIP-1103. Reviewers: Andrew Schofield <aschofield@confluent.io>	2025-03-03 16:44:34 +00:00
Lucas Brutschy	a04dd21f26	KAFKA-18613: Auto-creation of internal topics in streams group heartbeat (#18981 ) Implements auto-topic creation when handling the streams group heartbeat. Inside KafkaApis, the handler for streamsGroupHeartbeat uses the result of the streams group heartbeat inside the group coordinator to attempt to create all missing internal topics using AutoTopicCreationManager. CREATE TOPIC ACLs are checked. The unit tests class AutoTopicCreationManagerTest is brought back (it was recently deleted during a ZK removal PR), but testing only the kraft-related functionality. Reviewers: Bruno Cadonna <bruno@confluent.io> ### Committer Checklist (excluded from commit message) - [ ] Verify design and implementation - [ ] Verify test coverage and CI build status - [ ] Verify documentation (including upgrade notes)	2025-03-03 08:48:00 +01:00
Xuan-Zhang Gong	ceac4f0a1d	KAFKA-18880 Remove kafka.cluster.Broker and BrokerEndPointNotAvailableException (#19047 ) Remove kafka.cluster.Broker and BrokerEndPointNotAvailableException as they were used by zk path. Reviewers: TengYao Chi <kitingiao@gmail.com>, Ken Huang <s7133700@gmail.com>, Chia-Ping Tsai <chia7712@gmail.com>	2025-03-02 10:54:32 +08:00
TengYao Chi	e0c77140b2	KAFKA-17039 KIP-919 supports for unregisterBroker (#19063 ) Reviewers: Chia-Ping Tsai <chia7712@gmail.com>	2025-03-01 23:55:35 +08:00
Nick Guo	98bb79e732	KAFKA-17981 add Integration test for ConfigCommand to add config `key=[val1,val2]` (#17771 ) Reviewers: Jhen-Yung Hsu <jhenyunghsu@gmail.com>, TaiJuWu <tjwu1217@gmail.com>, Chia-Ping Tsai <chia7712@gmail.com>	2025-03-01 13:15:25 +08:00
Apoorv Mittal	8cf969e00a	KAFKA-18734: Implemented share partition metrics (KIP-1103) (#19045 ) The PR implements the SharePartitionMetrics as defined in KIP-1103, with one change. The metric `FetchLockRatio` is defined as `Meter` in KIP but is implemented as `HIstogram`. There was a discussion about same on KIP-1103 discussion where we thought that `FetchLockRatio` is pre-aggregated but while implemeting the rate from `Meter` can go above 100 as `Meter` defines rate per time period. Hence it makes more sense to implement metric `FetchLockRatio` as `Histogram`. Reviewers: Andrew Schofield <aschofield@confluent.io>	2025-02-28 14:22:27 +00:00
Apoorv Mittal	8b605bd362	MINOR: Removing share partition manager flaky annotation (#19053 ) There isn't any flaky test for SharePartitionManager in last 7 days, removing flaky annotation. Reviewers: Andrew Schofield <aschofield@confluent.io>	2025-02-28 08:49:59 +00:00
Dongnuo Lyu	36f19057e1	KAFKA-18813: ConsumerGroupHeartbeat API and ConsumerGroupDescribe API must check topic describe (#18989 ) This patch filters out the topic describe unauthorized topics from the ConsumerGroupHeartbeat and ConsumerGroupDescribe response. In ConsumerGroupHeartbeat, - if the request has `subscribedTopicNames` set, we directly check the authz in `KafkaApis` and return a topic auth failure in the response if any of the topics is denied. - Otherwise, we check the authz only if a regex refresh is triggered and we do it based on the acl of the consumer that triggered the refresh. If any of the topic is denied, we filter it out from the resolved subscription. In ConsumerGroupDescribe, we check the authz of the coordinator response. If any of the topic in the group is denied, we remove the described info and add a topic auth failure to the described group. (similar to the group auth failure) Reviewers: David Jacot <djacot@confluent.io>, Lianet Magrans <lmagrans@confluent.io>, Rajini Sivaram <rajinisivaram@googlemail.com>, Chia-Ping Tsai <chia7712@gmail.com>, TaiJuWu <tjwu1217@gmail.com>, TengYao Chi <kitingiao@gmail.com>	2025-02-26 13:05:36 -05:00
Lucas Brutschy	cb7c54ccd3	KAFKA-18614, KAFKA-18613: Add streams group request plumbing (#18979 ) This change implements the basic RPC handling StreamsGroupHeartbeat and StreamsGroupDescribe. This includes: - Adding an option to enable streams groups on the broker - Passing describe and heartbeats to the right shard of the group coordinator - The handler inside the GroupMetadatManager for StreamsGroupDescribe is fairly trivial, and is included directly in this PR. - The handler for StreamsGroupHeartbeat is complex and not included in this PR yet. Instead, a UnsupportedOperationException is thrown. However, the interface is already defined: The result of a streamsGroupHeartbeat is a response, together with a list of internal topics to be created. The heartbeat implementation inside the `GroupMetadataManager`, which actually implements the assignment / reconciliation logic, will come in a follow-up PR. Also, automatic creation of internal topics will be created in a follow-up PR. Reviewers: Bill Bejeck <bill@confluent.io>	2025-02-26 16:33:26 +01:00
Abhinav Dixit	4b5a16bf6f	KAFKA-18757: Create full-function SimpleAssignor to match KIP-932 description (#18864 ) ### About The current `SimpleAssignor` in AK assigned all subscribed topic partitions to all the share group members. This does not match the description given in [KIP-932](https://cwiki.apache.org/confluence/pages/viewpage.action?pageId=255070434#KIP932:QueuesforKafka-TheSimpleAssignor). Here are the rules as mentioned in the KIP by which the assignment should happen. We have changed the step 3 implementation here due to the reasons [described](https://github.com/apache/kafka/pull/18864#issuecomment-2659266502) - 1. The assignor hashes the member IDs of the members and maps the partitions assigned to the members based on the hash. This gives approximately even balance. 2. If any partitions were not assigned any members by (1) and do not have members already assigned in the current assignment, members are assigned round-robin until each partition has at least one member assigned to it. 3. We combine the current and new assignment. (Original rule - If any partitions were assigned members by (1) and also have members in the current assignment assigned by (2), the members assigned by (2) are removed.) ### Tests The added code has been verified with unit tests and the already present integration tests. Reviewers: Andrew Schofield <aschofield@confluent.io>, Apoorv Mittal <apoorvmittal10@gmail.com>, TaiJuWu <tjwu1217@gmail.com>	2025-02-26 11:02:23 +00:00
José Armando García Sancio	4a8a0637e0	KAFKA-18723; Better handle invalid records during replication (#18852 ) For the KRaft implementation there is a race between the network thread, which read bytes in the log segments, and the KRaft driver thread, which truncates the log and appends records to the log. This race can cause the network thread to send corrupted records or inconsistent records. The corrupted records case is handle by catching and logging the CorruptRecordException. The inconsistent records case is handle by only appending record batches who's partition leader epoch is less than or equal to the fetching replica's epoch and the epoch didn't change between the request and response. For the ISR implementation there is also a race between the network thread and the replica fetcher thread, which truncates the log and appends records to the log. This race can cause the network thread send corrupted records or inconsistent records. The replica fetcher thread already handles the corrupted record case. The inconsistent records case is handle by only appending record batches who's partition leader epoch is less than or equal to the leader epoch in the FETCH request. Reviewers: Jun Rao <junrao@apache.org>, Alyssa Huang <ahuang@confluent.io>, Chia-Ping Tsai <chia7712@apache.org>	2025-02-25 20:09:19 -05:00
Apoorv Mittal	df5839a9f4	KAFKA-17351: Improved handling of compacted topics in share partition (2/N) (#19010 ) The PR handles fetch for `compacted` topics. The fix was required only when complete batch disappears from the topic log, and same batch is marked re-available in Share Partition state cache. Subsequent log reads will not result the disappeared batch in read response hence respective batch will be left as available in the state cache. The PR checks for the first fetched/read batch base offset and if it's greater than the position from where the read occurred (fetch offset) then if there exists any `available` batches in the state cache then they will be archived. Reviewers: Andrew Schofield <aschofield@confluent.io>, Abhinav Dixit <adixit@confluent.io>	2025-02-25 14:11:39 +00:00
xijiu	1edc30bf30	KAFKA-17836 Move RackAwareTest to server module (#19021 ) Reviewers: Chia-Ping Tsai <chia7712@gmail.com>	2025-02-25 18:15:34 +08:00
TaiJuWu	1c82b89b4c	KAFKA-18712 Move Endpoint to server module (#18803 ) Reviewers: Ismael Juma <ismael@juma.me.uk>, Mickael Maison <mickael.maison@gmail.com>, Christo Lolov <lolovc@amazon.com>, Chia-Ping Tsai <chia7712@gmail.com>	2025-02-25 14:02:51 +08:00
PoAn Yang	10873e4210	KAFKA-18281: Kafka is improperly validating non-advertised listeners for routable controller addresses (#18387 ) When a cluster is configured with a dynamic controller quorum, KRaft replica's endpoint are computed using the advertised.listeners property and not the quorum.controller.voters property. This change in the configuration makes it difficult to keeping all previous node configurations compatible with the new endpoint discovery functionality. The least intrusive solution is to rely on Kafka's reverse hostname lookup when the hostname is not specified. The effective advertised controller listener now remove '0.0.0.0' hostname if the endpoint came from the listener configuration and not the advertised.listener configuration. Reviewers: José Armando García Sancio <jsancio@apache.org>, Alyssa Huang <ahuang@confluent.io>	2025-02-24 21:51:28 -05:00
Nick Guo	d23a61738a	KAFKA-17937 Cleanup AbstractFetcherThreadTest (#18900 ) - Remove AbstractFetcherThreadWithIbp26Test as it tests unsupported IBP - cleanup AbstractFetcherThreadTest to remove unreachable paths, variables, and code Reviewers: Chia-Ping Tsai <chia7712@gmail.com>	2025-02-25 07:45:47 +08:00
Apoorv Mittal	48a506b7b8	KAFKA-18522: Slice records for share fetch (#18804 ) The PR handles slicing of fetched records based on acquire response for share fetch. There could be additional bytes fetched from log but acquired offsets can be a subset, typically with `max fetch records` configuration. Rather sending additional bytes of fetched data to client we should slice the file and wire only needed batches. Note: If the acquired offsets are within a batch then we need to send the entire batch within the file record. Hence rather checking for individual batches, PR finds the first and last acquired offset, and trims the file for all batches between (inclusive) these two offsets. Reviewers: Christo Lolov <lolovc@amazon.com>, Andrew Schofield <aschofield@confluent.io>, Jun Rao <junrao@gmail.com>	2025-02-24 09:55:24 -08:00
mingdaoy	289e958c39	MINOR: Fix validateResourceNameIsNodeId's exception message (#19017 ) Reviewers: Chia-Ping Tsai <chia7712@gmail.com>	2025-02-24 09:30:02 +08:00
Ismael Juma	13cb87c2d0	MINOR: Remove request log space added inadvertently (#19011 ) Reviewers: Chia-Ping Tsai <chia7712@gmail.com>	2025-02-23 11:30:19 +08:00
Apoorv Mittal	6e45ab7d84	KAFKA-17351: Update tests and acquire API to allow discard batches from compacted topics (1/N) (#18978 ) The PR does following: 1. Adds `fetchOffset` to `acquire` API in SharePartition. 2. Adds a ShareFetchPartitionData class efficiently handle the propagation of fetchOffset information. 3. Updates SharePartitionTests to make common code so such improvements does not require all tests changes for future PRs. Reviewers: Andrew Schofield <aschofield@confluent.io>	2025-02-22 16:14:09 +00:00
Sushant Mahajan	4f28973bd1	KAFKA-18827: Initialize share state, share coordinator impl. [1/N] (#18968 ) In this PR, we have added the share coordinator and KafkaApis side impl of the intialize share group state RPC. ref: https://cwiki.apache.org/confluence/display/KAFKA/KIP-932%3A+Queues+for+Kafka#KIP932:QueuesforKafka-InitializeShareGroupStateAPI Reviewers: Andrew Schofield <aschofield@confluent.io>	2025-02-22 16:12:08 +00:00
Apoorv Mittal	f543eac4fe	KAFKA-18733: Implemented fetch ratio and partition acquire time metrics (3/N) (#18959 ) PR implements the final set of ShareGroupMetrics, RequestTopicPartitionsFetchRatio and TopicPartitionsAcquireTimeMs, as defined in KIP-1103: https://cwiki.apache.org/confluence/display/KAFKA/KIP-1103%3A+Additional+metrics+for+cooperative+consumption Note: Metric `RequestTopicPartitionsFetchRatio` is calculated as percentage as Histogram API doesn't record double. Reviewers: Andrew Schofield <aschofield@confluent.io>, Abhinav Dixit <adixit@confluent.io>	2025-02-21 17:01:39 +00:00
Calvin Liu	8f13e7c207	MINOR: Move the ELR default version to 4.1 (#18954 ) - ELR is enabled (ELRV_1) by default if the cluster is created with its bootstrap metadata version >= IBP_4_1_IV0. - ELRV_1 can be manually enabled iff the metadata version is >= IBP_4_0_IV1. Reviewers: Ismael Juma <ismael@juma.me.uk>, Colin P. McCabe <cmccabe@apache.org>, David Jacot <djacot@confluent.io>	2025-02-21 16:13:11 +01:00
Shivsundar R	7da1a6cbff	KAFKA-18033: Remove flaky tag in ShareConsumerTest (#18995 ) 3 tests which were marked flaky in ShareConsumerTest do not have any failure on trunk since the test was converted to use `ClusterTestExtensions`. Reviewers: Sushant Mahajan <smahajan@confluent.io>, Apoorv Mittal <apoorvmittal10@gmail.com>, Andrew Schofield <aschofield@confluent.io>	2025-02-21 13:50:08 +00:00
TengYao Chi	767a62ade6	KAFKA-18737 KafkaDockerWrapper setup functions fails due to storage format command (#18844 ) The current Docker Hub documentation for Kafka is based on the use of static voters. Since Kafka 4.0 utilizes dynamic voters, users following the doc of docker hub may encounter unexpected behavior. Due to the limited time available for the 4.0.0 release, a simple and quick solution is to revert to using static voters within the Docker image. This can be achieved by adding a configuration file with static voter definitions to the kafka/docker folder, keeping it separate from the main kafka/config directory. This approach allows us to encourage the use of dynamic voters in typical deployments while maintaining compatibility within the Docker image. Reviewers: Vedarth Sharma <142404391+VedarthConfluent@users.noreply.github.com>, Chia-Ping Tsai <chia7712@gmail.com>	2025-02-21 20:43:41 +08:00
TengYao Chi	d31cbf59de	KAFKA-18831 Migrating to log4j2 introduce behavior changes of adjusting level dynamically (#18969 ) fix the following behavior changes. 1) in log4j 1, users can't change the logger by parent if the logger is declared by properties explicitly. For example, `org.apache.kafka.controller` has level explicitly in the properties. Hence, we can't use "org.apache.kafka=INFO" to change the level of `org.apache.kafka.controller` to INFO. By contrast, log4j2 allows us to change all child loggers by the parent logger. 2) in log4j2, we can change the level of root to impact all loggers' level. By contrast, log4j 1 can't. Reviewers: Chia-Ping Tsai <chia7712@gmail.com>	2025-02-21 16:12:58 +08:00
Calvin Liu	1eecd02ce8	MINOR: Deflake EligibleLeaderReplicasIntegrationTest (#18923 ) Make sure to give enough time for the partition ISR updates. Reviewers: David Jacot <djacot@confluent.io>	2025-02-20 05:14:15 -08:00
Matthias J. Sax	538a60e1b3	MINOR: disallow rawtypes and fail build (#18877 ) Cleanup code to avoid rawtype, and add suppressions where necessary. Change the build to fail on rawtype warning. Reviewers: Apoorv Mittal <apoorvmittal10@gmail.com>, Andrew Schofield <aschofield@confluent.io>	2025-02-19 13:11:49 -08:00
Ismael Juma	3a59a526d9	MIINOR: Remove redundant quorum parameter from *AdminIntegrationTest classes (#18965 ) Reviewers: Lianet Magrans <lmagrans@confluent.io>	2025-02-19 15:57:47 -05:00
Shivsundar R	3603c8fe35	KAFKA-18829: Added check before converting to IMPLICIT mode (#18964 ) Reviewers: Andrew Schofield <aschofield@confluent.io>	2025-02-19 17:34:28 +00:00
Ismael Juma	3dba3125e9	KAFKA-18601: Assume a baseline of 3.3 for server protocol versions (#18845 ) 3.3.0 was the first KRaft release that was deemed production-ready and also when KIP-778 (KRaft to KRaft upgrades) landed. Given that, it's reasonable for 4.x to only support upgrades from 3.3.0 or newer (the metadata version also needs to be set to "3.3" or newer before upgrading). Noteworthy changes: 1. `AlterPartition` no longer includes topic names, which makes it possible to simplify `AlterParitionManager` logic. 2. Metadata versions older than `IBP_3_3_IV3` have been removed and `IBP_3_3_IV3` is now the minimum version. 3. `MINIMUM_BOOTSTRAP_VERSION` has been removed. 4. Removed `isLeaderRecoverySupported`, `isNoOpsRecordSupported`, `isKRaftSupported`, `isBrokerRegistrationChangeRecordSupported` and `isInControlledShutdownStateSupported` - these are always `true` now. Also removed related conditional code. 5. Removed default metadata version or metadata version fallbacks in multiple places - we now fail-fast instead of potentially using an incorrect metadata version. 6. Update `MetadataBatchLoader.resetToImage` to set `hasSeenRecord` based on whether image is empty - this was a previously existing issue that became more apparent after the changes in this PR. 7. Remove `ibp` parameter from `BootstrapDirectory` 8. A number of tests were not useful anymore and have been removed. I will update the upgrade notes via a separate PR as there are a few things that need changing and it would be easier to do so that way. Reviewers: Chia-Ping Tsai <chia7712@gmail.com>, Jun Rao <junrao@gmail.com>, David Arthur <mumrah@gmail.com>, Colin P. McCabe <cmccabe@apache.org>, Justine Olshan <jolshan@confluen.io>, Ken Huang <s7133700@gmail.com>	2025-02-19 05:35:42 -08:00
xijiu	4c4458c17a	KAFKA-18799 Remove AdminUtils (#18946 ) Reviewers: Chia-Ping Tsai <chia7712@gmail.com>	2025-02-19 06:25:43 +08:00
PoAn Yang	1132f08c57	KAFKA-18773 Migrate the log4j1 config to log4j 2 for native image and README (#18872 ) - update reflection-config.json and resource-config.json to include log4j2 and jackson - remove unused jackson scala library - fix the incorrect path of log4j2.yaml - adopt workaround (--standalone) to make this PR work and it will be fixed by KAFKA-18737) Reviewers: TengYao Chi <kitingiao@gmail.com>, Chia-Ping Tsai <chia7712@gmail.com>	2025-02-19 00:48:46 +08:00
TaiJuWu	934b0159bb	KAFKA-18089: Upgrade Caffeine lib to 3.1.8 (#18004 ) - Fixed the RemoteIndexCacheTest that fails with caffeine > 3.1.1 Reviewers: Luke Chen <showuon@gmail.com>, Kamal Chandraprakash <kamal.chandraprakash@gmail.com>	2025-02-18 21:51:38 +05:30
Parker Chang	ed366e6b89	MINOR: Align assertFutureThrows method signature with JUnit conventions (#18825 ) Reviewers: Chia-Ping Tsai <chia7712@gmail.com>, Andrew Schofield <aschofield@confluent.io>	2025-02-18 15:56:42 +00:00
Mickael Maison	0a2fab9310	KAFKA-14484: Decouple UnifiedLog and RemoteLogManager (#18460 ) Reviewers: Kamal Chandraprakash <kamal.chandraprakash@gmail.com>, Ismael Juma <ismael@juma.me.uk>	2025-02-18 15:10:31 +01:00
Andrew Schofield	6c14f64245	MINOR: Rename NoOpShareStatePersister for consistency (#18933 ) Reviewers: Chia-Ping Tsai <chia7712@gmail.com>	2025-02-18 14:07:59 +00:00
Chirag Wadhwa	63229a768c	KAFKA-16718 [1/n]: Added DeleteShareGroupOffsets request and response schema (#18927 ) Reviewers: Andrew Schofield <aschofield@confluent.io>	2025-02-18 14:06:24 +00:00
Andrew Schofield	385b7ad355	MINOR: Align share group admin authz with consumer group (#18936 ) Reviewers: Manikumar Reddy <manikumar.reddy@gmail.com>	2025-02-18 09:12:07 +00:00
Kamal Chandraprakash	da3643c6b4	KAFKA-18787: RemoteIndexCache fails to delete invalid files on init (#18888 ) The stale/invalid files that ends-with ".deleted" and ".tmp" should be cleaned when the broker gets restarted. - fix the remote-index-cache test to use the logDir instead of topicDir - fix the flaky test Reviewers: Luke Chen <showuon@gmail.com>	2025-02-18 12:56:03 +05:30
Apoorv Mittal	06ce3e890b	KAFKA-18733: Updating share group record acks metric (2/N) (#18924 ) Reviewers: Andrew Schofield <aschofield@confluent.io>	2025-02-17 18:12:58 +00:00
PoAn Yang	2b6e868538	KAFKA-18784 Fix ConsumerWithLegacyMessageFormatIntegrationTest (#18889 ) In PR #18267, we removed old message format for cases in ConsumerWithLegacyMessageFormatIntegrationTest. Although test cases can pass, they don't fulfill original purpose. We can't send old message format since 4.0, so I change cases to append old records by ReplicaManager directly. Reviewers: Chia-Ping Tsai <chia7712@gmail.com>	2025-02-17 20:43:29 +08:00
Andrew Schofield	9b7ad6ec32	MINOR: Mark testQuotaOverrideDelete as flaky (#18925 ) Reviewers: poorv Mittal <apoorvmittal10@gmail.com>, Chia-Ping Tsai <chia7712@gmail.com>	2025-02-17 15:20:35 +08:00
TengYao Chi	5cbe00e375	MINOR: Remove unused member in DynamicBrokerConfig (#18915 ) Reviewers: Jhen-Yung Hsu <jhenyunghsu@gmail.com>, Chia-Ping Tsai <chia7712@gmail.com>	2025-02-17 04:46:25 +08:00
Ming-Yen Chung	e828767062	KAFKA-18790 Fix testCustomQuotaCallback (#18906 ) Frequently updating the trust store can cause unexpected termination of the AsyncConsumer background thread. 1. To resolve this issue, reuse the same AdminClient instead of recreating it. 2. Add error logging when fail to initialize resources for the consumer network thread. Reviewers: Chia-Ping Tsai <chia7712@gmail.com>	2025-02-15 03:07:59 +08:00
Jimmy Wang	6a6b80215d	KAFKA-16717 [1/2]: Add AdminClient.alterShareGroupOffsets (#18819 ) KAFKA-16720 aims to add the support for the AlterShareGroupOffsets AdminClient. Key Changes in the PR: 1. Added handing of alterShareGroupOffsets() in KafkaAdminClient and introduce AlterShareGroupOffsetRequest/AlterShareGroupOffsetResponse/AlterShareGroupOffsetsOptions classes. 2. Corresponding test in KafkaAdminClientTest. 3. Added ALTER_SHARE_GROUP_OFFSETS API (will finish it in next PR and the share coordinator pieces) Reviewers: poorv Mittal <apoorvmittal10@gmail.com>, Andrew Schofield <aschofield@confluent.io>, Chia-Ping Tsai <chia7712@gmail.com>	2025-02-15 02:35:46 +08:00
Apoorv Mittal	53543bcf63	KAFKA-18733: Updating share group metrics (1/N) (#18826 ) Reviewers: Sushant Mahajan <smahajan@confluent.io>, Andrew Schofield <aschofield@confluent.io>	2025-02-14 08:48:41 +00:00
陳昱霖(Yu-Lin Chen)	2bbd25841e	KAFKA-18298 Fix flaky testConsumerGroupsDeprecatedConsumerGroupState and testConsumerGroups in PlaintextAdminIntegrationTest (#18513 ) It's related to KAFKA-18298 and KAFKA-18297. The root cause of the flaky tests is member rejoin after member removal. To prevent members from rejoining after being removed, before removing group members, calling `consumers.close` in ConsumerThread . This fix also extract the flaky member removal test to new test `testConsumerGroupWithMemberRemoval`. Flow of member removal test: 1. Set 2 static consumer + 1 dynamic consumer 2. Close all consumers. 3. remove one static member 4. remove remaining members Before KIP-1092, the member count is different between ClassicConsumer/AsyncConsumer. (AsyncConsumer will remove dynamic member after consumer closed.) To get more details, please refer to the discussion under KAFKA-18297 and this PR: - discussion : [Link](https://issues.apache.org/jira/browse/KAFKA-18297?focusedCommentId=17912537&page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel#comment-17912537) - review: https://github.com/apache/kafka/pull/18513#pullrequestreview-2589110367 This PR fixed below flaky errors: 1. PlaintextAdminIntegrationTest#testConsumerGroups a. `org.opentest4j.AssertionFailedError: expected: <2> but was: <3>` ([Report](https://ge.apache.org/s/lt3lpviv45cns/tests/task/:core:test/details/kafka.api.PlaintextAdminIntegrationTest/testConsumerGroups(String%2C%20String)%5B1%5D?top-execution=1)) b. `org.opentest4j.AssertionFailedError: expected: <true> but was: <false>` ([Report](https://ge.apache.org/s/jlxo446xalpoa/tests/task/:core:test/details/kafka.api.PlaintextAdminIntegrationTest/testConsumerGroups(String%2C%20String)%5B1%5D?top-execution=1)) 2. PlaintextAdminIntegrationTest#testConsumerGroupsDeprecatedConsumerGroupState a. `org.opentest4j.AssertionFailedError: expected: <2> but was: <3>` ([Report](https://ge.apache.org/s/ndoj6s2stb446/tests/task/:core:test/details/kafka.api.PlaintextAdminIntegrationTest/testConsumerGroupsDeprecatedConsumerGroupState(String%2C%20String)%5B1%5D?top-execution=1)) b. `org.opentest4j.AssertionFailedError: expected: <true> but was: <false>` ([Report](https://ge.apache.org/s/kh3jze2tc5qeu/tests/task/:core:test/details/kafka.api.PlaintextAdminIntegrationTest/testConsumerGroupsDeprecatedConsumerGroupState(String%2C%20String)%5B1%5D?top-execution=1)) Reviewers: David Jacot <djacot@confluent.io>, TaiJuWu <tjwu1217@gmail.com>, Chia-Ping Tsai <chia7712@gmail.com>	2025-02-14 07:28:45 +08:00
Andrew Schofield	952113e8e0	KAFKA-16720: Support multiple groups in DescribeShareGroupOffsets RPC (#18834 ) Reviewers: Apoorv Mittal <apoorvmittal10@gmail.com>, Manikumar Reddy <manikumar.reddy@gmail.com>	2025-02-13 18:27:05 +00:00
Calvin Liu	9cb271f1e1	KAFKA-18654[2/2]: Transction V2 retry add partitions on the server side when handling produce request. (#18810 ) During the transaction commit phase, it is normal to hit CONCURRENT_TRANSACTION error before the transaction markers are fully propagated. Instead of letting the client to retry the produce request, it is better to retry on the server side. Reviewers: Artem Livshits <alivshits@confluent.io>, Justine Olshan <jolshan@confluent.io>	2025-02-13 09:30:58 -08:00
Apoorv Mittal	a13d815a0d	MINOR: Updated share partition manager tests to close and other fixes (#18862 ) Reviewers: Abhinav Dixit <adixit@confluent.io>, Andrew Schofield <aschofield@confluent.io>	2025-02-13 13:37:37 +00:00
Ken Huang	9494bebee6	KAFKA-18728 Move ListOffsetsPartitionStatus to server module (#18807 ) Reviewers: Kamal Chandraprakash <kamal.chandraprakash@gmail.com>	2025-02-13 10:36:46 +05:30
Jhen-Yung Hsu	b0e5cdfc57	KAFKA-18777 add `PartitionsWithLateTransactionsCount` to BrokerMetricNamesTest (#18869 ) Rewrite BrokerMetricNamesTest using ReplicaManager.MetricNames, ensuring that all metrics are always included. This helps prevent issues like PartitionsWithLateTransactionsCount not being correctly included in the test before. Reviewers: Chia-Ping Tsai <chia7712@gmail.com>	2025-02-12 22:09:42 +08:00
PoAn Yang	63fc9b3cb8	KAFKA-18771: fix Flaky test KRaftClusterTest .testDescribeQuorumRequestToControllers (#18859 ) The case testDescribeQuorumRequestToControllers shutdowns raft client but not the controller. This makes client has chance to send a request to the controller and get NOT_LEADER_OR_FOLLOWER error. However, if the raft client finishes shutdown before handling the request, the request will not be handled. Shutdown the controller before doing KafkaFuture#get for the client request, so we can make sure the request is handled by another controller eventually. Signed-off-by: PoAn Yang <payang@apache.org> Reviewers: Luke Chen <showuon@gmail.com>	2025-02-12 16:16:43 +08:00
Justine Olshan	400363b7e2	KAFKA-18035: TransactionsTest testBumpTransactionalEpochWithTV2Disabled failed on trunk (#18451 ) Sometimes we didn't get into abortable state before aborting, so the epoch didn't get bumped. Now we force abortable state with an attempt to send before aborting so the epoch bump occurs as expected. Reviewers: Jeff Kim <jeff.kim@confluent.io>	2025-02-11 14:01:43 -08:00
Edoardo Comar	7e405ccc65	KAFKA-18758: NullPointerException in shutdown following InvalidConfigurationException (#18833 ) * KAFKA-18758: NullPointerException in shutdown following InvalidConfigurationException Add checks for null in shutdown as BrokerLifecycleManager is not instantiaited if LogManager constructor throws an Exception	2025-02-11 10:06:55 +00:00
Sushant Mahajan	675a0889de	KAFKA-18764: Throttle on share state RPCs auth failure. (#18855 ) Reviewers: Andrew Schofield <aschofield@confluent.io>	2025-02-11 09:54:24 +00:00
Mickael Maison	ece91e9247	KAFKA-14484: Move UnifiedLog static methods to storage (#18039 ) Reviewers: Chia-Ping Tsai <chia7712@gmail.com>	2025-02-11 09:55:32 +01:00
TengYao Chi	f5dd661cb5	KAFKA-18396: Migrate log4j1 configuration to log4j2 in KafkaDockerWrapper (#18394 ) After log4j migration, we need to update the logging configuration in KafkaDockerWrapper from log4j1 to log4j2. Reviewers: Manikumar Reddy <manikumar.reddy@gmail.com>	2025-02-11 13:25:23 +05:30
TaiJuWu	9fc7500684	KAFKA-18770 close the RM created by testDelayedShareFetchPurgatoryOperationExpiration (#18853 ) it's crucial to utilize a try-finally block to ensure proper closure of the ReplicaManager. Failing to do so can result in an unreleased thread from the purgatory, potentially leading to errors in subsequent integration tests that incorporate thread leak detection. Reviewers: poorv Mittal <apoorvmittal10@gmail.com>, Chia-Ping Tsai <chia7712@gmail.com>	2025-02-11 07:35:13 +08:00
Ken Huang	581e94840f	KAFKA-18366 Remove KafkaConfig.interBrokerProtocolVersion (#18820 ) Reviewers: Ismael Juma <ismael@juma.me.uk>, Chia-Ping Tsai <chia7712@gmail.com>	2025-02-11 06:18:02 +08:00
Jhen-Yung Hsu	4e36368d08	KAFKA-18743 Remove leader.imbalance.per.broker.percentage as it is not supported by Kraft (#18821 ) Remove `leader.imbalance.per.broker.percentage` from config. Add `leader.imbalance.per.broker.percentage` to release note Reviewers: Chia-Ping Tsai <chia7712@gmail.com>	2025-02-11 04:01:57 +08:00
Ken Huang	70adf746c4	KAFKA-18225 ClientQuotaCallback#updateClusterMetadata is unsupported by kraft (#18196 ) This commit ensures that the ClientQuotaCallback#updateClusterMetadata method is executed in KRaft mode. This method is triggered whenever a topic or cluster metadata change occurs. However, in KRaft mode, the current implementation of the updateClusterMetadata API is inefficient due to the requirement of creating a full Cluster object. To address this, a follow-up issue (KAFKA-18239) has been created to explore more efficient mechanisms for providing cluster information to the ClientQuotaCallback without incurring the overhead of a full Cluster object creation. Reviewers: Mickael Maison <mickael.maison@gmail.com>, TaiJuWu <tjwu1217@gmail.com>, Chia-Ping Tsai <chia7712@gmail.com>	2025-02-11 01:03:02 +08:00
PoAn Yang	b22c7d5b5c	KAFKA-17833: Convert DescribeAuthorizedOperationsTest to use KRaft (#18252 ) Reviewers: Mickael Maison <mickael.maison@gmail.com>	2025-02-07 15:44:27 +01:00
Piotr P. Karwasz	666571216b	KAFKA-18483 Disable `Log4jController` and `Loggers` if Log4j Core absent (#18496 ) If Log4j Core is absent, most calls to Log4jController and Loggers will end up with a NoClassDefFoundError. This changeset: - Profits from the major version bump to rename k.util.Log4jController to LoggingController. - Removes o.a.l.l.Level from the signature of public methods of o.a.k.connect.runtime.Loggers and replaces it with String. - Provides an additional no-op implementation of k.util.LoggingController and o.a.k.connect.runtime.Loggers: if Log4j Core is not present on the runtime classpath the no-op implementation will be used. Reviewers: Mickael Maison <mickael.maison@gmail.com>, Chia-Ping Tsai <chia7712@gmail.com>	2025-02-07 00:04:33 +08:00
Colin Patrick McCabe	b2b2408692	KAFKA-18360 Remove zookeeper configurations (#18566 ) Remove broker.id.generation.enable and reserved.broker.max.id, which are not used in KRaft mode. Remove inter.broker.protocol.version, which is not used in KRaft mode. Reviewers: PoAn Yang <payang@apache.org>, Ismael Juma <ismael@juma.me.uk>, Chia-Ping Tsai <chia7712@gmail.com>	2025-02-06 22:22:11 +08:00
Ken Huang	a3d9d881e1	KAFKA-18530 Remove ZooKeeperInternals (#18641 ) Since zk has been removed in 4.0, config handlers no longer need to handle the "<default>" value. This PR streamlines the config update process by eliminating the unnecessary string checks for "<default>" Reviewers: Christo Lolov <lolovc@amazon.com>, Ismael Juma <ismael@juma.me.uk>, Chia-Ping Tsai <chia7712@gmail.com>	2025-02-06 17:48:17 +08:00
Ming-Yen Chung	34e7136b7a	MINOR: Fix wrong config property in KafkaConfigTest (#18815 ) Reviewers: Ken Huang <s7133700@gmail.com>, Chia-Ping Tsai <chia7712@gmail.com>	2025-02-06 17:09:52 +08:00
Kuan-Po Tseng	b99be961b8	KAFKA-18206: EmbeddedKafkaCluster must set features (#18189 ) related to KAFKA-18206, set features in EmbeddedKafkaCluster in both streams and connect module, note that this PR also fix potential transaction with empty records in sendPrivileged method as transaction version 2 doesn't allow this kind of scenario. Reviewers: Justine Olshan <jolshan@confluent.io>	2025-02-05 09:14:36 -08:00
Chirag Wadhwa	01587d09d8	KAFKA-18494-3: solution for the bug relating to gaps in the share partition cachedStates post initialization (#18696 ) Reviewers: Apoorv Mittal <apoorvmittal10@gmail.com>, Abhinav Dixit <adixit@confluent.io>, Andrew Schofield <aschofield@confluent.io>	2025-02-05 15:16:25 +00:00
Sanskar Jhajharia	7dbed2f6e8	[KAFKA-16720] AdminClient Support for ListShareGroupOffsets (2/2) (#18671 ) Reviewers: Apoorv Mittal <apoorvmittal10@gmail.com>, Sushant Mahajan <smahajan@confluent.io>, Andrew Schofield <aschofield@confluent.io>	2025-02-05 14:38:09 +00:00
TengYao Chi	66363160c5	KAFKA-18645: New consumer should align close timeout handling with classic consumer (#18702 ) Reviewers: Lianet Magrans <lmagrans@confluent.io>, Kirk True <ktrue@confluent.io>, Chia-Ping Tsai <chia7712@gmail.com>	2025-02-05 09:08:51 -05:00
PoAn Yang	21645ebf0b	KAFKA-18705: Move ConfigRepository to metadata module (#18784 ) Reviewers: TengYao Chi <kitingiao@gmail.com>, Christo Lolov <lolovc@amazon.com>	2025-02-05 10:13:36 +00:00
Justine Olshan	00dddee347	MINOR: Add missing test tag to UnifiedLogTest.scala (#18794 ) Reviewers: Chia-Ping Tsai <chia7712@gmail.com>	2025-02-04 13:56:14 -08:00
Sean Quah	42e7cbb67e	KAFKA-18690: Keep leader metadata for RE2J-assigned partitions (#18777 ) Reviewers: Lianet Magrans <lmagrans@confluent.io>	2025-02-04 13:22:28 -05:00
Justine Olshan	822b8ab3d7	KAFKA-18691: Flaky test testFencingOnTransactionExpiration (#18793 ) It appears this test was failing because the transaction was never aborting and the concurrent transactions errors would not go away. `ccab9eb` introduced the test failure because it requires the transaction to complete, but I suspect the lack of completion was happening before the change. The timeout for the write is based on the transactional timeout, and 100ms seemed too small -- thus the requests to update the state would often repeatedly time out. Also removed the loop since it was not necessary. Reviewers: Jeff Kim <jeff.kim@confluent.io>, Calvin Liu <caliu@confluent.io>	2025-02-04 08:45:34 -08:00
Luke Chen	612e1299e4	KAFKA-18230: Handle not controller or not leader error in admin client (#18165 ) Reviewers: Mickael Maison <mickael.maison@gmail.com>, Chia-Ping Tsai <chia7712@gmail.com>	2025-02-04 16:51:24 +01:00
Calvin Liu	ad031b99d3	KAFKA-18635: reenable the unclean shutdown detection (#18277 ) We need to re-enable the unclean shutdown detection when in ELR mode, which was inadvertently removed during the development process. Reviewers: David Mao <dmao@confluent.io>, Jun Rao <junrao@gmail.com>	2025-02-03 22:26:57 -08:00
Ming-Yen Chung	9f78771a1f	KAFKA-18693 Remove PasswordEncoder (#18790 ) Reviewers: Chia-Ping Tsai <chia7712@gmail.com>	2025-02-04 13:18:41 +08:00
Justine Olshan	ab8ef87c7f	KAFKA-18654 [1/2]: Transaction Version 2 performance regression due to early return (#18720 ) https://issues.apache.org/jira/browse/KAFKA-18575 solved a critical race condition by returning with CONCURRENT_TRANSACTIONS early when the transaction was still completing. In testing, it was discovered that this early return could cause performance regressions. Prior to KIP-890 the addpartitions call was a separate call from the producer. There was a previous change https://issues.apache.org/jira/browse/KAFKA-5477 that decreased the retry backoff to 20ms. With KIP-890 and making the call through the produce path, we go back to the default retry backoff which takes longer. Prior to 18575 we introduce a slight delay when sending to the coordinator, so prior to 18575, we are less likely to return quickly and get stuck in this backoff. However, based on results from produce benchmarks, we can still run into the default backoff in some scenarios. This PR reverts KAFKA-18575, and doesn't return early and wait until the coordinator for checking if a transaction is ongoing. Instead, it will fix the handling with the verification guard so we don't hit the edge condition. Also cleans up some of the verification text that was unclear. Reviewers: Jeff Kim <jeff.kim@confluent.io>, Artem Livshits <alivshits@confluent.io>	2025-02-03 15:24:34 -08:00
Ken Huang	272d947f96	KAFKA-18545: Remove Zookeeper logic from LogManager (#18592 ) Reviewers: Chia-Ping Tsai <chia7712@gmail.com>, Ismael Juma <ismael@juma.me.uk>, Mickael Maison <mickael.maison@gmail.com>	2025-02-03 17:16:35 +00:00
Ken Huang	7fdd11295c	KAFKA-18685: Cleanup DynamicLogConfig constructor (#18764 ) Reviewers: Chia-Ping Tsai <chia7712@gmail.com>, Christo Lolov <lolovc@amazon.com>	2025-02-03 15:38:05 +00:00
PoAn Yang	f6f41dc5eb	KAFKA-17631 Convert SaslApiVersionsRequestTest to kraft (#18330 ) Reviewers: Chia-Ping Tsai <chia7712@gmail.com>	2025-02-03 21:01:38 +08:00
Jhen-Yung Hsu	9ba2621620	MINOR: Remove the test for ZooKeeper metrics used by ZooKeeperClient (#18775 ) Reviewers: Ismael Juma <ismael@juma.me.uk>, Ken Huang <s7133700@gmail.com>, Chia-Ping Tsai <chia7712@gmail.com>	2025-02-03 20:06:01 +08:00
David Jacot	bf05d2c914	KAFKA-18672; CoordinatorRecordSerde must validate value version (#18749 ) CoordinatorRecordSerde does not validate the version of the value to check whether the version is supported by the current version of the software. This is problematic if a future and unsupported version of the record is read by an older version of the software because it would misinterpret the bytes. Hence CoordinatorRecordSerde must throw an error if the version is unknown. This is also consistent with the handling in the old coordinator. Reviewers: Jeff Kim <jeff.kim@confluent.io>	2025-02-03 02:19:27 -08:00
Ismael Juma	78aff4fede	KAFKA-18659: librdkafka compressed produce fails unless api versions returns produce v0 (#18727 ) Return produce v0-v2 as supported versions in `ApiVersionsResponse`, but disable support for it everywhere else. Since clients pick the highest supported version by both client and broker during version negotiation, this solves the problem with minimal tech debt (even though it's not ideal that `ApiVersionsResponse` becomes inconsistent with the actual protocol support). Add one test for the socket server handling (in `ProcessorTest`) and one test for the client behavior (in `ProduceRequestTest`). Adjust a couple of api versions tests to verify the new behavior. Finally, include a few clean-ups in `ApiKeys`, `Protocol`, `ProduceRequest`, `ProduceRequestTest` and `BrokerApiVersionsCommandTest`. Reference to related librdkafka issue: https://github.com/confluentinc/librdkafka/issues/4956 Reviewers: Jun Rao <junrao@gmail.com>, Chia-Ping Tsai <chia7712@gmail.com>, Stanislav Kozlovski <stanislav_kozlovski@outlook.com>	2025-02-01 16:08:54 -08:00
kevin-wu24	184b891871	KAFKA-16524; Metrics for KIP-853 (#18304 ) This change implement some of the metrics enumerated in KIP-853. The KafkaRaftMetrics object now exposes number-of-voters, number-of-observers and uncommitted-voter-change. The number-of-observers and uncommitted-voter-change metrics are only present on the active controller or leader, since it does not make sense for other replicas to report these metrics. In order to make these two metrics thread-safe, KafkaRaftMetrics needs to be passed into LeaderState, and therefore QuorumState. This introduces a circularity since the KafkaRaftMetrics constructor takes in QuorumState. To break the circularity for now, the logic using QuorumState will be moved to the KafkaRaftMetrics#initialize method. The BrokerServerMetrics object now exposes ignored-static-voters. The ControllerServerMetrics object now exposes IgnoredStaticVoters. To implement both metrics for "ignored static voters", this PR introduces the ExternalKRaftMetrics interface, which allows for higher layer metrics objects to be accessible within the raft module. Reviewers: José Armando García Sancio <jsancio@apache.org>	2025-01-30 18:35:01 -05:00
Justine Olshan	ccab9eb8b4	KAFKA-18660: Transactions Version 2 doesn't handle epoch overflow correctly (#18730 ) Fixed the typo that used the wrong producer ID and epoch when returning so that we handle epoch overflow correctly. We also had to rearrange the concurrent transaction handling so that we don't self-fence when we start the new transaction with the new producer ID. I also tested this with a modified version of the code where epoch overflow happens on the first epoch bump (every request has a new producer id) Reviewers: Artem Livshits <alivshits@confluent.io>, Jeff Kim <jeff.kim@confluent.io>	2025-01-30 13:42:10 -08:00
Ken Huang	4b29fd6383	KAFKA-18034: CommitRequestManager should fail pending requests on fatal coordinator errors (#18548 ) Reviewers: Lianet Magrans <lmagrans@confluent.io>, Kirk True <ktrue@confluent.io>	2025-01-30 11:22:54 -05:00
Sushant Mahajan	be96807ac8	MINOR: Refactor share coord cache helper to share package. (#18743 ) Reviewers: Andrew Schofield <aschofield@confluent.io>	2025-01-30 13:33:42 +00:00
TengYao Chi	9dd73d43b0	KAFKA-18569: New consumer close may wait on unneeded FindCoordinator (#18590 ) Reviewers: Lianet Magrans <lmagrans@confluent.io>, Kirk True <ktrue@confluent.io>, Chia-Ping Tsai <chia7712@gmail.com>	2025-01-29 14:15:56 -05:00
PoAn Yang	4dd0bcbde8	KAFKA-18383 Remove reserved.broker.max.id and broker.id.generation.enable (#18478 ) Reviewers: Chia-Ping Tsai <chia7712@gmail.com>	2025-01-30 02:55:09 +08:00
Calvin Liu	a3b34c1315	KAFKA-18662: Return CONCURRENT_TRANSACTIONS on produce request in TV2 (#18733 ) While testing, it was found that the not_enough_replicas error was super common and could be easily confused. Since we are already bumping the request, we can signify that the produce request may return this error and new clients can handle it (Note, the java client should be able to handle this already as a retriable error, but other client libraries may need to implement this change) Reviewers: Justine Olshan <jolshan@confluent.io>	2025-01-29 10:15:48 -08:00
Sushant Mahajan	632aedcf4f	KAFKA-18632: Multibroker test improvements. (#18718 ) Reviewers: Andrew Schofield <aschofield@confluent.io>	2025-01-29 17:03:43 +00:00
Abhinav Dixit	dd1f2b8aab	KAFKA-18653: Fix mocks and potential thread leak issues causing silent RejectedExecutionException in share group broker tests (#18725 ) Reviewers: Apoorv Mittal <apoorvmittal10@gmail.com>, Andrew Schofield <aschofield@confluent.io>	2025-01-29 16:24:30 +00:00
Ismael Juma	ca5d2cf76d	KAFKA-18646: Null records in fetch response breaks librdkafka (#18726 ) Ensure we always return empty records (including cases where an error is returned). We also remove `nullable` from `records` since it is effectively expected to be non-null by a large percentage of clients in the wild. This behavior regressed in `fe56fc9` (KAFKA-18269). Empty records were previously set via `FetchResponse.recordsOrFail(partitionData)` in the now-removed `maybeConvertFetchedData` method. Added an integration test that fails without this fix and also update many tests to set `records` to `empty` instead of leaving them as `null`. Reviewers: Chia-Ping Tsai <chia7712@gmail.com>, David Arthur <mumrah@gmail.com>	2025-01-29 07:04:12 -08:00
TengYao Chi	97a228070e	KAFKA-18619: New consumer topic metadata events should set requireMetadata flag (#18668 ) Reviewers: Lianet Magrans <lmagrans@confluent.io>	2025-01-29 08:36:05 -05:00
Ismael Juma	e6d72c9e60	KAFKA-18648: Add back support for metadata version 0-3 (#18716 ) During testing, we identified that kafka-python (and aiokafka) relies on metadata request v0 and hence we need to add these back to comply with the premise of KIP-896 - i.e. it should not break the clients listed within it. I reverted the changes from #18218 related to the removal of metadata versions 0-3. I will submit a separate PR to undeprecate these API versions on the relevant 3.x branches. kafka-python (and aiokafka) work correctly (produce & consume) with this change on top of the 4.0 branch. Reviewers: David Arthur <mumrah@gmail.com>	2025-01-28 18:35:33 -08:00
Apoorv Mittal	c7619ef8d1	KAFKA-17951: Share parition rotate strategy (#18651 ) Reviewers: Andrew Schofield <aschofield@confluent.io>, Abhinav Dixit <adixit@confluent.io>	2025-01-28 11:44:48 +00:00
Sushant Mahajan	f32932cc25	KAFKA-18629: Delete share group state impl [1/N] (#18712 ) Reviewers: Christo Lolov <lolovc@amazon.com>, Andrew Schofield <aschofield@confluent.io>	2025-01-28 11:43:01 +00:00
Ken Huang	5631be20a6	MINOR: Remove ZooKeeper mentions in comments (#18646 ) Reviewers: Mickael Maison <mickael.maison@gmail.com>	2025-01-28 12:35:46 +01:00
Apoorv Mittal	04567cdb22	KAFKA-18657: Fixing SharePartitionManager flaky test (#18710 ) Reviewers: Chia-Ping Tsai <chia7712@gmail.com>, Andrew Schofield <aschofield@confluent.io>	2025-01-28 08:06:58 +00:00
TaiJuWu	e89b30d14e	KAFKA-18528: MultipleListenersWithSameSecurityProtocolBaseTest and GssapiAuthenticationTest should run for async consumer (#18555 ) Reviewers: Kirk True <ktrue@confluent.io>, Lianet Magrans <lmagrans@confluent.io>	2025-01-27 15:49:44 -05:00
Sushant Mahajan	b92cd9d236	KAFKA-18632: Added few share consumer multibroker tests. (#18679 ) Reviewers: Andrew Schofield <aschofield@confluent.io>	2025-01-27 12:56:56 +00:00
Chung, Ming-Yen	a8f6fc9cc4	KAFKA-18631 Remove ZkConfigs (#18693 ) Reviewers: Chia-Ping Tsai <chia7712@gmail.com>	2025-01-26 04:37:49 +08:00
PoAn Yang	be7415cb8b	KAFKA-18555 Avoid casting MetadataCache to KRaftMetadataCache (#18632 ) Reviewers: Ismael Juma <ismael@juma.me.uk>, Chia-Ping Tsai <chia7712@gmail.com>	2025-01-25 23:02:28 +08:00
Ken Huang	c40e7a1341	KAFKA-18533 Remove KafkaConfig zookeeper related logic (#18547 ) Reviewers: Ismael Juma <ismael@juma.me.uk>, Chia-Ping Tsai <chia7712@gmail.com>	2025-01-25 22:52:21 +08:00
Chung, Ming-Yen	43af241b50	KAFKA-18639 Enable the @Flaky annotation for some flaky tests (#18701 ) The following tests were previously reported as flaky but were only annotated with a comment in pull request #18558 due to module dependency limitations: testAdminClientApisAuthenticationFailure testOutdatedCoordinatorAssignment testThrottledProducerConsumer With the introduction of the new test infrastructure #18602 , which allows all modules to use the @Flaky annotation, these tests should now be updated to include the @Flaky annotation. Reviewers: TengYao Chi <kitingiao@gmail.com>, Chia-Ping Tsai <chia7712@gmail.com>	2025-01-25 22:44:35 +08:00
mingdaoy	c23d4a0d73	KAFKA-18499 Clean up zookeeper from LogConfig (#18583 ) Reviewers: Chia-Ping Tsai <chia7712@gmail.com>	2025-01-25 22:31:46 +08:00
TaiJuWu	023f9c26e6	KAFKA-18529: ConsumerRebootstrapTest should run for async consumer (#18554 ) Reviewers: Kirk True <ktrue@confluent.io>, Chia-Ping Tsai <chia7712@gmail.com>, Lianet Magrans <lmagrans@confluent.io>	2025-01-24 20:33:20 +01:00
Apoorv Mittal	70eab7778d	KAFKA-17894: Implemented broker topic metrics for Share Group 1/N (KIP-1103) (#18444 ) The PR implements the BrokerTopicMetrics defined in KIP-1103. The PR also corrected the share-acknowledgement-rate and share-acknowledgement-count metrics defined in KIP-932 as they are moved to BrokerTopicMetrics, necessary changes to KIP-932 broker metrics will be done once we complete KIP-1103. Reviewers: Andrew Schofield <aschofield@confluent.io>, Chia-Ping Tsai <chia7712@gmail.com>, Jun Rao <junrao@gmail.com>	2025-01-24 09:34:54 -08:00
TengYao Chi	2f1bf2f2ab	KAFKA-18630: Clean ReplicaManagerBuilder (#18687 ) Reviewers: Christo Lolov <lolovc@amazon.com>	2025-01-24 17:23:48 +00:00
David Arthur	8c0a0e07ce	KAFKA-17587 Refactor test infrastructure (#18602 ) This patch reorganizes our test infrastructure into three Gradle modules: ":test-common:test-common-internal-api" is now a minimal dependency which exposes interfaces and annotations only. It has one project dependency on server-common to expose commonly used data classes (MetadataVersion, Feature, etc). Since this pulls in server-common, this module is Java 17+. It cannot be used by ":clients" or other Java 11 modules. ":test-common:test-common-util" includes the auto-quarantined JUnit extension. The @Flaky annotation has been moved here. Since this module has no project dependencies, we can add it to the Java 11 list so that ":clients" and others can utilize the @Flaky annotation ":test-common:test-common-runtime" now includes all of the test infrastructure code (TestKitNodes, etc). This module carries heavy dependencies (core, etc) and so it should not normally be included as a compile-time dependency. In addition to this reorganization, this patch leverages JUnit SPI service discovery so that modules can utilize the integration test framework without depending on ":core". This will allow us to start moving integration tests out of core and into the appropriate sub-module. This is done by adding ":test-common:test-common-runtime" as a testRuntimeOnly dependency rather than as a testImplementation dependency. A trivial example was added to QuorumControllerTest to illustrate this. Reviewers: Ismael Juma <ismael@juma.me.uk>, Chia-Ping Tsai <chia7712@gmail.com>	2025-01-24 09:03:43 -05:00
Ken Huang	0c9df75295	KAFKA-18474: Remove zkBroker listener (#18477 ) Reviewers: Ismael Juma <ismael@juma.me.uk>, Chia-Ping Tsai <chia7712@gmail.com>, PoAn Yang <payang@apache.org>	2025-01-24 05:53:32 -08:00
David Jacot	80d2a8a42d	KAFKA-18616; Refactor DumpLogSegments's MessageParsers (#18688 ) All the work that we have done to automate and to simplify the coordinator records allows us to simplify the related MessageParsers in DumpLogSegments. They can all share the same based implementation. This is nice because it ensures that we handle all those records similarly. Reviewers: Chia-Ping Tsai <chia7712@gmail.com>	2025-01-24 04:59:30 -08:00
TengYao Chi	5d81fe20c8	KAFKA-18590 Cleanup DelegationTokenManager (#18618 ) Reviewers: Ismael Juma <ismael@juma.me.uk>, Chia-Ping Tsai <chia7712@gmail.com>	2025-01-24 20:12:03 +08:00
TengYao Chi	fa2df3bca7	KAFKA-18559 Cleanup FinalizedFeatures (#18593 ) Reviewers: Ismael Juma <ismael@juma.me.uk>, Chia-Ping Tsai <chia7712@gmail.com>	2025-01-24 19:39:01 +08:00
Logan Zhu	356f0d815c	KAFKA-18597 Fix max-buffer-utilization-percent is always 0 (#18627 ) Reviewers: Chia-Ping Tsai <chia7712@gmail.com>	2025-01-24 18:21:34 +08:00
TengYao Chi	66868fc1fa	KAFKA-18620: Remove UnifiedLog#legacyFetchOffsetsBefore (#18686 ) Reviewers: Mickael Maison <mickael.maison@gmail.com>	2025-01-24 11:11:05 +01:00
TengYao Chi	40890faa1b	KAFKA-18592 Cleanup ReplicaManager (#18621 ) Reviewers: Ismael Juma <ismael@juma.me.uk>, Christo Lolov <lolovc@amazon.com>, Chia-Ping Tsai <chia7712@gmail.com>	2025-01-24 01:34:36 +08:00
TaiJuWu	ce4eeaa379	MINOR: restore `testGetAllTopicMetadataShouldNotCreateTopicOrReturnUnknownTopicPartition` (#18633 ) Reviewers: Ismael Juma <ismael@juma.me.uk>, Chia-Ping Tsai <chia7712@gmail.com>	2025-01-24 01:27:18 +08:00
Sushant Mahajan	01afba8fdb	MINOR: Refactor ShareConsumerTest to use ClusterTestExtensions. (#18656 ) Reviewers: ShivsundarR <shr@confluent.io>, Apoorv Mittal <apoorvmittal10@gmail.com>, Andrew Schofield <aschofield@confluent.io>	2025-01-23 16:35:33 +00:00
Ken Huang	7e46087570	MINOR: rename `resendBrokerRegistrationUnlessZkMode` to `resendBrokerRegistration` (#18645 ) Reviewers: Chia-Ping Tsai <chia7712@gmail.com>	2025-01-24 00:33:05 +08:00
TengYao Chi	bdc92fd5a1	MINOR: Cleanup zk condition in TransactionsTest, QuorumTestHarness and PlaintextConsumerAssignorsTest (#18639 ) Reviewers: Ken Huang <s7133700@gmail.com>, Chia-Ping Tsai <chia7712@gmail.com>	2025-01-23 19:53:10 +08:00
David Jacot	bc807083fb	KAFKA-18486; [1/2] Update LocalLeaderEndPointTest (#18666 ) This patch is a first step towards removing `ReplicaManager#becomeLeaderOrFollower`. It updates the `LocalLeaderEndPointTest` tests. Reviewers: Christo Lolov <lolovc@amazon.com>, Ismael Juma <ismael@juma.me.uk>	2025-01-23 10:49:16 +01:00
Justine Olshan	94a1bfb128	KAFKA-18575: Transaction Version 2 doesn't correctly handle race condition with completing and new transaction(#18604 ) There is a subtle race condition with transactions V2 if a transaction is still completing when checking if we need to add a partition, but it completes when the request reaches the coordinator. One approach was to remove the verification for TV2 and just check the epoch on write, but a simpler one is to simply return concurrent transactions from the partition leader (before attempting to add the partition). I've done this and added a test for this behavior. Locally, I reproduced the race but adding a 1 second sleep when handling the WriteTxnMarkersRequest and a 2 second delay before adding the partition to the AddPartitionsToTxnManager. Without this change, the race happened on every second transaction as the first one completed. With this change, the error went away. As a followup, we may want to clean up some of the code and comments with respect to verification as the code is used by both TV0 + verification and TV2. But that doesn't need to complete for 4.0. This does :) Reviewers: Jeff Kim <jeff.kim@confluent.io>, Artem Livshits <alivshits@confluent.io>, Calvin Liu <caliu@confluent.io>	2025-01-22 13:44:08 -08:00
Lianet Magrans	410065a65d	KAFKA-18517: Enable ConsumerBounceTest to run for new async consumer (#18532 ) Reviewers: Andrew Schofield <aschofield@confluent.io>, Kirk True <ktrue@confluent.io>	2025-01-22 18:02:38 +01:00
Xiaobing Fang	f4d90398cc	MINOR: Fix `LogCleanerManagerTest.testLogsUnderCleanupIneligibleForCompaction()` for `LogMessageTimestampType = "LogAppendTime"` (#12333 ) While setting Defaults.LogMessageTimestampType to "LogAppendTime", `LogCleanerManagerTest.testLogsUnderCleanupIneligibleForCompaction()` fails with a InvalidTimestampException. This PR fixes this by regenerating the records instead of previous approach of re-using same records in the test. Reviewers: Divij Vaidya <diviv@amazon.com>, Kvicii <kvicii.yu@gmail.com> --------- Co-authored-by: fangxiaobing <fangxiaobing@kuaishou.com>	2025-01-22 17:50:39 +01:00
Ken Huang	341e535942	KAFKA-18519: Remove Json.scala, cleanup AclEntry.scala (#18614 ) Reviewers: Mickael Maison <mickael.maison@gmail.com>, Ismael Juma <ismael@juma.me.uk>, Chia-Ping Tsai <chia7712@gmail.com>	2025-01-22 16:12:06 +01:00
Chung, Ming-Yen	084fcbd327	KAFKA-18599: Remove Optional wrapping for forwardingManager in ApiVersionManager (#18630 ) `forwardingManager` is always present now. Reviewers: Ismael Juma <ismael@juma.me.uk>	2025-01-22 06:50:16 -08:00
TengYao Chi	a3da6bbb0c	MINOR: Cleanup ControllerCOntext and StateChangeLogger (#18588 ) These methods were previously invoked by ZK components, but we have just removed them. Reviewers: Ismael Juma <ismael@juma.me.uk>, Chia-Ping Tsai <chia7712@gmail.com>	2025-01-21 18:41:17 -08:00
David Jacot	b368c38684	KAFKA-18302; Update CoordinatorRecord (#18512 ) This patch does a few things: 1) Replace ApiMessageAndVersion by ApiMessage in CoordinatorRecord for the key 2) Leverage the fact that ApiMessage exposes the apiKey. Hence we don't need to specify the key anymore. Reviewers: Andrew Schofield <aschofield@confluent.io>	2025-01-21 18:11:26 +01:00
David Jacot	256ccd0c0d	KAFKA-18487; Remove ReplicaManager#stopReplicas (#18647 ) This patch removes `ReplicaManager#stopReplicas`. I have ensured that removed unit tests are covered by other existing tests or are updated to use kraft. Reviewers: Ismael Juma <ismael@juma.me.uk>, Chia-Ping Tsai <chia7712@gmail.com>	2025-01-21 11:47:16 +01:00
Dimitar Dimitrov	31d8e68ed1	KAFKA-18583; Fix getPartitionReplicaEndpoints for KRaft (#18635 ) Although `MetadataCache`'s `getPartitionReplicaEndpoints` takes a single topic-partition, the `KRaftMetadataCache` implementation iterates over all partitions of the matching topic. This is not necessary and can cause significant performance degradation when the topic has a relatively high number of partitions. Note that this is not a recent regression - it has been a part of `KRaftMetadataCache` since its creation. Reviewers: Ismael Juma <ismael@juma.me.uk>, David Jacot <djacot@confluent.io>	2025-01-21 10:51:59 +01:00
David Jacot	76bf38a4fd	KAFKA-18604; Update transaction coordinator (#18636 ) This patch updates the transaction coordinator record to use the new coordinator record definition. Reviewers: Andrew Schofield <aschofield@confluent.io>	2025-01-21 08:36:23 +01:00
Ismael Juma	87b37a4065	KAFKA-14552: Assume a baseline of 3.0 for server protocol versions (#18497 ) Kafka 4.0 will remove support for zk mode and will require conversion to kraft before upgrading to 4.0. The minimum kraft version is 3.0 (aka 3.0-IV1). This provides an opportunity to remove exclusively server side protocols versions that only exist to allow direct upgrades from versions older than 3.0 or that are used only by zk mode. Since KRaft became production ready in 3.3, we should consider setting the baseline to 3.3. But that requires more discussion and it can be done via a separate change (KAFKA-18601). Protocol changes: * Remove RequestHeader v0 (only used by ControlledShutdown v0) * Remove WriteTxnMarkers v0 * Remove all versions of ControlledShutdown, LeaderAndIsr, StopReplica, UpdateMetadata In order to remove all versions safely, extend generator to support setting "versions" to "none". In this case, we no longer generate the `*Data` classes, but we still reserve the id for the relevant protocol api (so it doesn't get accidentally used for something else). The protocol documentation is correct after these changes. We kept a simplified version of `LeaderAndIsr{Request\|Response}` because it's used by many tests that are still relevant in kraft mode. Once KAFKA-18486 is done, it may be possible to remove it (I left a comment on the ticket). Similarly, KAFKA-18487 may make it possible to remove the introduced `StopReplicaPartitionState` (left a comment on that ticket too). There are a number of places that were adjusted to include an `ApiKeys.hasValidVersion` check. Reviewers: Chia-Ping Tsai <chia7712@gmail.com>	2025-01-20 13:51:44 -08:00
TengYao Chi	837fb1ed02	MINOR: Remove unused QuotaConfgHandler (#18617 ) Reviewers: Chia-Ping Tsai <chia7712@gmail.com>	2025-01-21 03:02:42 +08:00
TengYao Chi	f1ee0557f8	MINOR: Remove zk related statement from ControllerConfigurationValidator (#18637 ) Reviewers: Mickael Maison <mickael.maison@gmail.com>	2025-01-20 17:10:24 +01:00
Ken Huang	892a446789	KAFKA-18594: Cleanup BrokerLifecycleManager (#18626 ) Reviewers: Christo Lolov <lolovc@amazon.com>	2025-01-20 15:17:57 +00:00
Ken Huang	71495a2013	KAFKA-18568: Fix flaky test ClientIdQuotaTest (#18612 ) The reason for flakiness is PR #18080 which modifies the linger.ms config from 0 to 5. ClientIdQuotaTest are testing "Low enough quota that a producer sending a small payload in a tight loop should get throttled", thus this config change Influence this test scenario. This commits uses the older value of 0ms for linger.ms for ClientIdQuotaTest tests. Reviewers: Ismael Juma <ismael@juma.me.uk>, TaiJuWu <tjwu1217@gmail.com>, Divij Vaidya <diviv@amazon.com>	2025-01-20 16:05:47 +01:00
TengYao Chi	a842c02b88	KAFKA-18553: Update javadoc and comments of ConfigType (#18567 ) Reviewers: Mickael Maison <mickael.maison@gmail.com>, Chia-Ping Tsai <chia7712@gmail.com>, Ismael Juma <ismael@juma.me.uk>, Andrew Schofield <aschofield@confluent.io>, Apoorv Mittal <amittal@confluent.io>	2025-01-20 15:20:36 +01:00
Sanskar Jhajharia	bcbc72e29b	[KAFKA-16720] AdminClient Support for ListShareGroupOffsets (1/n) (#18571 ) Reviewers: Apoorv Mittal <apoorvmittal10@gmail.com>, Andrew Schofield <aschofield@confluent.io>	2025-01-20 07:47:14 +00:00
Ken Huang	96499029b7	KAFKA-18588 Remove TopicKey.scala (#18624 ) Reviewers: Chia-Ping Tsai <chia7712@gmail.com>	2025-01-20 10:30:14 +08:00
TaiJuWu	20e616ecc1	KAFKA-18578: Remove `UpdateMetadataRequest` from `MetadataCacheTest` (#18628 ) Reviewers: Ismael Juma <ismael@juma.me.uk>	2025-01-19 12:18:43 -08:00
Ken Huang	c044eb61a1	KAFKA-18593 Remove ZkCachedControllerId In MetadataCache (#18625 ) Reviewers: Ismael Juma <ismael@juma.me.uk>	2025-01-19 10:15:06 -08:00

... 3 4 5 6 7 ...

5943 Commits