kafka

Commit Graph

Author	SHA1	Message	Date
Apoorv Mittal	a13d815a0d	MINOR: Updated share partition manager tests to close and other fixes (#18862 ) Reviewers: Abhinav Dixit <adixit@confluent.io>, Andrew Schofield <aschofield@confluent.io>	2025-02-13 13:37:37 +00:00
Ken Huang	9494bebee6	KAFKA-18728 Move ListOffsetsPartitionStatus to server module (#18807 ) Reviewers: Kamal Chandraprakash <kamal.chandraprakash@gmail.com>	2025-02-13 10:36:46 +05:30
Jhen-Yung Hsu	b0e5cdfc57	KAFKA-18777 add `PartitionsWithLateTransactionsCount` to BrokerMetricNamesTest (#18869 ) Rewrite BrokerMetricNamesTest using ReplicaManager.MetricNames, ensuring that all metrics are always included. This helps prevent issues like PartitionsWithLateTransactionsCount not being correctly included in the test before. Reviewers: Chia-Ping Tsai <chia7712@gmail.com>	2025-02-12 22:09:42 +08:00
PoAn Yang	63fc9b3cb8	KAFKA-18771: fix Flaky test KRaftClusterTest .testDescribeQuorumRequestToControllers (#18859 ) The case testDescribeQuorumRequestToControllers shutdowns raft client but not the controller. This makes client has chance to send a request to the controller and get NOT_LEADER_OR_FOLLOWER error. However, if the raft client finishes shutdown before handling the request, the request will not be handled. Shutdown the controller before doing KafkaFuture#get for the client request, so we can make sure the request is handled by another controller eventually. Signed-off-by: PoAn Yang <payang@apache.org> Reviewers: Luke Chen <showuon@gmail.com>	2025-02-12 16:16:43 +08:00
Justine Olshan	400363b7e2	KAFKA-18035: TransactionsTest testBumpTransactionalEpochWithTV2Disabled failed on trunk (#18451 ) Sometimes we didn't get into abortable state before aborting, so the epoch didn't get bumped. Now we force abortable state with an attempt to send before aborting so the epoch bump occurs as expected. Reviewers: Jeff Kim <jeff.kim@confluent.io>	2025-02-11 14:01:43 -08:00
Edoardo Comar	7e405ccc65	KAFKA-18758: NullPointerException in shutdown following InvalidConfigurationException (#18833 ) * KAFKA-18758: NullPointerException in shutdown following InvalidConfigurationException Add checks for null in shutdown as BrokerLifecycleManager is not instantiaited if LogManager constructor throws an Exception	2025-02-11 10:06:55 +00:00
Sushant Mahajan	675a0889de	KAFKA-18764: Throttle on share state RPCs auth failure. (#18855 ) Reviewers: Andrew Schofield <aschofield@confluent.io>	2025-02-11 09:54:24 +00:00
Mickael Maison	ece91e9247	KAFKA-14484: Move UnifiedLog static methods to storage (#18039 ) Reviewers: Chia-Ping Tsai <chia7712@gmail.com>	2025-02-11 09:55:32 +01:00
TengYao Chi	f5dd661cb5	KAFKA-18396: Migrate log4j1 configuration to log4j2 in KafkaDockerWrapper (#18394 ) After log4j migration, we need to update the logging configuration in KafkaDockerWrapper from log4j1 to log4j2. Reviewers: Manikumar Reddy <manikumar.reddy@gmail.com>	2025-02-11 13:25:23 +05:30
TaiJuWu	9fc7500684	KAFKA-18770 close the RM created by testDelayedShareFetchPurgatoryOperationExpiration (#18853 ) it's crucial to utilize a try-finally block to ensure proper closure of the ReplicaManager. Failing to do so can result in an unreleased thread from the purgatory, potentially leading to errors in subsequent integration tests that incorporate thread leak detection. Reviewers: poorv Mittal <apoorvmittal10@gmail.com>, Chia-Ping Tsai <chia7712@gmail.com>	2025-02-11 07:35:13 +08:00
Ken Huang	581e94840f	KAFKA-18366 Remove KafkaConfig.interBrokerProtocolVersion (#18820 ) Reviewers: Ismael Juma <ismael@juma.me.uk>, Chia-Ping Tsai <chia7712@gmail.com>	2025-02-11 06:18:02 +08:00
Jhen-Yung Hsu	4e36368d08	KAFKA-18743 Remove leader.imbalance.per.broker.percentage as it is not supported by Kraft (#18821 ) Remove `leader.imbalance.per.broker.percentage` from config. Add `leader.imbalance.per.broker.percentage` to release note Reviewers: Chia-Ping Tsai <chia7712@gmail.com>	2025-02-11 04:01:57 +08:00
Ken Huang	70adf746c4	KAFKA-18225 ClientQuotaCallback#updateClusterMetadata is unsupported by kraft (#18196 ) This commit ensures that the ClientQuotaCallback#updateClusterMetadata method is executed in KRaft mode. This method is triggered whenever a topic or cluster metadata change occurs. However, in KRaft mode, the current implementation of the updateClusterMetadata API is inefficient due to the requirement of creating a full Cluster object. To address this, a follow-up issue (KAFKA-18239) has been created to explore more efficient mechanisms for providing cluster information to the ClientQuotaCallback without incurring the overhead of a full Cluster object creation. Reviewers: Mickael Maison <mickael.maison@gmail.com>, TaiJuWu <tjwu1217@gmail.com>, Chia-Ping Tsai <chia7712@gmail.com>	2025-02-11 01:03:02 +08:00
PoAn Yang	b22c7d5b5c	KAFKA-17833: Convert DescribeAuthorizedOperationsTest to use KRaft (#18252 ) Reviewers: Mickael Maison <mickael.maison@gmail.com>	2025-02-07 15:44:27 +01:00
Piotr P. Karwasz	666571216b	KAFKA-18483 Disable `Log4jController` and `Loggers` if Log4j Core absent (#18496 ) If Log4j Core is absent, most calls to Log4jController and Loggers will end up with a NoClassDefFoundError. This changeset: - Profits from the major version bump to rename k.util.Log4jController to LoggingController. - Removes o.a.l.l.Level from the signature of public methods of o.a.k.connect.runtime.Loggers and replaces it with String. - Provides an additional no-op implementation of k.util.LoggingController and o.a.k.connect.runtime.Loggers: if Log4j Core is not present on the runtime classpath the no-op implementation will be used. Reviewers: Mickael Maison <mickael.maison@gmail.com>, Chia-Ping Tsai <chia7712@gmail.com>	2025-02-07 00:04:33 +08:00
Colin Patrick McCabe	b2b2408692	KAFKA-18360 Remove zookeeper configurations (#18566 ) Remove broker.id.generation.enable and reserved.broker.max.id, which are not used in KRaft mode. Remove inter.broker.protocol.version, which is not used in KRaft mode. Reviewers: PoAn Yang <payang@apache.org>, Ismael Juma <ismael@juma.me.uk>, Chia-Ping Tsai <chia7712@gmail.com>	2025-02-06 22:22:11 +08:00
Ken Huang	a3d9d881e1	KAFKA-18530 Remove ZooKeeperInternals (#18641 ) Since zk has been removed in 4.0, config handlers no longer need to handle the "<default>" value. This PR streamlines the config update process by eliminating the unnecessary string checks for "<default>" Reviewers: Christo Lolov <lolovc@amazon.com>, Ismael Juma <ismael@juma.me.uk>, Chia-Ping Tsai <chia7712@gmail.com>	2025-02-06 17:48:17 +08:00
Ming-Yen Chung	34e7136b7a	MINOR: Fix wrong config property in KafkaConfigTest (#18815 ) Reviewers: Ken Huang <s7133700@gmail.com>, Chia-Ping Tsai <chia7712@gmail.com>	2025-02-06 17:09:52 +08:00
Kuan-Po Tseng	b99be961b8	KAFKA-18206: EmbeddedKafkaCluster must set features (#18189 ) related to KAFKA-18206, set features in EmbeddedKafkaCluster in both streams and connect module, note that this PR also fix potential transaction with empty records in sendPrivileged method as transaction version 2 doesn't allow this kind of scenario. Reviewers: Justine Olshan <jolshan@confluent.io>	2025-02-05 09:14:36 -08:00
Chirag Wadhwa	01587d09d8	KAFKA-18494-3: solution for the bug relating to gaps in the share partition cachedStates post initialization (#18696 ) Reviewers: Apoorv Mittal <apoorvmittal10@gmail.com>, Abhinav Dixit <adixit@confluent.io>, Andrew Schofield <aschofield@confluent.io>	2025-02-05 15:16:25 +00:00
Sanskar Jhajharia	7dbed2f6e8	[KAFKA-16720] AdminClient Support for ListShareGroupOffsets (2/2) (#18671 ) Reviewers: Apoorv Mittal <apoorvmittal10@gmail.com>, Sushant Mahajan <smahajan@confluent.io>, Andrew Schofield <aschofield@confluent.io>	2025-02-05 14:38:09 +00:00
TengYao Chi	66363160c5	KAFKA-18645: New consumer should align close timeout handling with classic consumer (#18702 ) Reviewers: Lianet Magrans <lmagrans@confluent.io>, Kirk True <ktrue@confluent.io>, Chia-Ping Tsai <chia7712@gmail.com>	2025-02-05 09:08:51 -05:00
PoAn Yang	21645ebf0b	KAFKA-18705: Move ConfigRepository to metadata module (#18784 ) Reviewers: TengYao Chi <kitingiao@gmail.com>, Christo Lolov <lolovc@amazon.com>	2025-02-05 10:13:36 +00:00
Justine Olshan	00dddee347	MINOR: Add missing test tag to UnifiedLogTest.scala (#18794 ) Reviewers: Chia-Ping Tsai <chia7712@gmail.com>	2025-02-04 13:56:14 -08:00
Sean Quah	42e7cbb67e	KAFKA-18690: Keep leader metadata for RE2J-assigned partitions (#18777 ) Reviewers: Lianet Magrans <lmagrans@confluent.io>	2025-02-04 13:22:28 -05:00
Justine Olshan	822b8ab3d7	KAFKA-18691: Flaky test testFencingOnTransactionExpiration (#18793 ) It appears this test was failing because the transaction was never aborting and the concurrent transactions errors would not go away. `ccab9eb` introduced the test failure because it requires the transaction to complete, but I suspect the lack of completion was happening before the change. The timeout for the write is based on the transactional timeout, and 100ms seemed too small -- thus the requests to update the state would often repeatedly time out. Also removed the loop since it was not necessary. Reviewers: Jeff Kim <jeff.kim@confluent.io>, Calvin Liu <caliu@confluent.io>	2025-02-04 08:45:34 -08:00
Luke Chen	612e1299e4	KAFKA-18230: Handle not controller or not leader error in admin client (#18165 ) Reviewers: Mickael Maison <mickael.maison@gmail.com>, Chia-Ping Tsai <chia7712@gmail.com>	2025-02-04 16:51:24 +01:00
Calvin Liu	ad031b99d3	KAFKA-18635: reenable the unclean shutdown detection (#18277 ) We need to re-enable the unclean shutdown detection when in ELR mode, which was inadvertently removed during the development process. Reviewers: David Mao <dmao@confluent.io>, Jun Rao <junrao@gmail.com>	2025-02-03 22:26:57 -08:00
Ming-Yen Chung	9f78771a1f	KAFKA-18693 Remove PasswordEncoder (#18790 ) Reviewers: Chia-Ping Tsai <chia7712@gmail.com>	2025-02-04 13:18:41 +08:00
Justine Olshan	ab8ef87c7f	KAFKA-18654 [1/2]: Transaction Version 2 performance regression due to early return (#18720 ) https://issues.apache.org/jira/browse/KAFKA-18575 solved a critical race condition by returning with CONCURRENT_TRANSACTIONS early when the transaction was still completing. In testing, it was discovered that this early return could cause performance regressions. Prior to KIP-890 the addpartitions call was a separate call from the producer. There was a previous change https://issues.apache.org/jira/browse/KAFKA-5477 that decreased the retry backoff to 20ms. With KIP-890 and making the call through the produce path, we go back to the default retry backoff which takes longer. Prior to 18575 we introduce a slight delay when sending to the coordinator, so prior to 18575, we are less likely to return quickly and get stuck in this backoff. However, based on results from produce benchmarks, we can still run into the default backoff in some scenarios. This PR reverts KAFKA-18575, and doesn't return early and wait until the coordinator for checking if a transaction is ongoing. Instead, it will fix the handling with the verification guard so we don't hit the edge condition. Also cleans up some of the verification text that was unclear. Reviewers: Jeff Kim <jeff.kim@confluent.io>, Artem Livshits <alivshits@confluent.io>	2025-02-03 15:24:34 -08:00
Ken Huang	272d947f96	KAFKA-18545: Remove Zookeeper logic from LogManager (#18592 ) Reviewers: Chia-Ping Tsai <chia7712@gmail.com>, Ismael Juma <ismael@juma.me.uk>, Mickael Maison <mickael.maison@gmail.com>	2025-02-03 17:16:35 +00:00
Ken Huang	7fdd11295c	KAFKA-18685: Cleanup DynamicLogConfig constructor (#18764 ) Reviewers: Chia-Ping Tsai <chia7712@gmail.com>, Christo Lolov <lolovc@amazon.com>	2025-02-03 15:38:05 +00:00
PoAn Yang	f6f41dc5eb	KAFKA-17631 Convert SaslApiVersionsRequestTest to kraft (#18330 ) Reviewers: Chia-Ping Tsai <chia7712@gmail.com>	2025-02-03 21:01:38 +08:00
Jhen-Yung Hsu	9ba2621620	MINOR: Remove the test for ZooKeeper metrics used by ZooKeeperClient (#18775 ) Reviewers: Ismael Juma <ismael@juma.me.uk>, Ken Huang <s7133700@gmail.com>, Chia-Ping Tsai <chia7712@gmail.com>	2025-02-03 20:06:01 +08:00
David Jacot	bf05d2c914	KAFKA-18672; CoordinatorRecordSerde must validate value version (#18749 ) CoordinatorRecordSerde does not validate the version of the value to check whether the version is supported by the current version of the software. This is problematic if a future and unsupported version of the record is read by an older version of the software because it would misinterpret the bytes. Hence CoordinatorRecordSerde must throw an error if the version is unknown. This is also consistent with the handling in the old coordinator. Reviewers: Jeff Kim <jeff.kim@confluent.io>	2025-02-03 02:19:27 -08:00
Ismael Juma	78aff4fede	KAFKA-18659: librdkafka compressed produce fails unless api versions returns produce v0 (#18727 ) Return produce v0-v2 as supported versions in `ApiVersionsResponse`, but disable support for it everywhere else. Since clients pick the highest supported version by both client and broker during version negotiation, this solves the problem with minimal tech debt (even though it's not ideal that `ApiVersionsResponse` becomes inconsistent with the actual protocol support). Add one test for the socket server handling (in `ProcessorTest`) and one test for the client behavior (in `ProduceRequestTest`). Adjust a couple of api versions tests to verify the new behavior. Finally, include a few clean-ups in `ApiKeys`, `Protocol`, `ProduceRequest`, `ProduceRequestTest` and `BrokerApiVersionsCommandTest`. Reference to related librdkafka issue: https://github.com/confluentinc/librdkafka/issues/4956 Reviewers: Jun Rao <junrao@gmail.com>, Chia-Ping Tsai <chia7712@gmail.com>, Stanislav Kozlovski <stanislav_kozlovski@outlook.com>	2025-02-01 16:08:54 -08:00
kevin-wu24	184b891871	KAFKA-16524; Metrics for KIP-853 (#18304 ) This change implement some of the metrics enumerated in KIP-853. The KafkaRaftMetrics object now exposes number-of-voters, number-of-observers and uncommitted-voter-change. The number-of-observers and uncommitted-voter-change metrics are only present on the active controller or leader, since it does not make sense for other replicas to report these metrics. In order to make these two metrics thread-safe, KafkaRaftMetrics needs to be passed into LeaderState, and therefore QuorumState. This introduces a circularity since the KafkaRaftMetrics constructor takes in QuorumState. To break the circularity for now, the logic using QuorumState will be moved to the KafkaRaftMetrics#initialize method. The BrokerServerMetrics object now exposes ignored-static-voters. The ControllerServerMetrics object now exposes IgnoredStaticVoters. To implement both metrics for "ignored static voters", this PR introduces the ExternalKRaftMetrics interface, which allows for higher layer metrics objects to be accessible within the raft module. Reviewers: José Armando García Sancio <jsancio@apache.org>	2025-01-30 18:35:01 -05:00
Justine Olshan	ccab9eb8b4	KAFKA-18660: Transactions Version 2 doesn't handle epoch overflow correctly (#18730 ) Fixed the typo that used the wrong producer ID and epoch when returning so that we handle epoch overflow correctly. We also had to rearrange the concurrent transaction handling so that we don't self-fence when we start the new transaction with the new producer ID. I also tested this with a modified version of the code where epoch overflow happens on the first epoch bump (every request has a new producer id) Reviewers: Artem Livshits <alivshits@confluent.io>, Jeff Kim <jeff.kim@confluent.io>	2025-01-30 13:42:10 -08:00
Ken Huang	4b29fd6383	KAFKA-18034: CommitRequestManager should fail pending requests on fatal coordinator errors (#18548 ) Reviewers: Lianet Magrans <lmagrans@confluent.io>, Kirk True <ktrue@confluent.io>	2025-01-30 11:22:54 -05:00
Sushant Mahajan	be96807ac8	MINOR: Refactor share coord cache helper to share package. (#18743 ) Reviewers: Andrew Schofield <aschofield@confluent.io>	2025-01-30 13:33:42 +00:00
TengYao Chi	9dd73d43b0	KAFKA-18569: New consumer close may wait on unneeded FindCoordinator (#18590 ) Reviewers: Lianet Magrans <lmagrans@confluent.io>, Kirk True <ktrue@confluent.io>, Chia-Ping Tsai <chia7712@gmail.com>	2025-01-29 14:15:56 -05:00
PoAn Yang	4dd0bcbde8	KAFKA-18383 Remove reserved.broker.max.id and broker.id.generation.enable (#18478 ) Reviewers: Chia-Ping Tsai <chia7712@gmail.com>	2025-01-30 02:55:09 +08:00
Calvin Liu	a3b34c1315	KAFKA-18662: Return CONCURRENT_TRANSACTIONS on produce request in TV2 (#18733 ) While testing, it was found that the not_enough_replicas error was super common and could be easily confused. Since we are already bumping the request, we can signify that the produce request may return this error and new clients can handle it (Note, the java client should be able to handle this already as a retriable error, but other client libraries may need to implement this change) Reviewers: Justine Olshan <jolshan@confluent.io>	2025-01-29 10:15:48 -08:00
Sushant Mahajan	632aedcf4f	KAFKA-18632: Multibroker test improvements. (#18718 ) Reviewers: Andrew Schofield <aschofield@confluent.io>	2025-01-29 17:03:43 +00:00
Abhinav Dixit	dd1f2b8aab	KAFKA-18653: Fix mocks and potential thread leak issues causing silent RejectedExecutionException in share group broker tests (#18725 ) Reviewers: Apoorv Mittal <apoorvmittal10@gmail.com>, Andrew Schofield <aschofield@confluent.io>	2025-01-29 16:24:30 +00:00
Ismael Juma	ca5d2cf76d	KAFKA-18646: Null records in fetch response breaks librdkafka (#18726 ) Ensure we always return empty records (including cases where an error is returned). We also remove `nullable` from `records` since it is effectively expected to be non-null by a large percentage of clients in the wild. This behavior regressed in `fe56fc9` (KAFKA-18269). Empty records were previously set via `FetchResponse.recordsOrFail(partitionData)` in the now-removed `maybeConvertFetchedData` method. Added an integration test that fails without this fix and also update many tests to set `records` to `empty` instead of leaving them as `null`. Reviewers: Chia-Ping Tsai <chia7712@gmail.com>, David Arthur <mumrah@gmail.com>	2025-01-29 07:04:12 -08:00
TengYao Chi	97a228070e	KAFKA-18619: New consumer topic metadata events should set requireMetadata flag (#18668 ) Reviewers: Lianet Magrans <lmagrans@confluent.io>	2025-01-29 08:36:05 -05:00
Ismael Juma	e6d72c9e60	KAFKA-18648: Add back support for metadata version 0-3 (#18716 ) During testing, we identified that kafka-python (and aiokafka) relies on metadata request v0 and hence we need to add these back to comply with the premise of KIP-896 - i.e. it should not break the clients listed within it. I reverted the changes from #18218 related to the removal of metadata versions 0-3. I will submit a separate PR to undeprecate these API versions on the relevant 3.x branches. kafka-python (and aiokafka) work correctly (produce & consume) with this change on top of the 4.0 branch. Reviewers: David Arthur <mumrah@gmail.com>	2025-01-28 18:35:33 -08:00
Apoorv Mittal	c7619ef8d1	KAFKA-17951: Share parition rotate strategy (#18651 ) Reviewers: Andrew Schofield <aschofield@confluent.io>, Abhinav Dixit <adixit@confluent.io>	2025-01-28 11:44:48 +00:00
Sushant Mahajan	f32932cc25	KAFKA-18629: Delete share group state impl [1/N] (#18712 ) Reviewers: Christo Lolov <lolovc@amazon.com>, Andrew Schofield <aschofield@confluent.io>	2025-01-28 11:43:01 +00:00
Ken Huang	5631be20a6	MINOR: Remove ZooKeeper mentions in comments (#18646 ) Reviewers: Mickael Maison <mickael.maison@gmail.com>	2025-01-28 12:35:46 +01:00
Apoorv Mittal	04567cdb22	KAFKA-18657: Fixing SharePartitionManager flaky test (#18710 ) Reviewers: Chia-Ping Tsai <chia7712@gmail.com>, Andrew Schofield <aschofield@confluent.io>	2025-01-28 08:06:58 +00:00
TaiJuWu	e89b30d14e	KAFKA-18528: MultipleListenersWithSameSecurityProtocolBaseTest and GssapiAuthenticationTest should run for async consumer (#18555 ) Reviewers: Kirk True <ktrue@confluent.io>, Lianet Magrans <lmagrans@confluent.io>	2025-01-27 15:49:44 -05:00
Sushant Mahajan	b92cd9d236	KAFKA-18632: Added few share consumer multibroker tests. (#18679 ) Reviewers: Andrew Schofield <aschofield@confluent.io>	2025-01-27 12:56:56 +00:00
Chung, Ming-Yen	a8f6fc9cc4	KAFKA-18631 Remove ZkConfigs (#18693 ) Reviewers: Chia-Ping Tsai <chia7712@gmail.com>	2025-01-26 04:37:49 +08:00
PoAn Yang	be7415cb8b	KAFKA-18555 Avoid casting MetadataCache to KRaftMetadataCache (#18632 ) Reviewers: Ismael Juma <ismael@juma.me.uk>, Chia-Ping Tsai <chia7712@gmail.com>	2025-01-25 23:02:28 +08:00
Ken Huang	c40e7a1341	KAFKA-18533 Remove KafkaConfig zookeeper related logic (#18547 ) Reviewers: Ismael Juma <ismael@juma.me.uk>, Chia-Ping Tsai <chia7712@gmail.com>	2025-01-25 22:52:21 +08:00
Chung, Ming-Yen	43af241b50	KAFKA-18639 Enable the @Flaky annotation for some flaky tests (#18701 ) The following tests were previously reported as flaky but were only annotated with a comment in pull request #18558 due to module dependency limitations: testAdminClientApisAuthenticationFailure testOutdatedCoordinatorAssignment testThrottledProducerConsumer With the introduction of the new test infrastructure #18602 , which allows all modules to use the @Flaky annotation, these tests should now be updated to include the @Flaky annotation. Reviewers: TengYao Chi <kitingiao@gmail.com>, Chia-Ping Tsai <chia7712@gmail.com>	2025-01-25 22:44:35 +08:00
mingdaoy	c23d4a0d73	KAFKA-18499 Clean up zookeeper from LogConfig (#18583 ) Reviewers: Chia-Ping Tsai <chia7712@gmail.com>	2025-01-25 22:31:46 +08:00
TaiJuWu	023f9c26e6	KAFKA-18529: ConsumerRebootstrapTest should run for async consumer (#18554 ) Reviewers: Kirk True <ktrue@confluent.io>, Chia-Ping Tsai <chia7712@gmail.com>, Lianet Magrans <lmagrans@confluent.io>	2025-01-24 20:33:20 +01:00
Apoorv Mittal	70eab7778d	KAFKA-17894: Implemented broker topic metrics for Share Group 1/N (KIP-1103) (#18444 ) The PR implements the BrokerTopicMetrics defined in KIP-1103. The PR also corrected the share-acknowledgement-rate and share-acknowledgement-count metrics defined in KIP-932 as they are moved to BrokerTopicMetrics, necessary changes to KIP-932 broker metrics will be done once we complete KIP-1103. Reviewers: Andrew Schofield <aschofield@confluent.io>, Chia-Ping Tsai <chia7712@gmail.com>, Jun Rao <junrao@gmail.com>	2025-01-24 09:34:54 -08:00
TengYao Chi	2f1bf2f2ab	KAFKA-18630: Clean ReplicaManagerBuilder (#18687 ) Reviewers: Christo Lolov <lolovc@amazon.com>	2025-01-24 17:23:48 +00:00
David Arthur	8c0a0e07ce	KAFKA-17587 Refactor test infrastructure (#18602 ) This patch reorganizes our test infrastructure into three Gradle modules: ":test-common:test-common-internal-api" is now a minimal dependency which exposes interfaces and annotations only. It has one project dependency on server-common to expose commonly used data classes (MetadataVersion, Feature, etc). Since this pulls in server-common, this module is Java 17+. It cannot be used by ":clients" or other Java 11 modules. ":test-common:test-common-util" includes the auto-quarantined JUnit extension. The @Flaky annotation has been moved here. Since this module has no project dependencies, we can add it to the Java 11 list so that ":clients" and others can utilize the @Flaky annotation ":test-common:test-common-runtime" now includes all of the test infrastructure code (TestKitNodes, etc). This module carries heavy dependencies (core, etc) and so it should not normally be included as a compile-time dependency. In addition to this reorganization, this patch leverages JUnit SPI service discovery so that modules can utilize the integration test framework without depending on ":core". This will allow us to start moving integration tests out of core and into the appropriate sub-module. This is done by adding ":test-common:test-common-runtime" as a testRuntimeOnly dependency rather than as a testImplementation dependency. A trivial example was added to QuorumControllerTest to illustrate this. Reviewers: Ismael Juma <ismael@juma.me.uk>, Chia-Ping Tsai <chia7712@gmail.com>	2025-01-24 09:03:43 -05:00
Ken Huang	0c9df75295	KAFKA-18474: Remove zkBroker listener (#18477 ) Reviewers: Ismael Juma <ismael@juma.me.uk>, Chia-Ping Tsai <chia7712@gmail.com>, PoAn Yang <payang@apache.org>	2025-01-24 05:53:32 -08:00
David Jacot	80d2a8a42d	KAFKA-18616; Refactor DumpLogSegments's MessageParsers (#18688 ) All the work that we have done to automate and to simplify the coordinator records allows us to simplify the related MessageParsers in DumpLogSegments. They can all share the same based implementation. This is nice because it ensures that we handle all those records similarly. Reviewers: Chia-Ping Tsai <chia7712@gmail.com>	2025-01-24 04:59:30 -08:00
TengYao Chi	5d81fe20c8	KAFKA-18590 Cleanup DelegationTokenManager (#18618 ) Reviewers: Ismael Juma <ismael@juma.me.uk>, Chia-Ping Tsai <chia7712@gmail.com>	2025-01-24 20:12:03 +08:00
TengYao Chi	fa2df3bca7	KAFKA-18559 Cleanup FinalizedFeatures (#18593 ) Reviewers: Ismael Juma <ismael@juma.me.uk>, Chia-Ping Tsai <chia7712@gmail.com>	2025-01-24 19:39:01 +08:00
Logan Zhu	356f0d815c	KAFKA-18597 Fix max-buffer-utilization-percent is always 0 (#18627 ) Reviewers: Chia-Ping Tsai <chia7712@gmail.com>	2025-01-24 18:21:34 +08:00
TengYao Chi	66868fc1fa	KAFKA-18620: Remove UnifiedLog#legacyFetchOffsetsBefore (#18686 ) Reviewers: Mickael Maison <mickael.maison@gmail.com>	2025-01-24 11:11:05 +01:00
TengYao Chi	40890faa1b	KAFKA-18592 Cleanup ReplicaManager (#18621 ) Reviewers: Ismael Juma <ismael@juma.me.uk>, Christo Lolov <lolovc@amazon.com>, Chia-Ping Tsai <chia7712@gmail.com>	2025-01-24 01:34:36 +08:00
TaiJuWu	ce4eeaa379	MINOR: restore `testGetAllTopicMetadataShouldNotCreateTopicOrReturnUnknownTopicPartition` (#18633 ) Reviewers: Ismael Juma <ismael@juma.me.uk>, Chia-Ping Tsai <chia7712@gmail.com>	2025-01-24 01:27:18 +08:00
Sushant Mahajan	01afba8fdb	MINOR: Refactor ShareConsumerTest to use ClusterTestExtensions. (#18656 ) Reviewers: ShivsundarR <shr@confluent.io>, Apoorv Mittal <apoorvmittal10@gmail.com>, Andrew Schofield <aschofield@confluent.io>	2025-01-23 16:35:33 +00:00
Ken Huang	7e46087570	MINOR: rename `resendBrokerRegistrationUnlessZkMode` to `resendBrokerRegistration` (#18645 ) Reviewers: Chia-Ping Tsai <chia7712@gmail.com>	2025-01-24 00:33:05 +08:00
TengYao Chi	bdc92fd5a1	MINOR: Cleanup zk condition in TransactionsTest, QuorumTestHarness and PlaintextConsumerAssignorsTest (#18639 ) Reviewers: Ken Huang <s7133700@gmail.com>, Chia-Ping Tsai <chia7712@gmail.com>	2025-01-23 19:53:10 +08:00
David Jacot	bc807083fb	KAFKA-18486; [1/2] Update LocalLeaderEndPointTest (#18666 ) This patch is a first step towards removing `ReplicaManager#becomeLeaderOrFollower`. It updates the `LocalLeaderEndPointTest` tests. Reviewers: Christo Lolov <lolovc@amazon.com>, Ismael Juma <ismael@juma.me.uk>	2025-01-23 10:49:16 +01:00
Justine Olshan	94a1bfb128	KAFKA-18575: Transaction Version 2 doesn't correctly handle race condition with completing and new transaction(#18604 ) There is a subtle race condition with transactions V2 if a transaction is still completing when checking if we need to add a partition, but it completes when the request reaches the coordinator. One approach was to remove the verification for TV2 and just check the epoch on write, but a simpler one is to simply return concurrent transactions from the partition leader (before attempting to add the partition). I've done this and added a test for this behavior. Locally, I reproduced the race but adding a 1 second sleep when handling the WriteTxnMarkersRequest and a 2 second delay before adding the partition to the AddPartitionsToTxnManager. Without this change, the race happened on every second transaction as the first one completed. With this change, the error went away. As a followup, we may want to clean up some of the code and comments with respect to verification as the code is used by both TV0 + verification and TV2. But that doesn't need to complete for 4.0. This does :) Reviewers: Jeff Kim <jeff.kim@confluent.io>, Artem Livshits <alivshits@confluent.io>, Calvin Liu <caliu@confluent.io>	2025-01-22 13:44:08 -08:00
Lianet Magrans	410065a65d	KAFKA-18517: Enable ConsumerBounceTest to run for new async consumer (#18532 ) Reviewers: Andrew Schofield <aschofield@confluent.io>, Kirk True <ktrue@confluent.io>	2025-01-22 18:02:38 +01:00
Xiaobing Fang	f4d90398cc	MINOR: Fix `LogCleanerManagerTest.testLogsUnderCleanupIneligibleForCompaction()` for `LogMessageTimestampType = "LogAppendTime"` (#12333 ) While setting Defaults.LogMessageTimestampType to "LogAppendTime", `LogCleanerManagerTest.testLogsUnderCleanupIneligibleForCompaction()` fails with a InvalidTimestampException. This PR fixes this by regenerating the records instead of previous approach of re-using same records in the test. Reviewers: Divij Vaidya <diviv@amazon.com>, Kvicii <kvicii.yu@gmail.com> --------- Co-authored-by: fangxiaobing <fangxiaobing@kuaishou.com>	2025-01-22 17:50:39 +01:00
Ken Huang	341e535942	KAFKA-18519: Remove Json.scala, cleanup AclEntry.scala (#18614 ) Reviewers: Mickael Maison <mickael.maison@gmail.com>, Ismael Juma <ismael@juma.me.uk>, Chia-Ping Tsai <chia7712@gmail.com>	2025-01-22 16:12:06 +01:00
Chung, Ming-Yen	084fcbd327	KAFKA-18599: Remove Optional wrapping for forwardingManager in ApiVersionManager (#18630 ) `forwardingManager` is always present now. Reviewers: Ismael Juma <ismael@juma.me.uk>	2025-01-22 06:50:16 -08:00
TengYao Chi	a3da6bbb0c	MINOR: Cleanup ControllerCOntext and StateChangeLogger (#18588 ) These methods were previously invoked by ZK components, but we have just removed them. Reviewers: Ismael Juma <ismael@juma.me.uk>, Chia-Ping Tsai <chia7712@gmail.com>	2025-01-21 18:41:17 -08:00
David Jacot	b368c38684	KAFKA-18302; Update CoordinatorRecord (#18512 ) This patch does a few things: 1) Replace ApiMessageAndVersion by ApiMessage in CoordinatorRecord for the key 2) Leverage the fact that ApiMessage exposes the apiKey. Hence we don't need to specify the key anymore. Reviewers: Andrew Schofield <aschofield@confluent.io>	2025-01-21 18:11:26 +01:00
David Jacot	256ccd0c0d	KAFKA-18487; Remove ReplicaManager#stopReplicas (#18647 ) This patch removes `ReplicaManager#stopReplicas`. I have ensured that removed unit tests are covered by other existing tests or are updated to use kraft. Reviewers: Ismael Juma <ismael@juma.me.uk>, Chia-Ping Tsai <chia7712@gmail.com>	2025-01-21 11:47:16 +01:00
Dimitar Dimitrov	31d8e68ed1	KAFKA-18583; Fix getPartitionReplicaEndpoints for KRaft (#18635 ) Although `MetadataCache`'s `getPartitionReplicaEndpoints` takes a single topic-partition, the `KRaftMetadataCache` implementation iterates over all partitions of the matching topic. This is not necessary and can cause significant performance degradation when the topic has a relatively high number of partitions. Note that this is not a recent regression - it has been a part of `KRaftMetadataCache` since its creation. Reviewers: Ismael Juma <ismael@juma.me.uk>, David Jacot <djacot@confluent.io>	2025-01-21 10:51:59 +01:00
David Jacot	76bf38a4fd	KAFKA-18604; Update transaction coordinator (#18636 ) This patch updates the transaction coordinator record to use the new coordinator record definition. Reviewers: Andrew Schofield <aschofield@confluent.io>	2025-01-21 08:36:23 +01:00
Ismael Juma	87b37a4065	KAFKA-14552: Assume a baseline of 3.0 for server protocol versions (#18497 ) Kafka 4.0 will remove support for zk mode and will require conversion to kraft before upgrading to 4.0. The minimum kraft version is 3.0 (aka 3.0-IV1). This provides an opportunity to remove exclusively server side protocols versions that only exist to allow direct upgrades from versions older than 3.0 or that are used only by zk mode. Since KRaft became production ready in 3.3, we should consider setting the baseline to 3.3. But that requires more discussion and it can be done via a separate change (KAFKA-18601). Protocol changes: * Remove RequestHeader v0 (only used by ControlledShutdown v0) * Remove WriteTxnMarkers v0 * Remove all versions of ControlledShutdown, LeaderAndIsr, StopReplica, UpdateMetadata In order to remove all versions safely, extend generator to support setting "versions" to "none". In this case, we no longer generate the `*Data` classes, but we still reserve the id for the relevant protocol api (so it doesn't get accidentally used for something else). The protocol documentation is correct after these changes. We kept a simplified version of `LeaderAndIsr{Request\|Response}` because it's used by many tests that are still relevant in kraft mode. Once KAFKA-18486 is done, it may be possible to remove it (I left a comment on the ticket). Similarly, KAFKA-18487 may make it possible to remove the introduced `StopReplicaPartitionState` (left a comment on that ticket too). There are a number of places that were adjusted to include an `ApiKeys.hasValidVersion` check. Reviewers: Chia-Ping Tsai <chia7712@gmail.com>	2025-01-20 13:51:44 -08:00
TengYao Chi	837fb1ed02	MINOR: Remove unused QuotaConfgHandler (#18617 ) Reviewers: Chia-Ping Tsai <chia7712@gmail.com>	2025-01-21 03:02:42 +08:00
TengYao Chi	f1ee0557f8	MINOR: Remove zk related statement from ControllerConfigurationValidator (#18637 ) Reviewers: Mickael Maison <mickael.maison@gmail.com>	2025-01-20 17:10:24 +01:00
Ken Huang	892a446789	KAFKA-18594: Cleanup BrokerLifecycleManager (#18626 ) Reviewers: Christo Lolov <lolovc@amazon.com>	2025-01-20 15:17:57 +00:00
Ken Huang	71495a2013	KAFKA-18568: Fix flaky test ClientIdQuotaTest (#18612 ) The reason for flakiness is PR #18080 which modifies the linger.ms config from 0 to 5. ClientIdQuotaTest are testing "Low enough quota that a producer sending a small payload in a tight loop should get throttled", thus this config change Influence this test scenario. This commits uses the older value of 0ms for linger.ms for ClientIdQuotaTest tests. Reviewers: Ismael Juma <ismael@juma.me.uk>, TaiJuWu <tjwu1217@gmail.com>, Divij Vaidya <diviv@amazon.com>	2025-01-20 16:05:47 +01:00
TengYao Chi	a842c02b88	KAFKA-18553: Update javadoc and comments of ConfigType (#18567 ) Reviewers: Mickael Maison <mickael.maison@gmail.com>, Chia-Ping Tsai <chia7712@gmail.com>, Ismael Juma <ismael@juma.me.uk>, Andrew Schofield <aschofield@confluent.io>, Apoorv Mittal <amittal@confluent.io>	2025-01-20 15:20:36 +01:00
Sanskar Jhajharia	bcbc72e29b	[KAFKA-16720] AdminClient Support for ListShareGroupOffsets (1/n) (#18571 ) Reviewers: Apoorv Mittal <apoorvmittal10@gmail.com>, Andrew Schofield <aschofield@confluent.io>	2025-01-20 07:47:14 +00:00
Ken Huang	96499029b7	KAFKA-18588 Remove TopicKey.scala (#18624 ) Reviewers: Chia-Ping Tsai <chia7712@gmail.com>	2025-01-20 10:30:14 +08:00
TaiJuWu	20e616ecc1	KAFKA-18578: Remove `UpdateMetadataRequest` from `MetadataCacheTest` (#18628 ) Reviewers: Ismael Juma <ismael@juma.me.uk>	2025-01-19 12:18:43 -08:00
Ken Huang	c044eb61a1	KAFKA-18593 Remove ZkCachedControllerId In MetadataCache (#18625 ) Reviewers: Ismael Juma <ismael@juma.me.uk>	2025-01-19 10:15:06 -08:00
TengYao Chi	697cf641bc	KAFKA-17668: Clean-up LogCleaner#maxOverCleanerThreads and LogCleanerManager#maintainUncleanablePartitions (#17390 ) Since maxOverCleanerThreads does not have a corresponding unit test, I have added a unit test for it. maintainUncleanablePartitions has been thoroughly tested in tests, so I simply replaced the old implementation with the new one. Reviewers: Ismael Juma <ismael@juma.me.uk>, Chia-Ping Tsai <chia7712@gmail.com>	2025-01-19 07:21:52 -08:00
Ken Huang	3082a85e0e	MINOR: Remove KafkaApis unused method (#18620 ) Reviewers: Chia-Ping Tsai <chia7712@gmail.com>	2025-01-19 22:45:22 +08:00
Ken Huang	516d5240b9	KAFKA-18429 Remove ZkFinalizedFeatureCache and StateChangeFailedException (#18616 ) Reviewers: Chia-Ping Tsai <chia7712@gmail.com>	2025-01-19 19:13:47 +08:00
TengYao Chi	485d36d3b3	KAFKA-18589 Remove unused interBrokerProtocolVersion from GroupMetadataManager (#18619 ) Reviewers: Chia-Ping Tsai <chia7712@gmail.com>	2025-01-19 18:59:24 +08:00
Ken Huang	eb1c8419aa	KAFKA-18516 Remove RackAwareMode (#18598 ) Reviewers: TengYao Chi <kitingiao@gmail.com>, Chia-Ping Tsai <chia7712@gmail.com>	2025-01-19 18:49:12 +08:00

1 2 3 4 5 ...

5638 Commits