kafka

Commit Graph

Author	SHA1	Message	Date
Shivsundar R	3603c8fe35	KAFKA-18829: Added check before converting to IMPLICIT mode (#18964 ) Reviewers: Andrew Schofield <aschofield@confluent.io>	2025-02-19 17:34:28 +00:00
Ismael Juma	3dba3125e9	KAFKA-18601: Assume a baseline of 3.3 for server protocol versions (#18845 ) 3.3.0 was the first KRaft release that was deemed production-ready and also when KIP-778 (KRaft to KRaft upgrades) landed. Given that, it's reasonable for 4.x to only support upgrades from 3.3.0 or newer (the metadata version also needs to be set to "3.3" or newer before upgrading). Noteworthy changes: 1. `AlterPartition` no longer includes topic names, which makes it possible to simplify `AlterParitionManager` logic. 2. Metadata versions older than `IBP_3_3_IV3` have been removed and `IBP_3_3_IV3` is now the minimum version. 3. `MINIMUM_BOOTSTRAP_VERSION` has been removed. 4. Removed `isLeaderRecoverySupported`, `isNoOpsRecordSupported`, `isKRaftSupported`, `isBrokerRegistrationChangeRecordSupported` and `isInControlledShutdownStateSupported` - these are always `true` now. Also removed related conditional code. 5. Removed default metadata version or metadata version fallbacks in multiple places - we now fail-fast instead of potentially using an incorrect metadata version. 6. Update `MetadataBatchLoader.resetToImage` to set `hasSeenRecord` based on whether image is empty - this was a previously existing issue that became more apparent after the changes in this PR. 7. Remove `ibp` parameter from `BootstrapDirectory` 8. A number of tests were not useful anymore and have been removed. I will update the upgrade notes via a separate PR as there are a few things that need changing and it would be easier to do so that way. Reviewers: Chia-Ping Tsai <chia7712@gmail.com>, Jun Rao <junrao@gmail.com>, David Arthur <mumrah@gmail.com>, Colin P. McCabe <cmccabe@apache.org>, Justine Olshan <jolshan@confluen.io>, Ken Huang <s7133700@gmail.com>	2025-02-19 05:35:42 -08:00
xijiu	4c4458c17a	KAFKA-18799 Remove AdminUtils (#18946 ) Reviewers: Chia-Ping Tsai <chia7712@gmail.com>	2025-02-19 06:25:43 +08:00
PoAn Yang	1132f08c57	KAFKA-18773 Migrate the log4j1 config to log4j 2 for native image and README (#18872 ) - update reflection-config.json and resource-config.json to include log4j2 and jackson - remove unused jackson scala library - fix the incorrect path of log4j2.yaml - adopt workaround (--standalone) to make this PR work and it will be fixed by KAFKA-18737) Reviewers: TengYao Chi <kitingiao@gmail.com>, Chia-Ping Tsai <chia7712@gmail.com>	2025-02-19 00:48:46 +08:00
TaiJuWu	934b0159bb	KAFKA-18089: Upgrade Caffeine lib to 3.1.8 (#18004 ) - Fixed the RemoteIndexCacheTest that fails with caffeine > 3.1.1 Reviewers: Luke Chen <showuon@gmail.com>, Kamal Chandraprakash <kamal.chandraprakash@gmail.com>	2025-02-18 21:51:38 +05:30
Parker Chang	ed366e6b89	MINOR: Align assertFutureThrows method signature with JUnit conventions (#18825 ) Reviewers: Chia-Ping Tsai <chia7712@gmail.com>, Andrew Schofield <aschofield@confluent.io>	2025-02-18 15:56:42 +00:00
Mickael Maison	0a2fab9310	KAFKA-14484: Decouple UnifiedLog and RemoteLogManager (#18460 ) Reviewers: Kamal Chandraprakash <kamal.chandraprakash@gmail.com>, Ismael Juma <ismael@juma.me.uk>	2025-02-18 15:10:31 +01:00
Andrew Schofield	6c14f64245	MINOR: Rename NoOpShareStatePersister for consistency (#18933 ) Reviewers: Chia-Ping Tsai <chia7712@gmail.com>	2025-02-18 14:07:59 +00:00
Chirag Wadhwa	63229a768c	KAFKA-16718 [1/n]: Added DeleteShareGroupOffsets request and response schema (#18927 ) Reviewers: Andrew Schofield <aschofield@confluent.io>	2025-02-18 14:06:24 +00:00
Andrew Schofield	385b7ad355	MINOR: Align share group admin authz with consumer group (#18936 ) Reviewers: Manikumar Reddy <manikumar.reddy@gmail.com>	2025-02-18 09:12:07 +00:00
Kamal Chandraprakash	da3643c6b4	KAFKA-18787: RemoteIndexCache fails to delete invalid files on init (#18888 ) The stale/invalid files that ends-with ".deleted" and ".tmp" should be cleaned when the broker gets restarted. - fix the remote-index-cache test to use the logDir instead of topicDir - fix the flaky test Reviewers: Luke Chen <showuon@gmail.com>	2025-02-18 12:56:03 +05:30
Apoorv Mittal	06ce3e890b	KAFKA-18733: Updating share group record acks metric (2/N) (#18924 ) Reviewers: Andrew Schofield <aschofield@confluent.io>	2025-02-17 18:12:58 +00:00
PoAn Yang	2b6e868538	KAFKA-18784 Fix ConsumerWithLegacyMessageFormatIntegrationTest (#18889 ) In PR #18267, we removed old message format for cases in ConsumerWithLegacyMessageFormatIntegrationTest. Although test cases can pass, they don't fulfill original purpose. We can't send old message format since 4.0, so I change cases to append old records by ReplicaManager directly. Reviewers: Chia-Ping Tsai <chia7712@gmail.com>	2025-02-17 20:43:29 +08:00
Andrew Schofield	9b7ad6ec32	MINOR: Mark testQuotaOverrideDelete as flaky (#18925 ) Reviewers: poorv Mittal <apoorvmittal10@gmail.com>, Chia-Ping Tsai <chia7712@gmail.com>	2025-02-17 15:20:35 +08:00
TengYao Chi	5cbe00e375	MINOR: Remove unused member in DynamicBrokerConfig (#18915 ) Reviewers: Jhen-Yung Hsu <jhenyunghsu@gmail.com>, Chia-Ping Tsai <chia7712@gmail.com>	2025-02-17 04:46:25 +08:00
Ming-Yen Chung	e828767062	KAFKA-18790 Fix testCustomQuotaCallback (#18906 ) Frequently updating the trust store can cause unexpected termination of the AsyncConsumer background thread. 1. To resolve this issue, reuse the same AdminClient instead of recreating it. 2. Add error logging when fail to initialize resources for the consumer network thread. Reviewers: Chia-Ping Tsai <chia7712@gmail.com>	2025-02-15 03:07:59 +08:00
Jimmy Wang	6a6b80215d	KAFKA-16717 [1/2]: Add AdminClient.alterShareGroupOffsets (#18819 ) KAFKA-16720 aims to add the support for the AlterShareGroupOffsets AdminClient. Key Changes in the PR: 1. Added handing of alterShareGroupOffsets() in KafkaAdminClient and introduce AlterShareGroupOffsetRequest/AlterShareGroupOffsetResponse/AlterShareGroupOffsetsOptions classes. 2. Corresponding test in KafkaAdminClientTest. 3. Added ALTER_SHARE_GROUP_OFFSETS API (will finish it in next PR and the share coordinator pieces) Reviewers: poorv Mittal <apoorvmittal10@gmail.com>, Andrew Schofield <aschofield@confluent.io>, Chia-Ping Tsai <chia7712@gmail.com>	2025-02-15 02:35:46 +08:00
Apoorv Mittal	53543bcf63	KAFKA-18733: Updating share group metrics (1/N) (#18826 ) Reviewers: Sushant Mahajan <smahajan@confluent.io>, Andrew Schofield <aschofield@confluent.io>	2025-02-14 08:48:41 +00:00
陳昱霖(Yu-Lin Chen)	2bbd25841e	KAFKA-18298 Fix flaky testConsumerGroupsDeprecatedConsumerGroupState and testConsumerGroups in PlaintextAdminIntegrationTest (#18513 ) It's related to KAFKA-18298 and KAFKA-18297. The root cause of the flaky tests is member rejoin after member removal. To prevent members from rejoining after being removed, before removing group members, calling `consumers.close` in ConsumerThread . This fix also extract the flaky member removal test to new test `testConsumerGroupWithMemberRemoval`. Flow of member removal test: 1. Set 2 static consumer + 1 dynamic consumer 2. Close all consumers. 3. remove one static member 4. remove remaining members Before KIP-1092, the member count is different between ClassicConsumer/AsyncConsumer. (AsyncConsumer will remove dynamic member after consumer closed.) To get more details, please refer to the discussion under KAFKA-18297 and this PR: - discussion : [Link](https://issues.apache.org/jira/browse/KAFKA-18297?focusedCommentId=17912537&page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel#comment-17912537) - review: https://github.com/apache/kafka/pull/18513#pullrequestreview-2589110367 This PR fixed below flaky errors: 1. PlaintextAdminIntegrationTest#testConsumerGroups a. `org.opentest4j.AssertionFailedError: expected: <2> but was: <3>` ([Report](https://ge.apache.org/s/lt3lpviv45cns/tests/task/:core:test/details/kafka.api.PlaintextAdminIntegrationTest/testConsumerGroups(String%2C%20String)%5B1%5D?top-execution=1)) b. `org.opentest4j.AssertionFailedError: expected: <true> but was: <false>` ([Report](https://ge.apache.org/s/jlxo446xalpoa/tests/task/:core:test/details/kafka.api.PlaintextAdminIntegrationTest/testConsumerGroups(String%2C%20String)%5B1%5D?top-execution=1)) 2. PlaintextAdminIntegrationTest#testConsumerGroupsDeprecatedConsumerGroupState a. `org.opentest4j.AssertionFailedError: expected: <2> but was: <3>` ([Report](https://ge.apache.org/s/ndoj6s2stb446/tests/task/:core:test/details/kafka.api.PlaintextAdminIntegrationTest/testConsumerGroupsDeprecatedConsumerGroupState(String%2C%20String)%5B1%5D?top-execution=1)) b. `org.opentest4j.AssertionFailedError: expected: <true> but was: <false>` ([Report](https://ge.apache.org/s/kh3jze2tc5qeu/tests/task/:core:test/details/kafka.api.PlaintextAdminIntegrationTest/testConsumerGroupsDeprecatedConsumerGroupState(String%2C%20String)%5B1%5D?top-execution=1)) Reviewers: David Jacot <djacot@confluent.io>, TaiJuWu <tjwu1217@gmail.com>, Chia-Ping Tsai <chia7712@gmail.com>	2025-02-14 07:28:45 +08:00
Andrew Schofield	952113e8e0	KAFKA-16720: Support multiple groups in DescribeShareGroupOffsets RPC (#18834 ) Reviewers: Apoorv Mittal <apoorvmittal10@gmail.com>, Manikumar Reddy <manikumar.reddy@gmail.com>	2025-02-13 18:27:05 +00:00
Calvin Liu	9cb271f1e1	KAFKA-18654[2/2]: Transction V2 retry add partitions on the server side when handling produce request. (#18810 ) During the transaction commit phase, it is normal to hit CONCURRENT_TRANSACTION error before the transaction markers are fully propagated. Instead of letting the client to retry the produce request, it is better to retry on the server side. Reviewers: Artem Livshits <alivshits@confluent.io>, Justine Olshan <jolshan@confluent.io>	2025-02-13 09:30:58 -08:00
Apoorv Mittal	a13d815a0d	MINOR: Updated share partition manager tests to close and other fixes (#18862 ) Reviewers: Abhinav Dixit <adixit@confluent.io>, Andrew Schofield <aschofield@confluent.io>	2025-02-13 13:37:37 +00:00
Ken Huang	9494bebee6	KAFKA-18728 Move ListOffsetsPartitionStatus to server module (#18807 ) Reviewers: Kamal Chandraprakash <kamal.chandraprakash@gmail.com>	2025-02-13 10:36:46 +05:30
Jhen-Yung Hsu	b0e5cdfc57	KAFKA-18777 add `PartitionsWithLateTransactionsCount` to BrokerMetricNamesTest (#18869 ) Rewrite BrokerMetricNamesTest using ReplicaManager.MetricNames, ensuring that all metrics are always included. This helps prevent issues like PartitionsWithLateTransactionsCount not being correctly included in the test before. Reviewers: Chia-Ping Tsai <chia7712@gmail.com>	2025-02-12 22:09:42 +08:00
PoAn Yang	63fc9b3cb8	KAFKA-18771: fix Flaky test KRaftClusterTest .testDescribeQuorumRequestToControllers (#18859 ) The case testDescribeQuorumRequestToControllers shutdowns raft client but not the controller. This makes client has chance to send a request to the controller and get NOT_LEADER_OR_FOLLOWER error. However, if the raft client finishes shutdown before handling the request, the request will not be handled. Shutdown the controller before doing KafkaFuture#get for the client request, so we can make sure the request is handled by another controller eventually. Signed-off-by: PoAn Yang <payang@apache.org> Reviewers: Luke Chen <showuon@gmail.com>	2025-02-12 16:16:43 +08:00
Justine Olshan	400363b7e2	KAFKA-18035: TransactionsTest testBumpTransactionalEpochWithTV2Disabled failed on trunk (#18451 ) Sometimes we didn't get into abortable state before aborting, so the epoch didn't get bumped. Now we force abortable state with an attempt to send before aborting so the epoch bump occurs as expected. Reviewers: Jeff Kim <jeff.kim@confluent.io>	2025-02-11 14:01:43 -08:00
Edoardo Comar	7e405ccc65	KAFKA-18758: NullPointerException in shutdown following InvalidConfigurationException (#18833 ) * KAFKA-18758: NullPointerException in shutdown following InvalidConfigurationException Add checks for null in shutdown as BrokerLifecycleManager is not instantiaited if LogManager constructor throws an Exception	2025-02-11 10:06:55 +00:00
Sushant Mahajan	675a0889de	KAFKA-18764: Throttle on share state RPCs auth failure. (#18855 ) Reviewers: Andrew Schofield <aschofield@confluent.io>	2025-02-11 09:54:24 +00:00
Mickael Maison	ece91e9247	KAFKA-14484: Move UnifiedLog static methods to storage (#18039 ) Reviewers: Chia-Ping Tsai <chia7712@gmail.com>	2025-02-11 09:55:32 +01:00
TengYao Chi	f5dd661cb5	KAFKA-18396: Migrate log4j1 configuration to log4j2 in KafkaDockerWrapper (#18394 ) After log4j migration, we need to update the logging configuration in KafkaDockerWrapper from log4j1 to log4j2. Reviewers: Manikumar Reddy <manikumar.reddy@gmail.com>	2025-02-11 13:25:23 +05:30
TaiJuWu	9fc7500684	KAFKA-18770 close the RM created by testDelayedShareFetchPurgatoryOperationExpiration (#18853 ) it's crucial to utilize a try-finally block to ensure proper closure of the ReplicaManager. Failing to do so can result in an unreleased thread from the purgatory, potentially leading to errors in subsequent integration tests that incorporate thread leak detection. Reviewers: poorv Mittal <apoorvmittal10@gmail.com>, Chia-Ping Tsai <chia7712@gmail.com>	2025-02-11 07:35:13 +08:00
Ken Huang	581e94840f	KAFKA-18366 Remove KafkaConfig.interBrokerProtocolVersion (#18820 ) Reviewers: Ismael Juma <ismael@juma.me.uk>, Chia-Ping Tsai <chia7712@gmail.com>	2025-02-11 06:18:02 +08:00
Jhen-Yung Hsu	4e36368d08	KAFKA-18743 Remove leader.imbalance.per.broker.percentage as it is not supported by Kraft (#18821 ) Remove `leader.imbalance.per.broker.percentage` from config. Add `leader.imbalance.per.broker.percentage` to release note Reviewers: Chia-Ping Tsai <chia7712@gmail.com>	2025-02-11 04:01:57 +08:00
Ken Huang	70adf746c4	KAFKA-18225 ClientQuotaCallback#updateClusterMetadata is unsupported by kraft (#18196 ) This commit ensures that the ClientQuotaCallback#updateClusterMetadata method is executed in KRaft mode. This method is triggered whenever a topic or cluster metadata change occurs. However, in KRaft mode, the current implementation of the updateClusterMetadata API is inefficient due to the requirement of creating a full Cluster object. To address this, a follow-up issue (KAFKA-18239) has been created to explore more efficient mechanisms for providing cluster information to the ClientQuotaCallback without incurring the overhead of a full Cluster object creation. Reviewers: Mickael Maison <mickael.maison@gmail.com>, TaiJuWu <tjwu1217@gmail.com>, Chia-Ping Tsai <chia7712@gmail.com>	2025-02-11 01:03:02 +08:00
PoAn Yang	b22c7d5b5c	KAFKA-17833: Convert DescribeAuthorizedOperationsTest to use KRaft (#18252 ) Reviewers: Mickael Maison <mickael.maison@gmail.com>	2025-02-07 15:44:27 +01:00
Piotr P. Karwasz	666571216b	KAFKA-18483 Disable `Log4jController` and `Loggers` if Log4j Core absent (#18496 ) If Log4j Core is absent, most calls to Log4jController and Loggers will end up with a NoClassDefFoundError. This changeset: - Profits from the major version bump to rename k.util.Log4jController to LoggingController. - Removes o.a.l.l.Level from the signature of public methods of o.a.k.connect.runtime.Loggers and replaces it with String. - Provides an additional no-op implementation of k.util.LoggingController and o.a.k.connect.runtime.Loggers: if Log4j Core is not present on the runtime classpath the no-op implementation will be used. Reviewers: Mickael Maison <mickael.maison@gmail.com>, Chia-Ping Tsai <chia7712@gmail.com>	2025-02-07 00:04:33 +08:00
Colin Patrick McCabe	b2b2408692	KAFKA-18360 Remove zookeeper configurations (#18566 ) Remove broker.id.generation.enable and reserved.broker.max.id, which are not used in KRaft mode. Remove inter.broker.protocol.version, which is not used in KRaft mode. Reviewers: PoAn Yang <payang@apache.org>, Ismael Juma <ismael@juma.me.uk>, Chia-Ping Tsai <chia7712@gmail.com>	2025-02-06 22:22:11 +08:00
Ken Huang	a3d9d881e1	KAFKA-18530 Remove ZooKeeperInternals (#18641 ) Since zk has been removed in 4.0, config handlers no longer need to handle the "<default>" value. This PR streamlines the config update process by eliminating the unnecessary string checks for "<default>" Reviewers: Christo Lolov <lolovc@amazon.com>, Ismael Juma <ismael@juma.me.uk>, Chia-Ping Tsai <chia7712@gmail.com>	2025-02-06 17:48:17 +08:00
Ming-Yen Chung	34e7136b7a	MINOR: Fix wrong config property in KafkaConfigTest (#18815 ) Reviewers: Ken Huang <s7133700@gmail.com>, Chia-Ping Tsai <chia7712@gmail.com>	2025-02-06 17:09:52 +08:00
Kuan-Po Tseng	b99be961b8	KAFKA-18206: EmbeddedKafkaCluster must set features (#18189 ) related to KAFKA-18206, set features in EmbeddedKafkaCluster in both streams and connect module, note that this PR also fix potential transaction with empty records in sendPrivileged method as transaction version 2 doesn't allow this kind of scenario. Reviewers: Justine Olshan <jolshan@confluent.io>	2025-02-05 09:14:36 -08:00
Chirag Wadhwa	01587d09d8	KAFKA-18494-3: solution for the bug relating to gaps in the share partition cachedStates post initialization (#18696 ) Reviewers: Apoorv Mittal <apoorvmittal10@gmail.com>, Abhinav Dixit <adixit@confluent.io>, Andrew Schofield <aschofield@confluent.io>	2025-02-05 15:16:25 +00:00
Sanskar Jhajharia	7dbed2f6e8	[KAFKA-16720] AdminClient Support for ListShareGroupOffsets (2/2) (#18671 ) Reviewers: Apoorv Mittal <apoorvmittal10@gmail.com>, Sushant Mahajan <smahajan@confluent.io>, Andrew Schofield <aschofield@confluent.io>	2025-02-05 14:38:09 +00:00
TengYao Chi	66363160c5	KAFKA-18645: New consumer should align close timeout handling with classic consumer (#18702 ) Reviewers: Lianet Magrans <lmagrans@confluent.io>, Kirk True <ktrue@confluent.io>, Chia-Ping Tsai <chia7712@gmail.com>	2025-02-05 09:08:51 -05:00
PoAn Yang	21645ebf0b	KAFKA-18705: Move ConfigRepository to metadata module (#18784 ) Reviewers: TengYao Chi <kitingiao@gmail.com>, Christo Lolov <lolovc@amazon.com>	2025-02-05 10:13:36 +00:00
Justine Olshan	00dddee347	MINOR: Add missing test tag to UnifiedLogTest.scala (#18794 ) Reviewers: Chia-Ping Tsai <chia7712@gmail.com>	2025-02-04 13:56:14 -08:00
Sean Quah	42e7cbb67e	KAFKA-18690: Keep leader metadata for RE2J-assigned partitions (#18777 ) Reviewers: Lianet Magrans <lmagrans@confluent.io>	2025-02-04 13:22:28 -05:00
Justine Olshan	822b8ab3d7	KAFKA-18691: Flaky test testFencingOnTransactionExpiration (#18793 ) It appears this test was failing because the transaction was never aborting and the concurrent transactions errors would not go away. `ccab9eb` introduced the test failure because it requires the transaction to complete, but I suspect the lack of completion was happening before the change. The timeout for the write is based on the transactional timeout, and 100ms seemed too small -- thus the requests to update the state would often repeatedly time out. Also removed the loop since it was not necessary. Reviewers: Jeff Kim <jeff.kim@confluent.io>, Calvin Liu <caliu@confluent.io>	2025-02-04 08:45:34 -08:00
Luke Chen	612e1299e4	KAFKA-18230: Handle not controller or not leader error in admin client (#18165 ) Reviewers: Mickael Maison <mickael.maison@gmail.com>, Chia-Ping Tsai <chia7712@gmail.com>	2025-02-04 16:51:24 +01:00
Calvin Liu	ad031b99d3	KAFKA-18635: reenable the unclean shutdown detection (#18277 ) We need to re-enable the unclean shutdown detection when in ELR mode, which was inadvertently removed during the development process. Reviewers: David Mao <dmao@confluent.io>, Jun Rao <junrao@gmail.com>	2025-02-03 22:26:57 -08:00
Ming-Yen Chung	9f78771a1f	KAFKA-18693 Remove PasswordEncoder (#18790 ) Reviewers: Chia-Ping Tsai <chia7712@gmail.com>	2025-02-04 13:18:41 +08:00

1 2 3 4 5 ...

5609 Commits