kafka

Commit Graph

Author	SHA1	Message	Date
Ken Huang	d874aa42f3	KAFKA-18368 Remove TestUtils#MockZkConnect and remove zkConnect from TestUtils#createBrokerConfig (#18352 ) Reviewers: Chia-Ping Tsai <chia7712@gmail.com>	2025-01-07 21:03:13 +08:00
Ismael Juma	d6f24d3665	Use `instanceof` pattern to avoid explicit cast (#18373 ) This feature was introduced in Java 16. Reviewers: David Arthur <mumrah@gmail.com>, Apoorv Mittal <apoorvmittal10@gmail.com>	2025-01-02 09:32:51 -08:00
Ismael Juma	fe56fc98fa	KAFKA-18269: Remove deprecated protocol APIs support (KIP-896, KIP-724) (#18218 ) Included in this change: 1. Remove deprecated protocol api versions from json files. 3. Remove fields that are no longer used from json files (affects ListOffsets, OffsetCommit, DescribeConfigs). 4. Remove record down-conversion support from KafkaApis. 5. No longer return `Errors.UNSUPPORTED_COMPRESSION_TYPE` on the fetch path[1]. 6. Deprecate `TopicConfig. MESSAGE_DOWNCONVERSION_ENABLE_CONFIG` and made the relevant configs (`message.downconversion.enable` and `log.message.downcoversion.enable`) no-ops since down-conversion is no longer supported. It was an oversight not to deprecate this via KIP-724. 7. Fix `shouldRetainsBufferReference` to handle null request schemas for a given version. 8. Simplify producer logic since it only supports the v2 record format now. 9. Fix tests so they don't exercise protocol api versions that have been removed. 10. Add upgrade note. Testing: 1. System tests have a lot of failures, but those tests fail for trunk too and I didn't see any issues specific to this change - it's hard to be sure given the number of failing tests, but let's not block on that given the other testing that has been done (see below). 3. Java producers and consumers with version 0.9-0.10.1 don't have api versions support and hence they fail in an ungraceful manner: the broker disconnects and the clients reconnect until the relevant timeout is triggered. 4. Same thing seems to happen for the console producer 0.10.2 although it's unclear why since api versions should be supported. I will look into this separately, it's unlikely to be related to this PR. 5. Console consumer 0.10.2 fails with the expected error and a reasonable message[2]. 6. Console producer and consumer 0.11.0 works fine, newer versions should naturally also work fine. 7. kcat 1.5.0 (based on librdkafka 1.1.0) produce and consume fail with a reasonable message[3][4]. 8. kcat 1.6.0-1.7.0 (based on librdkafka 1.5.0 and 1.7.0 respectively) consume fails with a reasonable message[5]. 9. kcat 1.6.0-1.7.0 produce works fine. 10. kcat 1.7.1 (based on librdkafka 1.8.2) works fine for consumer and produce. 11. confluent-go-client (librdkafka based) 1.8.2 works fine for consumer and produce. 12. I will test more clients, but I don't think we need to block the PR on that. Note that this also completes part of KIP-724: produce v2 and lower as well as fetch v3 and lower are no longer supported. Future PRs will remove conditional code that is no longer needed (some of that has been done in KafkaApis, but only what was required due to the schema changes). We can probably do that in master only as it does not change behavior. Note that I did not touch `ignorable` fields even though some of them could have been changed. The reasoning is that this could result in incompatible changes for clients that use new protocol versions without setting such fields _if_ we don't manually validate their presence. I will file a JIRA ticket to look into this carefully for each case (i.e. if we do validate their presence for the appropriate versions, we can set them to ignorable=false in the json file). [1] We would return this error if a fetch < v10 was used and the compression topic config was set to zstd, but we would not do the same for the case where zstd was compressed at the producer level (the most common case). Since there is no efficient way to do the check for the common case, I made it consistent for both by having no checks. [2] ```org.apache.kafka.common.errors.UnsupportedVersionException: The broker is too new to support JOIN_GROUP version 1``` [3]```METADATA\|rdkafka#producer-1\| [thrd:main]: localhost:9092/bootstrap: Metadata request failed: connected: Local: Required feature not supported by broker (0ms): Permanent``` [4]```METADATA\|rdkafka#consumer-1\| [thrd:main]: localhost:9092/bootstrap: Metadata request failed: connected: Local: Required feature not supported by broker (0ms): Permanent``` [5] `ERROR: Topic test-topic [0] error: Failed to query logical offset END: Local: Required feature not supported by broker` Reviewers: David Arthur <mumrah@gmail.com>	2024-12-20 19:52:00 -08:00
TengYao Chi	772aa241b2	KAFKA-18136: Remove zk migration from code base (#18016 ) Reviewers: Mickael Maison <mickael.maison@gmail.com>, Chia-Ping Tsai <chia7712@gmail.com>	2024-12-12 18:34:29 +01:00
Calvin Liu	755adf8a56	KAFKA-14563: RemoveClient-Side AddPartitionsToTxn Requests (#17698 ) Removes the client side AddPartitionsToTxn/AddOffsetsToTxn calls so that the partition is implicitly added as part of KIP-890 part 2. This change also requires updating the valid state transitions. The client side can not know for certain if a partition has been added server side when the request times out (partial completion). Thus for TV2, the transition to PrepareAbort is now valid for Empty, CompleteCommit, and CompleteAbort. For readability, the V1 and V2 endTransaction methods have been separated. Reviewers: Artem Livshits <alivshits@confluent.io>, Justine Olshan <jolshan@confluent.io>, Ritika Reddy <rreddy@confluent.io>	2024-12-06 09:00:04 -08:00
David Jacot	24dd11d693	KAFKA-17593; [8/N] Resolve regular expressions (#17864 ) This patch introduces the asynchronous resolution of regular expressions. Let me unpack a few details about the implementations: 1) I have decided to finally update all the regular expressions within a consumer group together. My assumption is that the number of regular expressions in a group will be generally small but the number of topics in a cluster is large. Hence grouping has two benefits. Firstly, it allows to go through the list of topics once for all the regular expressions. Secondly, it reduces the number of potential rebalances because all the regular expressions are updated at the same time. 2) An update is triggered when the group is subscribed to at least one regular expressions. 3) An update is triggered when there is no ongoing update. 4) An update is triggered only of the previous one is older than 10s. 5) An update is triggered when the group has unresolved regular expressions. 6) An update is triggered when the metadata image has new topics. Reviewers: Jeff Kim <jeff.kim@confluent.io>	2024-11-26 08:56:25 -08:00
TengYao Chi	0e4d8b3e86	KAFKA-17569 Rewrite TestLinearWriteSpeed by Java (#17736 ) Reviewers: Chia-Ping Tsai <chia7712@gmail.com>	2024-11-26 23:43:01 +08:00
Manikumar Reddy	3268435fd6	KAFKA-18013: Add AutoOffsetResetStrategy internal class (#17858 ) - Deprecates OffsetResetStrategy enum - Adds new internal class AutoOffsetResetStrategy - Replaces all OffsetResetStrategy enum usages with AutoOffsetResetStrategy - Deprecate old/Add new constructors to MockConsumer Reviewers: Andrew Schofield <aschofield@confluent.io>, Matthias J. Sax <matthias@confluent.io>	2024-11-25 19:11:12 +05:30
Joao Pedro Fonseca Dantas	e9ccc2d6f5	KAFKA-16041: Replace Afterburn module with Blackbird (#17884 ) Reviewers: Mickael Maison <mickael.maison@gmail.com>	2024-11-21 14:52:45 +01:00
David Jacot	a802865aad	KAFKA-17593; [5/N] Include resolved regular expressions into target assignment computation (#17750 ) This patch does a few things: * Refactors the `TargetAssignmentBuilder` to use inheritance to differentiate Consumer and Share groups. * Introduces `UnionSet` to lazily aggregate the subscriptions for a given member. * Wires the resolved regular expressions in the `GroupMetadataManager`. At the moment, they are only used when the target assignment is computed. Reviewers: Sean Quah <squah@confluent.io>, Jeff Kim <jeff.kim@confluent.io>, Lianet Magrans <lmagrans@confluent.io>	2024-11-13 06:59:52 -08:00
TengYao Chi	4e3a3d398d	KAFKA-17570 Rewrite StressTestLog by Java (#17249 ) Reviewers: Chia-Ping Tsai <chia7712@gmail.com>	2024-11-09 14:24:32 +08:00
Mickael Maison	0049b967e5	KAFKA-17890: Move DelayedOperationPurgatory to server-common (#17636 ) Reviewers: Jun Rao <jun@confluent.io>, Apoorv Mittal <amittal@confluent.io>	2024-11-08 09:55:09 +01:00
PoAn Yang	7fb6e9ec1c	KAFKA-17840 Move ReplicationQuotaManager, ClientRequestQuotaManager and QuotaFactory to server module (#17609 ) Reviewers: Chia-Ping Tsai <chia7712@gmail.com>	2024-10-30 21:18:28 +08:00
Said Boudjelda	57053ef47d	MINOR: Remove never thrown exception in ByteUtilsBenchmark (#17532 ) Reviewers: Mickael Maison <mickael.maison@gmail.com>, Chia-Ping Tsai <chia7712@gmail.com>	2024-10-24 11:51:23 +02:00
Ken Huang	2ff13976ab	KAFKA-17568 Rewrite TestPurgatoryPerformance by Java (#17246 ) Reviewers: Chia-Ping Tsai <chia7712@gmail.com>	2024-10-24 02:44:37 +08:00
Apoorv Mittal	25a3590dc2	KAFKA-17813: Moving broker endpoint class and common server connection id (#17519 ) Reviewers: Chia-Ping Tsai <chia7712@gmail.com>, Kuan-Po Tseng <brandboat@gmail.com>, Jun Rao <junrao@gmail.com>	2024-10-22 11:58:28 -07:00
Ken Huang	76a9df47ca	KAFKA-17639 Add Java 23 to CI build matrix (#17409 ) Reviewers: Chia-Ping Tsai <chia7712@gmail.com>	2024-10-20 23:55:19 +08:00
Abhinav Dixit	cb3b03377d	KAFKA-17742: Move DelayedShareFetchPurgatory declaration to ReplicaManager (#17437 ) Declare the delayed share fetch purgatory inside ReplicaManager along with the existing purgatories. Check the share fetch purgatory when a replica becomes the follower or a replica is deleted from a broker through ReplicaManager. Perform a checkAndComplete for share fetch when HWM is updated. Reviewers: Andrew Schofield <aschofield@confluent.io>, Apoorv Mittal <apoorvmittal10@gmail.com>, Jun Rao <junrao@gmail.com>	2024-10-17 13:58:10 -07:00
TengYao Chi	582bb48e88	KAFKA-17748 Remove scala-java8-compat (#17497 ) Reviewers: Chia-Ping Tsai <chia7712@gmail.com>	2024-10-15 13:34:21 +08:00
Linsiyuan9	76a1af984b	KAFKA-17746 Replace JavaConverters with CollectionConverters (#17451 ) Reviewers: Chia-Ping Tsai <chia7712@gmail.com>	2024-10-14 17:13:20 +08:00
Mickael Maison	07cafdd9df	KAFKA-17729: Remove ZK from AuthorizerBenchmark, CheckpointBench and PartitionCreationBench (#17415 ) Reviewers: Chia-Ping Tsai <chia7712@gmail.com>	2024-10-09 11:07:15 +08:00
Ken Huang	10a0905628	KAFKA-17564 Move BrokerFeatures to server module (#17228 ) Reviewers: TengYao Chi <kitingiao@gmail.com>, Chia-Ping Tsai <chia7712@gmail.com>	2024-10-07 15:16:48 +08:00
TengYao Chi	0e4eebe9c0	KAFKA-12895 Drop support for Scala 2.12 in Kafka 4.0 (#17313 ) Reviewers: Chia-Ping Tsai <chia7712@gmail.com>	2024-10-07 01:34:38 +08:00
Colin Patrick McCabe	85bfdf4127	KAFKA-17613: Remove ZK migration code (#17293 ) Remove the controller machinery for doing ZK migration in Kafka 4.0. Reviewers: Chia-Ping Tsai <chia7712@gmail.com>, David Arthur <mumrah@gmail.com>	2024-10-03 12:01:14 -07:00
Sean Quah	99e1d8fbb3	MINOR: Cache topic resolution in TopicIds set (#17285 ) Looking up topics in a TopicsImage is relatively slow. Cache the results in TopicIds to improve assignor performance. In benchmarks, we see a noticeable improvement in performance in the heterogeneous case. Before ``` Benchmark (assignmentType) (assignorType) (isRackAware) (memberCount) (partitionsToMemberRatio) (subscriptionType) (topicCount) Mode Cnt Score Error Units ServerSideAssignorBenchmark.doAssignment INCREMENTAL RANGE false 10000 10 HOMOGENEOUS 1000 avgt 5 36.400 ± 3.004 ms/op ServerSideAssignorBenchmark.doAssignment INCREMENTAL RANGE false 10000 10 HETEROGENEOUS 1000 avgt 5 158.340 ± 0.825 ms/op ServerSideAssignorBenchmark.doAssignment INCREMENTAL UNIFORM false 10000 10 HOMOGENEOUS 1000 avgt 5 1.329 ± 0.041 ms/op ServerSideAssignorBenchmark.doAssignment INCREMENTAL UNIFORM false 10000 10 HETEROGENEOUS 1000 avgt 5 382.901 ± 6.203 ms/op ``` After ``` Benchmark (assignmentType) (assignorType) (isRackAware) (memberCount) (partitionsToMemberRatio) (subscriptionType) (topicCount) Mode Cnt Score Error Units ServerSideAssignorBenchmark.doAssignment INCREMENTAL RANGE false 10000 10 HOMOGENEOUS 1000 avgt 5 36.465 ± 1.954 ms/op ServerSideAssignorBenchmark.doAssignment INCREMENTAL RANGE false 10000 10 HETEROGENEOUS 1000 avgt 5 114.043 ± 1.424 ms/op ServerSideAssignorBenchmark.doAssignment INCREMENTAL UNIFORM false 10000 10 HOMOGENEOUS 1000 avgt 5 1.454 ± 0.019 ms/op ServerSideAssignorBenchmark.doAssignment INCREMENTAL UNIFORM false 10000 10 HETEROGENEOUS 1000 avgt 5 342.840 ± 2.744 ms/op ``` --- Based heavily on https://github.com/apache/kafka/pull/16527. Reviewers: David Arthur <mumrah@gmail.com>, David Jacot <djacot@confluent.io>	2024-10-03 00:40:25 -07:00
Dimitar Dimitrov	bc47ce1a53	MINOR: Fix a race and add JMH bench for HdrHistogram (#17221 )	2024-09-27 23:49:10 +09:00
xijiu	18340c9733	KAFKA-17563 Move `RequestConvertToJson` to server module (#17223 ) Reviewers: Chia-Ping Tsai <chia7712@apache.org>	2024-09-27 02:19:47 +08:00
Sean Quah	236f3d422f	KAFKA-17496: Add heterogeneous case to TargetAssignmentBuilderBenchmark (#17277 ) Bring the homogeneous case from ServerSideAssignorBenchmark to TargetAssignmentBuilderBenchmark. Reviewers: David Jacot <djacot@confluent.io>	2024-09-25 23:59:38 -07:00
PoAn Yang	bb97d63d41	KAFKA-17578: Remove partitionRacks from TopicMetadata (#17233 ) The ModernGroup#subscribedTopicMetadata takes too much memory due to partitionRacks. This is not being used at the moment as the consumer protocol does not support rack aware assignments. A heap dump from a group with 500 members, 2K subscribed topic partitions shows 654,400 bytes used for partitionRacks. The rest of the ConsumerGroup object holds 822,860 bytes. Reviewers: David Jacot <djacot@confluent.io>	2024-09-25 00:48:48 -07:00
Sean Quah	9352faa8fc	KAFKA-17495: Factor out assignor benchmark code into utils class (#17133 ) ServerSideAssignorBenchmark and TargetAssignmentBuilderBenchmark have the same topic and member subscription setup for the most part. Factor out the commonality so that it's easier to share new setups between both benchmarks. Reviewers: David Jacot <djacot@confluent.io>	2024-09-23 07:55:54 -07:00
Dmitry Werner	5fd7ce2ace	KAFKA-17414 Move RequestLocal to server-common module (#16986 ) Reviewers: Chia-Ping Tsai <chia7712@gmail.com>	2024-09-04 16:12:20 +08:00
Mickael Maison	c30615e6d7	KAFKA-17430: Move RequestChannel.Metrics/RequestMetrics to server module (#17015 ) Reviewers: Chia-Ping Tsai <chia7712@gmail.com>	2024-09-03 10:11:47 +02:00
Mickael Maison	b9fe9f532f	KAFKA-16972: Move BrokerTopicStats to storage module (#17003 ) Reviewers: Luke Chen <showuon@gmail.com>	2024-08-27 11:39:37 +02:00
TengYao Chi	d67c18b4ae	KAFKA-17331 Set correct version for EarliestLocalSpec and LatestTieredSpec (#16876 ) Add the version check to client side when building ListOffsetRequest for the specific timestamp: 1) the version must be >=8 if timestamp=-4L (EARLIEST_LOCAL_TIMESTAMP) 2) the version must be >=9 if timestamp=-5L (LATEST_TIERED_TIMESTAMP) Reviewers: PoAn Yang <payang@apache.org>, Chia-Ping Tsai <chia7712@gmail.com>	2024-08-25 17:39:28 +08:00
Mickael Maison	e23172a48a	MINOR: Move OffsetCheckpointFile to storage module (#16917 ) Reviewers: Chia-Ping Tsai <chia7712@gmail.com>	2024-08-20 16:29:24 +02:00
David Schlosnagle	050edfaf00	KAFKA-14336: MetadataResponse#convertToNodeArray uses iteration (#12782 ) Avoids stream allocation on hot code path in Admin#listOffsets This patch avoids allocating the stream reference pipeline & spliterator for this case by explicitly allocating the pre-sized Node[] and using a for loop with int induction over the specified IDs List argument. Reviewers: Apoorv Mittal <apoorvmittal10@gmail.com>, Kirk True <kirk@kirktrue.pro>, David Arthur <mumrah@gmail.com>	2024-08-19 19:46:51 -04:00
Josep Prat	4e862c0903	KAFKA-15875: Stops leak Snapshot in public methods (#16807 ) * KAFKA-15875: Stops leak Snapshot in public methods The Snapshot class is package protected but it's returned in several public methods in SnapshotRegistry. To prevent this accidental leakage, these methods are made package protected as well. For getOrCreateSnapshot a new method called IdempotentCreateSnapshot is created that returns void. * Make builer package protected, replace <br> with <p> Reviewers: Greg Harris <greg.harris@aiven.io>	2024-08-08 20:05:47 +02:00
Chirag Wadhwa	1db84c1a11	KAFKA-16745: Implemented handleShareFetchRequest RPC including unit tests (#16456 ) Implemented handleShareFetch request RPC in KafkaApis.scala. This method is called whenever the client sends a Share Fetch request to the broker. Although Share Fetch request support acknowledgements, since the logic for acknowledging records is not completely implemented in SharePartitionManager.java class, this method currently includes placeholder code for acknowledging, which will be replaced by the actual functionality in the upcoming PRs. Reviewers: Apoorv Mittal <apoorvmittal10@gmail.com>, Abhinav Dixit <adixit@confluent.io>, Jun Rao <junrao@gmail.com>	2024-08-06 07:59:04 -07:00
PoAn Yang	6e324487fa	KAFKA-16480: ListOffsets change should have an associated API/IBP version update (#16781 ) 1. Use oldestAllowedVersion as 9 if using ListOffsetsRequest#EARLIEST_LOCAL_TIMESTAMP or ListOffsetsRequest#LATEST_TIERED_TIMESTAMP. 2. Add test cases to ListOffsetsRequestTest#testListOffsetsRequestOldestVersion to make sure requireTieredStorageTimestamp return 9 as minVersion. 3. Add EarliestLocalSpec and LatestTierSpec to OffsetSpec. 4. Add more cases to KafkaAdminClient#getOffsetFromSpec. 5. Add testListOffsetsEarliestLocalSpecMinVersion and testListOffsetsLatestTierSpecSpecMinVersion to KafkaAdminClientTest to make sure request builder has oldestAllowedVersion as 9. Signed-off-by: PoAn Yang <payang@apache.org> Reviewers: Luke Chen <showuon@gmail.com>	2024-08-03 14:27:27 +08:00
Colin Patrick McCabe	4d3e366bc2	KAFKA-16772: Introduce kraft.version to support KIP-853 (#16230 ) Introduce the KRaftVersion enum to describe the current value of kraft.version. Change a bunch of places in the code that were using raw shorts over to using this new enum. In BrokerServer.scala, fix a bug that could cause null pointer exceptions during shutdown if we tried to shut down before fully coming up. Do not send finalized features that are finalized as level 0, since it is a no-op. Reviewers: dengziming <dengziming1993@gmail.com>, José Armando García Sancio <jsancio@apache.org>	2024-07-16 09:31:10 -07:00
Ritika Reddy	42f267a853	KAFKA-16944; Rewrite Range Assignor (#16504 ) The server side range assignor was made to be sticky i.e. partitions from the existing assignment are retained as much as possible. During a rebalance, the expected behavior is to achieve co-partitioning for members that are subscribed to the same set of topics with equal number of partitions. However, there are cases where this cannot be achieved efficiently with the current algorithm. There is no easy way to implement stickiness and co-partitioning and hence we have resorted to recomputing the target assignment every time. In case of static membership, instanceIds are leveraged to ensure some form of stickiness. ``` Benchmark (assignmentType) (assignorType) (isRackAware) (memberCount) (partitionsToMemberRatio) (subscriptionType) (topicCount) Mode Cnt Score Error Units ServerSideAssignorBenchmark.doAssignment INCREMENTAL RANGE false 100 10 HOMOGENEOUS 100 avgt 5 0.052 ± 0.001 ms/op ServerSideAssignorBenchmark.doAssignment INCREMENTAL RANGE false 100 10 HOMOGENEOUS 1000 avgt 5 0.454 ± 0.003 ms/op ServerSideAssignorBenchmark.doAssignment INCREMENTAL RANGE false 1000 10 HOMOGENEOUS 100 avgt 5 0.476 ± 0.046 ms/op ServerSideAssignorBenchmark.doAssignment INCREMENTAL RANGE false 1000 10 HOMOGENEOUS 1000 avgt 5 3.102 ± 0.055 ms/op ServerSideAssignorBenchmark.doAssignment INCREMENTAL RANGE false 10000 10 HOMOGENEOUS 100 avgt 5 5.640 ± 0.223 ms/op ServerSideAssignorBenchmark.doAssignment INCREMENTAL RANGE false 10000 10 HOMOGENEOUS 1000 avgt 5 37.947 ± 1.000 ms/op ServerSideAssignorBenchmark.doAssignment INCREMENTAL RANGE false 100 10 HETEROGENEOUS 100 avgt 5 0.172 ± 0.001 ms/op ServerSideAssignorBenchmark.doAssignment INCREMENTAL RANGE false 100 10 HETEROGENEOUS 1000 avgt 5 1.882 ± 0.006 ms/op ServerSideAssignorBenchmark.doAssignment INCREMENTAL RANGE false 1000 10 HETEROGENEOUS 100 avgt 5 1.730 ± 0.036 ms/op ServerSideAssignorBenchmark.doAssignment INCREMENTAL RANGE false 1000 10 HETEROGENEOUS 1000 avgt 5 17.654 ± 1.160 ms/op ServerSideAssignorBenchmark.doAssignment INCREMENTAL RANGE false 10000 10 HETEROGENEOUS 100 avgt 5 18.595 ± 0.316 ms/op ServerSideAssignorBenchmark.doAssignment INCREMENTAL RANGE false 10000 10 HETEROGENEOUS 1000 avgt 5 172.398 ± 2.251 ms/op JMH benchmarks done Benchmark (memberCount) (partitionsToMemberRatio) (topicCount) Mode Cnt Score Error Units TargetAssignmentBuilderBenchmark.build 100 10 100 avgt 5 0.071 ± 0.004 ms/op TargetAssignmentBuilderBenchmark.build 100 10 1000 avgt 5 0.428 ± 0.026 ms/op TargetAssignmentBuilderBenchmark.build 1000 10 100 avgt 5 0.659 ± 0.028 ms/op TargetAssignmentBuilderBenchmark.build 1000 10 1000 avgt 5 3.346 ± 0.102 ms/op TargetAssignmentBuilderBenchmark.build 10000 10 100 avgt 5 8.947 ± 0.386 ms/op TargetAssignmentBuilderBenchmark.build 10000 10 1000 avgt 5 40.240 ± 3.113 ms/op JMH benchmarks done ``` Reviewers: David Jacot <djacot@confluent.io>	2024-07-04 10:33:09 -07:00
Apoorv Mittal	f2dbc55d24	KAFKA-17047: Refactored group coordinator classes to modern package (KIP-932) (#16474 ) Following the discussion and suggestion by @dajac, https://github.com/apache/kafka/pull/16054#discussion_r1613638293, the PR refactors the common classes to build TargetAssignment in `modern` package. `consumer` package has been moved inside `modern` package with classes exclusive to `consumer group`. This PR completes the refactoring and base to introduce `share` package inside `modern`. The subsequent PRs will define the implementation specific to Share Groups while re-using the common functionality from `modern` package classes. Reviewers: Andrew Schofield <aschofield@confluent.io>, Chia-Ping Tsai <chia7712@gmail.com>, David Jacot <djacot@confluent.io>	2024-07-03 00:16:40 -07:00
Apoorv Mittal	60114a46a7	KAFKA-16822: Abstract consumer group to share functionality with share group (KIP-932) (#16054 ) Abstracted code for 2 classes `ConsumerGroup` and `ConsumerGroupMember` to `ModernGroup` and `ModernGroupMember` respectively. The new abstract classes are created to share common functionality with `ShareGroup` and `ShareGroupMember` which are being introduced with KIP-932. The patch is majorly code refactoring from existing classes to abstract classes. Also created a new package called `modern` where `MemberState` class is moved, in upcoming patches, I will move common classes for `Share` and `Consumer` Group in `modern` package itself. Reviewers: Lianet Magrans <lianetmr@gmail.com>, Andrew Schofield <aschofield@confluent.io>, David Jacot <djacot@confluent.io>	2024-06-27 05:42:58 -07:00
Kuan-Po (Cooper) Tseng	888a177603	KAFKA-12708 Rewrite org.apache.kafka.test.Microbenchmarks by JMH (#16231 ) Reviewers: Chia-Ping Tsai <chia7712@gmail.com>	2024-06-14 16:47:34 +08:00
gongxuanzhang	596b945072	KAFKA-16643 Add ModifierOrder checkstyle rule (#15890 ) Reviewers: Chia-Ping Tsai <chia7712@gmail.com>	2024-06-13 15:39:32 +08:00
gongxuanzhang	46eb0814f6	KAFKA-10787 Apply spotless to log4j-appender, trogdor, jmh-benchmarks, examples, shell and generator (#16296 ) Reviewers: Chia-Ping Tsai <chia7712@gmail.com>	2024-06-12 22:23:39 +08:00
David Jacot	049cfeac02	MINOR: Rename uniform assignor's internal builders (#16233 ) This patch renames the uniform assignor's builders to match the `SubscriptionType` which is used to determine which one is called. It removes the abstract class `AbstractUniformAssignmentBuilder` which is not necessary anymore. It also applies minor refactoring. Reviewers: Ritika Reddy <rreddy@confluent.io>, Chia-Ping Tsai <chia7712@gmail.com>	2024-06-10 05:26:56 -07:00
David Jacot	7d832cf74f	KAFKA-14701; Move `PartitionAssignor` to new `group-coordinator-api` module (#16198 ) This patch moves the `PartitionAssignor` interface and all the related classes to a newly created `group-coordinator/api` module, following the pattern used by the storage and tools modules. Reviewers: Ritika Reddy <rreddy@confluent.io>, Jeff Kim <jeff.kim@confluent.io>, Chia-Ping Tsai <chia7712@gmail.com>	2024-06-06 12:19:20 -07:00
Ritika Reddy	078dd9a311	KAFKA-16821; Member Subscription Spec Interface (#16068 ) This patch reworks the `PartitionAssignor` interface to use interfaces instead of POJOs. It mainly introduces the `MemberSubscriptionSpec` interface that represents a member subscription and changes the `GroupSpec` interfaces to expose the subscriptions and the assignments via different methods. The patch does not change the performance. before: ``` Benchmark (memberCount) (partitionsToMemberRatio) (topicCount) Mode Cnt Score Error Units TargetAssignmentBuilderBenchmark.build 10000 10 100 avgt 5 3.462 ± 0.687 ms/op TargetAssignmentBuilderBenchmark.build 10000 10 1000 avgt 5 3.626 ± 0.412 ms/op JMH benchmarks done ``` after: ``` Benchmark (memberCount) (partitionsToMemberRatio) (topicCount) Mode Cnt Score Error Units TargetAssignmentBuilderBenchmark.build 10000 10 100 avgt 5 3.677 ± 0.683 ms/op TargetAssignmentBuilderBenchmark.build 10000 10 1000 avgt 5 3.991 ± 0.065 ms/op JMH benchmarks done ``` Reviewers: David Jacot <djacot@confluent.io>	2024-06-04 06:44:37 -07:00
David Jacot	fb566e48bf	KAFKA-16864; Optimize uniform (homogenous) assignor (#16088 ) This patch optimizes uniform (homogenous) assignor by avoiding creating a copy of all the assignments. Instead, the assignor creates a copy only if the assignment is updated. It is a sort of copy-on-write. This change reduces the overhead of the TargetAssignmentBuilder when ran with the uniform (homogenous) assignor. Trunk: ``` Benchmark (memberCount) (partitionsToMemberRatio) (topicCount) Mode Cnt Score Error Units TargetAssignmentBuilderBenchmark.build 10000 10 100 avgt 5 24.535 ± 1.583 ms/op TargetAssignmentBuilderBenchmark.build 10000 10 1000 avgt 5 24.094 ± 0.223 ms/op JMH benchmarks done ``` ``` Benchmark (assignmentType) (assignorType) (isRackAware) (memberCount) (partitionsToMemberRatio) (subscriptionType) (topicCount) Mode Cnt Score Error Units ServerSideAssignorBenchmark.doAssignment INCREMENTAL UNIFORM false 10000 10 HOMOGENEOUS 100 avgt 5 14.697 ± 0.133 ms/op ServerSideAssignorBenchmark.doAssignment INCREMENTAL UNIFORM false 10000 10 HOMOGENEOUS 1000 avgt 5 15.073 ± 0.135 ms/op JMH benchmarks done ``` Patch: ``` Benchmark (memberCount) (partitionsToMemberRatio) (topicCount) Mode Cnt Score Error Units TargetAssignmentBuilderBenchmark.build 10000 10 100 avgt 5 3.376 ± 0.577 ms/op TargetAssignmentBuilderBenchmark.build 10000 10 1000 avgt 5 3.731 ± 0.359 ms/op JMH benchmarks done ``` ``` Benchmark (assignmentType) (assignorType) (isRackAware) (memberCount) (partitionsToMemberRatio) (subscriptionType) (topicCount) Mode Cnt Score Error Units ServerSideAssignorBenchmark.doAssignment INCREMENTAL UNIFORM false 10000 10 HOMOGENEOUS 100 avgt 5 1.975 ± 0.086 ms/op ServerSideAssignorBenchmark.doAssignment INCREMENTAL UNIFORM false 10000 10 HOMOGENEOUS 1000 avgt 5 2.026 ± 0.190 ms/op JMH benchmarks done ``` Reviewers: Ritika Reddy <rreddy@confluent.io>, Jeff Kim <jeff.kim@confluent.io>, Justine Olshan <jolshan@confluent.io>	2024-05-31 13:17:59 -07:00
Mickael Maison	8068a086a3	MINOR: Remove KafkaConfig dependency in KafkaRequestHandler (#16108 ) Reviewers: Luke Chen <showuon@gmail.com>, Apoorv Mittal <amittal@confluent.io>	2024-05-30 11:51:24 +02:00
Calvin Liu	c8af740bd4	Improve producer ID expiration performance (#16075 ) Skip using stream when expiring the producer ID. This can improve the performance significantly when the count is high. Before Benchmark (numProducerIds) Mode Cnt Score Error Units ProducerStateManagerBench.testDeleteExpiringIds 10000 avgt 3 101.253 ± 28.031 us/op ProducerStateManagerBench.testDeleteExpiringIds 100000 avgt 3 2297.219 ± 1690.486 us/op ProducerStateManagerBench.testDeleteExpiringIds 1000000 avgt 3 30688.865 ± 16348.768 us/op After Benchmark (numProducerIds) Mode Cnt Score Error Units ProducerStateManagerBench.testDeleteExpiringIds 10000 avgt 3 39.122 ± 1.151 us/op ProducerStateManagerBench.testDeleteExpiringIds 100000 avgt 3 464.363 ± 98.857 us/op ProducerStateManagerBench.testDeleteExpiringIds 1000000 avgt 3 5731.169 ± 674.380 us/op Also, made a change to the JMH testing which excludes the producer ID populating from the testing. Reviewers: Artem Livshits <alivshits@confluent.io>, Justine Olshan <jolshan@confluent.io>	2024-05-29 16:49:55 -07:00
Justine Olshan	5e3df22095	KAFKA-16308 [1/N]: Create FeatureVersion interface and add `--feature` flag and handling to StorageTool (#15685 ) As part of KIP-1022, I have created an interface for all the new features to be used when parsing the command line arguments, doing validations, getting default versions, etc. I've also added the --feature flag to the storage tool to show how it will be used. Created a TestFeatureVersion to show an implementation of the interface (besides MetadataVersion which is unique) and added tests using this new test feature. I will add the unstable config and tests in a followup. Reviewers: David Mao <dmao@confluent.io>, David Jacot <djacot@confluent.io>, Artem Livshits <alivshits@confluent.io>, Jun Rao <junrao@apache.org>	2024-05-29 16:36:06 -07:00
Omnia Ibrahim	64f699aeea	KAFKA-15853: Move general configs out of KafkaConfig (#16040 ) Reviewers: Mickael Maison <mickael.maison@gmail.com>, Chia-Ping Tsai <chia7712@gmail.com>	2024-05-28 16:22:54 +02:00
Ritika Reddy	a8d166c00e	KAFKA-16625; Reverse lookup map from topic partitions to members (#15974 ) This patch speeds up the computation of the unassigned partitions by exposing the inverted target assignment. It allows the assignor to check whether a partition is assigned or not. Reviewers: Jeff Kim <jeff.kim@confluent.io>, David Jacot <djacot@confluent.io>	2024-05-25 09:06:15 -07:00
Colin P. McCabe	4f55786a8a	KAFKA-16515: Fix the ZK Metadata cache confusion between brokers and controllers ZkMetadataCache could theoretically return KRaft controller information from a call to ZkMetadataCache.getAliveBrokerNode, which doesn't make sense. KRaft controllers are not part of the set of brokers. The only use-case for this functionality was in MetadataCacheControllerNodeProvider during ZK migration, where it allowed ZK brokers in migration mode to forward requests to kcontrollers when appropriate. This PR changes MetadataCacheControllerNodeProvider to simply delegate to quorumControllerNodeProvider in this case. Reviewers: José Armando García Sancio <jsancio@apache.org>	2024-05-24 10:16:59 -07:00
Jeff Kim	520aa8665c	KAFKA-16626; Lazily convert subscribed topic names to topic ids (#15970 ) This patch aims to remove the data structure that stores the conversion from topic names to topic ids which was taking time similar to the actual assignment computation. Instead, we reuse the already existing ConsumerGroupMember.subscribedTopicNames() and do the conversion to topic ids when the iterator is requested. Reviewers: David Jacot <djacot@confluent.io>	2024-05-24 00:51:50 -07:00
Greg Harris	11ad5e8bca	MINOR: Refactor Values class to fix checkstyle, add benchmark, optimize exceptions (#15469 ) Signed-off-by: Greg Harris <greg.harris@aiven.io> Reviewers: Mickael Maison <mickael.maison@gmail.com>	2024-05-23 13:23:18 -07:00
Mickael Maison	affe8da54c	KAFKA-7632: Support Compression Levels (KIP-390) (#15516 ) Reviewers: Jun Rao <jun@confluent.io>, Luke Chen <showuon@gmail.com> Co-authored-by: Lee Dongjin <dongjin@apache.org>	2024-05-21 17:58:49 +02:00
David Jacot	1e427c029e	MINOR: Fix TargetAssignmentBuilderBenchmark (#15950 ) Reviewers: Jeff Kim <kimkb2011@gmail.com>, Chia-Ping Tsai <chia7712@gmail.com>	2024-05-15 12:53:17 +08:00
Ritika Reddy	ee16eee5de	KAFKA-16587: Add subscription model information to group state (#15785 ) This patch introduces the SubscriptionType to the group state and passes it along to the partition assignor. A group is "homogeneous" when all the members are subscribed to the same topics; or it is "heterogeneous" otherwise. This mainly helps the uniform assignor because it does not have to re-compute this information to determine which algorithm to use. trunk: Benchmark (assignmentType) (assignorType) (isRackAware) (memberCount) (partitionsToMemberRatio) (subscriptionModel) (topicCount) Mode Cnt Score Error Units ServerSideAssignorBenchmark.doAssignment INCREMENTAL RANGE false 100 10 HOMOGENEOUS 100 avgt 5 0.136 ± 0.001 ms/op ServerSideAssignorBenchmark.doAssignment INCREMENTAL RANGE false 100 10 HOMOGENEOUS 1000 avgt 5 0.198 ± 0.002 ms/op ServerSideAssignorBenchmark.doAssignment INCREMENTAL RANGE false 1000 10 HOMOGENEOUS 100 avgt 5 1.767 ± 0.138 ms/op ServerSideAssignorBenchmark.doAssignment INCREMENTAL RANGE false 1000 10 HOMOGENEOUS 1000 avgt 5 1.540 ± 0.020 ms/op ServerSideAssignorBenchmark.doAssignment INCREMENTAL RANGE false 10000 10 HOMOGENEOUS 100 avgt 5 32.419 ± 7.173 ms/op ServerSideAssignorBenchmark.doAssignment INCREMENTAL RANGE false 10000 10 HOMOGENEOUS 1000 avgt 5 26.731 ± 1.985 ms/op ServerSideAssignorBenchmark.doAssignment INCREMENTAL UNIFORM false 100 10 HOMOGENEOUS 100 avgt 5 0.242 ± 0.006 ms/op ServerSideAssignorBenchmark.doAssignment INCREMENTAL UNIFORM false 100 10 HOMOGENEOUS 1000 avgt 5 1.002 ± 0.006 ms/op ServerSideAssignorBenchmark.doAssignment INCREMENTAL UNIFORM false 1000 10 HOMOGENEOUS 100 avgt 5 2.544 ± 0.168 ms/op ServerSideAssignorBenchmark.doAssignment INCREMENTAL UNIFORM false 1000 10 HOMOGENEOUS 1000 avgt 5 10.749 ± 0.207 ms/op ServerSideAssignorBenchmark.doAssignment INCREMENTAL UNIFORM false 10000 10 HOMOGENEOUS 100 avgt 5 26.832 ± 0.154 ms/op ServerSideAssignorBenchmark.doAssignment INCREMENTAL UNIFORM false 10000 10 HOMOGENEOUS 1000 avgt 5 106.209 ± 0.301 ms/op JMH benchmarks done patch: Benchmark (assignmentType) (assignorType) (isRackAware) (memberCount) (partitionsToMemberRatio) (subscriptionType) (topicCount) Mode Cnt Score Error Units ServerSideAssignorBenchmark.doAssignment INCREMENTAL RANGE false 100 10 HOMOGENEOUS 100 avgt 5 0.131 ± 0.001 ms/op ServerSideAssignorBenchmark.doAssignment INCREMENTAL RANGE false 100 10 HOMOGENEOUS 1000 avgt 5 0.185 ± 0.004 ms/op ServerSideAssignorBenchmark.doAssignment INCREMENTAL RANGE false 1000 10 HOMOGENEOUS 100 avgt 5 1.943 ± 0.091 ms/op ServerSideAssignorBenchmark.doAssignment INCREMENTAL RANGE false 1000 10 HOMOGENEOUS 1000 avgt 5 1.450 ± 0.139 ms/op ServerSideAssignorBenchmark.doAssignment INCREMENTAL RANGE false 10000 10 HOMOGENEOUS 100 avgt 5 30.803 ± 2.644 ms/op ServerSideAssignorBenchmark.doAssignment INCREMENTAL RANGE false 10000 10 HOMOGENEOUS 1000 avgt 5 24.251 ± 1.230 ms/op ServerSideAssignorBenchmark.doAssignment INCREMENTAL UNIFORM false 100 10 HOMOGENEOUS 100 avgt 5 0.155 ± 0.004 ms/op ServerSideAssignorBenchmark.doAssignment INCREMENTAL UNIFORM false 100 10 HOMOGENEOUS 1000 avgt 5 0.235 ± 0.010 ms/op ServerSideAssignorBenchmark.doAssignment INCREMENTAL UNIFORM false 1000 10 HOMOGENEOUS 100 avgt 5 1.602 ± 0.046 ms/op ServerSideAssignorBenchmark.doAssignment INCREMENTAL UNIFORM false 1000 10 HOMOGENEOUS 1000 avgt 5 1.901 ± 0.174 ms/op ServerSideAssignorBenchmark.doAssignment INCREMENTAL UNIFORM false 10000 10 HOMOGENEOUS 100 avgt 5 16.098 ± 1.905 ms/op ServerSideAssignorBenchmark.doAssignment INCREMENTAL UNIFORM false 10000 10 HOMOGENEOUS 1000 avgt 5 17.681 ± 0.174 ms/op JMH benchmarks done Reviewers: David Jacot <djacot@confluent.io>	2024-05-13 02:19:05 -07:00
David Arthur	fe8ccbc92c	KAFKA-16539 Fix IncrementalAlterConfigs during ZK migration (#15744 ) This patch fixes two issues with IncrementalAlterConfigs and the ZK migration. First, it changes the handling of IncrementalAlterConfigs to check if the controller is ZK vs KRaft and only forward for KRaft. Second, it adds a check in KafkaZkClient#setOrCreateEntityConfigs to ensure a ZK broker is not directly modifying configs in ZK if there is a KRaft controller. This closes the race condition between KRaft taking over as the active controller and the ZK brokers learning about this. Forwarding During the ZK migration, there is a time when the ZK brokers are running with migrations enabled, but KRaft has yet to take over as the controller. Prior to KRaft taking over as the controller, the ZK brokers in migration mode were unconditionally forwarding IncrementalAlterConfigs (IAC) to the ZK controller. This works for some config types, but breaks when setting BROKER and BROKER_LOGGER configs for a specific broker. The behavior in KafkaApis for IAC was to always forward if the forwarding manager was defined. Since ZK brokers in migration mode have forwarding enabled, the forwarding would happen, and the special logic for BROKER and BROKER_LOGGER would be missed, causing the request to fail. With this fix, the IAC handler will check if the controller is KRaft or ZK and only forward for KRaft. Protected ZK Writes As part of KIP-500, we moved most (but not all) ZK mutations to the ZK controller. One of the things we did not move fully to the controller was entity configs. This is because there was some special logic that needed to run on the broker for certain config updates. If a broker-specific config was set, AdminClient would route the request to the proper broker. In KRaft, we have a different mechanism for handling broker-specific config updates. Leaving this ZK update on the broker side would be okay if we were guarding writes on the controller epoch, but it turns out KafkaZkClient#setOrCreateEntityConfigs does unprotected "last writer wins" updates to ZK. This means a ZK broker could update the contents of ZK after the metadata had been migrated to KRaft. No good! To fix this, this patch adds a check on the controller epoch to KafkaZkClient#setOrCreateEntityConfigs but also adds logic to fail the update if the controller is a KRaft controller. The new logic in setOrCreateEntityConfigs adds STALE_CONTROLLER_EPOCH as a new exception that can be thrown while updating configs. Reviewers: Luke Chen <showuon@gmail.com>, Akhilesh Chaganti <akhileshchg@users.noreply.github.com>, Chia-Ping Tsai <chia7712@gmail.com>	2024-05-07 08:29:57 +08:00
Omnia Ibrahim	d88c15fc3e	KAFKA-15853 Move KRAFT configs out of KafkaConfig (#15775 ) Reviewers: Chia-Ping Tsai <chia7712@gmail.com>	2024-04-27 07:02:31 +08:00
Ritika Reddy	8013657f5d	KAFKA-16568: JMH Benchmarks for Server Side Rebalances (#15717 ) This patch add three benchmarks for the client assignors, the server assignors and the target assignment builder. Reviewers: David Jacot <djacot@confluent.io>	2024-04-25 07:46:45 -07:00
Kuan-Po (Cooper) Tseng	ced79ee12f	KAFKA-16552 Create an internal config to control InitialTaskDelayMs in LogManager to speed up tests (#15719 ) Reviewers: Luke Chen <showuon@gmail.com>, Chia-Ping Tsai <chia7712@gmail.com>	2024-04-20 20:34:02 +08:00
Omnia Ibrahim	e798bed198	KAFKA-16234: Log directory failure re-creates partitions in another logdir automatically (#15335 ) This pr fixes the bug created by #15263 which caused topic partition to be recreated whenever the original log dir is offline: Log directory failure re-creates partitions in another logdir automatically Reviewers: Luke Chen <showuon@gmail.com>, Chia-Ping Tsai <chia7712@gmail.com>, Igor Soarez <soarez@apple.com>, Gaurav Narula <gaurav_narula2@apple.com>, Proven Provenzano <pprovenzano@confluent.io>	2024-04-06 14:36:26 +08:00
Nikolay	d8673b26bf	KAFKA-15899 [1/2] Move kafka.security package from core to server module (#15572 ) 1) This PR moves kafka.security classes from core to server module. 2) AclAuthorizer not moved, because it has heavy dependencies on core classes that not rewrited from scala at the moment. 3) AclAuthorizer will be deleted as part of ZK removal Reviewers: Chia-Ping Tsai <chia7712@gmail.com>	2024-03-30 11:54:22 +08:00
Nikolay	355873aa54	MINOR: Use CONFIG suffix in ZkConfigs (#15614 ) Reviewers: Mickael Maison <mickael.maison@gmail.com>, Chia-Ping Tsai <chia7712@gmail.com>, Omnia Ibrahim <o.g.h.ibrahim@gmail.com> Co-authored-by: n.izhikov <n.izhikov@vk.team>	2024-03-28 15:52:34 +01:00
Nikolay	6f38fe5e0a	KAFKA-14588 ZK configuration moved to ZkConfig (#15075 ) Reviewers: Chia-Ping Tsai <chia7712@gmail.com>	2024-03-27 22:37:01 +08:00
Jorge Esteban Quilcate Otoya	b25c96a915	KAFKA-16229: Fix slow expired producer id deletion (#15324 ) Expiration of ProducerIds is implemented with a slow removal of map keys: producers.keySet().removeAll(keys); Unnecessarily going through all producer ids and then throw all expired keys to be removed. This leads to exponential time on worst case when most/all keys need to be removed: Benchmark (numProducerIds) Mode Cnt Score Error Units ProducerStateManagerBench.testDeleteExpiringIds 100 avgt 3 9164.043 ± 10647.877 ns/op ProducerStateManagerBench.testDeleteExpiringIds 1000 avgt 3 341561.093 ± 20283.211 ns/op ProducerStateManagerBench.testDeleteExpiringIds 10000 avgt 3 44957983.550 ± 9389011.290 ns/op ProducerStateManagerBench.testDeleteExpiringIds 100000 avgt 3 5683374164.167 ± 1446242131.466 ns/op A simple fix is to use map#remove(key) instead, leading to a more linear growth: Benchmark (numProducerIds) Mode Cnt Score Error Units ProducerStateManagerBench.testDeleteExpiringIds 100 avgt 3 5779.056 ± 651.389 ns/op ProducerStateManagerBench.testDeleteExpiringIds 1000 avgt 3 61430.530 ± 21875.644 ns/op ProducerStateManagerBench.testDeleteExpiringIds 10000 avgt 3 643887.031 ± 600475.302 ns/op ProducerStateManagerBench.testDeleteExpiringIds 100000 avgt 3 7741689.539 ± 3218317.079 ns/op Flamegraph of the CPU usage at dealing with expiration when producers ids ~1Million: Reviewers: Justine Olshan <jolshan@confluent.io>	2024-02-09 17:17:17 -08:00
David Arthur	7bf7fd99a5	KAFKA-16078: Be more consistent about getting the latest MetadataVersion This PR creates MetadataVersion.latestTesting to represent the highest metadata version (which may be unstable) and MetadataVersion.latestProduction to represent the latest version that should be used in production. It fixes a few cases where the broker was advertising that it supported the testing versions even when unstable metadata versions had not been configured. Reviewers: Colin P. McCabe <cmccabe@apache.org>, Ismael Juma <ismael@juma.me.uk>	2024-01-17 14:59:22 -08:00
Divij Vaidya	65424ab484	MINOR: New year code cleanup - include final keyword (#15072 ) Reviewers: Mickael Maison <mickael.maison@gmail.com>, Ismael Juma <ismael@juma.me.uk>, Sagar Rao <sagarmeansocean@gmail.com>	2024-01-11 17:53:35 +01:00
Fiore Mario Vitale	314de9f23c	KAFKA-15996: Improve JsonConverter performance (#14992 ) Improve JsonConverter performance by using afterBurnModule of Jackson library. Reviewers: Divij Vaidya <diviv@amazon.com>, Mickael Maison <mickael.maison@gmail.com>	2023-12-24 21:47:12 +01:00
Omnia Ibrahim	07490b929b	KAFKA-15365: Broker-side replica management changes (#14881 ) Reviewers: Igor Soarez <soarez@apple.com>, Ron Dagostino <rndgstn@gmail.com>, Proven Provenzano <pprovenzano@confluent.io>	2023-12-11 09:34:22 -05:00
David Arthur	a8622faf47	KAFKA-15799 Handle full metadata updates on ZK brokers (#14719 ) This patch adds the concept of a "Full" UpdateMetadataRequest, similar to what is used in LeaderAndIsr. A new tagged field is added to UpdateMetadataRequest at version 8 which allows the KRaft controller to indicate if a UMR contains all the metadata or not. Since UMR is implicitly treated as incremental by the ZK broker, we needed a way to detect topic deletions when the KRaft broker sends a metadata snapshot to the ZK broker. By sending a "Full" flag, the broker can now compare existing topic IDs to incoming topic IDs and calculate which topics should be removed from the MetadataCache. This patch only removes deleted topics from the MetadataCache. Partition/log management was implemented in KAFKA-15605. Reviewers: Colin P. McCabe <cmccabe@apache.org>	2023-11-16 14:38:44 -08:00
hudeqi	9911fab1a1	KAFKA-15432: RLM Stop partitions should not be invoked for non-tiered storage topics (#14667 ) Reviewers: Christo Lolov <lolovc@amazon.com>, Divij Vaidya <diviv@amazon.com>, Kamal Chandraprakash <kamal.chandraprakash@gmail.com>	2023-11-02 10:00:15 +01:00
Calvin Liu	14029e2ddd	KAFKA-15582: Identify clean shutdown broker (#14465 ) The PR includes: * Added a new class of CleanShutdownFile which helps write and read from a clean shutdown file. * Updated the BrokerRegistration API. * Client side handling for the broker epoch. * Minimum work on the controller side. Reviewers: Jun Rao <junrao@gmail.com>	2023-10-19 10:25:23 -07:00
Matthias J. Sax	9b468fb278	MINOR: Do not end Javadoc comments with `**/` (#14540 ) Reviewers: Bruno Cadonna <bruno@confluent.io>, Bill Bejeck <bill@confluent.io>, Hao Li <hli@confluent.io>, Josep Prat <josep.prat@aiven.io>	2023-10-17 21:11:04 -07:00
Mehari Beyene	25b128de81	KAFKA-14991: KIP-937-Improve message timestamp validation (#14135 ) This implementation introduces two new configurations `log.message.timestamp.before.max.ms` and `log.message.timestamp.after.max.ms` and deprecates `log.message.timestamp.difference.max.ms`. The default value for all these three configs is maintained to be Long.MAX_VALUE for backward compatibility but with the newly added configurations we can have a finer control when validating message timestamps that are in the past and the future compared to the broker's timestamp. To maintain backward compatibility if the default value of `log.message.timestamp.before.max.ms` is not changed, we are assuming users are still using the deprecated config `log.message.timestamp.difference.max.ms` and validation is done using its value. This ensures that existing customers who have customized the value of `log.message.timestamp.difference.max.ms` will continue to see no change in behavior. Reviewers: Divij Vaidya <diviv@amazon.com>, Christo Lolov <lolovc@amazon.com>	2023-08-24 12:04:55 +02:00
Luke Chen	748175ce62	KAFKA-15189: only init remote topic metrics when enabled (#14133 ) Only initialize remote topic metrics when system-wise remote storage is enabled to avoid impacting performance for existing brokers. Also add tests. Reviewers: Divij Vaidya <diviv@amazon.com>, Kamal Chandraprakash <kamal.chandraprakash@gmail.com>	2023-08-05 13:00:16 +08:00
Luke Chen	27ea025e33	KAFKA-15176: add tests for tiered storage metrics (#13999 ) Added tests for metrics: 1. RemoteLogReaderTaskQueueSize 2. RemoteLogReaderAvgIdlePercent 3. RemoteLogManagerTasksAvgIdlePercent Also, added tests for OffsetOutOfRangeException will be thrown while reading logs Reviewers: Christo Lolov <christololov@gmail.com>, Satish Duggana <satishd@apache.org>	2023-07-21 10:30:33 +08:00
Justine Olshan	ea0bb00126	KAFKA-14884: Include check transaction is still ongoing right before append (take 2) (#13787 ) Introduced extra mapping to track verification state. When verifying, there is a race condition that the add partitions verification response returns that the partition is in the ongoing transaction, but an abort marker is written before we get to append. Therefore, we track any given transaction we are verifying with an object unique to that transaction. We check this unique state upon the first append to the log. After that, we can rely on currentTransactionFirstOffset. We remove the verification state on appending to the log with a transactional data record or marker. We will also clean up lingering verification state entries via the producer state entry expiration mechanism. We do not update the the timestamp on retrying a verification for a transaction, so each entry must be verified before producer.id.expiration.ms. There were a few other fixes: - Moved the transaction manager handling for failed batch into the future completed exceptionally block to avoid processing it twice (this caused issues in unit tests) - handle interrupted exceptions encountered when callback thread encountered them - change handling to throw error if we try to set verification state and leaderLogIfLocal is None. Reviewers: David Jacot <djacot@confluent.io>, Artem Livshits <alivshits@confluent.io>, Jason Gustafson <jason@confluent.io>	2023-07-14 15:18:11 -07:00
Colin P. McCabe	cd3c0ab1a3	KAFKA-15060: fix the ApiVersionManager interface This PR expands the scope of ApiVersionManager a bit to include returning the current MetadataVersion and features that are in effect. This is useful in general because that information needs to be returned in an ApiVersionsResponse. It also allows us to fix the ApiVersionManager interface so that all subclasses implement all methods of the interface. Having subclasses that don't implement some methods is dangerous because they could cause exceptions at runtime in unexpected scenarios. On the KRaft controller, we were previously performing a read operation in the QuorumController thread to get the current metadata version and features. With this PR, we now read a volatile variable maintained by a separate MetadataVersionContextPublisher object. This will improve performance and simplify the code. It should not change the guarantees we are providing; in both the old and new scenarios, we need to be robust against version skew scenarios during updates. Add a Features class which just has a 3-tuple of metadata version, features, and feature epoch. Remove MetadataCache.FinalizedFeaturesAndEpoch, since it just duplicates the Features class. (There are some additional feature-related classes that can be consolidated in in a follow-on PR.) Create a java class, EndpointReadyFutures, for managing the futures associated with individual authorizer endpoints. This avoids code duplication between ControllerServer and BrokerServer and makes this code unit-testable. Reviewers: David Arthur <mumrah@gmail.com>, dengziming <dengziming1993@gmail.com>, Luke Chen <showuon@gmail.com>	2023-06-19 16:46:44 -07:00
David Jacot	7eea2a3908	MINOR: Move MockTime to server-common (#13823 ) This patch rewrite `MockTime` in Java and moves it to `server-common` module. This is a prerequisite to move `MockTimer` later on to `server-common` as well. Reviewers: David Arthur <mumrah@gmail.com>	2023-06-09 08:54:25 +02:00
Divij Vaidya	fe6a827e20	KAFKA-14633: Reduce data copy & buffer allocation during decompression (#13135 ) After this change, For broker side decompression: JMH benchmark RecordBatchIterationBenchmark demonstrates 20-70% improvement in throughput (see results for RecordBatchIterationBenchmark.measureSkipIteratorForVariableBatchSize). For consumer side decompression: JMH benchmark RecordBatchIterationBenchmark a mix bag of single digit regression for some compression type to 10-50% improvement for Zstd (see results for RecordBatchIterationBenchmark.measureStreamingIteratorForVariableBatchSize). Reviewers: Luke Chen <showuon@gmail.com>, Manyanda Chitimbo <manyanda.chitimbo@gmail.com>, Ismael Juma <mail@ismaeljuma.com>	2023-06-05 15:04:49 +08:00
Yash Mayya	9bb2f78d53	KAFKA-15034: Improve performance of the ReplaceField SMT; add JMH benchmark (#13776 ) Reviewers: Chris Egerton <chrise@aiven.io>	2023-06-01 15:14:31 -04:00
Justine Olshan	9edf2ec5cc	MINOR: Add transaction verification config to producerStateManager config (#13770 ) I have moved this config into producer state manager so it can be checked easily under the log lock when we are about to append. Only a few test files currently use the validation and those have been verified to work via running the tests. Reviews: David Jacot <djacot@confluent.io>	2023-05-30 13:46:17 -07:00
Divij Vaidya	6bcc497c36	KAFKA-14766: Improve performance of VarInt encoding and decoding (#13312 ) Motivation Reading/writing the protocol buffer varInt32 and varInt64 (also called varLong in our code base) is in the hot path of data plane code in Apache Kafka. We read multiple varInt in a record and in long. Hence, even a minor change in performance could extrapolate to larger performance benefit. In this PR, we only update varInt32 encoding/decoding. Changes This change uses loop unrolling and reduces the amount of repetition of calculations. Based on the empirical results from the benchmark, the code has been modified to pick up the best implementation. Results Performance has been evaluated using JMH benchmarks on JDK 17.0.6. Various implementations have been added in the benchmark and benchmarking has been done for different sizes of varints and varlongs. The benchmark for various implementations have been added at ByteUtilsBenchmark.java Reviewers: Ismael Juma <mlists@juma.me.uk>, Luke Chen <showuon@gmail.com>, Alexandre Dupriez <alexandre.dupriez@gmail.com>	2023-05-05 20:05:20 +08:00
Luke Chen	d796480fe8	KAFKA-14909: check zkMigrationReady tag before migration (#13631 ) 1. add ZkMigrationReady in apiVersionsResponse 2. check all nodes if ZkMigrationReady are ready before moving to next migration state Reviewers: David Arthur <mumrah@gmail.com>, dengziming <dengziming1993@gmail.com>	2023-04-28 14:35:12 +08:00
Purshotam Chauhan	df13775254	KAFKA-14828: Remove R/W locks using persistent data structures (#13437 ) Currently, StandardAuthorizer uses a R/W lock for maintaining the consistency of data. For the clusters with very high traffic, we will typically see an increase in latencies whenever a write operation comes. The intent of this PR is to get rid of the R/W lock with the help of immutable or persistent collections. Basically, new object references are used to hold the intermediate state of the write operation. After the completion of the operation, the main reference to the cache is changed to point to the new object. Also, for the read operation, the code is changed such that all accesses to the cache for a single read operation are done to a particular cache object only. In the PR description, you can find the performance of various libraries at the time of both read and write. Read performance is checked with the existing AuthorizerBenchmark. For write performance, a new AuthorizerUpdateBenchmark has been added which evaluates the performance of the addAcl operation. Reviewers: Ron Dagostino <rndgstn@gmail.com>, Manikumar Reddy <manikumar.reddy@gmail.com>, Divij Vaidya <diviv@amazon.com>	2023-04-21 14:08:23 +05:30
Dimitar Dimitrov	e14dd8024a	KAFKA-14821 Implement the listOffsets API with AdminApiDriver (#13432 ) We are handling complex workflows ListOffsets by chaining together MetadataCall instances and ListOffsetsCall instances, there are many complex and error-prone logic. In this PR we rewrote it with the `AdminApiDriver` infra, notable changes better than old logic: 1. Retry lookup stage on receiving `NOT_LEADER_OR_FOLLOWER` and `LEADER_NOT_AVAILABLE`, whereas in the past we failed the partition directly without retry. 2. Removing class field `supportsMaxTimestamp` and calculating it on the fly to avoid the mutable state, this won't change any behavior of the client. 3. Retry fulfillment stage on `RetriableException`, whereas in the past we just retry fulfillment stage on `InvalidMetadataException`, this means we will retry on `TimeoutException` and other `RetriableException`. We also `handleUnsupportedVersionException` to `AdminApiHandler` and `AdminApiLookupStrategy`, they are used to keep consistency with old logic, and we can continue improvise them. Reviewers: Ziming Deng <dengziming1993@gmail.com>, David Jacot <djacot@confluent.io>	2023-04-20 11:29:27 +08:00
Ron Dagostino	e27926f92b	KAFKA-14735: Improve KRaft metadata image change performance at high … (#13280 ) topic counts. Introduces the use of persistent data structures in the KRaft metadata image to avoid copying the entire TopicsImage upon every change. Performance that was O(<number of topics in the cluster>) is now O(<number of topics changing>), which has dramatic time and GC improvements for the most common topic-related metadata events. We abstract away the chosen underlying persistent collection library via ImmutableMap<> and ImmutableSet<> interfaces and static factory methods. Reviewers: Luke Chen <showuon@gmail.com>, Colin P. McCabe <cmccabe@apache.org>, Ismael Juma <ismael@juma.me.uk>, Purshotam Chauhan <pchauhan@confluent.io>	2023-04-17 17:52:28 -04:00
Calvin Liu	d5e216d618	KAFKA-14617: Fill broker epochs to the AlterPartitionRequest (#13489 ) As the third part of the KIP-903, it fills the broker epochs from the Fetch request into the AlterPartitionRequest. Also, before generating the alterPartitionRequest, the partition will check whether the broker epoch from the FetchRequest matches with the broker epoch recorded in the metadata cache. If not, the ISR change will be delayed. Reviewers: Jun Rao <junrao@gmail.com>	2023-04-07 09:09:29 -07:00
Purshotam Chauhan	f3e4dd9229	KAFKA-14827: Support for StandardAuthorizer benchmark (#13423 ) * KAFKA-14827: Support for StandardAuthorizer benchmark Co-authored-by: Purshotam Chauhan <purshotam.r.chauhan@gmail.com> * reverting unintentional change --------- Co-authored-by: David Arthur <mumrah@gmail.com> Reviewers: Manikumar Reddy <manikumar.reddy@gmail.com>	2023-03-28 14:14:50 +05:30
Calvin Liu	79b5f7f1ce	KAFKA-14617: Add ReplicaState to FetchRequest (KIP-903) (#13323 ) This patch is the first part of KIP-903. It updates the FetchRequest to include the new tagged ReplicaState field which replaces the now deprecated ReplicaId field. The FetchRequest version is bumped to version 15 and the MetadataVersion to 3.5-IV1. Reviewers: David Jacot <djacot@confluent.io>	2023-03-16 14:04:34 +01:00
Kowshik Prakasam	9f55945270	MINOR: Introduce OffsetAndEpoch in LeaderEndpoint interface return values (#13268 ) Reviewers: Satish Duggana <satishd@apache.org>, Alexandre Dupriez <alexandre.dupriez@gmail.com>, Jun Rao <junrao@gmail.com>	2023-02-23 17:29:32 -08:00
Satish Duggana	069ce59e1e	KAFKA 14714: Move/Rewrite RollParams, LogAppendInfo, and LeaderHwChange to storage module. (#13255 ) Reviewers: Jun Rao <junrao@gmail.com>, Ismael Juma <ismael@juma.me.uk>	2023-02-22 23:12:04 +05:30
David Jacot	3be7f7d611	KAFKA-14391; Add ConsumerGroupHeartbeat API (#12972 ) This patch does a few things: 1) It introduces a new flag to the request spec: `latestVersionUnstable`. It signifies that the last version of the API is considered unstable (or still in development). As such, the last API version is not exposed by the server unless specified otherwise with the new internal `unstable.api.versions.enable`. This allows us to commit new APIs which are still in development. 3) It adds the ConsumerGroupHeartbeat API, part of KIP-848, and marks it as unreleased for now. 4) It adds the new error codes required by the new ConsumerGroupHeartbeat API. Reviewers: Justine Olshan <jolshan@confluent.io>, Jeff Kim <jeff.kim@confluent.io>, Jason Gustafson <jason@confluent.io>	2023-02-09 09:13:31 +01:00
Satish Duggana	1d3fb76092	KAFKA-14688 Move package org.apache.kafka.server.log.internals to org.apache.kafka.storage.internals.log (#13213 ) Reviewers: Ismael Juma <ismael@juma.me.uk>	2023-02-08 09:22:42 +05:30
David Jacot	2e0a005dd4	KAFKA-14367; Add internal APIs to the new `GroupCoordinator` interface (#13112 ) This patch migrates all the internal APIs of the current group coordinator to the new `GroupCoordinator` interface. It also makes the current implementation package private to ensure that it is not used anymore. Reviewers: Justine Olshan <jolshan@confluent.io>	2023-01-20 08:38:21 +01:00

1 2 3 4 5 ...

259 Commits