kafka

Commit Graph

Author	SHA1	Message	Date
David Jacot	12bb23157c	MINOR: A few cleanups in BrokerToControllerChannelManager (#11937 ) Make the code style more consistent Reviewers: Luke Chen <showuon@gmail.com>	2022-03-24 15:49:25 +08:00
Liam Clarke-Hutchinson	e8f09007e4	KAFKA-13672: Race condition in DynamicBrokerConfig (#11920 ) Reviewers: David Jacot <djacot@confluent.io>, Luke Chen <showuon@gmail.com>	2022-03-24 11:54:05 +08:00
dengziming	c9c03dd7ef	MINOR: Remove scala KafkaException (#11913 ) Use the standard org.apache.kafka.common.KafkaException instead of kafka.common.KafkaException. Reviewers: Colin P. McCabe <cmccabe@apache.org>, Ismael Juma <ismael@confluent.io>	2022-03-21 14:56:25 -07:00
dengziming	d449f850e1	MINOR: show LogRecoveryState in MetadataShell and fix log message Show the LeaderRecoveryState in MetadataShell. Fix a case where we were comparing a Byte type with an enum type. Reviewers: Colin P. McCabe <cmccabe@apache.org>	2022-03-21 14:33:51 -07:00
David Jacot	72558da976	MINOR: Small cleanups in the AclAuthorizer (#11921 ) Reviewers: Manikumar Reddy <manikumar.reddy@gmail.com>	2022-03-21 11:23:31 +01:00
Luke Chen	3a8f6b17a6	KAFKA-7540: commit offset sync before close (#11898 ) Reviewers: Guozhang Wang <wangguoz@gmail.com>	2022-03-21 16:51:21 +08:00
José Armando García Sancio	8d6968e832	KAFKA-13682; KRaft Controller auto preferred leader election (#11893 ) Implement auto leader rebalance for KRaft by keeping track of the set of topic partitions which have a leader that is not the preferred replica. If this set is non-empty then schedule a leader balance event for the replica control manager. When applying PartitionRecords and PartitionChangeRecords to the ReplicationControlManager, if the elected leader is not the preferred replica then remember this topic partition in the set of imbalancedPartitions. Anytime the quorum controller processes a ControllerWriteEvent it schedules a rebalance operation if the there are no pending rebalance operations, the feature is enabled and there are imbalance partitions. This KRaft implementation only supports the configurations properties auto.leader.rebalance.enable and leader.imbalance.check.interval.seconds. The configuration property leader.imbalance.per.broker.percentage is not supported and ignored. Reviewers: Jun Rao <junrao@gmail.com>, David Arthur <mumrah@gmail.com>	2022-03-18 14:30:52 -07:00
José Armando García Sancio	52621613fd	KAFKA-13587; Implement leader recovery for KIP-704 (#11733 ) Implementation of the protocol for starting and stopping leader recovery after an unclean leader election. This includes the management of state in the controllers (legacy and KRaft) and propagating this information to the brokers. This change doesn't implement log recovery after an unclean leader election. Protocol Changes ================ For the topic partition state znode, the new field "leader_recovery_state" was added. If the field is missing the value is assumed to be RECOVERED. ALTER_PARTITION was renamed from ALTER_ISR. The CurrentIsrVersion field was renamed to PartitionEpoch. The new field LeaderRecoveryState was added. The new field LeaderRecoverState was added to the LEADER_AND_ISR request. The inter broker protocol version is used to determine which version to send to the brokers. A new tagged field for LeaderRecoveryState was added to both the PartitionRecord and PartitionChangeRecord. Controller ========== For both the KRaft and legacy controller the LeaderRecoveryState is set to RECOVERING, if the leader was elected out of the ISR, also known as unclean leader election. The controller sets the state back to RECOVERED after receiving an ALTER_PARTITION request with version 0, or with version 1 and with the LeaderRecoveryState set to RECOVERED. Both controllers preserve the leader recovery state even if the unclean leader goes offline and comes back online before an RECOVERED ALTER_PARTITION is sent. The controllers reply with INVALID_REQUEST if the ALTER_PARTITION either: 1. Attempts to increase the ISR while the partition is still RECOVERING 2. Attempts to change the leader recovery state to RECOVERING from a RECOVERED state. Topic Partition Leader ====================== The topic partition leader doesn't implement any log recovery in this change. The topic partition leader immediately marks the partition as RECOVERED and sends that state in the next ALTER_PARTITION request. Reviewers: Jason Gustafson <jason@confluent.io>	2022-03-18 09:24:11 -07:00
dengziming	5cebe12a66	KAFKA-13509; Support max timestamp in GetOffsetShell (KIP-815) (#11173 ) This patch implements KIP-815 as described here: https://cwiki.apache.org/confluence/display/KAFKA/KIP-815%3A++Support+max-timestamp+in+GetOffsetShell. Reviewers: Luke Chen <showuon@gmail.com>, Justine Olshan <jolshan@confluent.io>, David Jacot <djacot@confluent.io>	2022-03-17 17:53:37 +01:00
Jason Gustafson	76d287c967	KAFKA-13727; Preserve txn markers after partial segment cleaning (#11891 ) It is possible to clean a segment partially if the offset map is filled before reaching the end of the segment. The highest offset that is reached becomes the new dirty offset after the cleaning completes. The data above this offset is nevertheless copied over to the new partially cleaned segment. Hence we need to ensure that the transaction index reflects aborted transactions from both the cleaned and uncleaned portion of the segment. Prior to this patch, this was not the case. We only collected the aborted transactions from the cleaned portion, which means that the reconstructed index could be incomplete. This can cause the aborted data to become effectively committed. It can also cause the deletion of the abort marker before the corresponding data has been removed (i.e. the aborted transaction becomes hanging). Reviewers: Jun Rao <junrao@gmail.com>	2022-03-15 12:26:23 -07:00
wangyap	e8a762eee4	MINOR: set batch-size option into batch.size config in consoleProducer (#11855 ) Reviewers: Luke Chen <showuon@gmail.com>	2022-03-15 19:40:11 +08:00
Guozhang Wang	cad4985a0a	MINOR: Disable those flaky tests (#11895 ) I collected a list of the most flaky tests observed lately, checked / created their corresponding tickets, and mark them as ignored for now. Many of these failures are: 0. Failing very frequently in the past (at least in my observations). 1. not investigated for some time. 2. have a PR for review (mostly thanks to @showuon !), but not reviewed for some time. Since 0), these tests failures are hindering our development; and from 1/2) above, people are either too busy to look after them, or honestly the tests are not considered as providing values since otherwise people should care enough to panic and try to resolve. So I think it's reasonable to disable all these tests for now. If we later learned our lesson a hard way, it would motivate us to tackle flaky tests more diligently as well. I'm only disabling those tests that have been failed for a while, and if for such time no one have been looking into them, I'm concerned that just gossiping around about those flakiness would not bring people's attention to them either. So my psychological motivation is that "if people do not care about those failed tests for weeks (which, is not a good thing! :P), let's teach ourselves the lesson a hard way when it indeed buries a bug that bites us, or not learn the lesson at all --- that indicates those tests are indeed not valuable". For tests that I only very recently saw I did not disable them. Reviewers: John Roesler <vvcephei@apache.org>, Matthias J. Sax <mjsax@apache.org>, Luke Chen <showuon@gmail.com>, Randall Hauch <rhauch@gmail.com>	2022-03-14 21:32:28 -07:00
xuexiaoyue	f025a93c7c	MINOR: Fix comments in TransactionsTest (#11880 ) Reviewer: Luke Chen <showuon@gmail.com>	2022-03-11 15:42:44 +08:00
Vincent Jiang	798275f254	KAFKA-13717: skip coordinator lookup in commitOffsetsAsync if offsets is empty (#11864 ) Reviewer: Luke Chen <showuon@gmail.com>, David Jacot <djacot@confluent.io>	2022-03-10 10:52:05 +08:00
David Jacot	69926b5193	MINOR: Clean up AlterIsrManager code (#11832 ) Reviewers: Justine Olshan <jolshan@confluent.io>, Jason Gustafson <jason@confluent.io>	2022-03-09 07:31:07 +01:00
Vincent Jiang	b27000ec6a	MINOR: Fix flaky test cases SocketServerTest.remoteCloseWithoutBufferedReceives and SocketServerTest.remoteCloseWithIncompleteBufferedReceive (#11861 ) When a socket is closed, corresponding channel should be retained only if there is complete buffered requests. Reviewers: David Jacot <djacot@confluent.io>	2022-03-08 19:03:11 +01:00
Luke Chen	1848f049e1	KAFKA-13710: bring the InvalidTimestampException back for record error (#11853 ) Reviewers: Guozhang Wang <guozhang@confluent.io>, Ricardo Brasil <anribrasil@gmail.com>	2022-03-08 14:28:16 +08:00
RivenSun	3be978464c	KAFKA-13694: Log more specific information when the verification record fails on brokers. (#11830 ) Reviewers: Guozhang Wang <wangguoz@gmail.com>	2022-03-04 10:45:44 -08:00
wangyap	ae76b9d45a	KAFKA-13466: delete unused config batch.size in kafka-console-producer.sh (#11517 ) delete unused config batch.size in kafka-console-producer.sh Reviewer: Andrew Eugene Choi <andrew.choi@uwaterloo.ca>, Luke Chen <showuon@gmail.com>,	2022-03-04 09:47:23 +08:00
Colin Patrick McCabe	07553d13f7	MINOR: create KafkaConfigSchema and TimelineObject (#11809 ) Create KafkaConfigSchema to encapsulate the concept of determining the types of configuration keys. This is useful in the controller because we can't import KafkaConfig, which is part of core. Also introduce the TimelineObject class, which is a more generic version of TimelineInteger / TimelineLong. Reviewers: David Arthur <mumrah@gmail.com>	2022-03-02 14:26:31 -08:00
Kowshik Prakasam	67e99a4236	MINOR: Ensure LocalLog.flush is thread safe to recoveryPoint changes (#11814 ) Issue: Imagine a scenario where two threads T1 and T2 are inside UnifiedLog.flush() concurrently: KafkaScheduler thread T1 -> The periodic work calls LogManager.flushDirtyLogs() which in turn calls UnifiedLog.flush(). For example, this can happen due to log.flush.scheduler.interval.ms here. KafkaScheduler thread T2 -> A UnifiedLog.flush() call is triggered asynchronously during segment roll here. Supposing if thread T1 advances the recovery point beyond the flush offset of thread T2, then this could trip the check within LogSegments.values() here for thread T2, when it is called from LocalLog.flush() here. The exception causes the KafkaScheduler thread to die, which is not desirable. Fix: We fix this by ensuring that LocalLog.flush() is immune to the case where the recoveryPoint advances beyond the flush offset. Reviewers: Jun Rao <junrao@gmail.com>	2022-03-01 10:55:17 -08:00
Jason Gustafson	5f91aa7b4c	KAFKA-13698; KRaft authorizer should use host address instead of name (#11807 ) Use `InetAddress.getHostAddress` in `StandardAuthorizer` instead of `InetAddress.getHostName`. Reviewers: Colin Patrick McCabe <cmccabe@confluent.io>	2022-02-26 10:52:34 -08:00
Jason Gustafson	2c90447a59	KAFKA-13697; KRaft authorizer should support AclOperation.ALL (#11806 ) KRaft authorizer should support AclOperation.ALL correctly. Reviewers: Colin P. McCabe <cmccabe@apache.org>	2022-02-25 15:43:21 -08:00
Jason Gustafson	711b603ddc	MINOR: Cleanup admin creation logic in integration tests (#11790 ) There seemed to be a little sloppiness in the integration tests in regard to admin client creation. Not only was there duplicated logic, but it wasn't always clear which listener the admin client was targeting. This made it difficult to tell in the context of authorization tests whether we were indeed testing with the right principal. As an example, we had a method in TestUtils which was using the inter-broker listener implicitly. This meant that the test was using the broker principal which had super user privilege. This was intentional, but I think it would be clearer to make the dependence on this listener explicit. This patch attempts to clean this up a bit by consolidating some of the admin creation logic and making the reliance on the listener clearer. Reviewers: José Armando García Sancio <jsancio@users.noreply.github.com>	2022-02-24 07:37:28 -08:00
dengziming	a4b1e50f08	MINOR; Remove unused AdminZkClient in MetadataSupport (#11785 ) Remove unused AdminZkClient in MetadataSupport Reviewers: Luke Chen <showuon@gmail.com>	2022-02-18 14:31:44 +08:00
Wenjun Ruan	81e709c4e2	MINOR: Remove unused params in `ZkConfigManager` (#11763 ) Remove `changeExpirationMs` and `time` in `ZkConfigManager`, since these two parameters are not used. Reviewers: Jason Gustafson <jason@confluent.io>	2022-02-16 10:12:07 -08:00
Jason Gustafson	b765a2b44e	MINOR: Remove redundant forwarding integration tests (#11766 ) There are a few integration tests for the forwarding logic which were added prior to kraft being ready for integration testing. Now that we have enabled kraft in integration tests, these tests are redundant and can be removed. Reviewers: José Armando García Sancio <jsancio@users.noreply.github.com>	2022-02-15 18:28:34 -08:00
Wenjun Ruan	77cb8e0a5a	MINOR: Remove repeat creation of `ZkConfigRepository` (#11762 ) In `KafkaServer, `ZkConfigRepository` is just a wrapper of `zkClient`, so we don't need to create a new one. Reviewers: Jason Gustafson <jason@confluent.io>	2022-02-15 13:09:14 -08:00
Luke Chen	71cbff62b6	MINOR: Fix and clarify kraft README and example configuration files (#11616 ) Reviewers: Ron Dagostino <rdagostino@confluent.io>, Jason Gustafson <jason@confluent.io>	2022-02-15 10:27:38 -08:00
dengziming	dd36331a81	MINOR: Enable kraft in ApiVersionTest (#11667 ) This patch enables `ApiVersionsTest` to test both kraft brokers and controllers. It fixes a minor bug in which the `Envelope` request to be exposed from `ApiVersions` requests to the kraft broker. Reviewers: Jason Gustafson <jason@confluent.io>	2022-02-15 10:16:03 -08:00
David Jacot	c8fbe26f3b	KAFKA-13435; Static membership protocol should let the leader skip assignment (KIP-814) (#11688 ) This patch implements KIP-814 as described here: https://cwiki.apache.org/confluence/display/KAFKA/KIP-814%3A+Static+membership+protocol+should+let+the+leader+skip+assignment. Reviewers: Luke Chen <showuon@gmail.com>, Artem Livshits <alivshits@confluent.io>, Jason Gustafson <jason@confluent.io>	2022-02-14 11:55:38 +01:00
Jason Gustafson	fc20c551d6	MINOR: Clearer field names for ProducerIdsRecord and related classes (#11747 ) The current naming of the fields in `ProducerIdsRecord` is a little confusing in regard to whether the block range was inclusive or exclusive. This patch tries to improve naming to make this clearer. In the record class, instead of `ProducerIdsEnd`, we use `NextProducerId`. We have also updated related classes such as `ProducerIdsBlock.java` with similar changes. Reviewers: dengziming <dengziming1993@gmail.com>, David Arthur <mumrah@gmail.com>	2022-02-11 16:14:31 -08:00
Jason Gustafson	e43916c148	KAFKA-13661; Consistent permissions in KRaft for CreatePartitions API (#11745 ) In #11649, we fixed one permission inconsistency between kraft and zk authorization for the `CreatePartitions` request. Previously kraft was requiring `CREATE` permission on the `Topic` resource when it should have required `ALTER`. A second inconsistency is that kraft was also allowing `CREATE` on the `Cluster` resource, which is not supported in zk clusters and was not documented in KIP-195: https://cwiki.apache.org/confluence/display/KAFKA/KIP-195%3A+AdminClient.createPartitions. This patch fixes this inconsistency and adds additional test coverage for both cases. Reviewers: José Armando García Sancio <jsancio@gmail.com>	2022-02-11 15:01:08 -08:00
Mickael Maison	0269edfc80	KAFKA-13577: Replace easymock with mockito in kafka:core - part 3 (#11674 ) Reviewers: Tom Bentley <tbentley@redhat.com>	2022-02-11 16:16:25 +01:00
dengziming	590df2c8be	KAFKA-13316; Enable KRaft mode in CreateTopics tests (#11655 ) This PR follows #11629 to enable `CreateTopicsRequestWithForwardingTest` and `CreateTopicsRequestWithPolicyTest` in KRaft mode. Reviewers: Jason Gustafson <jason@confluent.io>	2022-02-10 14:10:23 -08:00
Alexandre Garnier	4da515da94	MINOR: Fix storage meta properties comparison (#11546 ) This patch adds missing `equals` and `hashCode` implements for `RawMetaProperties`. This is relied on by the storage tool for detecting when two log directories have different `meta.properties` files. Reproduce current issue: ```shell $ sed -i 's\|log.dirs=/tmp/kraft-combined-logs\|+log.dirs=/tmp/kraft-combined-logs,/tmp/kraft-combined-logs2' ./config/kraft/server.properties $ ./bin/kafka-storage.sh format -t R19xNyxMQvqQRGlkGDi2cg -c ./config/kraft/server.properties Formatting /tmp/kraft-combined-logs Formatting /tmp/kraft-combined-logs2 $ ./bin/kafka-storage.sh info -c ./config/kraft/server.properties Found log directories: /tmp/kraft-combined-logs /tmp/kraft-combined-logs2 Found metadata: {cluster.id=R19xNyxMQvqQRGlkGDi2cg, node.id=1, version=1} Found problem: Metadata for /tmp/kraft-combined-logs2/meta.properties was {cluster.id=R19xNyxMQvqQRGlkGDi2cg, node.id=1, version=1}, but other directories featured {cluster.id=R19xNyxMQvqQRGlkGDi2cg, node.id=1, version=1} ``` It's reporting that same metadata are not the same... With this fix: ```shell $ ./bin/kafka-storage.sh info -c ./config/kraft/server.properties Found log directories: /tmp/kraft-combined-logs /tmp/kraft-combined-logs2 Found metadata: {cluster.id=R19xNyxMQvqQRGlkGDi2cg, node.id=1, version=1} ``` Reviewers: Igor Soarez <soarez@apple.com>, Jason Gustafson <jason@confluent.io>	2022-02-10 09:55:14 -08:00
prince-mahajan	78a3789496	KAFKA-13636: Fix for the group coordinator issue where the offsets are deleted for unstable groups (#11742 ) This patch ensures that the committed offsets are not expired while the group is rebalancing. The issue is that we can't rely on the subscribed topics if the group is not stable. Reviewers: David Jacot <djacot@confluent.io>	2022-02-10 17:21:17 +01:00
Joseph (Ting-Chou) Lin	2e25ca1355	MINOR: Fix JavaDoc of OffsetIndex#append (#11744 ) Reviewers: Mickael Maison <mickael.maison@gmail.com>	2022-02-10 11:27:34 +01:00
Mickael Maison	57235a0cb4	KAFKA-13577: Replace easymock with mockito in kafka:core - part 2 (#11673 ) Reviewers: Tom Bentley <tbentley@redhat.com>	2022-02-10 09:15:58 +01:00
Colin Patrick McCabe	d35283f011	KAFKA-13646; Implement KIP-801: KRaft authorizer (#11649 ) Currently, when using KRaft mode, users still have to have an Apache ZooKeeper instance if they want to use AclAuthorizer. We should have a built-in Authorizer for KRaft mode that does not depend on ZooKeeper. This PR introduces such an authorizer, called StandardAuthorizer. See KIP-801 for a full description of the new Authorizer design. Authorizer.java: add aclCount API as described in KIP-801. StandardAuthorizer is currently the only authorizer that implements it, but eventually we may implement it for AclAuthorizer and others as well. ControllerApis.scala: fix a bug where createPartitions was authorized using CREATE on the topic resource rather than ALTER on the topic resource as it should have been. QuorumTestHarness: rename the controller endpoint to CONTROLLER for consistency (the brokers already called it that). This is relevant in AuthorizerIntegrationTest where we are examining endpoint names. Also add the controllerServers call. TestUtils.scala: adapt the ACL functions to be usable from KRaft, by ensuring that they use the Authorizer from the current active controller. BrokerMetadataPublisher.scala: add broker-side ACL application logic. Controller.java: add ACL APIs. Also add a findAllTopicIds API in order to make junit tests that use KafkaServerTestHarness#getTopicNames and KafkaServerTestHarness#getTopicIds work smoothly. AuthorizerIntegrationTest.scala: convert over testAuthorizationWithTopicExisting (more to come soon) QuorumController.java: add logic for replaying ACL-based records. This means storing them in the new AclControlManager object, and integrating them into controller snapshots. It also means applying the changes in the Authorizer, if one is configured. In renounce, when reverting to a snapshot, also set newBytesSinceLastSnapshot to 0. Reviewers: YeonCheol Jang <YeonCheolGit@users.noreply.github.com>, Jason Gustafson <jason@confluent.io>	2022-02-09 10:38:52 -08:00
Mickael Maison	150fecf6d3	KAFKA-13577: Replace easymock with mockito in kafka:core - part 1 (#11672 ) Reviewers: Tom Bentley <tbentley@redhat.com>	2022-02-09 17:02:27 +01:00
RivenSun	4b468a9d81	KAFKA-13310 : KafkaConsumer cannot jump out of the poll method, and the… (#11340 ) Title: KafkaConsumer cannot jump out of the poll method, and cpu and traffic on the broker side increase sharply description: The local test has been passed, the problem described by jira can be solved JIRA link : https://issues.apache.org/jira/browse/KAFKA-13310 Reviewers: Luke Chen <showuon@gmail.com>, Guozhang Wang <wangguoz@gmail.com>	2022-02-08 23:05:42 -08:00
Ismael Juma	7c2d672413	MINOR: Update library dependencies (Q1 2022) (#11306 ) - scala 2.13: 2.13.6 -> 2.13.8 * Support Java 18 and improve Android compatibility * https://www.scala-lang.org/news/2.13.7 * https://www.scala-lang.org/news/2.13.8 - scala 2.12: 2.12.14 -> 2.12.15. * The `-release` flag now works with Scala 2.12, backend parallelism can be enabled via `-Ybackend-parallelism N` and string interpolation is more efficient. * https://www.scala-lang.org/news/2.12.5 - gradle versions plugin: 0.38.0 -> 0.42.0 * Minor fixes * https://github.com/ben-manes/gradle-versions-plugin/releases/tag/v0.40.0 * https://github.com/ben-manes/gradle-versions-plugin/releases/tag/v0.41.0 * https://github.com/ben-manes/gradle-versions-plugin/releases/tag/v0.42.0 - gradle dependency check plugin: 6.1.6 -> 6.5.3 * Minor fixes - gradle spotbugs plugin: 4.7.1 -> 5.0.5 * Fixes and minor improvements * There were too many releases to include all the links, include the major version bump * https://github.com/spotbugs/spotbugs-gradle-plugin/releases/tag/5.0.0 - gradle scoverage plugin: 5.0.0 -> 7.0.0 * Support newer Gradle versions and other improvements * https://github.com/scoverage/gradle-scoverage/releases/tag/6.0.0 * https://github.com/scoverage/gradle-scoverage/releases/tag/6.1.0 * https://github.com/scoverage/gradle-scoverage/releases/tag/7.0.0 - gradle shadow plugin: 7.0.0 -> 7.1.2 * Support gradle toolchains and security fixes * https://github.com/johnrengelman/shadow/releases/tag/7.1.0 * https://github.com/johnrengelman/shadow/releases/tag/7.1.1 * https://github.com/johnrengelman/shadow/releases/tag/7.1.2 - bcpkix: 1.66 -> 1.70 * Several improvements and fixes * https://www.bouncycastle.org/releasenotes.html - jline: 3.12.1 -> 3.21.0 * Various fixes and improvements - jmh: 1.32 -> 1.34 * Compiler blackhole enabled by default when using Java 17 and improved gradle incremental compilation * https://mail.openjdk.java.net/pipermail/jmh-dev/2021-August/003355.html * https://mail.openjdk.java.net/pipermail/jmh-dev/2021-December/003406.html - scalaLogging: 3.9.3 -> 3.9.4 * Support for Scala 3.0 - jose4j: 0.7.8 -> 0.7.9 * Minor fixes - junit: 5.7.1 -> 5.8.2 * Minor improvements and fixes * https://junit.org/junit5/docs/current/release-notes/index.html#release-notes-5.8.0 * https://junit.org/junit5/docs/current/release-notes/index.html#release-notes-5.8.1 * https://junit.org/junit5/docs/current/release-notes/index.html#release-notes-5.8.2 - jqwik: 1.5.0 -> 1.6.3 * Numerous improvements * https://github.com/jlink/jqwik/releases/tag/1.6.0 - mavenArtifact: 3.8.1 -> 3.8.4 - mockito: 3.12.4 -> 4.3.1 * Removed deprecated methods, `DoNotMock` annotation and minor fixes/improvements * https://github.com/mockito/mockito/releases/tag/v4.0.0 * https://github.com/mockito/mockito/releases/tag/v4.1.0 * https://github.com/mockito/mockito/releases/tag/v4.2.0 * https://github.com/mockito/mockito/releases/tag/v4.3.0 - scalaCollectionCompat: 2.4.4 -> 2.6.0 * Minor fixes * https://github.com/scala/scala-collection-compat/releases/tag/v2.5.0 * https://github.com/scala/scala-collection-compat/releases/tag/v2.6.0 - scalaJava8Compat: 1.0.0 -> 1.0.2 * Minor changes - scoverage: 1.4.1 -> 1.4.11 * Support for newer Scala versions - slf4j: 1.7.30 -> 1.7.32 * Minor fixes, 1.7.35 automatically uses reload4j and 1.7.33/1.7.34 cause build failures, so we stick with 1.7.32 for now. - zstd: 1.5.0-4 -> 1.5.2-1 * zstd 1.5.2 * Small refinements and performance improvements * https://github.com/facebook/zstd/releases/tag/v1.5.1 * https://github.com/facebook/zstd/releases/tag/v1.5.2 Checkstyle, spotBugs and spotless will be upgraded separately as they either require non trivial code changes or they have regressions that affect us. Reviewers: Manikumar Reddy <manikumar.reddy@gmail.com>	2022-02-07 15:24:50 -08:00
Luke Chen	ca5d6f9229	KAFKA-13563: clear FindCoordinatorFuture for non consumer group mode (#11631 ) After KAFKA-10793, we clear the findCoordinatorFuture in 2 places: 1. heartbeat thread 2. AbstractCoordinator#ensureCoordinatorReady But in non consumer group mode with group id provided (for offset commitment. So that there will be consumerCoordinator created), there will be no (1)heartbeat thread , and it only call (2)AbstractCoordinator#ensureCoordinatorReady when 1st time consumer wants to fetch committed offset position. That is, after 2nd lookupCoordinator call, we have no chance to clear the findCoordinatorFuture , and causes the offset commit never succeeded. To avoid the race condition as KAFKA-10793 mentioned, it's not safe to clear the findCoordinatorFuture in the future listener. So, I think we can fix this issue by calling AbstractCoordinator#ensureCoordinatorReady when coordinator unknown in non consumer group case, under each ConsumerCoordinator#poll. Reviewers: Guozhang Wang <wangguoz@gmail.com>	2022-02-06 15:07:59 -08:00
Chia-Ping Tsai	f49524e4c3	MINOR: disable zookeeper.sasl.client to avoid false error (#11469 ) Reviewers: Mickael Maison <mimaison@users.noreply.github.com>	2022-02-06 03:34:22 +08:00
Jason Gustafson	ba0fe610ed	MINOR: Do not use optional args in `ProducerStateManager` (#11734 ) We allowed `maxProducerIdExpirationMs` and `time` to be optional in the `ProducerStateManager` constructor. We generally frown on optional arguments since it is too easy to overlook them. In this case, it was especially dangerous because the recently added `maxTransactionTimeoutMs` argument used the same type as `maxProducerIdExpirationMs`. Reviewers: David Jacot <david.jacot@gmail.com, Ismael Juma <ismael@juma.me.uk>	2022-02-05 11:00:17 -08:00
Luke Chen	e6db0ca48c	KAFKA-13598: enable idempotence producer by default and validate the configs (#11691 ) In v3.0, we changed the default value for `enable.idempotence` to true, but we didn't adjust the validator and the `idempotence` enabled check method. So if a user didn't explicitly enable idempotence, this feature won't be turned on. This patch addresses the problem, cleans up associated logic, and fixes tests that broke as a result of properly applying the new default. Specifically it does the following: 1. fix the `ProducerConfig#idempotenceEnabled` method, to make it correctly detect if `idempotence` is enabled or not 2. remove some unnecessary config overridden and checks due to we already default `acks`, `retries` and `enable.idempotence` configs. 3. move the config validator for the idempotent producer from `KafkaProducer` into `ProducerConfig`. The config validation should be the responsibility of `ProducerConfig` class. 4. add an `AbstractConfig#hasKeyInOriginals` method, to avoid `originals` configs get copied and only want to check the existence of the key. 5. fix many broken tests. As mentioned, we didn't actually enable idempotence in v3.0. After this PR, there are some tests broken due to some different behavior between idempotent and non-idempotent producer. 6. add additional tests to validate configuration behavior Reviewers: Kirk True <kirk@mustardgrain.com>, Ismael Juma <ismael@juma.me.uk>, Mickael Maison <mimaison@users.noreply.github.com>, Jason Gustafson <jason@confluent.io>	2022-02-05 10:53:27 -08:00
Matthew Wong	17dcb8097c	MINOR: Update documentation and DumpLogSegments tool for addition of `deleteHorizonMs` in batch format (#11694 ) This PR updates the documentation and tooling to match https://github.com/apache/kafka/pull/10914, which added support for encoding `deleteHorizonMs` in the record batch schema. The changes include adding the new attribute and updating field names. We have also updated stale references to the old `FirstTimestamp` field in the code and comments. Finally, In the `DumpLogSegments` tool, when record batch information is printed, it will also include the value of `deleteHorizonMs` is (e.g. `OptionalLong.empty` or `OptionalLong[123456]`). Reviewers: Vincent Jiang <84371940+vincent81jiang@users.noreply.github.com>, Kvicii <42023367+Kvicii@users.noreply.github.com>, Jason Gustafson <jason@confluent.io>	2022-02-04 16:20:37 -08:00
Colin Patrick McCabe	d4fb388583	MINOR: fix control plane listener + kraft error message (#11729 ) The current error message suggests that controller.listener.names is a replacement for control.plane.listener.name. This is incorrect since these configurations have very different functions. This PR deletes the incorrect message. Reviewers: David Jacot <david.jacot@gmail.com>, Kvicii	2022-02-03 10:08:00 -08:00
Kvicii	21c3009ac1	KAFKA-13583; Fix FetchRequestBetweenDifferentIbpTest flaky tests (#11699 ) Co-authored-by: Kvicii <Karonazaba@gmail.com> Reviewers: David Jacot <djacot@confluent.io>	2022-02-03 10:59:12 +01:00
Jason Gustafson	915991445f	KAFKA-13221; Implement `PartitionsWithLateTransactionsCount` metric (#11725 ) This patch implements a new metric `PartitionsWithLateTransactionsCount` which tracks the number of partitions with late transactions in the cluster. This metric was documented in KIP-664: https://cwiki.apache.org/confluence/display/KAFKA/KIP-664%3A+Provide+tooling+to+detect+and+abort+hanging+transactions. Reviewers: David Jacot <djacot@confluent.io>	2022-02-02 11:57:13 -08:00
Joseph (Ting-Chou) Lin	c1896e0bdc	MINOR: Replace if/else with match in KafkaZkClient#getPartitionAssignmentForTopics (#11669 ) Reviewers: Ismael Juma <ismael@juma.me.uk>	2022-02-02 06:02:57 -08:00
Tomonari Yamashita	4101be51aa	KAFKA-13619: Remove zookeeper.sync.time.ms (#11717 ) `zookeeper.sync.time.ms` was previously used with the old Scala consumer, which was removed in Apache Kafka 2.0.0. Remove the config definition from `KafkaConfig` and documentation. Reviewers: Luke Chen <showuon@gmail.com>, Ismael Juma <ismael@juma.me.uk>	2022-02-02 05:59:13 -08:00
Mickael Maison	3b534e1c7d	KAFKA-13595: Allow producing records with null fields in ConsoleProducer (#11695 ) Implements KIP-810: https://cwiki.apache.org/confluence/display/KAFKA/KIP-810%3A+Allow+producing+records+with+null+values+in+Kafka+Console+Producer ConsoleProducer accepts a new setting, `null.marker`, that allows settings the record key, value or headers to null. This can be used to produce "tombstone" records. Reviewers: David Jacot <djacot@confluent.io>, Tom Bentley <tbentley@redhat.com>, Israel Ekpo <israelekpo@gmail.com>	2022-02-01 18:56:04 +01:00
Mickael Maison	31fca1611a	KAFKA-13527: Add top-level error code field to DescribeLogDirsResponse (#11599 ) Implements KIP-784: https://cwiki.apache.org/confluence/display/KAFKA/KIP-784%3A+Add+top-level+error+code+field+to+DescribeLogDirsResponse Reviewers: David Jacot <djacot@confluent.io>, Tom Bentley <tbentley@redhat.com>	2022-02-01 18:53:30 +01:00
David Jacot	a47dae4622	MINOR: ConsoleConsumer should not always exit when Consumer::poll returns an empty record batch (#11718 ) With `ddb6959c62`, `Consumer::poll` will return an empty record batch when position advances due to aborted transactions or control records. This makes the `ConsoleConsumer` exists because it assumes that `poll` returns due to the timeout being reached. This patch fixes this by explicitly tracking the timeout. Reviewers: Jason Gustafson <jason@confluent.io>	2022-01-31 10:34:11 +01:00
Cong Ding	a21aec8d62	KAFKA-13603: Allow the empty active segment to have missing offset index during recovery (#11345 ) Within a LogSegment, the TimeIndex and OffsetIndex are lazy indices that don't get created on disk until they are accessed for the first time. However, Log recovery logic expects the presence of an offset index file on disk for each segment, otherwise, the segment is considered corrupted. This PR introduces a forceFlushActiveSegment boolean for the log.flush function to allow the shutdown process to flush the empty active segment, which makes sure the offset index file exists. Co-Author: Kowshik Prakasam kowshik@gmail.com Reviewers: Jason Gustafson <jason@confluent.io>, Jun Rao <junrao@gmail.com>	2022-01-27 14:59:21 -08:00
David Mao	5eef6b269d	KAFKA-13614: Don't apply leader replication quota to consumer fetches (#11714 ) In the fetch path, we check shouldLeaderThrottle regardless of whether the read is coming from a consumer or follower broker. This results in replication quota being applied to consumer fetches. This patch ensures that it is only applied to followers. Reviewers: David Jacot <djacot@confluent.io>	2022-01-27 08:21:57 +01:00
Jason Gustafson	e51ebb8a0a	MINOR: Convert LogLoader into a class (#11693 ) The logic for log loading is encapsulated in `LogLoader`. Currently all the methods are static and we pass the parameters through a separate object `LogLoaderParams`. This patch simplifies this structure by turning `LogLoader` into a normal object and get rid of `LogLoaderParams`. Reviewers: David Jacot <djacot@confluent.io>	2022-01-26 12:54:07 -08:00
Ron Dagostino	da25add383	MINOR: Add shutdown tests for KRaft (#11606 ) Augments existing shutdown tests for KRaft. Adds the ability to update configs in KRaft tests, and in both the ZK and KRaft cases to be able to update configs without losing the server's log directory and data. Reviewers: Colin P. McCabe <cmccabe@apache.org>	2022-01-25 23:38:11 -08:00
David Jacot	5b12d8a595	KAFKA-13585; Fix flaky test `ReplicaManagerTest.testReplicaAlterLogDirsWithAndWithoutIds` (#11665 ) The issue was quite subtile. It was due to a race for the `partitionMapLock` lock. `assertFetcherHasTopicId` would only succeed if it can acquire it before `processFetchRequest`. This PR refactors the test in order to make it more stable. Reviewers: Justine Olshan <jolshan@confluent.io>, Jason Gustafson <jason@confluent.io>,	2022-01-26 08:14:00 +01:00
Colin Patrick McCabe	68a19539cf	KAFKA-13552: Fix BROKER and BROKER_LOGGER in KRaft (#11657 ) Currently, KRaft does not support setting BROKER_LOGGER configs (it always fails.) Additionally, there are several bugs in the handling of BROKER configs. They are not properly validated on the forwarding broker, and the way we apply them is buggy as well. This PR fixes those issues. KafkaApis: add support for doing validation and log4j processing on the forwarding broker. This involves breaking the config request apart and forwarding only part of it. Adjust KafkaApisTest to test the new behavior, rather than expecting forwarding of the full request. MetadataSupport: remove MetadataSupport#controllerId since it duplicates the functionality of MetadataCache#controllerId. Add support for getResourceConfig and maybeForward. ControllerApis: log an error message if the handler throws an exception, just like we do in KafkaApis. ControllerConfigurationValidator: add JavaDoc. Move some functions that don't involve ZK from ZkAdminManager to DynamicConfigManager. Move some validation out of ZkAdminManager and into a new class, ConfigAdminManager, which is not tied to ZK. ForwardingManager: add support for sending new requests, rather than just forwarding existing requests. BrokerMetadataPublisher: do not try to apply dynamic configurations for brokers other than the current one. Log an INFO message when applying a new dynamic config, like we do in ZK mode. Also, invoke reloadUpdatedFilesWithoutConfigChange when applying a new non-default BROKER config. QuorumController: fix a bug in ConfigResourceExistenceChecker which prevented cluster configs from being set. Add a test for this class. Reviews: José Armando García Sancio <jsancio@users.noreply.github.com>	2022-01-21 17:00:21 -07:00
Colin Patrick McCabe	448e44a734	MINOR: fix bug in AbstractFetcherManagerTest on INFO level (#11696 ) Fix a bug where AbstractFetcherManagerTest would fail with an exception when core logging was turned on. Reviewers: David Arthur <mumrah@gmail.com>	2022-01-21 10:07:20 -08:00
David Jacot	ebe0ede5ed	KAFKA-13591; Fix flaky test `ControllerIntegrationTest.testTopicIdCreatedOnUpgrade` (#11666 ) The issue is that when `zkClient.getTopicIdsForTopics(Set(tp.topic)).get(tp.topic)` is called after the new controller is brought up, there is not guarantee that the controller has already written the topic id to the topic znode. Reviewers: Jason Gustafson <jason@confluent.io>	2022-01-21 17:38:40 +01:00
Jason Gustafson	df2236d73b	KAFKA-13412; Ensure initTransactions() safe for retry after timeout (#11452 ) If the user's `initTransactions` call times out, the user is expected to retry. However, the producer will continue retrying the `InitProducerId` request in the background. If it happens to return before the user retry of `initTransactions`, then the producer will raise an exception about an invalid state transition. The patch fixes the issue by tracking the pending state transition until the user has acknowledged the operation's result. In the case of `initTransactions`, even if the `InitProducerId` returns in the background and the state changes, we can still retry the `initTransactions` call to obtain the result. Reviewers: David Jacot <djacot@confluent.io>	2022-01-19 13:20:41 -08:00
Luke Chen	a945bd45b8	MINOR: Update streamResetter option description (#11613 ) Reviewers: Jorge Esteban Quilcate Otoya <quilcate.jorge@gmail.com>, Matthias J. Sax <matthias@confluent.io>	2022-01-19 11:37:47 -08:00
florin-akermann	ed8b7875fe	KAFKA-13351: Add possibility to write kafka headers in Kafka Console Producer (KIP-798) (#11456 ) This patch adds the possibility to write Kafka headers with the `kafka-console-producer` as specified in KIP-798: https://cwiki.apache.org/confluence/display/KAFKA/KIP-798%3A+Add+possibility+to+write+kafka+headers+in+Kafka+Console+Producer. Reviewers: David Jacot <djacot@confluent.io>, Mickael Maison <mickael.maison@gmail.com>	2022-01-19 09:53:28 +01:00
Jeff Kim	bf609694f8	KAFKA-13496: Add reason to LeaveGroupRequest (KIP-800) (#11571 ) This patch adds a `reason` field to the `LeaveGroupRequest` as specified in KIP-800: https://cwiki.apache.org/confluence/display/KAFKA/KIP-800%3A+Add+reason+to+JoinGroupRequest+and+LeaveGroupRequest. Reviewers: Luke Chen <showuon@gmail.com>, David Jacot <djacot@confluent.io>	2022-01-13 16:06:11 +01:00
Jeff Kim	69645f1fe5	KAFKA-13495: Add reason to JoinGroupRequest (KIP-800) (#11566 ) This patch adds a `reason` field to the `JoinGroupRequest` as specified in KIP-800: https://cwiki.apache.org/confluence/display/KAFKA/KIP-800%3A+Add+reason+to+JoinGroupRequest+and+LeaveGroupRequest. Reviewers: loboya~ <317307889@qq.com>, Luke Chen <showuon@gmail.com>, David Jacot <djacot@confluent.io>	2022-01-13 16:01:37 +01:00
Mickael Maison	5c7cf29c15	KAFKA-7589: Allow configuring network threads per listener (#11560 ) Implements KIP-788. The number of network threads can be set per listener using the following syntax: listener.name.<listener>.num.network.threads=<num> Reviewers: Tom Bentley <tbentley@redhat.com>, Andrew Eugene Choi <andrew.choi@uwaterloo.ca>, David Jacot <djacot@confluent.io>	2022-01-13 11:24:50 +01:00
Colin Patrick McCabe	aaa546df7a	MINOR: support KRaft in TransactionsExpirationTest (#11633 ) Reviewers: José Armando García Sancio <jsancio@gmail.com>	2022-01-06 14:53:35 -08:00
Colin Patrick McCabe	ff3ec47b79	MINOR: enable KRaft in MinIsrConfigTest (#11635 ) Reviewers: José Armando García Sancio <jsancio@gmail.com>	2022-01-06 14:01:40 -08:00
Colin Patrick McCabe	8e301b48e9	KAFKA-13528: KRaft RegisterBroker should validate that the cluster ID matches (#11593 ) The KRaft controller should validate that the clusterID matches before allowing a broker to register in the cluster. Reviewers: José Armando García Sancio <jsancio@gmail.com>	2022-01-06 10:28:36 -08:00
Colin Patrick McCabe	6b135bb59b	MINOR: enable KRaft mode in CreateTopicsRequestTest (#11629 ) Reviewers: Jason Gustafson <jason@confluent.io>, dengziming <dengziming1993@gmail.com>	2022-01-06 09:59:31 -08:00
Lucas Bradstreet	0b9a8bac36	MINOR: greatly improve test runtime by unblocking purgatory and quota manager threads (#11653 ) Reviewers: Rajini Sivaram <rajinisivaram@googlemail.com>	2022-01-06 09:22:19 +00:00
Colin Patrick McCabe	7530ac6583	MINOR: enable KRaft in MetadataRequestTest (#11637 ) Reviewers: David Arthur <mumrah@gmail.com>	2022-01-05 21:53:54 -08:00
Luke Chen	fb01c1376f	MINOR: Improve ConsoleProducer options descriptions (#11564 ) Reviewers: Mickael Maison <mickael.maison@gmail.com>	2022-01-04 11:10:34 +01:00
jiangyuan	d4d8a1deb7	KAFKA-13544: fix FinalizedFeatureChangeListener deadlock (#11607 ) Reviewers: Jun Rao <junrao@gmail.com>	2021-12-17 16:32:01 -08:00
Yang Yu	8cca18d7b9	replace lastOption with exists (#11605 ) Reviewers: Kowshik Prakasam <kprakasam@confluent.io>, Jun Rao <junrao@gmail.com>	2021-12-17 16:02:24 -08:00
Luke Chen	c219fba421	KAFKA-13419: Only reset generation ID when ILLEGAL_GENERATION error (#11451 ) Updated: This PR will reset generation ID when ILLEGAL_GENERATION error since the member ID is still valid. ===== resetStateAndRejoin when REBALANCE_IN_PROGRESS error in sync group, to avoid out-of-date ownedPartition == JIRA description == In KAFKA-13406, we found there's user got stuck when in rebalancing with cooperative sticky assignor. The reason is the "ownedPartition" is out-of-date, and it failed the cooperative assignment validation. Investigate deeper, I found the root cause is we didn't reset generation and state after sync group fail. In KAFKA-12983, we fixed the issue that the onJoinPrepare is not called in resetStateAndRejoin method. And it causes the ownedPartition not get cleared. But there's another case that the ownedPartition will be out-of-date. Here's the example: consumer A joined and synced group successfully with generation 1 New rebalance started with generation 2, consumer A joined successfully, but somehow, consumer A doesn't send out sync group immediately other consumer completed sync group successfully in generation 2, except consumer A. After consumer A send out sync group, the new rebalance start, with generation 3. So consumer A got REBALANCE_IN_PROGRESS error with sync group response When receiving REBALANCE_IN_PROGRESS, we re-join the group, with generation 3, with the assignment (ownedPartition) in generation 1. So, now, we have out-of-date ownedPartition sent, with unexpected results happened We might want to do resetStateAndRejoin when RebalanceInProgressException errors happend in sync group. Because when we got sync group error, it means, join group passed, and other consumers (and the leader) might already completed this round of rebalance. The assignment distribution this consumer have is already out-of-date. Reviewers: David Jacot <djacot@confluent.io>, Guozhang Wang <wangguoz@gmail.com>	2021-12-17 15:36:12 -08:00
Prateek Agarwal	f653cb7b58	KAFKA-13488: Producer fails to recover if topic gets deleted midway (#11552 ) Allow the leader epoch to be re-assigned to the new value from the Metadata response if `oldTopicId` is not present in the cache. This is needed because `oldTopicId` is removed from the cache if the topic gets deleted but the leader epoch is not removed. Hence, metadata for the newly recreated topic won't be accepted unless we allow `oldTopicId` to be null. Reviewers: Jason Gustafson <jason@confluent.io>, David Jacot <djacot@confluent.io>	2021-12-16 15:44:39 +01:00
Rajini Sivaram	0e150a4cfa	MINOR: Reset java.security.auth.login.config in ZK-tests to avoid config reload affecting subsequent tests (#11602 ) Reviewers: Manikumar Reddy <manikumar.reddy@gmail.com>	2021-12-15 14:30:39 +00:00
Ron Dagostino	ff6cf670f4	KAFKA-13456; Tighten KRaft listener config checks/constraints (#11503 ) This patch tightens the configuration checks related to KRaft configs by adding the following constraints: * `control.plane.listener.name` is confirmed to be empty in KRaft mode whenever a config object is created as opposed to later when the broker is given the config and tries to start. * `controller.listener.names` is required to be empty for the non-KRaft (i.e. ZooKeeper) case. A ZooKeeper-based cluster that sets this config will fail to restart until this config is removed. * There must be no advertised listeners when running just a KRaft controller (i.e. when `process.roles=controller`). This means neither `listeners` nor `advertised.listeners` (if the latter is explicitly defined) can contain a listener that does not also appear in `controller.listener.names`. * When running a KRaft broker (i.e. when `process.roles=broker` or `process.roles=broker,controller`), advertised listeners (which was already checked to be non-empty via the check that the inter-broker listener appear there) must not include any listeners appearing in `controller.listener.names`. * When running a KRaft controller (i.e. when `process.roles=controller` or `process.roles=broker,controller`) `controller.listener.names` must be non-empty and every one must appear in `listeners` * When running just a KRaft broker (i.e. when `process.roles=broker`) `controller.listener.names` must be non-empty and none of them can appear in `listeners`. This was indirectly checked previously, but the indirect checks did not catch all cases. * When running just a KRaft broker we log a warning if more than one entry appears in `controller.listener.names` because only the first entry is used. * We also map configured controller listener names to the `PLAINTEXT` security protocol by default provided that the security mapping is empty and no other security protocols are in use. Reviewers: José Armando García Sancio <jsancio@users.noreply.github.com>, Jason Gustafson <jason@confluent.io>	2021-12-10 13:28:57 -08:00
Colin Patrick McCabe	454d63c158	KAFKA-13515: Fix KRaft config validation issues (#11577 ) Require that topics exist before topic configurations can be created for them. Merge the code from ConfigurationControlManager#checkConfigResource into ControllerConfigurationValidator to avoid duplication. Add KRaft support to DynamicConfigChangeTest. Split out tests in DynamicConfigChangeTest that don't require a cluster into DynamicConfigChangeUnitTest to save test time. Reviewers: David Arthur <mumrah@gmail.com>	2021-12-10 12:29:58 -08:00
Justine Olshan	e14499948e	KAFKA-13512: Avoid duplicating maps in ZkMetadataCache topic accessors Reviewers: Colin P. McCabe <cmccabe@apache.org>, Luke Chen <showuon@gmail.com>, Ismael Juma <ismael@juma.me.uk>	2021-12-08 13:57:08 -08:00
zzccctv	a2da309f13	KAFKA-13454: Avoid duplicate config logging at startup (#11496 ) Kafka has duplicate configuration information log information printing during startup, repeated information printing will bring confusion to users.It is better to add log information before and after repeating the configuration information. Reviewers: Guozhang Wang <wangguoz@gmail.com>	2021-12-06 17:08:26 -08:00
dengziming	d93c4123ef	MINOR: Delete redudant tmp file (#11446 ) Reviewers: Colin P. McCabe <cmccabe@apache.org>	2021-12-05 18:49:30 -08:00
Colin P. Mccabe	a957748d2e	KAFKA-13490: Fix createTopics and incrementalAlterConfigs for KRaft mode #11416 For CreateTopics, fix a bug where if one createTopics in a batch failed, they would all fail with the same error code. Make the error message for TOPIC_ALREADY_EXISTS consistent with the ZK-based code by including the topic name. For IncrementalAlterConfigs, before we allow topic configurations to be set, we should check that they are valid. (This also applies to newly created topics.) IncrementalAlterConfigs should ignore non-null payloads for DELETE operations. Previously we would return an error in these cases. However, this is not compatible with the old ZK-based code, which ignores the payload in these cases. Reviewers: José Armando García Sancio <jsancio@gmail.com>, Jason Gustafson <jason@confluent.io>	2021-12-05 16:32:56 -08:00
Rajini Sivaram	da56146c60	KAFKA-13461: Don't re-initialize ZK client session after auth failure if connection still alive (#11563 ) If JAAS configuration does not contain a Client section for ZK clients, an auth failure event is generated. If this occurs after the connection is setup in the controller, we schedule reinitialize(), which causes controller to resign. In the case where SASL is not mandatory and the connection is alive, controller maintains the current session and doesn't register its watchers, leaving it in a bad state. Reviewers: Jun Rao <junrao@gmail.com>	2021-12-02 22:10:37 +00:00
loboya~	42306ba267	KAFKA-12932: Interfaces for SnapshotReader and SnapshotWriter (#11529 ) Change the snapshot API so that SnapshotWriter and SnapshotReader are interfaces. Change the existing types SnapshotWriter and SnapshotReader to use a different name and to implement the interfaces introduced by this commit. Co-authored-by: loboxu <loboxu@tencent.com> Reviews: José Armando García Sancio <jsancio@users.noreply.github.com>	2021-11-30 11:44:39 -07:00
Lucas Bradstreet	6a56fc57d2	MINOR: Reduce log cleaner offset memory usage in KRaftClusterTestKit (#11533 ) We now use 2MB as with the other test harnesses. Reviewers: José Armando García Sancio <jsancio@users.noreply.github.com>, Colin Patrick McCabe <cmccabe@confluent.io>, Luke Chen <showuon@gmail.com>	2021-11-25 10:56:47 -07:00
Colin P. Mccabe	0f967828e1	MINOR: guard against calls to exit in QuorumTestHarness tests (#11457 ) Author: Colin P. Mccabe <cmccabe@confluent.io> Reviewers: José Armando García Sancio <jsancio@users.noreply.github.com>, Sherzod Mamadaliev <mamadaliev@yahoo.com> Closes #11457 from cmccabe/guard_against_exit	2021-11-24 12:38:27 -07:00
Colin Patrick McCabe	e8b53caab4	KAFKA-13357; Store producer IDs in broker snapshots When creating snapshots, controllers generate a ProducerIdsRecord indicating the highest producer ID that has been used so far. Brokers should generate the same record, so that the snapshots can be compared. Also, fix a bug in MetadataDelta#finishSnapshot. The current logic will produce the wrong result if all objects of a certain type are completely removed in the snapshot. The fix is to unconditionally create each delta object. Reviewers: José Armando García Sancio <jsancio@users.noreply.github.com>	2021-11-24 11:06:09 -07:00
Haoze Wu	28dcbad007	KAFKA-13457: SocketChannel in Acceptor#accept is not closed upon IOException (#11504 ) This patch ensures that SocketChannel in Acceptor#accept is closed if an IOException is thrown while the socket is configured. Reviewers: Luke Chen <showuon@gmail.com>, David Jacot <djacot@confluent.io>	2021-11-24 09:30:43 +01:00
José Armando García Sancio	76ac43bbe5	MINOR: Share BrokerMetadataPublisher code for coordinators (#11525 ) Leader election and resignation logic for the Group Coordinator and Transaction Coordinator is the same. Share this logic by refactoring this code into a method. Reviewers: Colin P. McCabe <cmccabe@apache.org>	2021-11-22 13:46:30 -08:00
José Armando García Sancio	074a03cca1	MINOR: Brokers in KRaft don't need controller listener (#11511 ) The KRaft brokers should not list the names in `controller.listener.names` in `listeners` because brokers do not bind to those endpoints. This commit also removes the extra changes to the security protocol map because the `PLAINTEXT` protocol doesn't require additional configuration. To fully support all of the security protocol configuration additional changes to `QuorumTestHarness` are needed. Those changes can be made when migrating integration tests that need this functionality. Reviewers: Ron Dagostino <rdagostino@confluent.io>, Jason Gustafson <jason@confluent.io>	2021-11-19 09:28:19 -08:00
Okada Haruki	30a9085d50	KAFKA-9648: Add configuration to adjust listen backlog size for Acceptor (KIP-764) (#11422 ) This patch implements KIP-764 as described in https://cwiki.apache.org/confluence/display/KAFKA/KIP-764%3A+Configurable+backlog+size+for+creating+Acceptor. Reviewers: David Jacot <djacot@confluent.io>	2021-11-19 17:53:43 +01:00
Justine Olshan	0916622bcc	KAFKA-13394: Topic IDs should be removed from PartitionFetchState if they are no longer sent by the controller (#11459 ) With KAFKA-13102, we added topic IDs to the InitialFetchState and the PartitionFetchState in order to send fetch requests using topic IDs when IBP is 3.1. However, there are some cases where we could initially send topic IDs from the controller and then no longer to do so (controller changes to an IBP < 2.8). If we do not remove from the PartitionFetchState and one broker is still IBP 3.1, it will try to send a version 13 fetch request to brokers that no longer have topic IDs in the metadata cache. This could leave the cluster in a state unable to fetch from these partitions. This patch removes the topic IDs from the PartitionFetchState if the log contains a topic ID but the request does not. This means that we will always handle a leader and isr request if there is no ID in the request but an ID in the log. Such a state should be transient because we are either * upgrading the cluster and somehow switched between a new IBP controller and an old one --> and will eventually have all new IBP controllers/brokers. * downgrading the cluster --> will eventually have all old IBP controllers/brokers and will restart the broker/delete the partition metadata file for them. Reviewers: David Jacot <djacot@confluent.io>	2021-11-18 17:30:40 +01:00
Lee Dongjin	051efc7b1a	MINOR: Remove unused parameters, exceptions, comments, etc. (#11472 ) * Remove redundant toString call & unused value in LogCleanerParameterizedIntegrationTest * Remove unthrown exceptions in FileRawSnapshotTest * Remove unused parameters in DumpLogSegmentsTest.scala * Remove redundant parameter to FetchDataInfo() * Remove redundant toString call in EndToEndLatency * Remove unused parameters in DumpLogSegments * Remove unused toString call in AbstractLogCleanerIntegrationTest * Remove unused parameter in LogCleanerTest#appendTransactionalAsLeader * Remove redundant 'val's from ClientQuotaManagerTest.UserClient. * Remove redundant parameters in EdgeCaseRequestTest * Remove redundant Int.MaxValue from DumpLogSegments.dumpTimeIndex parameters. * Remove '// 9) static client-id quota' from DefaultQuotaCallback#quotaMetricTags; static client-id quota was removed in 3.0.0. * Remove redundant parameters to DumpLogSegments#[dumpLog, dumpTimeIndex]. Reviewers: Mickael Maison <mickael.maison@gmail.com>	2021-11-18 11:52:10 +01:00
Joel Hamill	c21c3ad3ee	MINOR: Fix client.quota.callback.class doc (#11510 ) Reviewers: David Jacot <djacot@confluent.io>	2021-11-18 11:04:25 +01:00
Kirk True	a448ddbecb	KAFKA-13443: Kafka broker exits when OAuth enabled and certain configuration not specified (#11484 ) The sasl.oauthbearer.jwks.endpoint.retry.backoff.ms and sasl.oauthbearer.jwks.endpoint.retry.backoff.max.ms configuration options were added to the SaslConfig class but their default values were not added to KafkaConfig. As a result, when the OAuth validation feature is enabled in the broker and those two configuration values aren't explicitly provided by the user, the broker exits. This patch fixes the issue by defining them in the KafkaConfig class. Reviewers: David Jacot <djacot@confluent.io>	2021-11-17 11:17:26 +01:00
Jason Gustafson	d37aaf68de	KAFKA-13071; Deprecate support for changing acls through the authorizer (#11502 ) This patch marks the following arguments as deprecated in kafka-acls.sh as documented in [KIP-604](https://cwiki.apache.org/confluence/display/KAFKA/KIP-604%3A+Remove+ZooKeeper+Flags+from+the+Administrative+Tools): --authorizer, --authorizer-properties, and --zk-tls-config-file. Reviewers: David Jacot <djacot@confluent.io>	2021-11-16 11:24:59 -08:00
RivenSun	894e520cfd	KAFKA-13449: Comment optimization for parameter log.cleaner.delete.retention.ms (#11505 ) Reviewers: Luke Chen <showuon@gmail.com>, David Jacot <djacot@confluent.io>	2021-11-16 14:08:40 +01:00
jiangyuan	a3ab7d5b42	MINOR: fix comment in TimingWheel (#11480 ) Co-authored-by: jiangyuan04 <jiangyuan04@baidu.com> Reviewers: Luke Chen <showuon@gmail.com>, Jun Rao <junrao@gmail.com>	2021-11-15 11:04:53 -08:00
Justine Olshan	e8818e234a	KAFKA-13111: Re-evaluate Fetch Sessions when using topic IDs (#11331 ) With the changes for topic IDs, we have a different flow. When a broker receives a request, it uses a map to convert the topic ID to topic names. If the topic ID is not found in the map, we return a top level error and close the session. This decision was motivated by the difficulty to store “unresolved” partitions in the session. In earlier iterations we stored an “unresolved” partition object in the cache, but it was somewhat hard to reason about and required extra logic to try to resolve the topic ID on each incremental request and add to the session. It also required extra logic to forget the topic (either by topic ID if the topic name was never known or by topic name if it was finally resolved when we wanted to remove from the session.) One helpful simplifying factor is that we only allow one type of request (uses topic ID or does not use topic ID) in the session. That means we can rely on a session continuing to have the same information. We don’t have to worry about converting topics only known by name to topic ID for a response and we won’t need to convert topics only known by ID to name for a response. This PR introduces a change to store the "unresolved partitions" in the cached partition object. If a version 13+ request is sent with a topic ID that is unknown, a cached partition will be created with that fetch request data and a null topic name. On subsequent incremental requests, unresolved partitions may be resolved with the new IDs found in the metadata cache. When handling the request, getting all partitions will return a TopicIdPartition object that will be used to handle the request and build the response. Since we can rely on only one type of request (with IDs or without), the cached partitions map will have different keys depending on what fetch request version is being used. This PR involves changes both in FetchSessionHandler and FetchSession. Some major changes are outlined below. 1. FetchSessionHandler: Forgetting a topic and adding a new topic with the same name - We may have a case where there is a topic foo with ID 1 in the session. Upon a subsequent metadata update, we may have topic foo with ID 2. This means that topic foo has been deleted and recreated. When sending fetch requests version 13+ we will send a request to add foo ID 2 to the session and remove foo ID 1. Otherwise, we will fall back to the same behavior for versions 12 and below 2. FetchSession: Resolving in Incremental Sessions - Incremental sessions contain two distinct sets of partitions. Partitions that are sent in the latest request that are new/updates/forgotten partitions and the partitions already in the session. If we want to resolve unknown topic IDs we will need to handle both cases. * Partitions in the request - These partitions are either new or updating/forgetting previous partitions in the session. The new partitions are trivial. We either have a resolved partition or create a partition that is unresolved. For the other cases, we need to be a bit more careful. * For updated partitions we have a few cases – keep in mind, we may not programmatically know if a partition is an update: 1. partition in session is resolved, update is resolved: trivial  2. partition in session is unresolved, update is unresolved: in code, this is equivalent to the case above, so trivial as well  3. partition in session is unresolved, update is resolved: this means the partition in the session does not have a name, but the metadata cache now contains the name – to fix this we can check if there exists a cached partition with the given ID and update it both with the partition update and with the topic name.  4. partition in session is resolved, update is unresolved: this means the partition in the session has a name, but the update was unable to be resolved (ie, the topic is deleted) – this is the odd case. We will look up the partition using the ID. We will find the old version with a name but will not replace the name. This will lead to an UNKNOWN_TOPIC_OR_PARTITION or INCONSISTENT_TOPIC_ID error which will be handled with a metadata update. Likely a future request will forget the partition, and we will be able to do so by ID.  5. Two partitions in the session have IDs, but they are different: only one topic ID should exist in the metadata at a time, so likely only one topic ID is in the fetch set. The other one should be in the toForget. We will be able to remove this partition from the session. If for some reason, we don't try to forget this partition — one of the partitions in the session will cause an inconsistent topic ID error and the metadata for this partition will be refreshed — this should result in the old ID being removed from the session. This should not happen if the FetchSessionHandler is correctly in sync.  * For the forgotten partitions we have the same cases: 1. partition in session is resolved, forgotten is resolved: trivial  2. partition in session is unresolved, forgotten is unresolved: in code, this is equivalent to the case above, so trivial as well  3. partition in session is unresolved, forgotten is resolved: this means the partition in the session does not have a name, but the metadata cache now contains the name – to fix this we can check if there exists a cached partition with the given ID and try to forget it before we check the resolved name case.  4. partition in session is resolved, update is unresolved: this means the partition in the session has a name, but the update was unable to be resolved (ie, the topic is deleted) We will look up the partition using the ID. We will find the old version with a name and be able to delete it.  5. both partitions in the session have IDs, but they are different: This should be the same case as described above. If we somehow do not have the ID in the session, no partition will be removed. This should not happen unless the Fetch Session Handler is out of sync.  * Partitions in the session - there may be some partitions in the session already that are unresolved. We can resolve them in forEachPartition using a method that checks if the partition is unresolved and tries to resolve it using a topicName map from the request. The partition will be resolved before the function using the cached partition is applied. Reviewers: David Jacot <djacot@confluent.io>	2021-11-15 10:04:43 +01:00
Jason Gustafson	e9db5a11e4	KAFKA-13421; Reenable `testRollingBrokerRestartsWithSmallerMaxGroupSizeConfigDisruptsBigGroup` (#11485 ) This test was disabled in `af8100b94f`. The reason the test was failing is that it assumes that the reference to `servers` can be mutated directly. The implementation in `IntegrationTestHarness` is intended to allow this by returning a mutable buffer, but the implementation actually returns a copy of the underlying collection. This caused the test case to create multiple `KafkaServer` instances instead of one as intended because it was modifying the copy. This led to the broker registration failure. Reviewers: David Jacot <djacot@confluent.io>	2021-11-13 10:06:16 -08:00
Jason Gustafson	78a5e921d4	KAFKA-13417; Ensure dynamic reconfigurations set old config properly (#11448 ) This patch fixes a bug in `DynamicBrokerConfig` which causes some configuration changes to be ignored. In particular, the bug is the result of the reference to the old configuration getting indirectly mutated prior to the call to `BrokerReconfigurable.reconfigure`. This causes the first dynamic configuration update to pass effectively the same configuration as both `oldConfig` and `newConfig`. In cases such as in `DynamicThreadPool`, the update is ignored because the old configuration value matches the new configuration value. This bug only affects KRaft. It is protected in the zk broker by the call to `DynamicBrokerConfig.initialize()`, which overwrites the stored reference to the original configuration. The patch fixes the problem by ensuring that `initialize()` is also invoked in KRaft when `BrokerServer` starts up. Reviewers: David Jacot <djacot@confluent.io>, Rajini Sivaram <rajinisivaram@googlemail.com>	2021-11-09 13:42:54 -08:00
Luke Chen	43bcc5682d	KAFKA-13396: Allow create topic without partition/replicaFactor (#11429 ) [KIP-464](https://cwiki.apache.org/confluence/display/KAFKA/KIP-464%3A+Defaults+for+AdminClient%23createTopic) (PR: https://github.com/apache/kafka/pull/6728) made it possible to create topics without passing partition count and/or replica factor when using the admin client. We incorrectly disallowed this via https://github.com/apache/kafka/pull/10457 while trying to ensure validation was consistent between ZK and the admin client (in this case the inconsistency was intentional). Fix this regression and add tests for the command lines in quick start (i.e. create topic and describe topic) to make sure it won't be broken in the future. Reviewers: Lee Dongjin <dongjin@apache.org>, Ismael Juma <ismael@juma.me.uk>	2021-11-04 06:44:05 -07:00
Lee Dongjin	22d056c9b7	TRIVIAL: Fix type inconsistencies, unthrown exceptions, etc (#10678 ) Reviewers: Ismael Juma <ismael@juma.me.uk>, Bruno Cadonna <cadonna@apache.org>	2021-11-03 14:58:42 +01:00
Colin Patrick McCabe	af8100b94f	KAFKA-13340: Change ZooKeeperTestHarness to QuorumTestHarness (#11417 ) Change ZooKeeperTestHarness to QuorumTestHarness so that integration tests which inherit from this class can test Kafka in both ZK and KRaft mode. Test cases which do this can specify the modes they support by including a ParameterizedTest annotation before each test case, like the following: @ParameterizedTest @valuesource(strings = Array("zk", "kraft")) def testValidCreateTopicsRequests(quorum: String): Unit = { ... } For each value that is specified here (zk, kraft), the test case will be run once in the appropriate mode. So the test shown above is run twice. This allows integration tests to be incrementally converted over to support KRaft mode, rather than rewritten to support it. For now, test cases which do not specify a quorum argument will continue to run only in ZK mode. JUnit5 makes the quorum annotation visible in the TestInfo object which each @BeforEeach function in a test can optionally take. Therefore, this PR converts over the setUp function of the quorum base class, plus every derived class, to take a TestInfo argument. The TestInfo object gets "passed up the chain" to the base class, where it determines which quorum type we create (ZK or KRaft). In a few cases, I discovered test cases inheriting from the test harness that had more than one @BeforeEach function. Because the JUnit5 framework does not define the order in which @BeforeEach hooks are run, I changed these to overload setUp() instead, to avoid undefined behavior. The general approach taken here is to make as much as possible work with KRaft, but to leave some things as ZK-only when appropriate. For example, a test that explicitly requests an AdminZkClient object will get an exception if it is running in KRaft mode. Similarly, tests which explicitly request KafkaServer rather than KafkaBroker will get an exception when running in KRaft mode. As a proof of concept, this PR converts over kafka.api.MetricsTest to support KRaft. This PR also renames the quorum controller event handler thread to include the text "QuorumControllerEventHandler". This allows QuorumTestHarness to check for hanging quorum controller threads, as it does for hanging ZK-based controller threads. Finally, ConsumerBounceTest#testRollingBrokerRestartsWithSmallerMaxGroupSizeConfigDisruptsBigGroup caused many failing test runs. Therefore, I disabled it here and filed KAFKA-13421 to fix the test logic to be more reliable. Reviewers: Jason Gustafson <jason@confluent.io>, Igor Soarez <soarez@apple.com>	2021-10-30 08:00:34 -07:00
Kirk True	7b379539a5	KAFKA-13202: KIP-768: Extend SASL/OAUTHBEARER with Support for OIDC (#11284 ) This task is to provide a concrete implementation of the interfaces defined in KIP-255 to allow Kafka to connect to an OAuth/OIDC identity provider for authentication and token retrieval. While KIP-255 provides an unsecured JWT example for development, this will fill in the gap and provide a production-grade implementation. The OAuth/OIDC work will allow out-of-the-box configuration by any Apache Kafka users to connect to an external identity provider service (e.g. Okta, Auth0, Azure, etc.). The code will implement the standard OAuth client credentials grant type. The proposed change is largely composed of a pair of AuthenticateCallbackHandler implementations: one to login on the client and one to validate on the broker. See the following for more detail: KIP-768 KAFKA-13202 Reviewers: Yi Ding <dingyi.zj@gmail.com>, Ismael Juma <ismael@juma.me.uk>, Jun Rao <junrao@gmail.com>	2021-10-28 11:36:53 -07:00
JoeCqupt	2c0364f343	MINOR: Fix partition state change error msg (#11427 ) Reviewers: David Jacot <djacot@confluent.io>	2021-10-28 09:01:47 +02:00
David Jacot	195ebff25b	MINOR: Fix `GroupCoordinator.onGroupLoaded` log (#11434 ) Reviewers: Mickael Maison <mickael.maison@gmail.com>	2021-10-27 11:02:48 +02:00
dengziming	53f5c2606d	MINOR: ZkMetadataCache should be in kafka.server.metadata #10956 Put ZkMetadataCache in the kafka.server.metadata package rather than the kafka.server package, so that its package is consistent with its position in the source directory hierarchy. Reviewers: Colin P. McCabe <cmccabe@apache.org>	2021-10-26 14:16:52 -07:00
Colin Patrick McCabe	b86c307b0e	MINOR: Make TestUtils usable for KRaft mode (#11410 ) Make TestUtils usable for KRaft mode by using KafkaBroker instead of KafkaServer where appropriate, and adding some alternate functions that use AdminClient instead of ZooKeeper. Reviewers: Jason Gustafson <jason@confluent.io>	2021-10-19 15:48:24 -07:00
Lucas Bradstreet	da38a1df27	MINOR: "partition" typos and method doc arg fix (#11298 ) Reviewers: Mickael Maison <mickael.maison@gmail.com>, Luke Chen <showuon@gmail.com>	2021-10-18 10:44:11 +02:00
Mickael Maison	99d18b7410	MINOR: Various small fixes in the configuration docs (#11402 ) Reviewers: David Jacot <djacot@confluent.io>	2021-10-15 16:08:22 +02:00
José Armando García Sancio	da58d75c43	MINOR: Fix highest offset when loading KRaft metadata snapshots (#11386 ) When loading a snapshot the broker BrokerMetadataListener was using the batch's append time, offset and epoch. These are not the same as the append time, offset and epoch from the log. This PR fixes it to instead use the lastContainedLogTimeStamp, lastContainedLogOffset and lastContainedLogEpoch from the SnapshotReader. This PR refactors the MetadataImage and MetadataDelta to include an offset and epoch. It also swaps the order of the arguments for ReplicaManager.applyDelta, in order to be more consistent with MetadataPublisher.publish. Reviewers: Colin P. McCabe <cmccabe@apache.org>	2021-10-12 17:19:03 -07:00
Colin Patrick McCabe	3f3a0e0d9e	KAFKA-13280: Avoid O(N) behavior in KRaftMetadataCache#topicNamesToIds (#11311 ) Avoid O(N) behavior in KRaftMetadataCache#topicNamesToIds and KRaftMetadataCache#topicIdsToNames by returning a map subclass that exposes the TopicsImage data structures without copying them. Reviewers: Jason Gustafson <jason@confluent.io>, Ismael Juma <ismael@juma.me.uk>	2021-10-07 09:41:57 -07:00
Vincent Jiang	c6aeb5c554	KAFKA-13305: fix NullPointerException in LogCleanerManager "uncleanable-bytes" gauge (#11327 ) Reviewers: David Jacot <djacot@confluent.io>, Jun Rao <junrao@gmail.com>	2021-09-27 09:49:21 -07:00
Justine Olshan	b76bcaf3a8	KAFKA-13102: Topic IDs not propagated to metadata cache quickly enough for Fetch path (#11170 ) Before we used the metadata cache to determine whether or not to use topic IDs. Unfortunately, metadata cache updates with ZK controllers are in a separate request and may be too slow for the fetcher thread. This results in switching between topic names and topic IDs for topics that could just use IDs. This patch adds topic IDs to FetcherState created in LeaderAndIsr requests. It also supports updating this state for follower threads as soon as a LeaderAndIsr request provides a topic ID. We've opted to only update replica fetcher threads. AlterLogDir threads will use either topic name or topic ID depending on what was present when they were created. Reviewers: David Jacot <djacot@confluent.io>	2021-09-24 10:51:08 +02:00
Cong Ding	d08e3ad7d5	KAFKA-13315: log layer exception during shutdown that caused an unclean shutdown (#11351 ) This also fixes KAFKA-13070. We have seen a problem caused by shutting down the scheduler before shutting down LogManager. When LogManager was closing partitions one by one, the scheduler called to delete old segments due to retention. However, the old segments could have been closed by the LogManager, which caused an exception and subsequently marked logdir as offline. As a result, the broker didn't flush the remaining partitions and didn't write the clean shutdown marker. Ultimately the broker took hours to recover the log during restart. This PR essentially reverts #10538 Reviewers: Ismael Juma <ismael@juma.me.uk>, Kowshik Prakasam <kprakasam@confluent.io>, Jun Rao <junrao@gmail.com>	2021-09-23 16:28:39 -07:00
Jason Gustafson	9abc45e068	MINOR: Print lastTimestamp when dumping producer snapshots (#11354 ) The `LastTimestamp` field is useful because its value is present even when there are no data batches written by a given producerId. Reviewers: David Jacot <djacot@confluent.io>	2021-09-23 10:07:33 -07:00
Colin Patrick McCabe	85548acafb	KAFKA-13279: allow CreateTopicsPolicy, AlterConfigsPolicy in KRaft mode (#11310 ) Add support for CreateTopicsPolicy and AlterConfigsPolicy when running in KRaft mode. Reviewers: David Arthur <mumrah@gmail.com>, Niket Goel <ngoel@confluent.io>	2021-09-22 13:07:45 -07:00
Cong Ding	02795d7f57	MINOR: fix CreateTopic to return the same as DescribeTopic (#11348 ) Internal topic configs with default value are not included in the response of CreateTopic/DescribeTopic. However, if they are explicitly set, they will be included in the response. Reviewers: Jun Rao <junrao@gmail.com>	2021-09-21 16:29:38 -07:00
Colin Patrick McCabe	3f3c3e1c69	MINOR: fix compilation of ReplicaManagerConcurrencyTest (#11346 ) Reviewers: Jason Gustafson <jason@confluent.io>	2021-09-20 12:51:56 -07:00
Cong Ding	1338025d24	MINOR: defineInternal for KIP-405 configs (#11293 ) We haven't finished implementing KIP-405, therefore we should make KIP-405 configs as defineInternal. Reviewers: Jun Rao <junrao@gmail.com>	2021-09-20 10:49:03 -07:00
Jason Gustafson	968e18cc79	KAFKA-13254; Fix deadlock when `AlterIsr` response returns (#11289 ) This patch fixes a deadlock when incrementing the high watermark after the synchronous zk ISR modification happens. The main difference is that we prevent the callback from executing while under the leader and ISR lock. The deadlock bug was introduced in https://github.com/apache/kafka/pull/11245. Reviewers: David Jacot <djacot@confluent.io>	2021-09-20 09:46:38 -07:00
Colin Patrick McCabe	074a3dacca	MINOR: Make ReplicaManager, LogManager, KafkaApis easier to construct (#11320 ) The ReplicaManager, LogManager, and KafkaApis class all have many constructor parameters. It is often difficult to add or remove a parameter, since there are so many locations that need to be updated. In order to address this problem, we should use named parameters when constructing these objects from Scala code. This will make it easy to add new optional parameters without modifying many test cases. It will also make it easier to read git diffs and PRs, since the parameters will have names next to them. Since Java does not support named paramters, this PR adds several Builder classes which can be used to achieve the same effect. ReplicaManager also had a secondary constructor, which this PR removes. The function of the secondary constructor was just to provide some default parameters for the main constructor. However, it is simpler just to actually use default parameters. Reviewers: David Arthur <mumrah@gmail.com>	2021-09-17 14:12:31 -07:00
Edoardo Comar	55701dc00a	KAFKA-12762: Use connection timeout when polling the network for new connections (#10649 ) Reviewers: Rajini Sivaram <rajinisivaram@googlemail.com>, Ismael Juma <ismael@juma.me.uk>, Tom Bentley <tbentley@redhat.com> Co-authored-by: Mickael Maison <mickael.maison@gmail.com> Co-authored-by: Edoardo Comar <ecomar@uk.ibm.com> Co-authored-by: Tom Bentley <tombentley@users.noreply.github.com>	2021-09-17 15:35:49 +02:00
Matthew Wong	6c80643009	[KAFKA-8522] Streamline tombstone and transaction marker removal (#10914 ) This PR aims to remove tombstones that persist indefinitely due to low throughput. Previously, deleteHorizon was calculated from the segment's last modified time. In this PR, the deleteHorizon will now be tracked in the baseTimestamp of RecordBatches. After the first cleaning pass that finds a record batch with tombstones, the record batch is recopied with deleteHorizon flag and a new baseTimestamp that is the deleteHorizonMs. The records in the batch are rebuilt with relative timestamps based on the deleteHorizonMs that is recorded. Later cleaning passes will be able to remove tombstones more accurately on their deleteHorizon due to the individual time tracking on record batches. KIP 534: https://cwiki.apache.org/confluence/display/KAFKA/KIP-534%3A+Retain+tombstones+and+transaction+markers+for+approximately+delete.retention.ms+milliseconds Co-authored-by: Ted Yu <yuzhihong@gmail.com> Co-authored-by: Richard Yu <yohan.richard.yu@gmail.com>	2021-09-16 09:17:15 -07:00
Jason Gustafson	7de8a93c7e	KAFKA-13162: Ensure ElectLeaders is properly handled in KRaft (#11186 ) This patch fixes several problems with the `ElectLeaders` API in KRaft: - `KafkaApis` did not properly forward this request type to the controller. - `ControllerApis` did not handle the request type. - `ElectLeadersRequest.getErrorResponse` may raise NPE when `TopicPartitions` is null. - Controller should not do preferred election if `ElectLeaders` specifies `UNCLEAN` election. - Controller should not do unclean election if `ElectLeaders` specifies `PREFERRED` election. - Controller should use proper error codes to handle cases when desired leader is unavailable or when no election is needed because a desired leader is already elected. - When election for all partitions is requested (indicated with null `TopicPartitions` field), the response should not return partitions for which no election was necessary. In addition to extending the unit test coverage in `ReplicationControlManagerTest`, I have also converted `LeaderElectionCommandTest` to use KRaft. Reviewers: dengziming <swzmdeng@163.com>, José Armando García Sancio <jsancio@users.noreply.github.com>, David Arthur <mumrah@gmail.com>	2021-09-15 08:52:45 -07:00
Colin Patrick McCabe	0786dc8784	KAFKA-13224: Ensure that broker.id is set in KafkaConfig#originals (#11312 ) Some plugins make use of KafkaConfig#originals rather than the KafkaConfig object. We should ensure that these plugins see the correct value for broker.id if the broker is running in KRaft mode and node.id has been configured, but not broker.id. This PR does this by ensuring that both node.id and broker.id are set in the originals map if either one is set. We also check that they are set to the same value in KafkaConfig#validateValues. Co-author: Ron Dagostino <rdagostino@confluent.io>	2021-09-13 10:13:41 -07:00
Colin Patrick McCabe	01abc7a57e	MINOR: GroupMetadataManager#shutdown should remove metrics (#11313 ) Reviewers: Jason Gustafson <jason@confluent.io>	2021-09-10 16:47:53 -07:00
Tom Bentley	0a1df12382	KAFKA-13276: Prefer KafkaFuture in admin Result constructors (#11301 ) Avoid using the non-public API KafkaFutureImpl in the Admin client's `Result` class constructors. This is particularly problematic for `DescribeConsumerGroupsResult` which currently has a public constructor. For the other classes the rationale is simply consistency with the majority of the `Result` classes. Reviewers: Ismael Juma <ismael@juma.me.uk, David Jacot <djacot@confluent.io>, Luke Chen <showuon@gmail.com>	2021-09-08 12:15:44 -07:00
David Jacot	5e3ecf192e	KAFKA-13237; Add ActiveBrokerCount and FencedBrokerCount metrics to the ZK controller (KIP-748) (#11273 ) This patch adds the `ActiveBrokerCount` and the `FencedBrokerCount` metrics to the ZK controller. Note that `FencedBrokerCount` is always set to zero in the ZK controller. Reviewers: Jason Gustafson <jason@confluent.io>	2021-09-08 18:44:29 +02:00
David Jacot	06e53afbef	KAFKA-13266; `InitialFetchState` should be created after partition is removed from the fetchers (#11294 ) `ReplicationTest.test_replication_with_broker_failure` in KRaft mode sometimes fails with the following error in the log: ``` [2021-08-31 11:31:25,092] ERROR [ReplicaFetcher replicaId=1, leaderId=2, fetcherId=0] Unexpected error occurred while processing data for partition __consumer_offsets-1 at offset 31727 (kafka.server.ReplicaFetcherThread)java.lang.IllegalStateException: Offset mismatch for partition __consumer_offsets-1: fetched offset = 31727, log end offset = 31728. at kafka.server.ReplicaFetcherThread.processPartitionData(ReplicaFetcherThread.scala:194) at kafka.server.AbstractFetcherThread.$anonfun$processFetchRequest$8(AbstractFetcherThread.scala:545) at scala.Option.foreach(Option.scala:437) at kafka.server.AbstractFetcherThread.$anonfun$processFetchRequest$7(AbstractFetcherThread.scala:533) at kafka.server.AbstractFetcherThread.$anonfun$processFetchRequest$7$adapted(AbstractFetcherThread.scala:532) at kafka.utils.Implicits$MapExtensionMethods$.$anonfun$forKeyValue$1(Implicits.scala:62) at scala.collection.convert.JavaCollectionWrappers$JMapWrapperLike.foreachEntry(JavaCollectionWrappers.scala:359) at scala.collection.convert.JavaCollectionWrappers$JMapWrapperLike.foreachEntry$(JavaCollectionWrappers.scala:355) at scala.collection.convert.JavaCollectionWrappers$AbstractJMapWrapper.foreachEntry(JavaCollectionWrappers.scala:309) at kafka.server.AbstractFetcherThread.processFetchRequest(AbstractFetcherThread.scala:532) at kafka.server.AbstractFetcherThread.$anonfun$maybeFetch$3(AbstractFetcherThread.scala:216) at kafka.server.AbstractFetcherThread.$anonfun$maybeFetch$3$adapted(AbstractFetcherThread.scala:215) at scala.Option.foreach(Option.scala:437) at kafka.server.AbstractFetcherThread.maybeFetch(AbstractFetcherThread.scala:215) at kafka.server.AbstractFetcherThread.doWork(AbstractFetcherThread.scala:197) at kafka.utils.ShutdownableThread.run(ShutdownableThread.scala:99)[2021-08-31 11:31:25,093] WARN [ReplicaFetcher replicaId=1, leaderId=2, fetcherId=0] Partition __consumer_offsets-1 marked as failed (kafka.server.ReplicaFetcherThread) ``` The issue is due to a race condition in `ReplicaManager#applyLocalFollowersDelta`. The `InitialFetchState` is created and populated before the partition is removed from the fetcher threads. This means that the fetch offset of the `InitialFetchState` could be outdated when the fetcher threads are re-started because the fetcher threads could have incremented the log end offset in between. The patch fixes the issue by removing the partitions from the replica fetcher threads before creating the `InitialFetchState` for them. Reviewers: Jason Gustafson <jason@confluent.io> ### Committer Checklist (excluded from commit message) - [ ] Verify design and implementation - [ ] Verify test coverage and CI build status - [ ] Verify documentation (including upgrade notes)	2021-09-08 18:23:37 +02:00
Weisheng Yang	bc1800809d	MINOR: Supplement unit test for KAFKA-13175 (#11304 ) Reviewers: Guozhang Wang <wangguoz@gmail.com>	2021-09-07 13:50:34 -07:00
Yanwen(Jason) Lin	66a27af2f1	KAFKA-10038: Supports default client.id for ConsoleConsumer, ProducerPerformance, ConsumerPerformance (#11297 ) Reviewers: Guozhang Wang <wangguoz@gmail.com>	2021-09-07 13:49:50 -07:00
Ismael Juma	3475347aed	KAFKA-13270: Set JUTE_MAXBUFFER to 4 MB by default (#11295 ) We restore the 3.4.x/3.5.x behavior unless the caller has set the property (note that ZKConfig auto configures itself if certain system properties have been set). I added a unit test that fails without the change and passes with it. I also refactored the code to streamline the way we handle parameters passed to KafkaZkClient and ZooKeeperClient. See https://github.com/apache/zookeeper/pull/1129 for the details on why the behavior changed in 3.6.0. Credit to @rondagostino for finding and reporting this issue. Reviewers: David Jacot <djacot@confluent.io>	2021-09-06 09:18:47 -07:00
David Mao	3c0b89d9df	KAFKA-13225: Controller skips sending UpdateMetadataRequest when no change in partition state. (#11255 ) The controller can skip sending updateMetadataRequest during the broker failure callback if there are offline partitions and the deleted brokers don't host any partitions. Reviewers: Jun Rao <junrao@gmail.com>	2021-09-02 13:44:12 -07:00
Weisheng Yang	1a33b65e0f	KAFKA-13175; Optimization TopicExistsException,When a topic is marked for deletion. (#11226 ) After a topic is deleted, the topic is marked for deletion, create topic with the same name throw exception topic already exists. It should indicate the topic is marked for deletion. Reviewers: Guozhang Wang <wangguoz@gmail.com>	2021-09-01 10:16:41 -07:00
Satish Duggana	923bc2e9f7	MINOR Refactored the existing CheckpointFile in core module, moved to server-common module and introduced it as SnapshotFile. (#11060 ) MINOR Refactored the existing CheckpointFile in core module, moved to server-common module. Refactored CheckpointFile to server-common module as a Java class and it is reused by LeaderCheckpointFile, OffsetCheckpointFile. This will be used by CommittedOffsetsFile which checkpoints remote log metadata partitions with respective offsets in the default RemoteLogMetadataManager implementation. Existing tests are available for LeaderCheckpointFile, OffsetCheckpointFile. Reviewers: Jun Rao <junrao@gmail.com>	2021-08-30 08:43:25 -07:00
dengziming	1d22b0d706	KAFKA-10774; Admin API for Describe topic using topic IDs (#9769 ) Reviewers: Justine Olshan <jolshan@confluent.io>, Chia-Ping Tsai <chia7712@gmail.com>, Satish Duggana <satishd@apache.org>, Rajini Sivaram <rajinisivaram@googlemail.com>	2021-08-28 09:00:36 +01:00
Cong Ding	555f709175	MINOR: move tiered storage related configs to a separate class within LogConfig (#11110 ) The original code uses a RemoteLogManagerConfig class to store KIP-405 configs and adds three configs to LogConfig. This makes the code complicated and developers may be confused. This PR allows us to access RemoteLogManagerConfig from KafkaConfig and do the same for LogConfig. Kafka developers will see the same interface for the KIP-405 configs. After this change, if we want to read remoteStorageEnable we should use LogConfig.tieredLogConfig.remoteStorageEnable instead of LogConfig.remoteStorageEnable. The same for localRetentionMs and localRetentionBytes. If we want to read configs in RemoteLogManagerConfig, we should use KafkaConfig.tieredKafkaConfig.xxx. Reviewers: Satish Duggana <satishd@apache.org>, Kowshik Prakasam <kprakasam@confluent.io>, Jun Rao <junrao@gmail.com>	2021-08-27 11:10:58 -07:00
David Arthur	edd5a249e9	KAFKA-13233 Log zkVersion in more places (#11266 ) When debugging issues with partition state, it's very useful to know the zkVersion that was written. This patch adds the zkVersion of LeaderAndIsr in a few more places.	2021-08-27 11:19:58 -04:00
David Jacot	8d5185d976	MINOR; Small optimizations in `ReplicaManager#becomeLeaderOrFollower` (#11225 ) This patch refactors `ReplicaManager#becomeLeaderOrFollower` to avoid having to re-iterate over all the partitions to determine which ones should become leaders and which ones should become followers. The patch also refactors how partitions are marked as offline when the log can't be created. Before the patch, we were iterating over all the partitions in the request or in the delta to mark them as offline is the log was not present. Now, we mark them as failed directly if the log can not be created. Reviewers: Luke Chen <showuon@gmail.com>, Jason Gustafson <jason@confluent.io>	2021-08-27 08:22:36 +02:00
Kowshik Prakasam	1d3b96389b	MINOR: Improve local variable name in UnifiedLog.maybeIncrementFirstUnstableOffset (#11253 ) Reviewers: Jun Rao <junrao@gmail.com>	2021-08-26 13:27:51 -07:00
Jason Gustafson	d30b4e5151	MINOR: Improve controlled shutdown logging (#11246 ) A few small logging improvements: - Only print error when it is not NONE - Full list of remaining partitions are printed only at debug level - Only backoff and print retry logging if there are remaining retries Reviewers: Luke Chen <showuon@gmail.com>, David Jacot <djacot@confluent.io>	2021-08-24 09:22:58 -07:00
David Mao	7298aa3734	KAFKA-13134: Give up group metadata lock before sending heartbeat response (#11127 ) Small locking improvement to drop the group metadata lock before invoking the response callback in `GroupCoordinator#handleHeartbeat`. Reviewers: David Jacot <djacot@confluent.io>	2021-08-24 12:07:11 +02:00
David Jacot	0fe4e24c09	KAFKA-12840; Removing `compact` cleaning on a topic should abort on-going compactions (#11230 ) This patch ensure that on-going compaction is aborted when `compact` is removed from the `cleanup.policy` of a topic. Reviewers: Lucas Bradstreet <lucas@confluent.io>, Jun Rao <junrao@gmail.com>	2021-08-24 09:57:13 +02:00
David Mao	9cf7a5c4bc	KAFKA-12933: Flaky test ReassignPartitionsIntegrationTest.testReassignmentWithAlterIsrDisabled (#11244 ) Removes assertion added in #10471. It's unsafe to assert that there are partition movements ongoing for some of the tests in the suite because partitions in some of the tests have 0 data, which may complete reassignment before `verify` can run. Tests pass locally. Reviewers: Luke Chen <showuon@gmail.com>, Ismael Juma <ismael@juma.me.uk>	2021-08-23 14:33:04 -07:00
Ron Dagostino	bc3972c9f3	KAFKA-13219: BrokerState metric not working for KRaft clusters (#11239 ) The BrokerState metric always has a value of 0, for NOT_RUNNING, in KRaft clusters. This patch fixes it and adds a test. Reviewers: Ismael Juma <ismael@juma.me.uk>	2021-08-23 13:45:44 -07:00
Jason Gustafson	26ee78f392	KAFKA-13091; Ensure high watermark incremented after AlterIsr returns (#11245 ) After we have shrunk the ISR, we have an opportunity to advance the high watermark. We do this currently in `maybeShrinkIsr` after the synchronous update through ZK. For the `AlterIsr` path, however, we cannot rely on this call since the request is sent asynchronously. Instead we should attempt to advance the high watermark in the callback when the `AlterIsr` response returns successfully. Reviewers: David Jacot <djacot@confluent.io>	2021-08-23 11:07:39 -07:00
José Armando García Sancio	d20e51d1cd	MINOR: Fix how the last broker id is computed in `TestUtils.createBrokerConfigs` (#11237 ) The old logic only worked when the starting brokerId was 0. Reviewers: Jason Gustafson <jason@confluent.io>	2021-08-19 14:54:57 -07:00
José Armando García Sancio	9bcf4a525b	KAFKA-13198: Stop replicas when reassigned (#11216 ) Stop the replica and resign the coordinators when a replica gets reassigned away from a topic partition. 1. Implement localChanges in TopicsDelta and TopicDelta to return all of the partitions that were deleted, became leader and became follower for the given broker id. 2. Add tests for TopicsDelta::localChanges 3. Resign coordinators that were moved away from the consumer offset and transaction topic partitions. 4. Add replica manager tests for testing reassignment of replicas and removal of topic. 5. Add a new type LocalReplicaChanges that encapsulates topic partitions deleted, became leader and became follower. Reviewers: Jun Rao <junrao@gmail.com>	2021-08-17 13:10:03 -07:00
Rajini Sivaram	bbf1ee74d7	KAFKA-13207: Skip truncation on fetch response with diverging epoch if partition removed from fetcher (#11221 ) AbstractFetcherThread#truncateOnFetchResponse is used with IBP 2.7 and above to truncate partitions based on diverging epoch returned in fetch responses. Truncation should only be performed for partitions that are still owned by the fetcher and this check should be done while holding partitionMapLock to ensure that any partitions removed from the fetcher thread are not truncated. Truncation will be performed by any new fetcher that owns the partition when it restarts fetching. Reviewers: David Jacot <djacot@confluent.io>, Jason Gustafson <jason@confluent.io>	2021-08-17 17:12:51 +01:00
dengziming	38fd06146c	MINOR: clarify assertion in handleListPartitionReassignmentsRequest #11219 Reviewers: Colin P. McCabe <cmccabe@apache.org>, José Armando García Sancio <jsancio@gmail.com>, Ron Dagostino <rndgstn@gmail.com>	2021-08-16 11:59:09 -07:00
Lucas Bradstreet	64b8e17827	KAFKA-13194: bound cleaning by both LSO and HWM when firstUnstableOffsetMetadata is None (#11199 ) When the high watermark is contained in a non-active segment, we are not correctly bounding it by the hwm. This means that uncommitted records may overwrite committed data. I've separated out the bounding point tests to check the hwm case in addition to the existing active segment case. Reviewers: Jun Rao <junrao@gmail.com>	2021-08-13 15:35:59 -07:00
Kowshik Prakasam	db1f581da7	KAFKA-13068: Rename Log to UnifiedLog (#11154 ) In this PR, I've renamed kafka.log.Log to kafka.log.UnifiedLog. With the advent of KIP-405, going forward the existing Log class would present a unified view of local and tiered log segments, so we rename it to UnifiedLog. The motivation for this PR is also the same as outlined in this design document: https://docs.google.com/document/d/1dQJL4MCwqQJSPmZkVmVzshFZKuFy_bCPtubav4wBfHQ/edit. This PR is a follow-up to #10280 where we had refactored the Log layer introducing a new kafka.log.LocalLog class. Note: the Log class name had to be hardcoded to ensure metrics are defined under the Log class (for backwards compatibility). Please refer to the newly introduced UnifiedLog.metricName() method. Reviewers: Cong Ding <cong@ccding.com>, Satish Duggana <satishd@apache.org>, Jun Rao <junrao@gmail.com>	2021-08-12 16:10:19 -07:00
José Armando García Sancio	0837fba997	KAFKA-13161; Update replica partition state and replica fetcher state on follower update (#11189 ) When processing the topics delta, make sure that the replica manager partition state and replica fetcher state matches the information included in the topic delta. Also ensure that delayed operations are processed after the follower state change has been made since that is what allows them to be completed. Reviewers: Jason Gustafson <jason@confluent.io>	2021-08-10 12:06:30 -07:00
Ryan Dielhenn	89f1bea39b	KAFKA-13165: Validate KRaft node id, process role and quorum voters (#11179 ) Validate that KRaft controllers are members of the KRaft quorum, and non-controllers are not. This validation assumes that controllers and brokers have the same ID only when they are co-located. Reviewers: David Arthur <mumrah@gmail.com>, José Armando García Sancio <jsancio@gmail.com>, Luke Chen <showuon@gmail.com>	2021-08-09 12:57:23 -07:00
Justine Olshan	04023d49fd	KAFKA-13132; Upgrading to topic IDs in LISR requests has gaps introduced in 3.0 (part 2) (#11171 ) Most of [KAFKA-13132](https://issues.apache.org/jira/browse/KAFKA-13132) has been resolved, but there is one part of one case not covered. From the ticket: `2. We only assign the topic ID when we are associating the log with the partition in replicamanager for the first time` We covered the case where the log is already existing when the leader epoch is _equal_ (ie, no updates besides the topic ID), but we don't cover the update case where the leader epoch is bumped and we already have the log associated to the partition. This PR ensures we correctly assign topic ID in the makeLeaders/Followers path when the log already exists. I've also added a test for the bumped leader epoch scenario. Reviewers: Jason Gustafson <jason@confluent.io>	2021-08-05 18:10:48 -07:00
Ryan Dielhenn	d6f6edd2b1	KAFKA-13168: KRaft observers should not have a replica id (#11178 )	2021-08-05 11:18:41 -04:00
Ryan Dielhenn	1790025f64	KAFKA-13160: Fix bug in BrokerMetadataPublisher handling of default resources (#11168 ) When dealing with the default resource, BrokerMetadataPublisher should translate its name from the empty string (KRaft convention) to "<default>" (ZK convention). In the long term, we should eventually move from using a string to this for using an Option[String]. Reviewers: Colin P. McCabe <cmccabe@apache.org>	2021-08-04 16:43:49 -07:00
Jason Gustafson	fccaa5c389	KAFKA-13167; KRaft broker should send heartbeat immediately after starting controlled shutdown (#11177 ) Controlled shutdown in KRaft is signaled through a heartbeat request with the `shouldShutDown` flag set to true. When we begin controlled shutdown, we should immediately schedule the next heartbeat instead of waiting for the next periodic heartbeat. This allows the broker to shutdown more quickly. Reviewers: Colin P. McCabe <cmccabe@apache.org>	2021-08-04 15:58:34 -07:00
Ismael Juma	06034938c4	MINOR: Fix missing word in LogLoader logged warning (#11150 ) Reviewers: David Arthur <mumrah@gmail.com>	2021-08-04 10:12:19 -07:00
dengziming	b980ca8709	KAFKA-12158; Better return type of RaftClient.scheduleAppend (#10909 ) This patch improves the return type for `scheduleAppend` and `scheduleAtomicAppend`. Previously we were using a `Long` value and using both `null` and `Long.MaxValue` to distinguish between different error cases. In this PR, we change the return type to `long` and only return a value if the append was accepted. For the error cases, we instead throw an exception. For this purpose, the patch introduces a couple new exception types: `BufferAllocationException` and `NotLeaderException`. Reviewers: José Armando García Sancio <jsancio@users.noreply.github.com>, Jason Gustafson <jason@confluent.io>	2021-08-02 14:47:03 -07:00
José Armando García Sancio	4efd9bf0cf	KAFKA-13114; Revert state and reregister raft listener (#11116 ) RaftClient's scheduleAppend may split the list of records into multiple batches. This means that it is possible for the active controller to see a committed offset for which it doesn't have an in-memory snapshot. If the active controller needs to renounce and it is missing an in-memory snapshot, then revert the state and reregister the Raft listener. This will cause the controller to replay the entire metadata partition. Reviewers: Jason Gustafson <jason@confluent.io>	2021-08-01 15:26:04 -07:00
Lucas Bradstreet	56eb950d85	MINOR: log epoch and offset truncation similarly to HWM truncation (#11140 ) This patch improves logging around follower truncation. Specifically, the log message includes the end epoch state obtained from the leader and the resulting truncation state on the follower. An example log message is given below: ``` Truncating partition topic1-0 with TruncationState(offset=5, completed=true) due to leader epoch and offset EpochEndOffset(errorCode=0, partition=0, leaderEpoch=0, endOffset=5) ``` Reviewers: Jason Gustafson <jason@confluent.io>	2021-07-30 13:26:42 -07:00
Ryan Dielhenn	6f104935e9	KAFKA-13151; Disallow policy configs in KRaft since they are not yet supported (#11145 ) The configs `alter.config.policy.class.name` and `create.topic.policy.class.name` are not yet supported by KRaft. KRaft servers should fail startup if any of these are configured. Reviewers: Luke Chen <showuon@gmail.com>, David Arthur <mumrah@gmail.com>, Jason Gustafson <jason@confluent.io>	2021-07-30 12:46:25 -07:00
Justine Olshan	bebb9aeb9c	KAFKA-13132; Ensure topicId is updated on replicas even when the leader epoch is unchanged (#11126 ) In 3.0, there was a change that resulted in no longer assigning topic IDs to the log and the partition.metadata file in certain upgrade scenarios, specifically when upgrading from IBP 2.7 or below to 3.0. In this case, there may not be a bump to the leader epoch when the topicId is assigned by the controller, so the LeaderAndIsr request from the controller would be ignored by the replica. This PR fixes the problem by adding a check for whether we need to handle the LeaderAndIsr request given a new topic ID when one is not yet assigned in the log and code to assign a topic ID when the log is already associated to a partition in ReplicaManager. Reviewers: Jason Gustafson <jason@confluent.io>	2021-07-29 14:36:44 -07:00
Niket	257533a21b	KAFKA-13143; Remove Metadata handling from ControllerApis (#11135 ) This PR removes the `METADATA` API from the Kraft controller as the controller does not yet implement the metadata fetch functionality completely. Without the change (as per the JIRA https://issues.apache.org/jira/browse/KAFKA-13143), the API would return an empty list of topics making the caller incorrectly think that there were no topics in the cluster which could be confusing. After this change the describe and list topic APIs timeout on the controller endpoint when using the `kafka-topics` CLI (which is the same behavior as create_topic). Reviewers: Luke Chen <showuon@gmail.com>, José Armando García Sancio <jsancio@users.noreply.github.com>, Jason Gustafson <jason@confluent.io>	2021-07-29 09:23:47 -07:00
Rajini Sivaram	fe0fe686e9	KAFKA-13141; Skip follower fetch offset update in leader if diverging epoch is present (#11136 ) Reviewers: Jason Gustafson <jason@confluent.io>	2021-07-28 17:27:26 +01:00
Jason Gustafson	c5ec390fa6	KAFKA-13099; Transactional expiration should account for max batch size (#11098 ) When expiring transactionalIds, we group the tombstones together into batches. Currently there is no limit on the size of these batches, which can lead to `MESSAGE_TOO_LARGE` errors when a bunch of transactionalIds need to be expired at the same time. This patch fixes the problem by ensuring that the batch size respects the configured limit. Any transactionalIds which are eligible for expiration and cannot be fit into the batch are postponed until the next periodic check. Reviewers: David Jacot <djacot@confluent.io>, Guozhang Wang <wangguoz@gmail.com>	2021-07-27 18:23:00 -07:00
Matthias J. Sax	d89ea74918	MINOR: Move off deprecated APIs in StreamsResetter (#11075 ) Reviewers: Luke Chen <showuon@gmail.com>, Bill Bejeck <bill@confluent.io>	2021-07-26 14:15:33 -07:00
Jason Gustafson	b69027726e	MINOR: Remove redundant fields in dump log record output (#11101 ) In 2.8, the dump log output regressed to print batch level information for each record, which makes the output much noisier. This patch changes the output to what it was in 2.7 and previous versions. We only print batch metadata at the batch level. Reviewers: David Arthur <mumrah@gmail.com>, Ismael Juma <ismael@juma.me.uk>	2021-07-23 15:56:41 -07:00
Jason Gustafson	a75997f279	KAFKA-13127; Fix stray topic partition deletion for kraft (#11118 ) This patch fixes BrokerMetadataPublisher.findGhostReplicas (renamed to findStrayPartitions) so that it returns the stray partitions. Previously it was returning the non-stray partitions. This caused all of these partitions to get deleted on startup by mistake. Reviewers: Colin P. McCabe <cmccabe@apache.org>, José Armando García Sancio <jsancio@gmail.com>	2021-07-23 15:01:39 -07:00
Ismael Juma	f34bb28ab6	KAFKA-13116: Fix message_format_change_test and compatibility_test_new_broker_test failures (#11108 ) These failures were caused by `a46b82bea9`. Details for each test: * message_format_change_test: use IBP 2.8 so that we can write in older message formats. * compatibility_test_new_broker_test_failures: fix down-conversion path to handle empty record batches correctly. The record scan in the old code ensured that empty record batches were never down-converted, which hid this bug. * upgrade_test: set the IBP 2.8 when message format is < 0.11 to ensure we are actually writing with the old message format even though the test was passing without the change. Verified with ducker that some variants of these tests failed without these changes and passed with them. Also added a unit test for the down-conversion bug fix. Reviewers: Jason Gustafson <jason@confluent.io>	2021-07-23 13:43:31 -07:00
Justine Olshan	d998cbcc88	KAFKA-8529; Flakey test ConsumerBounceTest#testCloseDuringRebalance (#11097 ) When the replica fetcher receives a top-level error in the fetch response, it marks all partitions are failed and adds a backoff delay before resuming fetching from them. In addition to this, there is an additional backoff enforced after the top-level error is handled, so we end up waiting twice the configured backoff time before resuming. This patch removes this extra backoff. Reviewers: Jason Gustafson <jason@confluent.io>	2021-07-23 09:30:55 -07:00
Niket	57866bd588	MINOR: Rename the @metadata topic to __cluster_metadata #11102 Reviewers: Colin P. McCabe <cmccabe@apache.org>	2021-07-21 17:30:35 -07:00
Niket	6dd425e276	MINOR: Validate the KRaft controllerListener config on startup (#11070 ) Reviewers: Colin P. McCabe <cmccabe@apache.org>, David Arthur <mumrah@gmail.com>	2021-07-21 10:41:41 -07:00
Ismael Juma	e0f8eda6fd	MINOR: Improve usage of LogManager.currentDefaultConfig (#11094 ) In `deleteLogs`, we use a consistent value for `fileDeleteDelayMs` for the whole method. In `DynamicBrokerConfig.reconfigure`, it's a minor readability improvement, but there should be no change in behavior. Reviewers: David Arthur <mumrah@gmail.com>	2021-07-21 06:12:03 -07:00
Ron Dagostino	d779ab08ae	MINOR: Test ReplicaManager Metric Names (#11066 ) This patch closes a test gap where we do not check ReplicaManager metrics remain as expected. There was a bug in 2.8 where the metrics moved under a different class name for the KRaft case. Having such tests would have helped identify the bug. Reviewers: Colin P. McCabe <cmccabe@apache.org>	2021-07-20 16:37:27 -07:00
David Arthur	030caec096	MINOR: Handle some null cases in BrokerMetadataPublisher (#11029 ) Reviewers: Chia-Ping Tsai <chia7712@gmail.com>, Colin P. McCabe <cmccabe@apache.org>	2021-07-20 14:10:14 -04:00
Luke Chen	b6159042c0	KAFKA-12598: ConfigCommand should only support communication via ZooKeeper for a reduced set of cases (#10811 ) Checked the documentation, we must use `--zookeeper` option in 3 places (alter and describe): 1. user configs where the config is a SCRAM mechanism name (i.e. a SCRAM credential for a user) 2. update broker configs for a particular broker when that broker is down 3. broker default configs when all brokers are down Reference: 1. [config SCRAM Credentials](https://kafka.apache.org/documentation/#security_sasl_scram_credentials) 2. [Update config before broker started](https://kafka.apache.org/documentation/#dynamicbrokerconfigs) So, after this PR, we only support `--zookeeper` on `users` and `brokers` entity. Add some argument parse rules and tests. Reviewers: Ron Dagostino <rdagostino@confluent.io>, Ismael Juma <ismael@juma.me.uk>	2021-07-19 18:53:14 -07:00
Colin Patrick McCabe	4423a54935	MINOR: log static broker configs in KRaft mode (#11067 ) Reviewers: Ismael Juma <ismael@juma.me.uk>	2021-07-19 12:34:42 -07:00
Ismael Juma	a46b82bea9	KAFKA-12944: Assume message format version is 3.0 when inter-broker protocol is 3.0 or higher (KIP-724) (#11036 ) Also: * Deprecate `log.message.format.version` and `message.format.version`. * Log broker warning if the deprecated config values are ignored due to the inter-broker protocol version. * Log warning if `message.format.version` is set via `ConfigCommand`. * Always down-convert if fetch version is v3 or lower. * Add tests to verify new message format version based on the inter-broker protocol version. * Adjust existing tests that create topics with an older message format to have the inter-broker protocol set to 2.8. * Add upgrade note. Note that the log compaction change to always write new segments with record format v2 if the IBP is 3.0 or higher will be done as part of KAFKA-13093 (with Kafka 3.1 as the target release version). Reviewers: David Jacot <djacot@confluent.io>, David Arthur <mumrah@gmail.com>, Jason Gustafson <jason@confluent.io>	2021-07-19 05:37:16 -07:00
Chia-Ping Tsai	a24dbba1e5	MINOR: make sure alterAclsPurgatory is closed when controller server … (#10868 ) Reviewers: Ismael Juma <ismael@juma.me.uk>	2021-07-17 14:52:37 +08:00
José Armando García Sancio	b5cb02b288	KAFKA-13090: Improve kraft snapshot integration test Check and verify generated snapshots for the controllers and the brokers. Assert reader state when reading last log append time. Reviewers: Colin P. McCabe <cmccabe@apache.org>	2021-07-16 14:10:52 -07:00
David Arthur	061517212f	KAFKA-12777: Fix a potential NPE in AutoTopicCreationManager Reviewers: Colin P. McCabe <cmccabe@apache.org>	2021-07-16 13:17:32 -07:00
José Armando García Sancio	8e48212343	KAFKA-13098: Fix NoSuchFileException during snapshot recovery (#11071 ) Fix a bug where if a snapshot file is deleted while we're running snapshot recovery, a NoSuchFileException will be thrown and snapshot recovery will fail. Reviewers: Colin P. McCabe <cmccabe@apache.org>	2021-07-16 12:56:15 -07:00
Justine Olshan	584213ed20	Fix perf regression on LISR requests by asynchronously flushing the partition.metadata file (#11056 ) After noticing increased LISR times, we discovered a lot of time was spent synchronously flushing the partition metadata file. This PR changes the code so we asynchronously flush the files. We ensure files are flushed before appending, renaming or closing the log to ensure we have the partition metadata information on disk. Three new tests have been added to address these cases. Reviewers: Lucas Bradstreet <lucas@confluent.io>, Jun Rao <junrao@gmail.com>	2021-07-15 14:00:32 -07:00
Colin Patrick McCabe	e07de97a4c	KAFKA-12803: Support reassigning partitions when in KRaft mode (#10753 ) Support the KIP-455 reassignment API when in KRaft mode. Reassignments which merely rearrange partitions complete immediately. Those that only remove a partition complete immediately if the ISR would be non-empty after the specified removals. Reassignments that add one or more partitions follow the KIP-455 pattern of adding all the adding replicas to the replica set, and then waiting for the ISR to include all the new partitions before completing. Changes to the partition sets are accomplished via PartitionChangeRecord. Reviewers: Jun Rao <junrao@gmail.com>	2021-07-15 11:41:51 -07:00
Omnia G H Ibrahim	1e19add7a6	KAFKA-10589; Rename kafka-replica-verification CLI command line arguments for KIP-629 (#11007 ) This patch marks --topic-white-list as deprecated argument and introduce --topics-include for kafka-replica-verification as described in KIP-629: https://cwiki.apache.org/confluence/display/KAFKA/KIP-629%3A+Use+racially+neutral+terms+in+our+codebase. Reviewers: Xavier Léauté <xavier@confluent.io>, Lee Dongjin <dongjin@apache.org>, David Jacot <djacot@confluent.io>	2021-07-14 08:53:28 +02:00
Omnia G H Ibrahim	eccdfcdc3a	KAFKA-10588; Rename kafka-console-consumer CLI command line arguments for KIP-629 (#11008 ) This patch marks --whitelist as deprecated argument and introduce --include for kafka-console-consumer as described in KIP-629: https://cwiki.apache.org/confluence/display/KAFKA/KIP-629%3A+Use+racially+neutral+terms+in+our+codebase. Reviewers: Xavier Léauté <xavier@confluent.io>, David Jacot <djacot@confluent.io>	2021-07-14 08:26:07 +02:00
Colin Patrick McCabe	700a553864	MINOR: fix Scala 2.12 compile failure in ControllerApisTest (#11043 ) Reviewers: Konstantine Karantasis <k.karantasis@gmail.com>	2021-07-13 23:18:01 -07:00
Tom Bentley	a95d0e9326	KAFKA-13049: Name the threads used for log recovery (#11006 ) Reviewers: Chia-Ping Tsai <chia7712@gmail.com>, Luke Chen <showuon@gmail.com>	2021-07-14 06:14:35 +01:00
Kowshik Prakasam	b80ff184f3	KAFKA-12554: Refactor Log layer (#10280 ) TL;DR: This PR implements the details of the Log layer refactor, as outlined in this document: https://docs.google.com/document/d/1dQJL4MCwqQJSPmZkVmVzshFZKuFy_bCPtubav4wBfHQ/edit. Few details maybe different from the doc, but it is more or less the same. STRATEGY: In this PR, I've extracted a new class called LocalLog out of Log. Currently LocalLog is purely an implementation detail thats not exposed outside Log class (except for tests). The object encapsulation is that each Log instance wraps around a LocalLog instance. This new LocalLog class attempts to encompass most of the responsibilities of local log surrounding the segments map, which otherwise were present in Log previously. Note that not all local log responsibilities have been moved over to this new class (yet). The criteria I used was to preserve (for now) in existing Log class, any logic that is mingled in a complex manner with the logStartOffset or the LeaderEpochCache or the ProducerStateManager. Reviewers: Ismael Juma <ismael@juma.me.uk>, Satish Duggana <satishd@apache.org>, Jun Rao <junrao@gmail.com>	2021-07-13 18:01:39 -07:00
José Armando García Sancio	8134adcf91	KAFKA-13073: Fix MockLog snapshot implementation (#11032 ) Fix a simulation test failure by: 1. Relaxing the valiation of the snapshot id against the log start offset when the state machine attempts to create new snapshot. It is safe to just ignore the request instead of throwing an exception when the snapshot id is less that the log start offset. 2. Fixing the MockLog implementation so that it uses startOffset both externally and internally. Reviewers: Colin P. McCabe <cmccabe@apache.org>	2021-07-13 17:06:18 -07:00
David Arthur	0785aa38d2	KAFKA-13067 Add internal config to lower the metadata log segment size (#11031 ) Add an internal configuration in order to facilitate system and integration tests that need a smaller log segment size. Since this is not intended for use in production, log an ERROR message if it is set to a non-default level. Reviewers: Colin P. McCabe <cmccabe@apache.org>	2021-07-13 16:23:32 -07:00
José Armando García Sancio	d33a874ce7	KAFKA-13080: Direct fetch snapshot request to kraft controller (#11041 ) Reviewers: Colin P. McCabe <cmccabe@apache.org>	2021-07-13 16:14:15 -07:00
Ryan Dielhenn	f97f36b650	KAFKA-13051; Require principal builders implement `KafkaPrincipalSerde` and set default (#11011 ) This patch adds a check to ensure that principal builder implementations implement `KafkaPrincipalSerde` as specified in KIP-590: https://cwiki.apache.org/confluence/display/KAFKA/KIP-590%3A+Redirect+Zookeeper+Mutation+Protocols+to+The+Controller. This patch also changes the default value of `principal.builder.class` to `DefaultKafkaPrincipalBuilder`, which was already the implicit behavior when no principal builder was specified. Reviewers: Ismael Juma <ismael@juma.me.uk>, Jason Gustafson <jason@confluent.io>	2021-07-13 10:54:36 -07:00
Ryan Dielhenn	cc33cc5f37	MINOR: kraft quorum configs should not be internal #11030 Reviewers: David Arthur <mumrah@gmail.com>	2021-07-12 14:20:58 -07:00
Uwe Eisele	2b8d41b468	KAFKA-13003: In kraft mode also advertise configured advertised port instead of socket port (#10935 ) In Kraft mode, Apache Kafka 2.8.0 advertises the socket port instead of the configured advertised port. A broker with the following configuration: listeners=PUBLIC://0.0.0.0:19092,REPLICATION://0.0.0.0:9091 advertised.listeners=PUBLIC://envoy-kafka-broker:9091,REPLICATION://kafka-broker1:9091 advertises on the PUBLIC listener envoy-kafka-broker:19092, however I would expect that envoy-kafka-broker:9091 is advertised. In ZooKeeper mode it works as expected. This PR changes the BrokerServer class so that in Kraft mode the configured advertised port is registered as expected. Reviewers: Jason Gustafson <jason@confluent.io>, Ismael Juma <ismael@juma.me.uk>	2021-07-12 13:40:09 -07:00
Luke Chen	c003cc6b35	KAFKA-12677: Return not_controller error in envelope response itself in KRaft mode (#10794 ) In Kafka Raft mode, the flow sending request from client to controller is like this: 1. client send request to a random controller (ex: A-controller) 2. A-controller will forward the request to active controller (ex: B-controller) to handle the request 3. After active B-controller completed the request, the A-controller will receive the response, and do a check: 3.1. if the response has "disconnected" or "NOT_CONTROLLER" error, which means the cached active controller is changed. So, clear the cached active controller, and wait for next retry to get the updated active controller from `controllerNodeProvider` 3.2. else, complete the request and respond back to client In this bug, we have 2 issues existed: 1. "NOT_CONTROLLER" exception won't be correctly send back to the requester, instead, `UNKNOWN_SERVER_ERROR` will be returned. The reason is the `NotControllerException` is wrapped by a `CompletionException` when the `Future` completeExceptionally. And the `CompletionException` will not match any Errors we defined, so the `UNKNOWN_SERVER_ERROR` will be returned. Even if we don't want the `NotControllerException` return back to client, we need to know it to do some check. fix 1: unwrap the `CompletionException` before encoding the exception to error. 2. Even if we fixed 1st bug, we still haven't fixed this issue. After the 1st bug fixed, the client can successfully get `NotControllerException` now, and keep retrying... until timeout. So, why won't it meet the flow `3.1` mentioned above, since it has `NotControllerException`? The reason is, we wrapped the original request with `EnvelopeRequest` and forwarded to active controller. So, after the active controller completed the request, responded with `NotControllerException`, and then, wrapped into an `EnvelopeResponse` with no error, and then send the `EnvelopeResponse` back. That is, in the flow `3.1`, we only got "no error" from `EnvelopeResponse`, not the `NotControllerException` inside. fix 2: Make the envelope response return `NotControllerException` if the controller response has `NotControllerException`. So that we can catch the `NotControllerException` on envelopeResponse to update the active controller. Reviewers: wenbingshen <oliver.shen999@gmail.com>, Ismael Juma <ismael@juma.me.uk>, dengziming <dengziming1993@gmail.com>, Jason Gustafson <jason@confluent.io>	2021-07-12 09:17:46 -07:00
Jason Gustafson	5ef962ba06	KAFKA-13056; Do not rely on broker for snapshots if controller is co-resident (#11013 ) When a node is serving as both broker and controller, we should only rely on the controller to write new snapshots. Reviewers: Luke Chen <showuon@gmail.com>, Colin P. McCabe <cmccabe@apache.org>	2021-07-10 10:47:46 -07:00
David Arthur	6e857c531f	KAFKA-13057; Add KRaft "broker" to several RPC's listeners (#11012 ) This patch fixes a few request listener specs. We were missing "broker" for many APIs which are now implemented in KRaft and there were a couple cases where we had unnecessarily exposed a controller-only API on the broker. Reviewers: Jason Gustafson <jason@confluent.io>	2021-07-10 10:45:27 -07:00
Sanjana Kaundinya	af282f76a7	KAFKA-13045: Adding a test for batched offsetFetch requests with one group repeating (#11000 ) Reviewers: Rajini Sivaram <rajinisivaram@googlemail.com>	2021-07-10 08:43:55 +01:00
Colin Patrick McCabe	526fdfb97b	MINOR: the broker should use metadata.log.max.record.bytes.between.snapshots (#10990 ) The broker should trigger a snapshot once metadata.log.max.record.bytes.between.snapshots has been exceeded. Reviewers: Jason Gustafson <jason@confluent.io>	2021-07-09 12:00:26 -07:00
José Armando García Sancio	f6d6bcc153	MINOR: Print the cached broker epoch (#11005 ) When the broker epochs do not match make sure to print both broker epochs.	2021-07-09 14:17:57 -04:00
David Arthur	35e65b7f06	MINOR: Fix KRaft snapshot delete bug in Scala 2.12 (#10997 ) The sliding window + takeWhile behavior over a sequence seems somewhat different between Scala 2.12 and Scala 2.13. This PR works around the difference by using foreach with an early return. Reviewers: Colin P. McCabe <cmccabe@apache.org>	2021-07-08 16:51:56 -07:00
dengziming	ecfccb480b	KAFKA-12660; Do not update offset commit sensor after append failure (#10560 ) Do not update the commit-sensor if the commit failed and add test logic. The patch also adds 2 unit tests, the first for `OFFSET_METADATA_TOO_LARGE` error, the second is to cover circumstance when one offset is committed and the other is failed with `OFFSET_METADATA_TOO_LARGE`. Both of these cases were uncovered previously. Reviewers: Jason Gustafson <jason@confluent.io>	2021-07-08 10:13:23 -07:00
Justine Olshan	2b8aff58b5	KAFKA-10580: Add topic ID support to Fetch request (#9944 ) Updated FetchRequest and FetchResponse to use topic IDs rather than topic names. Some of the complicated code is found in FetchSession and FetchSessionHandler. We need to be able to store topic IDs and maintain a cache on the broker for IDs that may not have been resolved. On incremental fetch requests, we will try to resolve them or remove them if in toForget. Reviewers: Rajini Sivaram <rajinisivaram@googlemail.com>, Chia-Ping Tsai <chia7712@gmail.com>, Jun Rao <junrao@gmail.com>	2021-07-07 16:02:37 -07:00
David Arthur	03890ff1d1	MINOR: Fix NPE from addingReplicas and removingReplicas (#10992 ) Fix NPE from addingReplicas and removingReplicas. Make addingReplicas and removingReplicas in PartitionRecord non-nullable as described in KIP-746. Reviewers: Colin P. McCabe <cmccabe@apache.org>	2021-07-07 15:39:43 -07:00
David Arthur	61b6539517	MINOR: Fix a startup NPE in BrokerServer (#10989 ) Reviewers: Colin P. McCabe <cmccabe@apache.org>	2021-07-07 14:51:13 -04:00
Sanjana Kaundinya	e00c0f3719	KAFKA-12234: Implement request/response for offsetFetch batching (KIP-709) (#10962 ) This implements the request and response portion of KIP-709. It updates the OffsetFetch request and response to support fetching offsets for multiple consumer groups at a time. If the broker does not support the new OffsetFetch version, clients can revert to the previous behaviour and use a request for each coordinator. Reviewers: Rajini Sivaram <rajinisivaram@googlemail.com>, Konstantine Karantasis <konstantine@confluent.io>	2021-07-07 11:55:00 +01:00
Cong Ding	ec0d12cd6a	MINOR: Fix typo in LogCleanerTest (#10984 ) Reviewers: Luke Chen <showuon@gmail.com>, David Jacot <djacot@confluent.io>	2021-07-07 08:44:25 +02:00
Luke Chen	b32e548156	KAFKA-13023: make "range, cooperative-sticky" as the default assignor in V3.0 (#10903 ) Set the default assignor to ["range", "cooperative-sticky"] to make it easier for users to switch over to cooperative rebalancing by using only a single rolling bounce. Reviewers: Anna Sophie Blee-Goldman <ableegoldman@apache.org>	2021-07-06 21:41:00 -07:00
Colin Patrick McCabe	7bd55f5156	KAFKA-12998: Implement broker-side KRaft snapshots (#10931 ) This PR implements broker-side KRaft snapshots, including both saving and loading. The code for triggering a periodic broker-side snapshot will come in a follow-on PR. Loading should work with just this PR. It also implements reloading broker snapshots after initialization. In order to facilitate snapshots, this PR introduces the concept of MetadataImage and MetadataDelta. MetadataImage represents the metadata state retained in memory. It is basically a generalization of MetadataCache that includes a few things that MetadataCache does not (such as features and client quotas.) KRaftMetadataCache is now an accessor for the data stored in this object. Similarly, MetadataImage replaces CacheConfigRespository and ClientQuotaCache. It also subsumes kafka.server.metadata.MetadataImage and related classes. MetadataDelta represents a change to a MetadataImage. When a KRaft snapshot is loaded, we will accumulate all the changes into a MetadataDelta first, prior to applying it. If we must reload a snapshot because we fell too far behind while consuming metadata, the resulting MetadataDelta will contain all the changes needed to catch us up. During normal operation, MetadataDelta is also used to accumulate the changes of each incoming batch of metadata records. These incremental deltas should be relatively small. I have removed the logic for updating the various manager objects from BrokerMetadataListener and placed it into BrokerMetadataPublisher. This makes it easier to unit test BrokerMetadataListener. Reviewers: David Arthur <mumrah@gmail.com>, Jason Gustafson <jason@confluent.io>	2021-07-06 16:36:01 -07:00
David Arthur	284ec262c6	KAFKA-12155: Metadata log and snapshot cleaning #10864 This PR includes changes to KafkaRaftClient and KafkaMetadataLog to support periodic cleaning of old log segments and snapshots. Four new public config keys are introduced: metadata.log.segment.bytes, metadata.log.segment.ms, metadata.max.retention.bytes, and metadata.max.retention.ms. These are used to configure the log layer as well as the snapshot cleaning logic. Snapshot and log cleaning is performed based on two criteria: total metadata log + snapshot size (metadata.max.retention.bytes), and max age of a snapshot (metadata.max.retention.ms). Since we have a requirement that the log start offset must always align with a snapshot, we perform the cleaning on snapshots first and then clean what logs we can. The cleaning algorithm follows: 1. Delete the oldest snapshot. 2. Advance the log start offset to the new oldest snapshot. 3. Request that the log layer clean any segments prior to the new log start offset 4. Repeat this until the retention size or time is no longer violated, or only a single snapshot remains. The cleaning process is triggered every 60 seconds from the KafkaRaftClient polling thread. Reviewers: José Armando García Sancio <jsancio@gmail.com>, dengziming <dengziming1993@gmail.com>, Colin P. McCabe <cmccabe@apache.org>	2021-07-06 14:19:44 -07:00
thomaskwscott	fb6425188c	KAFKA-12981; Ensure LogSegment.maxTimestampSoFar and LogSegment.offsetOfMaxTimestampSoFar are read/updated in sync (#10960 ) This patch ensures that `maxTimestampSoFar` and `offsetOfMaxTimestampSoFar` are consistent with each others. It does so by storing them together. It relates to KIP-734 which exposes them via the admin client. Reviewers: Ismael Juma <ismael@juma.me.uk>, David Jacot <djacot@confluent.io>	2021-07-06 19:28:09 +02:00
zhaohaidao	943967773d	KAFKA-12992; Make kraft configuration properties public (#10971 ) This patch makes the following KRaft configurations public: - `process.roles` - `node.id` - `initial.broker.registration.timeout.ms` - `broker.heartbeat.interval.ms` - `broker.session.timeout.ms` - `metadata.log.dir` - `controller.listener.names` - `sasl.mechanism.controller.protocol` Reviewers: Luke Chen <showuon@gmail.com>, Jason Gustafson <jason@confluent.io>	2021-07-05 17:21:40 -07:00
Ryanne Dolan	6d2f563865	KAFKA-12436: Deprecate MirrorMaker v1 (KIP-720) (#10805 ) Reviewers: Luke Chen <showuon@gmail.com>, Ismael Juma <ismael@juma.me.uk>, Mickael Maison <mickael.maison@gmail.com>	2021-07-04 15:17:31 +01:00
OmniaGM	51796bcdef	KAFKA-10587; Rename kafka-mirror-maker CLI command line arguments for KIP-629 [KAFKA-10587](https://issues.apache.org/jira/browse/KAFKA-10587) Rename kafka-mirror-maker CLI command line arguments for KIP-629 Replace "whitelist" argument in kafka-mirror-maker cli command with "include" Author: OmniaGM <o.g.h.ibrahim@gmail.com> Reviewers: Luke Chen, Xavier Leaute, Gwen Shapira Closes #10937 from OmniaGM/KAFKA-10587	2021-07-01 18:56:35 -07:00
José Armando García Sancio	9f01909dc3	KAFKA-12997: Expose the append time for batches from raft (#10946 ) Add the record append time to Batch. Change SnapshotReader to set this time to the time of the last log in the last batch. Fix the QuorumController to remember the last committed batch append time and to store it in the generated snapshot. Reviewers: David Arthur <mumrah@gmail.com>, Luke Chen <showuon@gmail.com>, Colin P. McCabe <cmccabe@apache.org>	2021-07-01 16:38:59 -07:00
Gardner Vickers	789fc26042	KAFKA-12964: Collect and rename snapshot files prior to async deletion. (#10896 ) Segment and index files are currently renamed with a .deleted suffix prior to async deletion. This serves two purposes, to resume deletion on broker failure and also protect against deletion of new segments during truncation (due to deletion being async). We should do the same for snapshot files. While they are not subject to issues around resuming deletion due to the stray snapshot scanning which is performed on log initialization, we can end up with situations where truncation queues snapshots for deletion, but prior to deletion new segments with the same snapshot file name are created. Async deletion can then delete these new snapshots. This patch offers a two-stage snapshot deletion which first renames and removes the segments in question from the ProducerStateManager, allowing the Log to asynchronously delete them. Credit to Kowshik Prakasam <kowshik@gmail.com> for finding this issue and creating the test demonstrating the failure. Co-authored-by: Kowshik Prakasam <kowshik@gmail.com> Address PR feedback Reviewers: Kowshik Prakasam <kprakasam@confluent.io>, Jun Rao <junrao@gmail.com>	2021-07-01 14:23:58 -07:00
Jeff Kim	8cd04cb1a0	KAFKA-13007; KafkaAdminClient getListOffsetsCalls reuse cluster snapshot (#10940 ) In getListOffsetsCalls, we rebuild the cluster snapshot for every topic partition. instead, we should reuse a snapshot. For manual testing (used AK 2.8), i've passed in a map of 6K topic partitions to listOffsets Without snapshot reuse: duration of building futures from metadata response: 15582 milliseconds total duration of listOffsets: 15743 milliseconds With reuse: duration of building futures from metadata response: 24 milliseconds total duration of listOffsets: 235 milliseconds Reviewers: Luke Chen <showuon@gmail.com>, Ismael Juma <ismael@juma.me.uk>, Jason Gustafson <jason@confluent.io>	2021-07-01 14:16:35 -07:00
Mickael Maison	f5d5f654db	KAFKA-12663: Update FindCoordinator to support batch lookups (KIP-699) (#10743 ) This implements KIP-699: https://cwiki.apache.org/confluence/display/KAFKA/KIP-699%3A+Update+FindCoordinator+to+resolve+multiple+Coordinators+at+a+time It updates FindCoordinator request and response to support resolving multiple coordinators at a time. If a broker does not support the new FindCoordinator version, clients can revert to the previous behaviour and use a request for each coordinator. Reviewers: David Jacot <djacot@confluent.io>, Tom Bentley <tbentley@redhat.com>, Sanjana Kaundinya <skaundinya@gmail.com>	2021-07-01 22:05:03 +01:00
Justine Olshan	cee2e975d1	KAFKA-13011; Update deleteTopics Admin API (#10892 ) This patch adds two new apis to support topic deletion using topic IDs or names. It uses a new class `TopicCollection` to keep a collection of topics defined either by names or IDs. Finally, it modifies `DeleteTopicsResult` to support both names and IDs and deprecates the old methods which have become ambiguous. Eventually we will want to deprecate the old `deleteTopics` apis as well, but this patch does not do so. Reviewers: Jason Gustafson <jason@confluent.io>	2021-06-30 23:20:21 -07:00
José Armando García Sancio	1b7ab8eb9f	KAFKA-12863: Configure controller snapshot generation (#10812 ) Add the ability for KRaft controllers to generate snapshots based on the number of new record bytes that have been applied since the last snapshot. Add a new configuration key to control this parameter. For now, it defaults to being off, although we will change that in a follow-on PR. Also, fix LocalLogManager so that snapshot loading is only triggered when the listener is not the leader. Reviewers: Colin P. McCabe <cmccabe@apache.org>	2021-06-30 18:13:53 -07:00
Colin Patrick McCabe	a10e4e8547	MINOR: Some broker code cleanups #10948 Fix the JavaDoc for the ClientQuotaManagerConfig#throttle function to refer to the correct parameter name. BrokerEndPointTest#testHashAndEquals should test the BrokerEndPoint class, rather than the MetadataBroker class. TopicConfigHandler: make the kafkaController argument optional, since we won't have it when in KRaft mode. Remove the unecessary ConfigRepository argument for the Partition class. Remove the unused TestUtils#deleteBrokersInZk function. Reviewers: Jason Gustafson <jason@confluent.io>	2021-06-30 14:58:01 -07:00
Colin Patrick McCabe	9f71db96fd	MINOR: Move ZkMetadataCache into its own file. (#10942 ) Reviewers: Jason Gustafson <jason@confluent.io>	2021-06-29 16:03:02 -07:00
Colin Patrick McCabe	d9b898b678	MINOR: Refactor the MetadataCache interface (#10887 ) Remove getNonExistingTopics, which was not necessary. MetadataCache already lets callers check for the existence of topics by calling MetadataCache#contains. Add MetadataCache#getAliveBrokerNode and getAliveBrokerNodes. This simplifies the calling code, which always wants a Node. Fix a case where we were calling getAliveBrokers and filtering by id, rather than simply calling getAliveBroker(id) and making use of the hash map. Reviewers: Jason Gustafson <jason@confluent.io>, Jose Sancio <jsancio@gmail.com>	2021-06-29 15:59:20 -07:00
Cong Ding	0b6d6b1785	KAFKA-12520: Ensure log loading does not truncate producer state unless required (#10763 ) When we find a .swap file on startup, we typically want to rename and replace it as .log, .index, .timeindex, etc. as a way to complete any ongoing replace operations. These swap files are usually known to have been flushed to disk before the replace operation begins. One flaw in the current logic is that we recover these swap files on startup and as part of that, end up truncating the producer state and rebuild it from scratch. This is unneeded as the replace operation does not mutate the producer state by itself. It is only meant to replace the .log file along with corresponding indices. Because of this unneeded producer state rebuild operation, we have seen multi-hour startup times for clusters that have large compacted topics. This patch fixes the issue. With ext4 ordered mode, the metadata are ordered and no matter it is a clean/unclean shutdown. As a result, we rework the recovery workflow as follows. If there are any .cleaned files, we delete all .swap files with higher/equal offsets due to KAFKA-6264. We also delete the .cleaned files. If no .cleaned file, do nothing for this step. If there are any .log.swap files left after step 1, they, together with their index files, must be renamed from .cleaned and are complete (renaming from .cleaned to .swap is in reverse offset order). We rename these .log.swap files and their corresponding index files to regular files, while deleting the original files from compaction or segment split if they haven't been deleted. Do log splitting for legacy log segments with offset overflow (KAFKA-6264) If there are any other index swap files left, they must come from partial renaming from .swap files to regular files. We can simply rename them to regular files. credit: some code is copied from @dhruvilshah3 's PR: #10388 Reviewers: Dhruvil Shah <dhruvil@confluent.io>, Jun Rao <junrao@gmail.com>	2021-06-29 09:17:13 -07:00
Rajini Sivaram	c6d2778a8d	KAFKA-12996; Return OFFSET_OUT_OF_RANGE for fetchOffset < startOffset even for diverging epochs (#10930 ) If fetchOffset < startOffset, we currently throw OffsetOutOfRangeException when attempting to read from the log in the regular case. But for diverging epochs, we return Errors.NONE with the new leader start offset, hwm etc.. ReplicaFetcherThread throws OffsetOutOfRangeException when processing responses with Errors.NONE if the leader's offsets in the response are out of range and this moves the partition to failed state. The PR adds a check for this case when processing fetch requests and throws OffsetOutOfRangeException regardless of epoch. Reviewers: Luke Chen <showuon@gmail.com>, Nikhil Bhatia <rite2nikhil@gmail.com>, Guozhang Wang <wangguoz@gmail.com>	2021-06-29 08:49:36 -07:00
Ignacio Acuña Frías	d95c191945	KAFKA-12926: ConsumerGroupCommand's java.lang.NullPointerException at negative offsets while running kafka-consumer-groups.sh (#10858 ) This patch fixes the `ConsumerGroupCommand` to correctly handle missing offsets, which are returned as `null` by the admin API. Reviewers: David Jacot <djacot@confluent.io>	2021-06-29 09:00:56 +02:00
Justine Olshan	397fa1f894	KAFKA-12976; Remove UNSUPPORTED_VERSION error from delete topics call (#10923 ) Removed the condition to throw the error. Now we return UNKNOWN_TOPIC_ID which allows clients to retry instead of failing. Updated the test for IBP < 2.8 that tries to delete topics using ID. Reviewers: Luke Chen <showuon@gmail.com>, Jason Gustafson <jason@confluent.io>	2021-06-28 10:15:50 -07:00
Luke Chen	c96dc3aef7	KAFKA-12938: Fix and reenable testChrootExistsAndRootIsLocked test (#10916 ) Reviewers: Manikumar Reddy <manikumar.reddy@gmail.com>, Igor Soarez <soarez@apple.com>	2021-06-28 16:21:59 +05:30
dengziming	bd1ee02b87	MINOR: Add missing apiversion test for 3.0 (#10748 ) Reviewers: Luke Chen <showuon@gmail.com>, David Jacot <djacot@confluent.io>	2021-06-28 10:36:16 +02:00
阿洋	bc6873a61b	MINOR: To verify segment.hasOverflow, the path of the segment should be printed (#10925 ) When refer to the return "Check whether the last offset of the last batch in this segment overflows the indexes", if the result is not expected, the path of the segment should be printed so that users can find problems. Reviewers: Luke Chen, Guozhang Wang <wangguoz@gmail.com>	2021-06-27 20:32:06 -07:00
Colin Patrick McCabe	bd668e90c6	MINOR: add MockConfigRepository (#10927 ) Use MockConfigRepository rather than CachedConfigRepository in unit tests. This is useful for an upcoming change that will remove CachedConfigRepository. Reviewers: David Arthur <mumrah@gmail.com>	2021-06-25 16:40:42 -07:00
thomaskwscott	bd72ef1bf1	KAFKA-12541; Extend ListOffset to fetch offset with max timestamp (KIP-734) (#10760 ) This patch implements KIP-734 as described in https://cwiki.apache.org/confluence/display/KAFKA/KIP-734%3A+Improve+AdminClient.listOffsets+to+return+timestamp+and+offset+for+the+record+with+the+largest+timestamp. Reviewers: David Jacot <djacot@confluent.io>	2021-06-25 14:29:12 +02:00
Jason Gustafson	2beaf9a720	KAFKA-12967; KRaft broker should forward DescribeQuorum to controller (#10900 ) We added the DescribeQuorum API in KIP-595. This patch adds the logic to forward DescribeQuorum requests to the controller when KRaft is enabled. The KRaft broker listener has already been enabled in DescribeQuorumRequest.json. The zk broker is not enabled, however, so DescribeQuorum requests will not be advertised and will be rejected at the network layer. Reviewers: José Armando García Sancio <jsancio@users.noreply.github.com>, David Arthur <mumrah@gmail.com>	2021-06-24 10:54:32 -07:00
Ignacio Acuña Frías	d03f7c84f9	KAFKA-12949; Match null when eventQueue poll hits eventTimeoutMs in `TestRaftServer` (#10883 ) This patch fixes a match error in `TestRaftServer` which causes the process to crash. We should match against `null` to handle the case of a timeout when polling for events. Reviewers: José Armando García Sancio <jsancio@users.noreply.github.com>, Jason Gustafson <jason@conflluent.io>	2021-06-22 12:15:32 -07:00
iamgd67	ac5ddc574e	KAFKA-12889: log clean relative index range check of group consider empty log segment to avoid too many empty log segment left (#10818 ) To avoid log index 4 byte relative offset overflow, log cleaner group check log segments offset to make sure group offset range not exceed Int.MaxValue. This offset check currentlly not cosider next is next log segment is empty, so there will left empty log files every about 2^31 messages. The left empty logs will be reprocessed every clean cycle, which will rewrite it with same empty content, witch cause little no need io. For __consumer_offsets topic, normally we can set cleanup.policy to compact,delete to get rid of this. My cluster is 0.10.1.1, but after analyze the trunk code, it should has same problem too. Co-authored-by: Liu Qiang(BSS-HZ) <qliu.zj@best-inc.com> Reviewers: Luke Chen <showuon@gmail.com>, Guozhang Wang <wangguoz@gmail.com>	2021-06-19 15:33:52 -07:00
Satish Duggana	d3709dafbe	MINOR: Addressed minor typos in READMEs. (#10905 ) Reviewers: Chia-Ping Tsai <chia7712@gmail.com>	2021-06-19 23:58:57 +08:00
José Armando García Sancio	4ddf1a5f94	KAFKA-12837; Process entire batch reader in the `BrokerMetadataListener` commit handler (#10902 ) We should process the entire batch in `BrokerMetadataListener` and make sure that `hasNext` is called before calling `next` on the iterator. The previous code worked because the raft client kept track of the position in the iterator, but it caused NoSuchElementException to be raised when the reader was empty (as might be the case with control records). Reviewers: Jason Gustafson <jason@confluent.io>	2021-06-18 13:20:43 -07:00
Satish Duggana	56250f446a	KAFKA-12816 Added tiered storage related configs including remote log manager configs. (#10733 ) Added tiered storage related configs including remote log manager configs. Added local log retention configs to LogConfig. Added tests for the added configs. Reviewers: Kowshik Prakasam <kprakasam@confluent.io>, Jun Rao <junrao@gmail.com>	2021-06-18 09:38:42 -07:00
Justine Olshan	195a8b0ed1	KAFKA-12835: Topic IDs can mismatch on brokers (after interbroker protocol version update) (#10754 ) Upon upgrading to IBP 2.8, topic ID can end up getting reassigned which can cause errors in LeaderAndIsr handling when the partition metadata files from the previous ID are still on the broker. Topic IDs are stored in the TopicZNode. The behavior of the code before this fix is as follows: Consider we had a controller with IBP 2.8+. Each topic will be assigned topic IDs and LeaderAndIsr requests will write partition.metadata files to the brokers. If we re-elect the controller and end up with a controller with an older IBP version and we reassign partitions, the TopicZNode is overwritten and we lose the topic ID. Upon electing a 2.8+ IBP controller, we will see the TopicZNode is missing a topic ID and will generate a new one. If the broker still has the old partition metadata file, we will see an ID mismatch that causes the error. This patch changes controller logic so that we maintain the topic ID in the controller and the ZNode even when IBP < 2.8. This means that in the scenario above, reassigning partitions will not result in losing the topic ID and reassignment. Topic IDs may be lost when downgrading the code below version 2.8, but upon re-upgrading to code version 2.8+, before bumping the IBP, all partition metadata files will be deleted to prevent any errors. Reviewers: Lucas Bradstreet <lucas@confluent.io>, David Jacot <djacot@confluent.io>	2021-06-18 09:25:03 +02:00

... 3 4 5 6 7 ...

3933 Commits