kafka

Commit Graph

Author	SHA1	Message	Date
Nikolay	4bba2c8a32	KAFKA-14591: Move DeleteRecordsCommand to tools (#13278 ) Reviewers: Mickael Maison <mickael.maison@gmail.com>, Federico Valeri <fedevaleri@gmail.com>	2023-07-21 17:30:28 +02:00
Satish Duggana	4ea9394e7e	MINOR Fix the build failure (#14065 ) Fixing the build failure caused by the earlier commit `27ea025e33` ``` [Error] /Users/satishd/repos/apache-kafka/core/src/test/scala/unit/kafka/server/ReplicaManagerTest.scala:3526:77: the result type of an implicit conversion must be more specific than Object [Error] /Users/satishd/repos/apache-kafka/core/src/test/scala/unit/kafka/server/ReplicaManagerTest.scala:3530:70: the result type of an implicit conversion must be more specific than Object [Warn] /Users/satishd/repos/apache-kafka/core/src/test/scala/unit/kafka/server/ServerGenerateBrokerIdTest.scala:23:21: imported `QuorumTestHarness` is permanently hidden by definition of object QuorumTestHarness in package server [Warn] /Users/satishd/repos/apache-kafka/core/src/test/scala/unit/kafka/server/ServerGenerateClusterIdTest.scala:29:21: imported `QuorumTestHarness` is permanently hidden by definition of object QuorumTestHarness in package server [Error] /Users/satishd/repos/apache-kafka/core/src/test/scala/unit/kafka/utils/TestUtils.scala:1438:15: ambiguous reference to overloaded definition, both method doReturn in class Mockito of type (x$1: Any, x$2: Object*)org.mockito.stubbing.Stubber and method doReturn in class Mockito of type (x$1: Any)org.mockito.stubbing.Stubber match argument types (kafka.log.UnifiedLog) ``` Reviewers: Luke Chen <showuon@gmail.com>	2023-07-21 13:56:02 +05:30
Luke Chen	27ea025e33	KAFKA-15176: add tests for tiered storage metrics (#13999 ) Added tests for metrics: 1. RemoteLogReaderTaskQueueSize 2. RemoteLogReaderAvgIdlePercent 3. RemoteLogManagerTasksAvgIdlePercent Also, added tests for OffsetOutOfRangeException will be thrown while reading logs Reviewers: Christo Lolov <christololov@gmail.com>, Satish Duggana <satishd@apache.org>	2023-07-21 10:30:33 +08:00
gaurav-narula	29f36d733b	KAFKA-15141: Initialize logger statically on hot codepaths (#13949 ) Log4j based loggers use `org.apache.logging.log4j.spi.AbstractLoggerAdapter::getContext` which invokes StackLocatorUtil to walk the stacktrace. This operation is quite CPU intensive and is performed each time during instantiation. To avoid walking the stack often, this change uses a static variable to initialize the logger for a few classes which seem to be instantiated frequently. Reviewers: Divij Vaidya <diviv@amazon.com>, Ismael Juma <ismael@juma.me.uk>	2023-07-19 12:24:40 -07:00
Jeff Kim	a500c3ecf9	KAFKA-14500; [5/N] Implement JoinGroup protocol in new GroupCoordinator (#13870 ) This patch implements the existing JoinGroup protocol within the new group coordinator. Some notable differences: * Methods return a CoordinatorResult to the runtime framework, which includes records to append to the log as well as a future to complete after the append succeeds/fails. * The coordinator runtime ensures that only a single thread will be processing a group at any given time, therefore there is no more locking on groups. * Instead of using on purgatories, we rely on the Timer interface to schedule/cancel delayed operations. Reviewers: David Jacot <djacot@confluent.io>	2023-07-19 09:15:13 +02:00
Abhijeet Kumar	fd3b1137d2	KAFKA-14953: Add tiered storage related metrics (#13944 ) * KAFKA-14953: Adding RemoteLogManager metrics In this PR, I have added the following metrics that are related to tiered storage mentioned in[ KIP-405](https://cwiki.apache.org/confluence/display/KAFKA/KIP-405%3A+Kafka+Tiered+Storage). \|Metric\|Description\| \|-----------------------------------------\|--------------------------------------------------------------\| \| RemoteReadRequestsPerSec \| Number of remote storage read requests per second \| \| RemoteWriteRequestsPerSec \| Number of remote storage write requests per second \| \| RemoteBytesInPerSec \| Number of bytes read from remote storage per second \| \| RemoteReadErrorsPerSec \| Number of remote storage read errors per second \| \| RemoteBytesOutPerSec \| Number of bytes copied to remote storage per second \| \| RemoteWriteErrorsPerSec \| Number of remote storage write errors per second \| \| RemoteLogReaderTaskQueueSize \| Number of remote storage read tasks pending for execution. \| \| RemoteLogReaderAvgIdlePercent \| Average idle percent of the remote storage reader thread pool\| \| RemoteLogManagerTasksAvgIdlePercent \| Average idle percent of RemoteLogManager thread pool \| Added unit tests for all the rate metrics. Reviewers: Luke Chen <showuon@gmail.com>, Divij Vaidya <diviv@amazon.com>, Kamal Chandraprakash<kamal.chandraprakash@gmail.com>, Jorge Esteban Quilcate Otoya <quilcate.jorge@gmail.com>, Staniel Yao<yaolixinylx@gmail.com>, hudeqi<1217150961@qq.com>, Satish Duggana <satishd@apache.org>	2023-07-18 20:16:19 +05:30
vamossagar12	fa5b493241	KAFKA-14647: Move TopicFilter to server-common/utils (#13158 ) Reviewers: Mickael Maison <mickael.maison@gmail.com>, Federico Valeri <fedevaleri@gmail.com>	2023-07-18 10:38:56 +02:00
Hailey Ni	9e50f7cdd3	MINOR: Add ZK dual-write lag metric (#14009 ) This patch adds ZKWriteBehindLag metric to the KafkaController mbean as specified in KIP-866 Reviewers: David Arthur <mumrah@gmail.com>	2023-07-16 21:23:01 -04:00
Justine Olshan	ea0bb00126	KAFKA-14884: Include check transaction is still ongoing right before append (take 2) (#13787 ) Introduced extra mapping to track verification state. When verifying, there is a race condition that the add partitions verification response returns that the partition is in the ongoing transaction, but an abort marker is written before we get to append. Therefore, we track any given transaction we are verifying with an object unique to that transaction. We check this unique state upon the first append to the log. After that, we can rely on currentTransactionFirstOffset. We remove the verification state on appending to the log with a transactional data record or marker. We will also clean up lingering verification state entries via the producer state entry expiration mechanism. We do not update the the timestamp on retrying a verification for a transaction, so each entry must be verified before producer.id.expiration.ms. There were a few other fixes: - Moved the transaction manager handling for failed batch into the future completed exceptionally block to avoid processing it twice (this caused issues in unit tests) - handle interrupted exceptions encountered when callback thread encountered them - change handling to throw error if we try to set verification state and leaderLogIfLocal is None. Reviewers: David Jacot <djacot@confluent.io>, Artem Livshits <alivshits@confluent.io>, Jason Gustafson <jason@confluent.io>	2023-07-14 15:18:11 -07:00
David Arthur	d9253fed5c	MINOR Improve logging during the ZK to KRaft migration (#14008 ) * Adds an exponential backoff to 1m while the controller is waiting for brokers to show up * Increases one-time logs to INFO * Adds a summary of the migration records * Use RecordRedactor for summary of migration batches (TRACE only) Reviewers: Colin P. McCabe <cmccabe@apache.org>	2023-07-14 17:44:00 -04:00
David Jacot	32ff347b2c	KAFKA-14462; [23/23] Wire GroupCoordinatorService in BrokerServer (#13991 ) This patch wires the new group coordinator in BrokerServer (KRaft only). With this, it is now possible to run a cluster with the new group coordinator and to use the ConsumerGroupHeartbeat API by specifying the following two properties: - group.coordinator.new.enable = true (to enable the new group coordinator) - unstable.api.versions.enable = true (to enable unreleased APIs) Note that the new group coordinator does not support all the existing APIs yet. Reviewers: Jeff Kim <jeff.kim@confluent.io>, Justine Olshan <jolshan@confluent.io>	2023-07-14 17:41:06 +02:00
Kirk True	b3ce2e54f4	KAFKA-15180: Generalize integration tests to change use of KafkaConsumer to Consumer (#13997 ) Update the integration tests to swap the use of the concrete KafkaConsumer class to the generic Consumer interface. Reviewers: Divij Vaidya <diviv@amazon.com>, Philip Nee <philipnee@gmail.com>, Jun Rao <junrao@gmail.com>	2023-07-14 10:24:36 +02:00
Divij Vaidya	4960a5ebe9	MINOR: Remove thread leak from ConsumerBounceTest (#13956 ) This commit prevents the leak of daemon-bounce-broker thread which was causing test failures for tests which check for thread leak prior to running. Reviewers: Luke Chen <showuon@gmail.com>, Justine Olshan <jolshan@confluent.io>, Philip Nee <philipnee@gmail.com>	2023-07-14 10:22:14 +02:00
Satish Duggana	7e2f878713	KAFKA-14522 Rewrite/Move of RemoteIndexCache to storage module. (#13275 ) KAFKA-14522 Rewrite and Move of RemoteIndexCache to storage module. Cleanedup index file suffix usages and other minor cleanups Reviewers: Jun Rao <junrao@gmail.com>, Ismael Juma <ismael@juma.me.uk>, Luke Chen <showuon@gmail.com>, Divij Vaidya <diviv@amazon.com>, Kamal Chandraprakash<kamal.chandraprakash@gmail.com>, Jorge Esteban Quilcate Otoya <quilcate.jorge@gmail.com>	2023-07-11 23:55:23 +05:30
Alyssa Huang	5b5f6fcafb	[KAFKA-15137] Do not log entire request payload in KRaftControllerChannelManager (#13988 ) Reviewers: David Arthur <mumrah@gmail.com>	2023-07-11 10:48:53 +02:00
Cheryl Simmons	e98508747a	Doc fixes: Fix format and other small errors in config documentation (#13661 ) Various formatting fixes in the config docs. Reviewers: Bill Bejeck <bbejeck@apache.org>	2023-07-10 12:48:35 -04:00
Divij Vaidya	5926840cab	MINOR: Check for thread leak at the end of @AfterEach, not at beginning (#13976 ) Reviewers: David Jacot <djacot@confluent.io>	2023-07-10 14:29:04 +02:00
DL1231	d481163d55	MINOR: Print startup time for RemoteIndexCache (#13970 ) Reviewers: Satish Duggana <satishd@apache.org>, Divij Vaidya <diviv@amazon.com> Co-authored-by: d00791190 <dinglan6@huawei.com>	2023-07-08 12:53:47 +02:00
Divij Vaidya	7bdcb22cf6	MINOR: Refactor & cleanup for RemoteIndexCache (#13936 ) - Add new unit tests - Change the on-disk filename from <offset>_<uuid>_.<indexSuffix> to <offset>_<uuid>.<indexSuffix> i.e. remove trailing underscore after - Fix a small bug where we were parsing offset as Int when reading the file name from disk. Offset is long. - Perform input validation in RemoteLogSegmentMetadata. - Remove an extra loop in cleaner thread. Shutdownable thread already performs looping. Reviewers: Jorge Esteban Quilcate Otoya <jorge.quilcate@aiven.io>, Satish Duggana <satishd@apache.org>	2023-07-08 12:52:22 +02:00
andymg3	1223b79973	KAFKA-15149: Fix handling of new partitions in dual-write mode (#13968 ) Fixes a bug where we don't send UMR and LISR requests in dual-write mode when new partitions are created. Prior to this patch, KRaftMigrationZkWriter was mutating the internal data-structures of TopicDelta which prevented MigrationPropagator from sending UMR and LISR for the changed partitions. Reviewers: David Arthur <mumrah@gmail.com>	2023-07-07 10:16:51 -04:00
hudeqi	1d8b07ed64	KAFKA-15129;[7/N] Remove metrics in TransactionMarkerChannelManager when TransactionCoordinator shutdown (#13962 ) Reviewers: Divij Vaidya <diviv@amazon.com> Co-authored-by: Deqi Hu <deqi.hu@shopee.com>	2023-07-07 10:27:10 +02:00
hudeqi	574f394a3e	MINOR: Fix regression introduced in #13924 (#13958 ) Fixes a regression introduced in PR #13924 by moving the map from static to a instance specific variable. --------- Co-authored-by: Deqi Hu <deqi.hu@shopee.com>	2023-07-07 10:18:38 +02:00
David Jacot	bd1f02b2be	MINOR: Move MockTimer to server-common (#13954 ) This patch rewrites MockTimer in Java and moves it from core to server-common. This continues the work started in https://github.com/apache/kafka/pull/13820. Reviewers: Divij Vaidya <diviv@amazon.com>	2023-07-06 14:56:05 +02:00
DL1231	701f924352	KAFKA-15140: Use TestUtils methods and add logs for assertion failure at TopicCommandIntegrationTest (#13950 ) This commit utilizes TestUtils methods to create a topic and adds logs when assertions fail. Reviewers: Divij Vaidya <diviv@amazon.com> --------- Co-authored-by: d00791190 <dinglan6@huawei.com>	2023-07-04 16:02:39 +02:00
hudeqi	48eb8c90ef	KAFKA-15129: [1/N] Remove metrics in LogCleanerManager when LogCleaner shutdown (#13924 ) Reviewers: Divij Vaidya <diviv@amazon.com>, Christo Lolov <lolovc@amazon.com> --------- Co-authored-by: Deqi Hu <deqi.hu@shopee.com>	2023-07-03 16:14:30 +02:00
Jorge Esteban Quilcate Otoya	0ae1d22879	KAFKA-15135: fix(storage): pass endpoint configurations as client common to TBRLMM (#13938 ) Pass endpoint properties from RLM to TBRLMM and validate those are not ignored. Reviewers: Luke Chen <showuon@gmail.com>	2023-07-03 09:16:15 +08:00
Gantigmaa Selenge	b2d647904c	KAFKA-8982: Add retry of fetching metadata to Admin.deleteRecords (#13760 ) Use AdminApiDriver class to refresh the metadata and retry the request that failed with retriable errors. Reviewers: Luke Chen <showuon@gmail.com>, Divij Vaidya <diviv@amazon.com>, Mickael Maison <mmaison@redhat.com>, Dimitar Dimitrov <30328539+dimitarndimitrov@users.noreply.github.com>	2023-07-03 09:13:55 +08:00
Ismael Juma	1f4cbc5d53	MINOR: Add JDK 20 CI build and remove some branch builds (#12948 ) It's good for us to add support for Java 20 in preparation for Java 21 - the next LTS. Given that Scala 2.12 support has been deprecated, a Scala 2.12 variant is not included. Also remove some branch builds that add load to the CI, but have low value: JDK 8 & Scala 2.13 (JDK 8 support has been deprecated), JDK 11 & Scala 2.12 (Scala 2.12 support has been deprecated) and JDK 17 & Scala 2.12 (Scala 2.12 support has been deprecated). A newer version of Mockito (4.9.0 -> 4.11.0) is required for Java 20 support, but we only use it with Scala 2.13+ since it causes compilation errors with Scala 2.12. Similarly, we upgrade easymock when the Java version is 16 or newer as it's incompatible with powermock (which doesn't support Java 16 or newer). Filed KAFKA-15117 for a test that fails with Java 20 (SslTransportLayerTest.testValidEndpointIdentificationCN). Finally, fixed some lossy conversions that were added after #13582 was submitted. Reviewers: Ismael Juma <ismael@juma.me.uk>	2023-06-30 01:12:00 -07:00
Proven Provenzano	586f89cb1c	KAFKA-15114: Update StorageTool help for creating SCRAM credentials to specify name instead of user. (#13904 ) The choice of using name vs. user as a parameter is because internally the record uses name, all tests using the StorageTool use name as a parameter, KafkaPrincipals are created with name and because creating SCRAM credentials is done with --entity-name Reviewers: Colin P. McCabe <cmccabe@apache.org>	2023-06-29 11:11:12 -07:00
Bo Gao	005416879e	KAFKA-15053: Use case insensitive validator for security.protocol config (#13831 ) Fixed a regression described in KAFKA-15053 that security.protocol only allows uppercase values like PLAINTEXT, SSL, SASL_PLAINTEXT, SASL_SSL. With this fix, both lower case and upper case values will be supported (e.g. PLAINTEXT, SSL, SASL_PLAINTEXT, SASL_SSL, plaintext, ssl, sasl_plaintext, sasl_ssl) Reviewers: Chris Egerton <chrise@aiven.io>, Divij Vaidya <diviv@amazon.com>	2023-06-29 10:13:21 +02:00
David Jacot	482299c4e2	KAFKA-14462; [19/N] Add CoordinatorLoader implementation (#13880 ) This patch adds a coordinator loader implementation. Reviewers: Jeff Kim <jeff.kim@confluent.io>, Justine Olshan <jolshan@confluent.io>	2023-06-29 08:12:53 +02:00
Justine Olshan	2f71708955	KAFKA-15028: AddPartitionsToTxnManager metrics (#13798 ) Adding the following metrics as per kip-890: VerificationTimeMs – number of milliseconds from adding partition info to the manager to the time the response is sent. This will include the round trip to the transaction coordinator if it is called. This will also account for verifications that fail before the coordinator is called. VerificationFailureRate – rate of verifications that returned in failure either from the AddPartitionsToTxn response or through errors in the manager. AddPartitionsToTxnVerification metrics – separating the verification request metrics from the typical add partitions ones similar to how fetch replication and fetch consumer metrics are separated. Reviewers: Divij Vaidya <diviv@amazon.com>	2023-06-28 09:00:37 -07:00
Chia-Ping Tsai	12005484af	MINOR: fix flaky ZkMigrationIntegrationTest.testNewAndChangedTopicsInDualWrite (#13902 ) Reviewers: David Arthur <mumrah@gmail.com>	2023-06-28 22:45:26 +08:00
Jeff Kim	1dbcb7da9e	KAFKA-14694: RPCProducerIdManager should not wait on new block (#13267 ) RPCProducerIdManager initiates an async request to the controller to grab a block of producer IDs and then blocks waiting for a response from the controller. This is done in the request handler threads while holding a global lock. This means that if many producers are requesting producer IDs and the controller is slow to respond, many threads can get stuck waiting for the lock. This patch aims to: * resolve the deadlock scenario mentioned above by not waiting for a new block and returning an error immediately * remove synchronization usages in RpcProducerIdManager.generateProducerId() * handle errors returned from generateProducerId() so that KafkaApis does not log unexpected errors * confirm producers backoff before retrying * introduce backoff if manager fails to process AllocateProducerIdsResponse Reviewers: Artem Livshits <alivshits@confluent.io>, Jason Gustafson <jason@confluent.io>	2023-06-22 10:19:39 -07:00
David Arthur	1bf7039999	KAFKA-15098 Allow authorizers to be configured in ZK migration (#13895 ) Reviewers: Ron Dagostino <rdagostino@confluent.io>	2023-06-22 09:34:49 -04:00
Divij Vaidya	88e784f7c6	KAFKA-15084: Remove lock contention from RemoteIndexCache (#13850 ) Use thread safe Caffeine to cache indexes fetched from RemoteTier locally. This PR removes a lock contention that led to higher fetch latencies as the IO threads spent time unnecessarily waiting on global cache lock while a single thread fetches the index from remote tier. See PR #13850 for details and rejected alternatives. Reviewers: Luke Chen <showuon@gmail.com>, Satish Duggana <satishd@apache.org>	2023-06-21 18:22:49 +02:00
hudeqi	d5dafe22fe	MINOR:Fill missing parameter annotations for LogCleaner methods (#13839 ) Reviewers: Josep Prat <josep.prat@aiven.io> --------- Co-authored-by: Deqi Hu <deqi.hu@shopee.com>	2023-06-21 15:54:32 +02:00
Joseph (Ting-Chou) Lin	72503904e8	MINOR: Log lastCaughtUpTime on ISR shrinkage (#13187 ) Reviewers: Divij Vaidya <diviv@amazon.com>	2023-06-21 10:15:50 +02:00
Divij Vaidya	dd25753aa2	MINOR: Close ReplicaManager correctly in ReplicaManagerTest (#13868 ) Fixes thread leaks by closing the ReplicaManager using try/finally at the end of each test. The leaks were leading to flaky test failures in ReplicaManagerTest. Reviewers: Justine Olshan <jolshan@confluent.io>, David Jacot <djacot@confluent.io>	2023-06-21 09:55:03 +02:00
minjian.cai	3d97743c67	MINOR: Fix some typos for core (#13882 ) Reviewers: Divij Vaidya <diviv@amazon.com>	2023-06-20 22:52:39 +02:00
Ismael Juma	dfaae317b8	MINOR: Upgrade Scala for Java 20/21 support (#13840 ) Upgrade to Scala 2.13.11 and Scala 2.12.18. A minor test change was required to fix compilation with Scala 2.13.11. Scala 2.13 release notes: * https://github.com/scala/scala/releases/tag/v2.13.11 Scala 2.12 release notes: * https://github.com/scala/scala/releases/tag/v2.12.16 * https://github.com/scala/scala/releases/tag/v2.12.17 * https://github.com/scala/scala/releases/tag/v2.12.18 Reviewers: Justine Olshan <jolshan@confluent.io>, Josep Prat <josep.prat@aiven.io>	2023-06-20 10:29:23 -07:00
Dimitar Dimitrov	b100f1efac	KAFKA-15087 Move/rewrite InterBrokerSendThread to server-commons (#13856 ) The Java rewrite is kept relatively close to the Scala original to minimize potential newly introduced bugs and to make reviewing simpler. The following details might be of note: - The `Logging` trait moved to InterBrokerSendThread with the rewrite of ShutdownableThread has been similarly moved to any subclasses that currently use it. InterBrokerSendThread's own logging has been made to use ShutdownableThread's logger which mimics the prefix/log identifier that the trait provided. - The case RequestAndCompletionHandler class has been made a separate POJO class and the internal-use UnsentRequests class has been kept as a static nested class. - The relatively commonly used but internal (not part of the public API) clients classes that InterBrokerSendThread relies on have been allowlisted in the server-common import control. - The accompanying test class has also been moved and rewritten with one new test added and most of the pre-existing tests made stricter. Reviewers: David Jacot <djacot@confluent.io>	2023-06-20 16:50:46 +02:00
Colin P. McCabe	cd3c0ab1a3	KAFKA-15060: fix the ApiVersionManager interface This PR expands the scope of ApiVersionManager a bit to include returning the current MetadataVersion and features that are in effect. This is useful in general because that information needs to be returned in an ApiVersionsResponse. It also allows us to fix the ApiVersionManager interface so that all subclasses implement all methods of the interface. Having subclasses that don't implement some methods is dangerous because they could cause exceptions at runtime in unexpected scenarios. On the KRaft controller, we were previously performing a read operation in the QuorumController thread to get the current metadata version and features. With this PR, we now read a volatile variable maintained by a separate MetadataVersionContextPublisher object. This will improve performance and simplify the code. It should not change the guarantees we are providing; in both the old and new scenarios, we need to be robust against version skew scenarios during updates. Add a Features class which just has a 3-tuple of metadata version, features, and feature epoch. Remove MetadataCache.FinalizedFeaturesAndEpoch, since it just duplicates the Features class. (There are some additional feature-related classes that can be consolidated in in a follow-on PR.) Create a java class, EndpointReadyFutures, for managing the futures associated with individual authorizer endpoints. This avoids code duplication between ControllerServer and BrokerServer and makes this code unit-testable. Reviewers: David Arthur <mumrah@gmail.com>, dengziming <dengziming1993@gmail.com>, Luke Chen <showuon@gmail.com>	2023-06-19 16:46:44 -07:00
Joobi S B	f4981790c4	KAFKA-15085: Make Timer.java implement AutoCloseable (#13872 ) Change Timer.java to implement AutoCloseable because automatic bug finders will flag a warning if an object of a class is marked as AutoCloseable but is not closed properly in the code. Reviewers: Divij Vaidya <diviv@amazon.com>	2023-06-19 15:50:30 +02:00
Alexandre Garnier	546b912b83	MINOR: Add and use new method TestUtils.tempPropertiesFile() (#12976 ) Reviewers: Divij Vaidya <diviv@amazon.com>	2023-06-19 13:09:10 +02:00
Manyanda Chitimbo	9b7f7e0fa0	MINOR: update LogCleaner.scala javadoc with a link to OffsetMap to help with code navigation in IDE (#13866 ) Reviewers: Divij Vaidya <diviv@amazon.com>	2023-06-19 11:05:07 +02:00
David Jacot	ce7758f3f3	MINOR: Fix testRackAwareRangeAssignor, second try (#13863 ) This test still fails regularly with the following error: ``` Error java.util.concurrent.ExecutionException: org.opentest4j.AssertionFailedError: Timed out while awaiting expected assignment Set(topicWithAllPartitionsOnAllRacks-0, topicWithSingleRackPartitions-0). The current assignment is [] Stacktrace java.util.concurrent.ExecutionException: org.opentest4j.AssertionFailedError: Timed out while awaiting expected assignment Set(topicWithAllPartitionsOnAllRacks-0, topicWithSingleRackPartitions-0). The current assignment is [] at java.base/java.util.concurrent.FutureTask.report(FutureTask.java:122) at java.base/java.util.concurrent.FutureTask.get(FutureTask.java:205) at integration.kafka.server.FetchFromFollowerIntegrationTest.$anonfun$testRackAwareRangeAssignor$9(FetchFromFollowerIntegrationTest.scala:211) at integration.kafka.server.FetchFromFollowerIntegrationTest.$anonfun$testRackAwareRangeAssignor$9$adapted(FetchFromFollowerIntegrationTest.scala:211) at scala.collection.IterableOnceOps.foreach(IterableOnce.scala:575) at scala.collection.IterableOnceOps.foreach$(IterableOnce.scala:573) ``` I propose to increase the timeouts to 30 secs to mitigate it. The test already uses 30 secs timeouts in many places. This patch uses 30 secs everywhere. This solution is not optimal but this is better than having a flaky test. Reviewers: Justine Olshan <jolshan@confluent.io>	2023-06-19 08:43:42 +02:00
David Jacot	48903c0a9f	MINOR: Make offsets topic creation more reliable in tests (zk mode) (#13848 ) I have seen failures like the following one in a few builds: ``` Build / JDK 11 and Scala 2.13 / testDescribeSimpleConsumerGroup() – kafka.admin.DescribeConsumerGroupTest org.apache.kafka.common.errors.TopicExistsException: Topic '__consumer_offsets' already exists. ``` Many tests still use `TestUtils.createOffsetsTopic(zkClient, servers)` to create the offsets topic. This method does not handle the case where the topic exists (e.g. in the case of a retry). This patch adds this logic. Reviewers: Divij Vaidya <diviv@amazon.com>, Justine Olshan <jolshan@confluent.io>	2023-06-19 08:38:45 +02:00
hudeqi	09e8adb330	MINOR:Optimize the use of metrics in ReplicaManager and remove checks (#13705 ) Co-authored-by: Deqi Hu <deqi.hu@shopee.com> Reviewers: Divij Vaidya <diviv@amazon.com>, Manyanda Chitimbo <manyanda.chitimbo@gmail.com>, Kirk True <ktrue@confluent.io>	2023-06-17 10:44:56 +02:00
Divij Vaidya	b10beaae77	MINOR: Add more information in assertion failure for non daemon threads (#13858 ) Reviewers: Luke Chen <showuon@gmail.com>	2023-06-16 15:17:57 +02:00

1 2 3 4 5 ...

4260 Commits