kafka

Commit Graph

Author	SHA1	Message	Date
Johnny Hsu	bf3f088c94	KAFKA-16341 fix the LogValidator for non-compressed type (#15476 ) - Fix the verifying logic. If it's LOG_APPEND_TIME, we choose the offset of the first record. Else, we choose the record with the maxTimeStamp. - rename the shallowOffsetOfMaxTimestamp to offsetOfMaxTimestamp Reviewers: Jun Rao <junrao@gmail.com>, Luke Chen <showuon@gmail.com>, Chia-Ping Tsai <chia7712@gmail.com>	2024-03-19 23:00:30 +08:00
Luke Chen	834efa6606	KAFKA-16342 fix getOffsetByMaxTimestamp for compressed records (#15474 ) Fix getOffsetByMaxTimestamp for compressed records. This PR adds: 1) For inPlaceAssignment case, compute the correct offset for maxTimestamp when traversing the batch records, and set to ValidationResult in the end, instead of setting to last offset always. 2) For not inPlaceAssignment, set the offsetOfMaxTimestamp for the log create time, like non-compressed, and inPlaceAssignment cases, instead of setting to last offset always. 3) Add tests to verify the fix. Reviewers: Jun Rao <junrao@apache.org>, Chia-Ping Tsai <chia7712@gmail.com>	2024-03-15 06:09:45 +08:00
Johnny Hsu	3fcaa9ccc0	MINOR: remove the copy constructor of LogSegment (#15488 ) In the LogSegment, the copy constructor is only used in LogLoaderTest Reviewers: Chia-Ping Tsai <chia7712@gmail.com>	2024-03-10 03:06:41 +08:00
John Yu	554fa57af8	KAFKA-16209 : fetchSnapshot might return null if topic is created before v2.8 (#15444 ) Change the function with a better way to deal with the NULL pointer exception. Reviewers: Luke Chen <showuon@gmail.com>	2024-03-06 09:00:58 +08:00
Nikolay	eea369af94	KAFKA-14588 Log cleaner configuration move to CleanerConfig (#15387 ) In order to move ConfigCommand to tools we must move all it's dependencies which includes KafkaConfig and other core classes to java. This PR moves log cleaner configuration to CleanerConfig class of storage module. Reviewers: Chia-Ping Tsai <chia7712@gmail.com>	2024-03-05 18:11:56 +08:00
John Yu	1bb9a85174	MINOR: Remove the space between two words (#15439 ) Remove the space between two words Reviewers: Luke Chen <showuon@gmail.com>	2024-02-29 08:14:35 +08:00
Satish Duggana	fc8b644e56	MINOR Removed unused CommittedOffsetsFile class. (#15209 ) `CommittedOffsetsFile` can be introduced when it is required for enhancing TBRLMM to consume from a specific offset when snapshots are implemented. Reviewers: Kamal Chandraprakash<kamal.chandraprakash@gmail.com>, Divij Vaidya <diviv@amazon.com>	2024-02-12 17:35:01 +05:30
Jorge Esteban Quilcate Otoya	b25c96a915	KAFKA-16229: Fix slow expired producer id deletion (#15324 ) Expiration of ProducerIds is implemented with a slow removal of map keys: producers.keySet().removeAll(keys); Unnecessarily going through all producer ids and then throw all expired keys to be removed. This leads to exponential time on worst case when most/all keys need to be removed: Benchmark (numProducerIds) Mode Cnt Score Error Units ProducerStateManagerBench.testDeleteExpiringIds 100 avgt 3 9164.043 ± 10647.877 ns/op ProducerStateManagerBench.testDeleteExpiringIds 1000 avgt 3 341561.093 ± 20283.211 ns/op ProducerStateManagerBench.testDeleteExpiringIds 10000 avgt 3 44957983.550 ± 9389011.290 ns/op ProducerStateManagerBench.testDeleteExpiringIds 100000 avgt 3 5683374164.167 ± 1446242131.466 ns/op A simple fix is to use map#remove(key) instead, leading to a more linear growth: Benchmark (numProducerIds) Mode Cnt Score Error Units ProducerStateManagerBench.testDeleteExpiringIds 100 avgt 3 5779.056 ± 651.389 ns/op ProducerStateManagerBench.testDeleteExpiringIds 1000 avgt 3 61430.530 ± 21875.644 ns/op ProducerStateManagerBench.testDeleteExpiringIds 10000 avgt 3 643887.031 ± 600475.302 ns/op ProducerStateManagerBench.testDeleteExpiringIds 100000 avgt 3 7741689.539 ± 3218317.079 ns/op Flamegraph of the CPU usage at dealing with expiration when producers ids ~1Million: Reviewers: Justine Olshan <jolshan@confluent.io>	2024-02-09 17:17:17 -08:00
Divij Vaidya	65424ab484	MINOR: New year code cleanup - include final keyword (#15072 ) Reviewers: Mickael Maison <mickael.maison@gmail.com>, Ismael Juma <ismael@juma.me.uk>, Sagar Rao <sagarmeansocean@gmail.com>	2024-01-11 17:53:35 +01:00
Christo Lolov	d4f3bf93d3	KAFKA-16014: Implement RemoteLogSizeBytes (#15050 ) This pull request aims to implement RemoteLogSizeBytes from KIP-963. Reviewers: Kamal Chandraprakash <kamal.chandraprakash@gmail.com>, Satish Duggana <satishd@apache.org>, Luke Chen <showuon@gmail.com>	2023-12-22 15:00:44 +08:00
Christo Lolov	1a97de2fe6	KAFKA-16002: Implement RemoteCopyLagSegments, RemoteDeleteLagBytes and RemoteDeleteLagSegments (#15005 ) This pull request aims to implement RemoteCopyLagSegments, RemoteDeleteLagBytes and RemoteDeleteLagSegments from KIP-963. Reviewers: Luke Chen <showuon@gmail.com>, Kamal Chandraprakash <kamal.chandraprakash@gmail.com>	2023-12-21 14:27:12 +08:00
Luke Chen	4e11de00a7	KAFKA-16014: Add RemoteLogMetadataCount metric (#15026 ) Reviewers: Christo Lolov <lolovc@amazon.com>, Kamal Chandraprakash<kamal.chandraprakash@gmail.com>, Satish Duggana <satishd@apache.org>	2023-12-20 14:21:30 +05:30
Gantigmaa Selenge	7b21da9712	KAFKA-15158: Add metrics for RemoteDelete and BuildRemoteLogAuxState (#14375 ) This PR implements part of KIP-963, specifically for adding new metrics. The metrics added in this PR are: RemoteDeleteRequestsPerSec (emitted when expired log segments on remote storage being deleted) RemoteDeleteErrorsPerSec (emitted when failed to delete expired log segments on remote storage) BuildRemoteLogAuxStateRequestsPerSec (emitted when building remote log aux state for replica fetchers) BuildRemoteLogAuxStateErrorsPerSec (emitted when failed to build remote log aux state for replica fetchers) Reviewers: Luke Chen <showuon@gmail.com>, Nikhil Ramakrishnan <ramakrishnan.nikhil@gmail.com>, Christo Lolov <lolovc@amazon.com>, Kamal Chandraprakash <kamal.chandraprakash@gmail.com>, Divij Vaidya <diviv@amazon.com>, Satish Duggana <satishd@apache.org>	2023-12-19 15:02:45 +08:00
Luke Chen	c240993be2	KAFKA-16014: Add RemoteLogSizeComputationTime metric (#15021 ) Reviewers: Satish Duggana <satishd@apache.org>, Kamal Chandraprakash<kamal.chandraprakash@gmail.com>, Christo Lolov <lolovc@amazon.com>	2023-12-18 21:39:43 +05:30
Christo Lolov	a87e86e015	KAFKA-15883: Implement RemoteCopyLagBytes (#14832 ) This pull request implements the first in the list of metrics in KIP-963: Additional metrics in Tiered Storage. Since each partition of a topic will be serviced by its own RLMTask we need an aggregator object for a topic. The aggregator object in this pull request is BrokerTopicAggregatedMetric. Since the RemoteCopyLagBytes is a gauge I have introduced a new GaugeWrapper. The GaugeWrapper is used by the metrics collection system to interact with the BrokerTopicAggregatedMetric. The RemoteLogManager interacts with the BrokerTopicAggregatedMetric directly. Reviewers: Luke Chen <showuon@gmail.com>, Satish Duggana <satishd@apache.org>, Kamal Chandraprakash<kamal.chandraprakash@gmail.com>	2023-12-14 09:21:37 +08:00
Okada Haruki	d71d0639d9	KAFKA-15046: Get rid of unnecessary fsyncs inside UnifiedLog.lock to stabilize performance (#14242 ) While any blocking operation under holding the UnifiedLog.lock could lead to serious performance (even availability) issues, currently there are several paths that calls fsync(2) inside the lock In the meantime the lock is held, all subsequent produces against the partition may block This easily causes all request-handlers to be busy on bad disk performance Even worse, when a disk experiences tens of seconds of glitch (it's not rare in spinning drives), it makes the broker to unable to process any requests with unfenced from the cluster (i.e. "zombie" like status) This PR gets rid of 4 cases of essentially-unnecessary fsync(2) calls performed under the lock: (1) ProducerStateManager.takeSnapshot at UnifiedLog.roll I moved fsync(2) call to the scheduler thread as part of existing "flush-log" job (before incrementing recovery point) Since it's still ensured that the snapshot is flushed before incrementing recovery point, this change shouldn't cause any problem (2) ProducerStateManager.removeAndMarkSnapshotForDeletion as part of log segment deletion This method calls Utils.atomicMoveWithFallback with needFlushParentDir = true internally, which calls fsync. I changed it to call Utils.atomicMoveWithFallback with needFlushParentDir = false (which is consistent behavior with index files deletion. index files deletion also doesn't flush parent dir) This change shouldn't cause problems neither. (3) LeaderEpochFileCache.truncateFromStart when incrementing log-start-offset This path is called from deleteRecords on request-handler threads. Here, we don't need fsync(2) either actually. On unclean shutdown, few leader epochs might be remained in the file but it will be handled by LogLoader on start-up so not a problem (4) LeaderEpochFileCache.truncateFromEnd as part of log truncation Likewise, we don't need fsync(2) here, since any epochs which are untruncated on unclean shutdown will be handled on log loading procedure Reviewers: Luke Chen <showuon@gmail.com>, Divij Vaidya <diviv@amazon.com>, Justine Olshan <jolshan@confluent.io>, Jun Rao <junrao@gmail.com>	2023-11-29 09:43:44 -08:00
Kamal Chandraprakash	20b0bf063b	MINOR: Fix the flaky TBRLMM `testInternalTopicExists` test (#14840 ) The internal topic creation is asynchronous so the test gets flaky. To fix the test flakiness and in this test I want to assert that doesTopicExist should return true when a topic exists, so created a dummy internal topic. Reviewers: Luke Chen <showuon@gmail.com>, Jun Rao <jun@confluent.io>, Satish Duggana <satishd@apache.org>	2023-11-29 10:50:22 +08:00
Kamal Chandraprakash	fade3d10ea	KAFKA-15047: Roll active segment when it breaches the retention policy (#14766 ) Roll the active segment and offload it to remote storage once it breaches the retention time policy. A segment is eligible for deletion once it gets uploaded to the remote storage. We have checks to allow only the passive segments to be uploaded, so the active segment never gets removed at all even if breaches the retention time. For low-throughput/stale topics, the active segment can hold the data beyond the configured retention time by the user. Reviewers: Satish Duggana <satishd@apache.org>, Christo Lolov <lolovc@amazon.com>	2023-11-28 09:38:11 +05:30
Kamal Chandraprakash	a42c846336	KAFKA-15241: Compute tiered copied offset by keeping the respective epochs in scope (#14787 ) "findHighestRemoteOffset" does not take into account the leader-epoch end offset. This can cause log divergence between the local and remote log segments when there is unclean leader election. To handle it correctly, the logic to find the highest remote offset can be updated to: find-highest-remote-offset = min(end-offset-for-epoch-in-the-checkpoint, highest-remote-offset-for-epoch) Discussion thread: https://github.com/apache/kafka/pull/14004#discussion_r1266864272 Reviewers: Satish Duggana <satishd@apache.org>, Christo Lolov <lolovc@amazon.com>	2023-11-27 09:10:46 +05:30
Jason Gustafson	e905ef1edf	MINOR: Small LogValidator clean ups (#14697 ) This patch contains a few small clean-ups in LogValidator and associated classes: 1. Set shallowOffsetOfMaxTimestamp consistently as the last offset in the batch for v2 compressed and non-compressed data. 2. Rename `RecordConversionStats` to `RecordValidationStats` since one of its fields `temporaryMemoryBytes` does not depend on conversion. 3. Rename `batchIndex` in `recordIndex` in loops over the records in each batch inside `LogValidator`. Reviewers: Qichao Chu <5326144+ex172000@users.noreply.github.com>, Jun Rao <junrao@gmail.com>	2023-11-20 10:40:45 -08:00
Kamal Chandraprakash	feee616f73	MINOR: Query before creating the internal remote log metadata topic (#14755 ) When a node starts (or) restarts, then we send a CREATE_TOPICS request to the controller to create the internal __remote_log_metadata topic. Topic creation event is costly and handled by the controller. During re-balance, the controller can have pending requests in its queue and can lead to CREATE_TOPICS timeout. Instead of firing the CREATE_TOPICS request when a node restarts, send a METADATA request (topic describe) which is handled by the least loaded node before sending a request to create the topic. Reviewers: Satish Duggana <satishd@apache.org>, Christo Lolov <lolovc@amazon.com>	2023-11-20 14:50:11 +05:30
David Mao	c6ea0a84ab	KAFKA-15780: Wait for consistent KRaft metadata when creating or deleting topics (#14695 ) TestUtils.createTopicWithAdmin calls waitForAllPartitionsMetadata which waits for partition(s) to be present in each brokers' metadata cache. This is a sufficient check in ZK mode because the controller sends an LISR request before sending an UpdateMetadataRequest which means that the partition in the ReplicaManager will be updated before the metadata cache. In KRaft mode, the metadata cache is updated first, so the check may return before partitions and other metadata listeners are fully initialized. Testing: Insert a Thread.sleep(100) in BrokerMetadataPublisher.onMetadataUpdate after // Publish the new metadata image to the metadata cache. metadataCache.setImage(newImage) and run EdgeCaseRequestTest.testProduceRequestWithNullClientId and the test will fail locally nearly deterministically. After the change(s), the test no longer fails. Reviewers: Justine Olshan <jolshan@confluent.io>	2023-11-06 17:07:56 -08:00
gongzhongqiang	d682b15eeb	KAFKA-15769: Fix logging with exception trace (#14683 ) Reviewers: Divij Vaidya <diviv@amazon.com>, hudeqi <1217150961@qq.com>	2023-11-06 11:02:05 +01:00
Alok Thatikunta	eca8502990	KAFKA-14484: [1/N] Move PartitionMetadataFile to storage module (#14607 ) This PR moves PartitionMetadataFile to the storage module. Existing unit tests in UnifiedLogTest like testLogFlushesPartitionMetadataOnAppend should suffice. Reviewers: Ismael Juma <ismael@juma.me.uk>, Jun Rao <junrao@gmail.com>	2023-11-01 09:40:45 -07:00
Kamal Chandraprakash	57fd8f4c36	KAFKA-15632: Drop the invalid remote log metadata events (#14576 ) Reviewers: Christo Lolov <lolovc@amazon.com>, Luke Chen <showuon@gmail.com>, Satish Duggana <satishd@apache.org>	2023-10-31 15:21:33 +05:30
Calvin Liu	8f8ad6db38	KAFKA-15582: Move the clean shutdown file to the storage package (#14603 ) A follow-up change to move the clean shutdown file to the storage package. Reviewers: Ismael Juma <ismael@juma.me.uk>, Jun Rao <junrao@gmail.com>	2023-10-30 16:27:40 -07:00
Jotaniya Jeel	4612fe42af	KAFKA-15481: Fix concurrency bug in RemoteIndexCache (#14483 ) RemoteIndexCache has a concurrency bug which leads to IOException while fetching data from remote tier. The bug could be reproduced as per the following order of events:- Thread 1 (cache thread): invalidates the entry, removalListener is invoked async, so the files have not been renamed to "deleted" suffix yet. Thread 2: (fetch thread): tries to find entry in cache, doesn't find it because it has been removed by 1, fetches the entry from S3, writes it to existing file (using replace existing) Thread 1: async removalListener is invoked, acquires a lock on old entry (which has been removed from cache), it renames the file to "deleted" and starts deleting it Thread 2: Tries to create in-memory/mmapped index, but doesn't find the file and hence, creates a new file of size 2GB in AbstractIndex constructor. JVM returns an error as it won't allow creation of 2GB random access file. This commit fixes the bug by using EvictionListener instead of RemovalListener to perform the eviction atomically with the file rename. It handles the manual removal (not handled by EvictionListener) by using computeIfAbsent() and enforcing atomic cache removal & file rename. Reviewers: Luke Chen <showuon@gmail.com>, Divij Vaidya <diviv@amazon.com>, Arpit Goyal <goyal.arpit.91@gmail.com>, Kamal Chandraprakash <kamal.chandraprakash@gmail.com>	2023-10-23 14:50:46 +02:00
Justine Olshan	e8c8969330	KAFKA-15626: Replace verification guard object with an specific type (#14568 ) I've added a new class with an incrementing atomic long to represent the verification guard. Upon creation of verification guard, we will increment this value and assign it to the guard. The expected behavior is the same as the object guard, but with better debuggability with the string value and type safety (I found a type safety issue in the current code when implementing this) Reviewers: Ismael Juma <ismael@juma.me.uk>, Artem Livshits <alivshits@confluent.io>	2023-10-20 14:26:20 -07:00
Arpit Goyal	dc6a53e196	MINOR: Rename lock variable of the entry class (#14569 ) The RemoteIndexCache has a variable lock and the child class also have a variable lock in the same class file. Renaming lock of the entry(child class) to avoid confusion. Reviewers: Luke Chen <showuon@gmail.com>, hudeqi <1217150961@qq.com>	2023-10-18 18:20:55 +08:00
Ismael Juma	1073d434ec	KAFKA-14481: Move LogSegment/LogSegments to storage module (#14529 ) A few notes: * Delete a few methods from `UnifiedLog` that were simply invoking the related method in `LogFileUtils` * Fix `CoreUtils.swallow` to use the passed in `logging` * Fix `LogCleanerParameterizedIntegrationTest` to close `log` before reopening * Minor tweaks in `LogSegment` for readability For broader context on this change, please check: * KAFKA-14470: Move log layer to storage module Reviewers: Divij Vaidya <diviv@amazon.com>, Satish Duggana <satishd@apache.org>	2023-10-16 06:37:30 -07:00
hudeqi	b0b8693c72	KAFKA-15536: Dynamically resize remoteIndexCache (#14511 ) Dynamically resize remoteIndexCache Reviewers: Christo Lolov <lolovc@amazon.com>, Luke Chen <showuon@gmail.com>, Divij Vaidya <diviv@amazon.com>, Kamal Chandraprakash <kamal.chandraprakash@gmail.com>	2023-10-16 15:24:36 +08:00
Matthias J. Sax	dc0f0db864	MINOR: fix typo (#14542 ) Reviewers: Bruno Cadonna <bruno@confluent.io>	2023-10-13 09:28:00 -07:00
Arpit Goyal	99ce2e081c	KAFKA-15169: Added TestCase in RemoteIndexCache (#14482 ) est Cases Covered 1. Index Files already exist on disk but not in Cache i.e. RemoteIndexCache should not call remoteStorageManager to fetch it instead cache it from the local index file present. 2. RSM returns CorruptedIndex File i.e. RemoteIndexCache should throw CorruptedIndexException instead of successfull execution. 3. Deleted Suffix Indexes file already present on disk i.e. If cleaner thread is slow , then there is a chance of deleted index files present on the disk while in parallel same index Entry is invalidated. To understand more refer https://issues.apache.org/jira/browse/KAFKA-15169 Reviewers: Divij Vaidya <diviv@amazon.com>, Luke Chen <showuon@gmail.com>, Kamal Chandraprakash<kamal.chandraprakash@gmail.com>	2023-10-11 10:58:17 +08:00
hudeqi	1c3eb4395a	KAFKA-14912:Add a dynamic config for remote index cache size (#14381 ) Reviewers: Luke Chen <showuon@gmail.com>, Satish Duggana <satishd@apache.org>, Kamal Chandraprakash<kamal.chandraprakash@gmail.com>, Divij Vaidya <diviv@amazon.com>, Subhrodip Mohanta <hello@subho.xyz>	2023-10-08 13:24:09 +05:30
David Mao	2c925e9f33	KAFKA-15526: Simplify the LogAppendInfo class (#14470 ) The LogAppendInfo class is a bit bloated in terms of class fields. That's because it is used as an umbrella class for both leader log appends and follower log appends and needs to carry fields for both. This makes the constructor for the class a bit cludgy to use. It also ends up being a bit confusing when fields are important and when they aren't. I noticed there were a few fields that didn't seem necessary. Below is a description of changes: firstOffset is a LogOffsetMetadata but there are no readers of the field that use anything but the messageOffset field - simplified to a long. LogAppendInfo.errorMessage is only set in one context - when calling LogAppendInfo.unknownLogAppendInfoWithAdditionalInfo. When we use this constructor, we pass up the original exception in LogAppendResult anyway, so the field is redundant with the LogAppendResult.exception field. This allows us to simplify the handling in KAFKA-15459: Convert coordinator retriable errors to a known producer response error #14378 since there are no custom error messages we just return whatever is in the exception message. We only use targetCompressionType when constructing the LogValidator - just inline the call instead of including it in the LogAppendInfo. offsetsMonotonic is only used when not assigning offsets to throw an exception - just throw the exception instead of setting a field to throw later. shallowCount is only there to determine whether there are any messages in the append. Instead, we can just check validBytes which is incremented with a non-zero value every time we increment shallowCount. Reviewers: Justine Olshan <jolshan@confluent.io>	2023-10-03 17:32:44 -07:00
iit2009060	13b119aa62	KAFKA-15511: Handle CorruptIndexException in RemoteIndexCache (#14459 ) A bug in the RemoteIndexCache leads to a situation where the cache does not replace the corrupted index with a new index instance fetched from remote storage. This commit fixes the bug by adding correct handling for `CorruptIndexException`. Reviewers: Divij Vaidya <diviv@amazon.com>, Satish Duggana <satishd@apache.org>, Kamal Chandraprakash <kamal.chandraprakash@gmail.com>, Alexandre Dupriez <duprie@amazon.com>	2023-09-29 12:26:46 +02:00
Kamal Chandraprakash	aa399a335f	KAFKA-15499: Fix the flaky DeleteSegmentsDueToLogStartOffsetBreach test (#14439 ) DeleteSegmentsDueToLogStartOffsetBreach configures the segment such that it can hold at-most 2 record-batches. And, it asserts that the local-log-start-offset based on the assumption that each segment will contain exactly two messages. During leader switch, the segment can get rotated and may not always contain two records. Previously, we were checking whether the expected local-log-start-offset is equal to the base-offset-of-the-first-local-log-segment. With this patch, we will scan the first local-log-segment for the expected offset. Reviewers: Divij Vaidya <diviv@amazon.com>	2023-09-28 15:04:37 +02:00
Ismael Juma	98febb989a	KAFKA-15485: Fix "this-escape" compiler warnings introduced by JDK 21 (1/N) (#14427 ) This is one of the steps required for kafka to compile with Java 21. For each case, one of the following fixes were applied: 1. Suppress warning if fixing would potentially result in an incompatible change (for public classes) 2. Add final to one or more methods so that the escape is not possible 3. Replace method calls with direct field access. In addition, we also fix a couple of compiler warnings related to deprecated references in the `core` module. See the following for more details regarding the new lint warning: https://www.oracle.com/java/technologies/javase/21-relnote-issues.html#JDK-8015831 Reviewers: Divij Vaidya <diviv@amazon.com>, Satish Duggana <satishd@apache.org>, Chris Egerton <chrise@aiven.io>	2023-09-24 05:59:29 -07:00
Wuzhengyu97	fcd382138e	MINOR: Used Admin instead of AdminClient to create Admin (#14411 ) Used Admin instead of AdminClient to create Admin Reviewers: Ziming Deng <dengziming1993@gmail.com>	2023-09-21 11:01:08 +08:00
Luke Chen	c00c5b1b66	MINOR: Add verification for (local) log start offset in txn tests (#14401 ) In this PR, we noticed failed tests caused by the verification of log start offset and local log start offset. After investigation: #14347 (comment) https://ci-builds.apache.org/job/Kafka/job/kafka-pr/job/PR-14347/13/#showFailuresLink After investigation, I found it's because the scala 2.12 cannot recognize the override method of maybeWaitForAtLeastOneSegmentUpload since it's using varargs in scala. I think there must be some bugs/gaps between java/scala that causes these issue. We can fix it by not using varargs, instead, using the Seq. This PR adds back the log start offset and local log start offset verification, and make sure all tests passed. Reviewers: Satish Duggana <satishd@apache.org>, Divij Vaidya <diviv@amazon.com>, Kamal Chandraprakash <kamal.chandraprakash@gmail.com>	2023-09-19 20:46:09 +08:00
Kamal Chandraprakash	dacb3b31d9	KAFKA-15439: Transactions test with tiered storage (#14347 ) This test extends the existing TransactionsTest. It configures the broker and topic with tiered storage and expects at-least one log segment to be uploaded to the remote storage. Reviewers: Luke Chen <showuon@gmail.com>, Satish Duggana <satishd@apache.org>, Divij Vaidya <diviv@amazon.com>	2023-09-14 09:52:13 +08:00
Luke Chen	8a7e5e8ea0	MINOR: Fix errors in javadoc and docs in tiered storage (#14379 ) Reviewers: Satish Duggana <satishd@apache.org>	2023-09-13 12:45:36 +05:30
Luke Chen	4be174f5c1	MINOR: reduce default RLMM retry interval (#14374 ) Reduce default remote.log.metadata.initialization.retry.interval.ms value to 100ms. Reviewers: Satish Duggana <satishd@apache.org>, Kamal Chandraprakash<kamal.chandraprakash@gmail.com>	2023-09-12 22:06:31 +05:30
Abhijeet Kumar	a63bf93dce	KAFKA-14993: Improve TransactionIndex instance handling while copying to and fetching from RSM (#14363 ) - Updated the contract for RSM's fetchIndex to throw a ResourceNotFoundException instead of returning an empty InputStream when it does not have a TransactionIndex. - Updated the LocalTieredStorage implementation to adhere to the new contract. - Added Unit Tests for the change. Reviewers: Satish Duggana <satishd@apache.org>, Luke Chen <showuon@gmail.com>, Divij Vaidya <diviv@amazon.com>, Christo Lolov <lolovc@amazon.com>, Kamal Chandraprakash<kamal.chandraprakash@gmail.com>	2023-09-12 17:54:20 +05:30
Kamal Chandraprakash	672ea644f0	MINOR: Removed the RSM and RLMM classpath config validator (#14358 ) - RSM and RLMM classpath can be empty since it's optional so removed the non-empty string validator - Fix getting the `localTieredStorage` by brokerId after stopping a broker. Reviewers: Christo Lolov <lolovc@amazon.com>, Luke Chen <showuon@gmail.com>, Satish Duggana <satishd@apache.org>	2023-09-09 19:02:42 +05:30
Kamal Chandraprakash	6e818c6b02	KAFKA-15410: Delete records with tiered storage integration test (4/4) (#14330 ) * Added the integration test for DELETE_RECORDS API for tiered storage enabled topic * Added validation checks before removing remote log segments for log-start-offset breach Reviewers: Satish Duggana <satishd@apache.org>, Luke Chen <showuon@gmail.com>, Christo Lolov <lolovc@amazon.com>	2023-09-07 21:02:39 +05:30
Luke Chen	cd897e6c76	MINOR: Update the javadoc in RSM (#14352 ) Reviewers: Satish Duggana <satishd@apache.org>, Kamal Chandraprakash<kamal.chandraprakash@gmail.com>	2023-09-07 20:55:11 +05:30
Kamal Chandraprakash	6d762480c9	KAFKA-15351: Update log-start-offset after leader election for topics enabled with remote storage (#14340 ) On leadership failover, the new leader's start offset may be older than the start offset of old leader. This works fine for local storage scenario because the new leader still contains data associated with stale start offset. But in case of remote storage, although new leader has a stale offset, the data associated with it has been deleted from remote by the old leader. Hence, we end up in a situation where leader has a start offset but no data associated with it. This commit fixes the situation by ensuring that on every leadership failover, for topics with remote storage, the leader will update it's start offset from the base of first segment in current leader chain present in the remote storage (if any). Reviewers: Satish Duggana <satishd@apache.org>, Luke Chen <showuon@gmail.com>, Christo Lolov <lolovc@amazon.com>, Divij Vaidya <diviv@amazon.com>	2023-09-07 16:32:16 +02:00
Kamal Chandraprakash	80982c4ae3	KAFKA-15410: Delete topic integration test with LocalTieredStorage and TBRLMM (3/4) (#14329 ) Added delete topic integration tests for tiered storage enabled topics with LocalTieredStorage and TBRLMM Reviewers: Satish Duggana <satishd@apache.org>, Divij Vaidya <diviv@amazon.com>, Luke Chen <showuon@gmail.com>	2023-09-06 05:50:12 +05:30
Luke Chen	be0de2124a	MINOR: Update comment in consumeAction (#14335 ) Reviewers: Satish Duggana <satishd@apache.org>, Divij Vaidya <diviv@amazon.com>	2023-09-05 21:36:28 +05:30

1 2 3

148 Commits