Commit Graph

933 Commits

Author SHA1 Message Date
A. Sophie Blee-Goldman f20f299492
KAFKA-18839: Drop EAGER rebalancing support in Kafka Streams (#18988)
In 3.1 we deprecated the eager rebalancing protocol and marked it for
removal in a later release. We aim to officially drop support and remove
the protocol from Streams in 4.0.

The effect of this PR is that it will no longer be possible to perform a
live upgrade Kafka Streams directly to 4.0 from version 2.3 or below.
Users will have to go through a bridge release between 2.4 - 3.9
instead.

Reviewers: Matthias J. Sax <matthias@confluent.io>
2025-02-25 19:05:03 -08:00
Calvin Liu 10da082184
MINOR: update truncation test (#18952)
Reduce the minISR to be 1 for the truncation test in order to skip the protection from KIP-966

Reviewers: David Jacot <djacot@confluent.io>, Colin P. McCabe <cmccabe@apache.org>, Chia-Ping Tsai <chia7712@gmail.com>
2025-02-25 04:32:29 +08:00
Ismael Juma 48527a1e7f
MINOR: Clean-up imports, imports and unused parameter in upgrade_test.py (#19018)
Reviewers: Chia-Ping Tsai <chia7712@gmail.com>
2025-02-24 06:35:07 -08:00
Ken Huang c6335c2ae8
MINOR: Fix fail e2e transactions_upgrade_test.py::TransactionsUpgradeTest.test_transactions_upgrade (#19004)
The main root cause is
3dba3125e9,
this PR remove the metadata version which is older than 3.3, thus this
test will fail when it use metadata version 3.2, 3.1

Reviewers: David Jacot <djacot@confluent.io>
2025-02-22 14:45:39 +01:00
TengYao Chi 1e9565788c
MINOR: Fix fail e2e TestUpgrade#test_combined_mode_upgrade and test_isolated_mode_upgrade (#19003)
#18845 assumed a baseline of 3.3 for server protocol versions so that
the lower version couldn't roll up to 4.0. Hence, the
`TestUpgrade#test_combined_mode_upgrad` and `test_isolated_mode_upgrade`
failed for the 3.1 and 3.2 versions.

e2e tests result with this patch on jenkins:
![Screenshot from 2025-02-22
13-22-17](https://github.com/user-attachments/assets/2de6f707-8281-4f30-b5d0-83dd4de9666d)
e2e tests result with this patch on local machine:
![Screenshot from 2025-02-22
13-28-16](https://github.com/user-attachments/assets/2e5e563a-1ac4-4894-ba30-593304697d1d)

Reviewers: David Jacot <djacot@confluent.io>
2025-02-22 08:53:34 +01:00
Ken Huang d820559751
MINOR: Fix fail e2e TransactionsMixedVersionsTest#test_transactions_mixed_versions (#19002)
The main root cause is
3dba3125e9,
this PR remove the metadata version which is older than 3.3, thus this
test will fail when it use metadata version 3.2, 3.1

Reviewers: Chia-Ping Tsai <chia7712@gmail.com>, David Jacot <djacot@confluent.io>
2025-02-22 08:52:22 +01:00
Matthias J. Sax 9f23b25f6e
MINOR: fix Kafka Streams "smoke test" pass criteria (#18835)
Reviewers: Bill Bejeck <bill@confluent.io>, Bruno Cadonna <bruno@confluent.io>
2025-02-19 14:33:31 -08:00
kevin-wu24 929a90da50
KAFKA-18667 Add replication system test case for combined broker + controller failure (#18757)
This patch adds a test case to replication_test.py test_replication_with_broker_failure which validates the scenario when we have failures of a combined mode broker/controller.

Reviewers: David Arthur <mumrah@gmail.com>
2025-02-18 12:58:58 -05:00
David Jacot aec0e555be
MINOR: Mark IBP_4_0_IV3 as production ready! (#18902)
This patch marks IBP_4_0_IV3 as production ready for the Apache Kafka 4.0 release. It also introduced IBP_4_1_IV0 as the next development version.

Reviewers: Justine Olshan <jolshan@confluent.io>
2025-02-14 08:17:11 -08:00
Calvin Liu e7a2af8414
KAFKA-18634: Fix ELR metadata version issues (#18680)
This patch cleans up the places that should not use MV to determine ELR is enabled marks 4.0IV1 stable.

Reviewers: Alyssa Huang <ahuang@confluent.io>, Colin P. McCabe <cmccabe@apache.org>
2025-02-13 23:40:31 -08:00
Chirag Wadhwa 0989a6a389
KAFKA-18756: Enabled share group configs for queues related system tests (#18829)
Reviewers: Andrew Schofield <aschofield@confluent.io>
2025-02-11 17:15:33 +00:00
David Jacot 84b639d932
MINOR: Fix reassign partitions system test (#18860)
The tests which set reassign_from_offset_zero=False have a setup phase which produces records with old timestamps to the topic and waits until they are cleaned by the retention in order to run the main phase of the test based on non-zero offsets. The setup phases did not wait enough for the cleaning task to kick in, mainly because the scheduled task was not started yet due to log.initial.task.delay.ms being set to 30s by default. Reducing it to 5s helps to stabilize the test. The patch also changes the sleep to 12s in order to have a bit more head room.

```
================================================================================
SESSION REPORT (ALL TESTS)
ducktape version: 0.12.0
session_id:       2025-02-11--016
run time:         26 minutes 9.451 seconds
tests run:        12
passed:           12
flaky:            0
failed:           0
ignored:          0
================================================================================
```

Reviewers: Chia-Ping Tsai <chia7712@gmail.com>
2025-02-11 15:46:19 +01:00
David Jacot 1bebdd9fe8
MINOR: Fix log compaction system test (#18857)
`log.segment.bytes` must be greater or equals to 1MB (KIP-1030).

```
================================================================================
SESSION REPORT (ALL TESTS)
ducktape version: 0.12.0
session_id:       2025-02-10--003
run time:         55.903 seconds
tests run:        1
passed:           1
flaky:            0
failed:           0
ignored:          0
================================================================================
```

Reviewers: Divij Vaidya <diviv@amazon.com>
2025-02-11 14:50:45 +01:00
Chirag Wadhwa 7fef5b8646
KAFKA-18763: changed the assertion statement for acknowledgements to include only successful acks (#18846)
Reviewers: Andrew Schofield <aschofield@confluent.io>
2025-02-10 11:36:26 +00:00
David Jacot 676293d709
MINOR: Fix TestBounce sys test (#18798)
```
================================================================================
SESSION REPORT (ALL TESTS)
ducktape version: 0.12.0
session_id:       2025-02-04--005
run time:         4 minutes 0.023 seconds
tests run:        4
passed:           4
flaky:            0
failed:           0
ignored:          0
================================================================================
```

Reviewers: Lianet Magrans <lmagrans@confluent.io>
2025-02-04 14:49:20 +01:00
David Jacot 4c6af67eb1
MINOR: Fix PerformanceService sys test (#18797)
This patch fixes the PerformanceService system test which was still using ZK.

```
================================================================================
SESSION REPORT (ALL TESTS)
ducktape version: 0.12.0
session_id:       2025-02-04--003
run time:         1 minute 42.629 seconds
tests run:        4
passed:           4                                                                                                                                                                         flaky:            0
failed:           0                                                                                                                                                                         ignored:          0
================================================================================
```

Reviewers: Chia-Ping Tsai <chia7712@gmail.com>
2025-02-04 14:39:24 +01:00
David Jacot 17d1447f9c
MINOR: Fix Benchmark sys tests (#18796)
This patch fixes the Benchmark system tests. We misconfigured the quorum in bc7b87001b.

```
================================================================================
SESSION REPORT (ALL TESTS)
ducktape version: 0.12.0
session_id:       2025-02-04--001
run time:         57 minutes 27.169 seconds
tests run:        62
passed:           62
flaky:            0
failed:           0
ignored:          0
================================================================================
```

Reviewers: PoAn Yang <payang@apache.org>, Christo Lolov <lolovc@amazon.com>
2025-02-04 14:34:57 +01:00
PoAn Yang bc7b87001b
KAFKA-18676; Update Benchmark system tests (#18785)
Update `benchmark_test.py` to use KRaft.

```
> TC_PATHS="tests/kafkatest/benchmarks/core/benchmark_test.py" /bin/bash tests/docker/run_tests.sh

================================================================================
SESSION REPORT (ALL TESTS)
ducktape version: 0.12.0
session_id:       2025-02-03--001
run time:         96 minutes 48.900 seconds
tests run:        120
passed:           120
flaky:            0
failed:           0
ignored:          0
================================================================================
```

Reviewers: David Jacot <djacot@confluent.io>
2025-02-03 14:42:22 +01:00
PoAn Yang 5268fcdc98
KAFKA-18678 Update TestVerifiableProducer system test (#18768)
Reviewers: David Jacot <djacot@confluent.io>, Chia-Ping Tsai <chia7712@gmail.com>
2025-02-03 14:14:54 +08:00
David Jacot 0ff4dafb7d
KAFKA-18146; tests/kafkatest/tests/core/upgrade_test.py needs to be re-added as KRaft (#18766)
This patch renames kraft_upgrade_test.py to upgrade_test.py. This is enough to cover the old upgrade/downgrade tests.

Reviewers: Chia-Ping Tsai <chia7712@gmail.com>
2025-01-31 06:07:11 -08:00
TengYao Chi d7a5b877f2
KAFKA-18677; Update ConsoleConsumerTest system test (#18763)
This patch converts the ConsoleConsumerTest system test to only use KRaft.

Reviewers: David Jacot <djacot@confluent.io>
2025-01-31 12:19:49 +01:00
Bill Bejeck 20b073bbee
KAFKA-18498: Update lock ownership from main thread (#18732)
Once a StreamThread receives its assignment, it will close the startup tasks. But during the closing process, the StandbyTask.closeClean() method will eventually call theStatemanagerUtil.closeStateManager method which needs to lock the state directory, but locking requires the calling thread be the current owner. Since the main thread grabs the lock on startup but moves on without releasing it, we need to update ownership explicitly here in order for the stream thread to close the startup task and begin processing.

Reviewers: Matthias Sax <mjsax@apache.org>, Nick Telford
2025-01-29 14:09:44 -05:00
Chirag Wadhwa c09d2c940f
KAFKA-18642: Increased the timeouts in share_consumer_test.py system tests (#18699)
Reviewers: Andrew Schofield <aschofield@confluent.io>
2025-01-25 19:23:40 +00:00
Bill Bejeck adb0332114
MINOR: Ignore streams eos test until we have an agreed upon fix. (#18664)
Ignore the streams_eos_test.py until we come up with a fix for creating local standby tasks on startup.
Reviewers: Matthias Sax<mjsax@apache.org>
2025-01-21 18:28:52 -05:00
TengYao Chi 78e35457ee
KAFKA-18480 Fix fail e2e `test_offset_truncate` (#18492)
Reviewers: TaiJuWu <tjwu1217@gmail.com>, Chia-Ping Tsai <chia7712@gmail.com>
2025-01-18 10:54:14 +08:00
Chirag Wadhwa 8cc560e954
Infrastructure for system tests for the new share consumer client (#18209)
Reviewers: Apoorv Mittal <apoorvmittal10@gmail.com>, Andrew Schofield <aschofield@confluent.io>
2025-01-17 12:03:32 +00:00
Bill Bejeck f4fca746cc
use version.py get_version to get version (#18563)
A prior commit introduced checking for the version of a node related to move to log4j2 but it was causing an error
AttributeError("'ClusterNode' object has no attribute 'version'") This PR uses the get_version method from version.py which checks if the Node has a version attribute preventing an error.

Reviewers: Matthias Sax <mjsax@apache.org>
2025-01-15 15:14:24 -05:00
TaiJuWu ceee1a732e
KAFKA-18346 Fix e2e TestKRaftUpgrade for v3.3.2 (#18386)
Due to an issue with handling folders in Kafka version 3.3.2 (see https://github.com/apache/kafka/pull/13130), this end-to-end test requires using a single folder for upgrade/downgrade scenarios involving 3.3.2.

Reviewers: Ismael Juma <ismael@juma.me.uk>, Chia-Ping Tsai <chia7712@gmail.com>
2025-01-15 20:37:55 +08:00
Kirk True 45e3c21e9a
KAFKA-17915: Convert Kafka Client system tests to use KRaft (#17669)
Reviewers: Lianet Magrans <lmagrans@confluent.io>, Chia-Ping Tsai <chia7712@gmail.com>
2025-01-14 14:47:15 +01:00
kevin-wu24 25fdcd05fc
KAFKA-17915: Convert remaining Kafka Client system tests to use KRaft (#18367)
Reviewers: Lianet Magrans <lmagrans@confluent.io>
2025-01-09 19:20:34 +01:00
TaiJuWu 5acbd42dd7
KAFKA-18277 Convert network_degrade_test to Kraft mode (#18247)
Reviewers: Chia-Ping Tsai <chia7712@gmail.com>
2025-01-10 01:49:13 +08:00
Jhen-Yung Hsu f95726a211
KAFKA-18417 Remove controlled.shutdown.max.retries and controlled.shutdown.retry.backoff.ms (#18431)
Reviewers: Chia-Ping Tsai <chia7712@gmail.com>
2025-01-08 17:13:42 +08:00
TaiJuWu 6737178c12
KAFKA-18280 fix e2e TestSecurityRollingUpgrade.test_rolling_upgrade_sasl_mechanism_phase_one (#18308)
Reviewers: Chia-Ping Tsai <chia7712@gmail.com>
2024-12-30 22:36:47 +08:00
TengYao Chi 96527be90d
KAFKA-18243 Fix compatibility of Loggers class between log4j and log4j2 (#18185)
Reviewers: Chia-Ping Tsai <chia7712@gmail.com>
2024-12-30 01:50:52 +08:00
Kuan-Po Tseng 4567f39825
KAFKA-18343: Use java_pids to implement pids (#18312)
Related to KAFKA-18343,

Currently, there is an issue that the ps ax output is truncated, which causes the Kafka process ID to be unavailable. This issue can be mitigated by replacing ps ax with jcmd (i.e. using java_pids in ducktape), as it does not suffer from the truncation problem.

Reviewers: Justine Olshan <jolshan@confluent.io>
2024-12-24 14:52:26 -08:00
Justine Olshan 8bd3746e0c
KAFKA-17705: Add Transactions V2 system tests and mark as production ready (#18132)
Added transaction version 2 to some of the system tests. Also marking TV2 as production ready.

Also fixes the defaultVersion test. 

Reviewers: Jun Rao <jun@confluent.io>
2024-12-21 14:01:54 -08:00
TengYao Chi 8c55dcc979
KAFKA-18237: Upgrade system tests from using 3.7.1 to 3.7.2 (#18180)
Reviewers: Matthias J. Sax <matthias@confluent.io>
2024-12-16 09:49:03 -08:00
David Jacot 84049369c1
MINOR: Bump trunk to 4.1.0-SNAPSHOT (#18213)
Reviewers: Mickael Maison <mickael.maison@gmail.com>
2024-12-16 16:47:13 +01:00
Jhen-Yung Hsu 9cc1547672
KAFKA-18247 Use log4j2 yaml to rewrite the fix of KAFKA-18145 (#18192)
Reviewers: Chia-Ping Tsai <chia7712@gmail.com>
2024-12-16 20:53:12 +08:00
TengYao Chi ca7b1dcb00
MINOR: Add an example of how to run multiple test file to tests/README (#18181)
Reviewers: TaiJuWu <tjwu1217@gmail.com>, Yung <yungyung7654321@gmail.com>, Chia-Ping Tsai <chia7712@gmail.com>
2024-12-16 13:52:14 +08:00
ClarkChen a2a098de6a
KAFKA-18246 Fix ConnectRestApiTest.test_rest_api by adding multiversioning configs (#18191)
Reviewers: Chia-Ping Tsai <chia7712@gmail.com>
2024-12-15 15:39:43 +08:00
Nick Guo 2966856049
KAFKA-18240 Remove nonexistent `LOG4J_CONFIG` from transactional_message_copier.py (#18184)
Reviewers: Ken Huang <s7133700@gmail.com>, TengYao Chi <kitingiao@gmail.com>, TaiJuWu <tjwu1217@gmail.com>, Yung <yungyung7654321@gmail.com>, Chia-Ping Tsai <chia7712@gmail.com>
2024-12-15 05:54:14 +08:00
TaiJuWu 161d1cdf85
KAFKA-18218 fix Trogdor system test (#18156)
Reviewers: Ken Huang <s7133700@gmail.com>, TengYao Chi <kitingiao@gmail.com>, Chia-Ping Tsai <chia7712@gmail.com>
2024-12-14 01:23:58 +08:00
TengYao Chi b37b89c668
KAFKA-9366 Upgrade log4j to log4j2 (#17373)
This pull request replaces Log4j with Log4j2 across the entire project, including dependencies, configurations, and code. The notable changes are listed below:

1. Introduce Log4j2 Instead of Log4j
2. Change Configuration File Format from Properties to YAML
3. Adds warnings to notify users if they are still using Log4j properties, encouraging them to transition to Log4j2 configurations

Co-authored-by: Lee Dongjin <dongjin@apache.org>

Reviewers: Luke Chen <showuon@gmail.com>, Mickael Maison <mickael.maison@gmail.com>, Chia-Ping Tsai <chia7712@gmail.com>
2024-12-14 01:14:31 +08:00
Ken Huang ce77a7413e
KAFKA-18194 Flaky test_broker_rolling_bounce due to metadata update (#18153)
Reviewers: Chia-Ping Tsai <chia7712@gmail.com>
2024-12-13 07:33:03 +08:00
Kuan-Po Tseng d2ad418cfd
KAFKA-18156 VerifiableConsumer should ignore "--session-timeout" when using CONSUMER protocol (#18036)
Reviewers: TaiJuWu <tjwu1217@gmail.com>, Chia-Ping Tsai <chia7712@gmail.com>
2024-12-11 21:12:46 +08:00
Yung 104fa57933
KAFKA-18118 Fix the incorrect soft link of results/latest (#17992)
Reviewers: Chia-Ping Tsai <chia7712@gmail.com>
2024-12-08 08:14:53 +08:00
ShivsundarR 50b6953661
KAFKA-18122 : Added support for ShareConsumeBenchWorker (#17984)
Added ShareConsumeBenchSpec and ShareConsumeBenchWorker similar to ConsumeBenchSpec/ConsumeBenchWorker. This will help us run trogdor workloads for share consumers as well.
Added a sample json workload running 5 share consumers.

Reviewers: Andrew Schofield <aschofield@confluent.io>, Manikumar Reddy <manikumar.reddy@gmail.com>
2024-12-05 18:46:32 +05:30
Ken Huang 6ef8b1c4c0
KAFKA-18132 Remove "session.timeout.ms" from connect-distributed.properties to fix connect e2e (#18005)
Reviewers: Chia-Ping Tsai <chia7712@gmail.com>
2024-12-05 10:28:03 +08:00
mingdaoy 0322764ab2
KAFKA-17460 Remove downgrade_test.py (#18038)
Reviewers: Chia-Ping Tsai <chia7712@gmail.com>
2024-12-05 07:07:01 +08:00