kafka

Commit Graph

Author	SHA1	Message	Date
Jeff Huang	2988eac082	KAFKA-9944: Added supporting customized HTTP response headers for Kafka Connect. (#8620 ) Added support for customizing the HTTP response headers for Kafka Connect as described in KIP-577. Author: Jeff Huang <jeff.huang@confluent.io> Reviewer: Randall Hauch <rhauch@gmail.com>	2020-05-24 08:56:27 -05:00
Randall Hauch	981ef5166d	KAFKA-9931: Implement KIP-605 to expand support for Connect worker internal topic configurations (#8654 ) Added support for -1 replication factor and partitions for distributed worker internal topics by expanding the allowed values for the internal topics’ replication factor and partitions from positive values to also include -1 to signify that the broker defaults should be used. The Kafka storage classes were already constructing a `NewTopic` object (always with a replication factor and partitions) and sending it to Kafka when required. This change will avoid setting the replication factor and/or number of partitions on this `NewTopic` if the worker configuration uses -1 for the corresponding configuration value. Also added support for extra settings for internal topics on distributed config, status, and offset internal topics. Quite a few new tests were added to verify that the `TopicAdmin` utility class is correctly using the AdminClient, and that the `DistributedConfig` validators for these configurations are correct. Also added integration tests for internal topic creation, covering preexisting functionality plus the new functionality. Author: Randall Hauch <rhauch@gmail.com> Reviewer: Konstantine Karantasis <konstantine@confluent.io>	2020-05-23 09:00:32 -05:00
Matthias J. Sax	1daa8f638b	KAFKA-9748: Add Streams eos-beta integration test (#8496 ) Reviewers: Boyang Chen <boyang@confluent.io>, Guozhang Wang <guozhang@confluent.io>	2020-05-04 22:45:54 -07:00
A. Sophie Blee-Goldman	b5de449377	KAFKA-9127: don't create StreamThreads for global-only topology (#8540 ) Reviewers: Matthias J. Sax <matthias@confluent.io>, John Roesler <vvcephei@apache.org>	2020-04-28 22:34:17 -05:00
Colin Patrick McCabe	bf6dffe93b	KAFKA-9309: Add the ability to translate Message classes to and from JSON (#7844 ) Reviewers: David Arthur <mumrah@gmail.com>, Ron Dagostino <rdagostino@confluent.io>	2020-04-09 13:11:36 -07:00
Chia-Ping Tsai	833dc7725c	HOTFIX: exclude ConsumerCoordinator from NPathComplexity check (#8447 ) Reviewers: Rajini Sivaram <rajinisivaram@googlemail.com>	2020-04-08 13:00:03 +01:00
Lucas Bradstreet	46540eb5e0	KAFKA-9820: validateMessagesAndAssignOffsetsCompressed allocates unused iterator (#8422 ) `3e9d1c1411` introduced skipKeyValueIterator(s) which were intended to be used, but in this case were created but were not used in offset validation. A subset of the benchmark results follow. Looks like a 20% improvement in validation performance and a 40% reduction in garbage allocation for 1-2 batch sizes. # Parameters: (bufferSupplierStr = NO_CACHING, bytes = RANDOM, compressionType = LZ4, maxBatchSize = 1, messageSize = 1000, messageVersion = 2) Before: Result "org.apache.kafka.jmh.record.RecordBatchIterationBenchmark.measureValidation": 64851.837 ±(99.9%) 944.248 ops/s [Average] (min, avg, max) = (64505.317, 64851.837, 65114.359), stdev = 245.218 CI (99.9%): [63907.589, 65796.084] (assumes normal distribution) "org.apache.kafka.jmh.record.RecordBatchIterationBenchmark.measureValidation:·gc.alloc.rate.norm": 164088.003 ±(99.9%) 0.004 B/op [Average] (min, avg, max) = (164088.001, 164088.003, 164088.004), stdev = 0.001 CI (99.9%): [164087.998, 164088.007] (assumes normal distribution) After: Result "org.apache.kafka.jmh.record.RecordBatchIterationBenchmark.measureValidation": 78910.273 ±(99.9%) 707.024 ops/s [Average] (min, avg, max) = (78785.486, 78910.273, 79234.007), stdev = 183.612 CI (99.9%): [78203.249, 79617.297] (assumes normal distribution) "org.apache.kafka.jmh.record.RecordBatchIterationBenchmark.measureValidation:·gc.alloc.rate.norm": 96440.002 ±(99.9%) 0.001 B/op [Average] (min, avg, max) = (96440.002, 96440.002, 96440.002), stdev = 0.001 CI (99.9%): [96440.002, 96440.003] (assumes normal distribution) # Parameters: (bufferSupplierStr = NO_CACHING, bytes = RANDOM, compressionType = LZ4, maxBatchSize = 2, messageSize = 1000, messageVersion = 2) Before: Result "org.apache.kafka.jmh.record.RecordBatchIterationBenchmark.measureValidation": 64815.364 ±(99.9%) 639.309 ops/s [Average] (min, avg, max) = (64594.545, 64815.364, 64983.305), stdev = 166.026 CI (99.9%): [64176.056, 65454.673] (assumes normal distribution) "org.apache.kafka.jmh.record.RecordBatchIterationBenchmark.measureValidation:·gc.alloc.rate.norm": 163944.003 ±(99.9%) 0.001 B/op [Average] (min, avg, max) = (163944.002, 163944.003, 163944.003), stdev = 0.001 CI (99.9%): [163944.002, 163944.004] (assumes normal distribution) After: Result "org.apache.kafka.jmh.record.RecordBatchIterationBenchmark.measureValidation": 77075.096 ±(99.9%) 201.092 ops/s [Average] (min, avg, max) = (77021.537, 77075.096, 77129.693), stdev = 52.223 CI (99.9%): [76874.003, 77276.188] (assumes normal distribution) "org.apache.kafka.jmh.record.RecordBatchIterationBenchmark.measureValidation:·gc.alloc.rate.norm": 96504.002 ±(99.9%) 0.003 B/op [Average] (min, avg, max) = (96504.001, 96504.002, 96504.003), stdev = 0.001 CI (99.9%): [96503.999, 96504.005] (assumes normal distribution) Reviewers: Chia-Ping Tsai <chia7712@gmail.com>, Ismael Juma <ismael@juma.me.uk>	2020-04-04 10:05:51 -07:00
A. Sophie Blee-Goldman	6e0d553350	MINOR: clean up Streams assignment classes and tests (#8406 ) First set of cleanup pushed to followup PR after KIP-441 Pt. 5. Main changes are: 1. Moved `RankedClient` and the static `buildClientRankingsByTask` to a new file 2. Moved `Movement` and the static `getMovements` to a new file (also renamed to `TaskMovement`) 3. Consolidated the many common variables throughout the assignment tests to the new `AssignmentTestUtils` 4. New utility to generate comparable/predictable UUIDs for tests, and removed the generic from `TaskAssignor` and all related classes Reviewers: John Roesler <vvcephei@apache.org>, Andrew Choi <a24choi@edu.uwaterloo.ca>	2020-04-03 13:53:51 -05:00
A. Sophie Blee-Goldman	2322bc0a6f	KAFKA-6145: Pt. 5 Implement high availability assignment (#8337 ) Adds a new TaskAssignor implementation, currently hidden behind an internal feature flag, that implements the high availability algorithm of KIP-441. Reviewers: Bruno Cadonna <bruno@confluent.io>, John Roesler <vvcephei@apache.org>	2020-04-02 13:36:03 -05:00
Gardner Vickers	8cf781ef01	MINOR: Improve performance of checkpointHighWatermarks, patch 1/2 (#6741 ) This PR works to improve high watermark checkpointing performance. `ReplicaManager.checkpointHighWatermarks()` was found to be a major contributor to GC pressure, especially on Kafka clusters with high partition counts and low throughput. Added a JMH benchmark for `checkpointHighWatermarks` which establishes a performance baseline. The parameterized benchmark was run with 100, 1000 and 2000 topics. Modified `ReplicaManager.checkpointHighWatermarks()` to avoid extra copies and cached the Log parent directory Sting to avoid frequent allocations when calculating `File.getParent()`. A few clean-ups: * Changed all usages of Log.dir.getParent to Log.parentDir and Log.dir.getParentFile to Log.parentDirFile. * Only expose public accessor for `Log.dir` (consistent with `Log.parentDir`) * Removed unused parameters in `Partition.makeLeader`, `Partition.makeFollower` and `Partition.createLogIfNotExists`. Benchmark results: \| Topic Count \| Ops/ms \| MB/sec allocated \| \|-------------\|---------\|------------------\| \| 100 \| + 51% \| - 91% \| \| 1000 \| + 143% \| - 49% \| \| 2000 \| + 149% \| - 50% \| Reviewers: Lucas Bradstreet <lucas@confluent.io>. Ismael Juma <ismael@juma.me.uk> Co-authored-by: Gardner Vickers <gardner@vickers.me> Co-authored-by: Ismael Juma <ismael@juma.me.uk>	2020-03-25 20:53:42 -07:00
A. Sophie Blee-Goldman	6cf27c9c77	KAFKA-6145: Pt 2.5 Compute overall task lag per client (#8252 ) Once we have encoded the offset sums per task for each client, we can compute the overall lag during assign by fetching the end offsets for all changelog and subtracting. If the listOffsets request fails, we simply return a "completely sticky" assignment, ie all active tasks are given to previous owners regardless of balance. Builds (but does not yet use) the statefulTasksToRankedCandidates map with the ranking: Rank -1: active running task Rank 0: standby or restoring task whose overall lag is within acceptableRecoveryLag Rank 1: tasks whose lag is unknown (eg during version probing) Rank 1+: all other tasks are ranked according to their actual total lag Implements: KIP-441 Reviewers: Bruno Cadonna <bruno@confluent.io>, John Roesler <vvcephei@apache.org>	2020-03-21 13:40:34 -05:00
Colin Patrick McCabe	56051e7639	KAFKA-8820: kafka-reassign-partitions.sh should support the KIP-455 API (#8244 ) Rewrite ReassignPartitionsCommand to use the KIP-455 API when possible, rather than direct communication with ZooKeeper. Direct ZK access is still supported, but deprecated, as described in KIP-455. As specified in KIP-455, the tool has several new flags. --cancel stops an assignment which is in progress. --preserve-throttle causes the --verify and --cancel commands to leave the throttles alone. --additional allows users to execute another partition assignment even if there is already one in progress. Finally, --show displays all of the current partition reassignments. Reorganize the reassignment code and tests somewhat to rely more on unit testing using the MockAdminClient and less on integration testing. Each integration test where we bring up a cluster seems to take about 5 seconds, so it's good when we can get similar coverage from unit tests. To enable this, MockAdminClient now supports incrementalAlterConfigs, alterReplicaLogDirs, describeReplicaLogDirs, and some other APIs. MockAdminClient is also now thread-safe, to match the real AdminClient implementation. In DeleteTopicTest, use the KIP-455 API rather than invoking the reassignment command.	2020-03-19 20:44:34 -07:00
Matthias J. Sax	89cd2f2a0b	KAFKA-9441: Unify committing within TaskManager (#8218 ) - part of KIP-447 - commit all tasks at once using non-eos (and eos-beta in follow up work) - unified commit logic into TaskManager - split existing methods of Task interface in pre/post parts Reviewers: Boyang Chen <boyang@confluent.io>, Guozhang Wang <guozhang@confluent.io>	2020-03-19 11:31:51 -07:00
Manikumar Reddy	a0e1407820	KAFKA-9670; Reduce allocations in Metadata Response preparation (#8236 ) This PR removes intermediate conversions between `MetadataResponse.TopicMetadata` => `MetadataResponseTopic` and `MetadataResponse.PartitionMetadata` => `MetadataResponsePartition` objects. There is 15-20% reduction in object allocations and 5-10% improvement in metadata request performance. Reviewers: Ismael Juma <ismael@juma.me.uk>, Jason Gustafson<jason@confluent.io>	2020-03-16 09:30:48 -07:00
Brian Byrne	227a7322b7	KIP-546: Implement describeClientQuotas and alterClientQuotas. (#8083 ) Reviewers: Colin P. McCabe <cmccabe@apache.org>	2020-03-14 23:03:13 -07:00
jiao	e3ccf20794	KAFKA-9685: Solve Set concatenation perf issue in AclAuthorizer To dismiss the usage of operation ++ against Set which is slow when Set has many entries. This pr introduces a new class 'AclSets' which takes multiple Sets as parameters and do 'find' against them one by one. For more details about perf and benchmark, refer to [KAFKA-9685](https://issues.apache.org/jira/browse/KAFKA-9685) Author: jiao <jiao.zhang@linecorp.com> Reviewers: Manikumar Reddy <manikumar.reddy@gmail.com> Closes #8261 from jiao-zhangS/jira-9685	2020-03-13 20:53:29 +05:30
John Roesler	78374a1549	KAFKA-9615: Clean up task/producer create and close (#8213 ) * Consolidate task/producer management. Now, exactly one component manages the creation and destruction of Producers, whether they are per-thread or per-task. * Add missing test coverage on TaskManagerTest Reviewers: Guozhang Wang <wangguoz@gmail.com>, Boyang Chen <boyang@confluent.io>	2020-03-05 14:20:46 -06:00
Manikumar Reddy	8dff0b168a	Kafka 9626: Improve ACLAuthorizer.acls() performance This PR avoids creation of unnecessary sets in AclAuthorizer.acls() method implementation. Perf results: Old ``` Benchmark (aclCount) (resourceCount) Mode Cnt Score Error Units AclAuthorizerBenchmark.testAclsIterator 5 5000 avgt 15 5.821 ? 0.309 ms/op AclAuthorizerBenchmark.testAclsIterator 5 10000 avgt 15 15.303 ? 0.107 ms/op AclAuthorizerBenchmark.testAclsIterator 5 50000 avgt 15 74.976 ? 0.543 ms/op AclAuthorizerBenchmark.testAclsIterator 10 5000 avgt 15 15.366 ? 0.184 ms/op AclAuthorizerBenchmark.testAclsIterator 10 10000 avgt 15 29.899 ? 0.129 ms/op AclAuthorizerBenchmark.testAclsIterator 10 50000 avgt 15 167.301 ? 1.723 ms/op AclAuthorizerBenchmark.testAclsIterator 15 5000 avgt 15 21.980 ? 0.114 ms/op AclAuthorizerBenchmark.testAclsIterator 15 10000 avgt 15 44.385 ? 0.255 ms/op AclAuthorizerBenchmark.testAclsIterator 15 50000 avgt 15 241.919 ? 3.955 ms/op ``` New ``` Benchmark (aclCount) (resourceCount) Mode Cnt Score Error Units AclAuthorizerBenchmark.testAclsIterator 5 5000 avgt 15 0.666 ? 0.004 ms/op AclAuthorizerBenchmark.testAclsIterator 5 10000 avgt 15 1.427 ? 0.015 ms/op AclAuthorizerBenchmark.testAclsIterator 5 50000 avgt 15 21.410 ? 0.225 ms/op AclAuthorizerBenchmark.testAclsIterator 10 5000 avgt 15 1.230 ? 0.018 ms/op AclAuthorizerBenchmark.testAclsIterator 10 10000 avgt 15 4.303 ? 0.744 ms/op AclAuthorizerBenchmark.testAclsIterator 10 50000 avgt 15 36.724 ? 0.409 ms/op AclAuthorizerBenchmark.testAclsIterator 15 5000 avgt 15 2.433 ? 0.379 ms/op AclAuthorizerBenchmark.testAclsIterator 15 10000 avgt 15 9.818 ? 0.214 ms/op AclAuthorizerBenchmark.testAclsIterator 15 50000 avgt 15 52.886 ? 0.525 ms/op ``` Author: Manikumar Reddy <manikumar.reddy@gmail.com> Author: Lucas Bradstreet <lucas@confluent.io> Reviewers: Ismael Juma <ismael@juma.me.uk>, Rajini Sivaram <rajinisivaram@googlemail.com>, Lucas Bradstreet <lucas@confluent.io> Closes #8199 from omkreddy/KAFKA-9626	2020-03-03 01:51:09 +05:30
Bob Barrett	937f1f741c	KAFKA-8805; Bump producer epoch on recoverable errors (#7389 ) This change is the client-side part of KIP-360. It identifies cases where it is safe to abort a transaction, bump the producer epoch, and allow the application to continue without closing the producer. In these cases, when KafkaProducer.abortTransaction() is called, the producer sends an InitProducerId following the transaction abort, which causes the producer epoch to be bumped. The application can then start a new transaction and continue processing. For recoverable errors in the idempotent producer, the epoch is bumped locally. In-flight requests for partitions with an error are rewritten to reflect the new epoch, and in-flights of all other partitions are allowed to complete using the old epoch. Reviewers: Boyang Chen <boyang@confluent.io>, Jason Gustafson <jason@confluent.io>	2020-02-15 22:47:10 -08:00
Konstantine Karantasis	16ee326755	KAFKA-9556; Fix two issues with KIP-558 and expand testing coverage (#8085 ) Correct the Connect worker logic to properly disable the new topic status (KIP-558) feature when `topic.tracking.enable=false`, and fix automatic topic status reset after a connector is deleted. Also adds new `ConnectorTopicsIntegrationTest` and expanded unit tests. Reviewers: Randall Hauch <rhauch@gmail.com>	2020-02-14 14:34:34 -08:00
Xavier Léauté	7e1c39f75a	KAFKA-9106 make metrics exposed via jmx configurable (#7674 ) Reviewers: Colin P. McCabe <cmccabe@apache.org>, Rajini Sivaram <rajinisivaram@googlemail.com>, Manikumar Reddy <manikumar.reddy@gmail.com>	2020-02-13 10:21:14 -08:00
Boyang Chen	07db26c20f	KAFKA-9417: New Integration Test for KIP-447 (#8000 ) This change mainly have 2 components: 1. extend the existing transactions_test.py to also try out new sendTxnOffsets(groupMetadata) API to make sure we are not introducing any regression or compatibility issue a. We shrink the time window to 10 seconds for the txn timeout scheduler on broker so that we could trigger expiration earlier than later 2. create a completely new system test class called group_mode_transactions_test which is more complicated than the existing system test, as we are taking rebalance into consideration and using multiple partitions instead of one. For further breakdown: a. The message count was done on partition level, instead of global as we need to visualize the per partition order throughout the test. For this sake, we extend ConsoleConsumer to print out the data partition as well to help message copier interpret the per partition data. b. The progress count includes the time for completing the pending txn offset expiration c. More visibility and feature improvements on TransactionMessageCopier to better work under either standalone or group mode. Reviewers: Matthias J. Sax <matthias@confluent.io>, Guozhang Wang <wangguoz@gmail.com>	2020-02-12 12:34:12 -08:00
John Roesler	1681c78f60	KAFKA-9500: Fix FK Join Topology (#8015 ) Corrects a flaw leading to an exception while building topologies that include both: * A foreign-key join with the result not explicitly materialized * An operation after the join that requires source materialization Also corrects a flaw in TopologyTestDriver leading to output records being enqueued in the wrong order under some (presumably rare) circumstances. Reviewers: Matthias J. Sax <matthias@confluent.io>, Guozhang Wang <wangguoz@gmail.com>	2020-02-11 22:38:05 -06:00
Guozhang Wang	4090f9a2b0	KAFKA-9113: Clean up task management and state management (#7997 ) This PR is collaborated by Guozhang Wang and John Roesler. It is a significant tech debt cleanup on task management and state management, and is broken down by several sub-tasks listed below: Extract embedded clients (producer and consumer) into RecordCollector from StreamTask. guozhangwang#2 guozhangwang#5 Consolidate the standby updating and active restoring logic into ChangelogReader and extract out of StreamThread. guozhangwang#3 guozhangwang#4 Introduce Task state life cycle (created, restoring, running, suspended, closing), and refactor the task operations based on the current state. guozhangwang#6 guozhangwang#7 Consolidate AssignedTasks into TaskManager and simplify the logic of changelog management and task management (since they are already moved in step 2) and 3)). guozhangwang#8 guozhangwang#9 Also simplified the StreamThread logic a bit as the embedded clients / changelog restoration logic has been moved into step 1) and 2). guozhangwang#10 Reviewers: A. Sophie Blee-Goldman <sophie@confluent.io>, Bruno Cadonna <bruno@confluent.io>, Boyang Chen <boyang@confluent.io>	2020-02-04 21:06:39 -08:00
Rajini Sivaram	a565d1a182	KAFKA-9181; Maintain clean separation between local and group subscriptions in consumer's SubscriptionState (#7941 ) Reviewers: Jason Gustafson <jason@confluent.io>, Guozhang Wang <wangguoz@gmail.com>	2020-01-24 10:38:21 +00:00
Boyang Chen	de90175fc2	KAFKA-9418; Add new sendOffsetsToTransaction API to KafkaProducer (#7952 ) This patch adds a new API to the producer to implement transactional offset commit fencing through the group coordinator as proposed in KIP-447. This PR mainly changes on the Producer end for compatible paths to old `sendOffsetsToTxn(offsets, groupId)` vs new `sendOffsetsToTxn(offsets, groupMetadata)`. Reviewers: Matthias J. Sax <matthias@confluent.io>, Guozhang Wang <wangguoz@gmail.com>, Jason Gustafson <jason@confluent.io>	2020-01-22 13:48:36 -08:00
Mickael Maison	3953204d35	MINOR: Fix connect:mirror checkstyle (#7951 ) Reviewers: Guozhang Wang <wangguoz@gmail.com>, Jason Gustafson <jason@confluent.io>	2020-01-13 15:25:24 -08:00
John Roesler	717ce42a6d	KAFKA-9138: Add system test for relational joins (#7664 ) Add a system test to verify the new foreign-key join introduced in KIP-213 Reviewers: Guozhang Wang <wangguoz@gmail.com>	2019-12-11 09:48:23 -08:00
Colin Patrick McCabe	02df8e1496	KAFKA-8986: Allow null as a valid default for tagged fields. (#7585 ) Allow null as a valid default for tagged fields. Fix a bunch of cases where this would previously result in null pointer dereferences. Also allow inferring FieldSpec#versions based on FieldSpec#taggedVersions. Prefix 'key' with an underscore when it is used in the generated code, to avoid potential name collisions if someone names an RPC field "key". Allow setting setting hexadecimal constants and 64-bit contstants. Add a lot more test cases to SimpleExampleMessage.json. Reviewers: Jason Gustafson <jason@confluent.io>	2019-11-20 16:40:18 -08:00
John Roesler	4a5155c934	KAFKA-8868: Generate SubscriptionInfo protocol message (#7248 ) Rather than maintain hand coded protocol serialization code, Streams could use the same code-generation framework as Clients/Core. There isn't a perfect match, since the code generation framework includes an assumption that you're generating "protocol messages", rather than just arbitrary blobs, but I think it's close enough to justify using it, and improving it over time. Using the code generation allows us to drop a lot of detail-oriented, brittle, and hard-to-maintain serialization logic in favor of a schema spec. Reviewers: Colin P. McCabe <cmccabe@apache.org>, Boyang Chen <boyang@confluent.io>, Bill Bejeck <bill@confluent.io>, Guozhang Wang <wangguoz@gmail.com>	2019-11-01 10:03:55 -07:00
Nikolay	adb2bdb122	KAFKA-8584: The RPC code generator should support ByteBuffer. (#7342 ) The RPC code generator should support using the ByteBuffer class in addition to byte arrays. By using the ByteBuffer class, we can avoid performing a copy in many situations. Also modify TestByteBufferDataTest to test the new feature. Reviewers: Colin P. McCabe <cmccabe@apache.org>, Guozhang Wang <wangguoz@gmail.com>	2019-10-23 12:39:12 -07:00
John Roesler	f93c473be1	KAFKA-9000: fix flaky FK join test by using TTD (#7517 ) Migrate this integration test to use TopologyTestDriver instead of running 3 Streams instances. Dropped one test that was attempting to produce specific interleavings. If anything, these should be verified deterministically by unit testing. Reviewers: Matthias J. Sax <matthias@confluent.io>, Guozhang Wang <wangguoz@gmail.com>	2019-10-16 22:40:57 -07:00
Greg Harris	ff68b60429	KAFKA-8340, KAFKA-8819: Use PluginClassLoader while statically initializing plugins (#7315 ) Added plugin isolation unit tests for various scenarios, with a `TestPlugins` class that compiles and builds multiple test plugins without them being on the classpath and verifies that the Plugins and DelegatingClassLoader behave properly. These initially failed for several cases, but now pass since the issues have been fixed. KAFKA-8340 and KAFKA-8819 are closely related, and this fix corrects the problems reported in both issues. Author: Greg Harris <gregh@confluent.io> Reviewers: Chris Egerton <chrise@confluent.io>, Magesh Nandakumar <mageshn@confluent.io>, Konstantine Karantasis <konstantine@confluent.io>, Randall Hauch <rhauch@gmail.com>	2019-10-16 20:43:00 -05:00
Lucas Bradstreet	8966d066bd	KAFKA-9039: Optimize ReplicaFetcher fetch path (#7443 ) Improves the performance of the replica fetcher for high partition count fetch requests, where a majority of the partitions did not update between fetch requests. All benchmarks were run on an r5x.large. Vanilla Benchmark (partitionCount) Mode Cnt Score Error Units ReplicaFetcherThreadBenchmark.testFetcher 100 avgt 15 26491.825 ± 438.463 ns/op ReplicaFetcherThreadBenchmark.testFetcher 500 avgt 15 153941.952 ± 4337.073 ns/op ReplicaFetcherThreadBenchmark.testFetcher 1000 avgt 15 339868.602 ± 4201.462 ns/op ReplicaFetcherThreadBenchmark.testFetcher 5000 avgt 15 2588878.448 ± 22172.482 ns/op From 100 to 5000 partitions the latency increase is 2588878.448 / 26491.825 = 97. Avoid gettimeofdaycalls in steady state fetch states `8545888` Benchmark (partitionCount) Mode Cnt Score Error Units ReplicaFetcherThreadBenchmark.testFetcher 100 avgt 15 22685.381 ± 267.727 ns/op ReplicaFetcherThreadBenchmark.testFetcher 500 avgt 15 113622.521 ± 1854.254 ns/op ReplicaFetcherThreadBenchmark.testFetcher 1000 avgt 15 273698.740 ± 9269.554 ns/op ReplicaFetcherThreadBenchmark.testFetcher 5000 avgt 15 2189223.207 ± 1706.945 ns/op From 100 to 5000 partitions the latency increase is 2189223.207 / 22685.381 = 97X Avoid copying partition states to maintain fetch offsets `29fdd60` Benchmark (partitionCount) Mode Cnt Score Error Units ReplicaFetcherThreadBenchmark.testFetcher 100 avgt 15 17039.989 ± 609.355 ns/op ReplicaFetcherThreadBenchmark.testFetcher 500 avgt 15 99371.086 ± 1833.256 ns/op ReplicaFetcherThreadBenchmark.testFetcher 1000 avgt 15 216071.333 ± 3714.147 ns/op ReplicaFetcherThreadBenchmark.testFetcher 5000 avgt 15 2035678.223 ± 5195.232 ns/op From 100 to 5000 partitions the latency increase is 2035678.223 / 17039.989 = 119X Keep lag alongside PartitionFetchState to avoid expensive isReplicaInSync check `0e57e3e` Benchmark (partitionCount) Mode Cnt Score Error Units ReplicaFetcherThreadBenchmark.testFetcher 100 avgt 15 15131.684 ± 382.088 ns/op ReplicaFetcherThreadBenchmark.testFetcher 500 avgt 15 86813.843 ± 3346.385 ns/op ReplicaFetcherThreadBenchmark.testFetcher 1000 avgt 15 193050.381 ± 3281.833 ns/op ReplicaFetcherThreadBenchmark.testFetcher 5000 avgt 15 1801488.513 ± 2756.355 ns/op From 100 to 5000 partitions the latency increase is 1801488.513 / 15131.684 = 119X Fetch session optimizations (mostly presizing the next hashmap, and avoiding making a copy of sessionPartitions, as a deep copy is not required for the ReplicaFetcher) `2614b24` Benchmark (partitionCount) Mode Cnt Score Error Units ReplicaFetcherThreadBenchmark.testFetcher 100 avgt 15 11386.203 ± 416.701 ns/op ReplicaFetcherThreadBenchmark.testFetcher 500 avgt 15 60820.292 ± 3163.001 ns/op ReplicaFetcherThreadBenchmark.testFetcher 1000 avgt 15 146242.158 ± 1937.254 ns/op ReplicaFetcherThreadBenchmark.testFetcher 5000 avgt 15 1366768.926 ± 3305.712 ns/op From 100 to 5000 partitions the latency increase is 1366768.926 / 11386.203 = 120 Reviewers: Jun Rao <junrao@gmail.com>, Guozhang Wang <wangguoz@gmail.com>	2019-10-16 09:49:53 -07:00
A. Sophie Blee-Goldman	d88f1048da	KAFKA-8179: Part 7, cooperative rebalancing in Streams (#7386 ) Key improvements with this PR: * tasks will remain available for IQ during a rebalance (but not during restore) * continue restoring and processing standby tasks during a rebalance * continue processing active tasks during rebalance until the RecordQueue is empty* * only revoked tasks must suspended/closed * StreamsPartitionAssignor tries to return tasks to their previous consumers within a client * but do not try to commit, for now (pending KAFKA-7312) Reviewers: John Roesler <john@confluent.io>, Boyang Chen <boyang@confluent.io>, Guozhang Wang <wangguoz@gmail.com>	2019-10-07 09:27:09 -07:00
Ryanne Dolan	4ac892ca78	KAFKA-7500: MirrorMaker 2.0 (KIP-382) Implementation of [KIP-382 "MirrorMaker 2.0"](https://cwiki.apache.org/confluence/display/KAFKA/KIP-382%3A+MirrorMaker+2.0) Author: Ryanne Dolan <ryannedolan@gmail.com> Author: Arun Mathew <arunmathew88@gmail.com> Author: In Park <inpark@cloudera.com> Author: Andre Price <obsoleted@users.noreply.github.com> Author: christian.hagel@rio.cloud <christian.hagel@rio.cloud> Reviewers: Eno Thereska <eno.thereska@gmail.com>, William Hammond <william.t.hammond@gmail.com>, Viktor Somogyi <viktorsomogyi@gmail.com>, Jakub Korzeniowski, Tim Carey-Smith, Kamal Chandraprakash <kamal.chandraprakash@gmail.com>, Arun Mathew, Jeremy-l-ford, vpernin, Oleg Kasian <oleg.kasian@gmail.com>, Mickael Maison <mickael.maison@gmail.com>, Qihong Chen, Sriharsha Chintalapani <sriharsha@apache.org>, Jun Rao <junrao@gmail.com>, Randall Hauch <rhauch@gmail.com>, Manikumar Reddy <manikumar.reddy@gmail.com>, Ismael Juma <ismael@juma.me.uk> Closes #6295 from ryannedolan/KIP-382	2019-10-07 13:57:54 +05:30
Colin Patrick McCabe	0de61a4683	KAFKA-8885; The Kafka Protocol should Support Optional Tagged Fields (#7325 ) This patch implements support for optional (tagged) fields in the Kafka protocol as documented in KIP-482: https://cwiki.apache.org/confluence/display/KAFKA/KIP-482%3A+The+Kafka+Protocol+should+Support+Optional+Tagged+Fields#KIP-482:TheKafkaProtocolshouldSupportOptionalTaggedFields-TypeClasses. Reviewers: David Jacot <djacot@confluent.io>, Ismael Juma <ismael@juma.me.uk>, Jason Gustafson <jason@confluent.io>	2019-10-06 21:13:23 -07:00
Adam Bellemare	c87fe9402c	KAFKA-3705 Added a foreignKeyJoin implementation for KTable. (#5527 ) https://issues.apache.org/jira/browse/KAFKA-3705 Allows for a KTable to map its value to a given foreign key and join on another KTable keyed on that foreign key. Applies the joiner, then returns the tuples keyed on the original key. This supports updates from both sides of the join. Reviewers: Guozhang Wang <wangguoz@gmail.com>, Matthias J. Sax <mjsax@apache.org>, John Roesler <john@confluent.io>, Boyang Chen <boyang@confluent.io>, Christopher Pettitt <cpettitt@confluent.io>, Bill Bejeck <bbejeck@gmail.com>, Jan Filipiak <Jan.Filipiak@trivago.com>, pgwhalen, Alexei Daniline	2019-10-03 18:59:31 -04:00
Chris Egerton	791d0d61bf	KAFKA-8804: Secure internal Connect REST endpoints (#7310 ) Implemented KIP-507 to secure the internal Connect REST endpoints that are only for intra-cluster communication. A new V2 of the Connect subprotocol enables this feature, where the leader generates a new session key, shares it with the other workers via the configuration topic, and workers send and validate requests to these internal endpoints using the shared key. Currently the internal `POST /connectors/<connector>/tasks` endpoint is the only one that is secured. This change adds unit tests and makes some small alterations to system tests to target the new `sessioned` Connect subprotocol. A new integration test ensures that the endpoint is actually secured (i.e., requests with missing/invalid signatures are rejected with a 400 BAD RESPONSE status). Author: Chris Egerton <chrise@confluent.io> Reviewed: Konstantine Karantasis <konstantine@confluent.io>, Randall Hauch <rhauch@gmail.com>	2019-10-02 17:06:57 -05:00
Arjun Satish	1c831c22e1	KAFKA-7772: Dynamically Adjust Log Levels in Connect (#7403 ) Implemented KIP-495 to expose a new `admin/loggers` endpoint for the Connect REST API that lists the current log levels and allows the caller to change log levels. Author: Arjun Satish <arjun@confluent.io> Reviewer: Randall Hauch <rhauch@gmail.com>	2019-10-02 17:00:37 -05:00
Yaroslav Tkachenko	70d1bb40d9	KAFKA-7273: Extend Connect Converter to support headers (#6362 ) Implemented KIP-440 to allow Connect converters to use record headers when serializing or deserializing keys and values. This change is backward compatible in that the new methods default to calling the older existing methods, so existing Converter implementations need not be changed. This changes the WorkerSinkTask and WorkerSourceTask to use the new converter methods, but Connect's existing Converter implementations and the use of converters for internal topics are intentionally not modified. Added unit tests. Author: Yaroslav Tkachenko <sapiensy@gmail.com> Reviewers: Ryanne Dolan <ryannedolan@gmail.com>, Ewen Cheslack-Postava <me@ewencp.org>, Randall Hauch <rhauch@gmail.com>	2019-09-25 11:23:03 -05:00
Colin Patrick McCabe	92688ef82c	MINOR: improve the Kafka RPC code generator (#7340 ) Move the generator checkstyle suppressions to a special section, rather than mixing them in with the other sections. For generated code, do not complain about variable names or cyclic complexity. FieldType.java: remove isInteger since it isn't used anywhere. This way, we don't have to decide whether a UUID is an integer or not (there are arguments for both choices). Add FieldType#serializationIsDifferentInFlexibleVersions and FieldType#isVariableLength. HeaderGenerator: add the ability to generate static imports. Add IsNullConditional, VersionConditional, and ClauseGenerator as easier ways of generating "if" statements.	2019-09-25 11:58:54 -04:00
Guozhang Wang	a0470726c4	MINOR: Move Murmur3 to Streams	2019-09-19 16:38:18 -07:00
Adam Bellemare	2d0cd2ef54	MINOR: Murmur3 Hash with Guava dependency Part of supporting KIP-213 ( https://cwiki.apache.org/confluence/display/KAFKA/KIP-213+Support+non-key+joining+in+KTable ). Murmur3 hash is used as a hashing mechanism in KIP-213 for the large range of uniqueness. The Murmur3 class and tests are ported directly from Apache Hive, with no alterations to the code or dependencies. Author: Adam Bellemare <adam.bellemare@wishabi.com> Reviewers: John Roesler <vvcephei@users.noreply.github.com>, Ismael Juma <ismael@juma.me.uk>, Guozhang Wang <wangguoz@gmail.com> Closes #7271 from bellemare/murmur3hash	2019-09-19 15:36:32 -07:00
Lucas Bradstreet	f3ded39a05	KAFKA-8841; Reduce overhead of ReplicaManager.updateFollowerFetchState (#7324 ) This PR makes two changes to code in the ReplicaManager.updateFollowerFetchState path, which is in the hot path for follower fetches. Although calling ReplicaManager.updateFollowerFetch state is inexpensive on its own, it is called once for each partition every time a follower fetch occurs. 1. updateFollowerFetchState no longer calls maybeExpandIsr when the follower is already in the ISR. This avoid repeated expansion checks. 2. Partition.maybeIncrementLeaderHW is also in the hot path for ReplicaManager.updateFollowerFetchState. Partition.maybeIncrementLeaderHW calls Partition.remoteReplicas four times each iteration, and it performs a toSet conversion. maybeIncrementLeaderHW now avoids generating any intermediate collections when updating the HWM. Benchmark results for Partition.updateFollowerFetchState on a r5.xlarge: Old: ``` 1288.633 ±(99.9%) 1.170 ns/op [Average] (min, avg, max) = (1287.343, 1288.633, 1290.398), stdev = 1.037 CI (99.9%): [1287.463, 1289.802] (assumes normal distribution) ``` New (when follower fetch offset is updated): ``` 261.727 ±(99.9%) 0.122 ns/op [Average] (min, avg, max) = (261.565, 261.727, 261.937), stdev = 0.114 CI (99.9%): [261.605, 261.848] (assumes normal distribution) ``` New (when follower fetch offset is the same): ``` 68.484 ±(99.9%) 0.025 ns/op [Average] (min, avg, max) = (68.446, 68.484, 68.520), stdev = 0.023 CI (99.9%): [68.460, 68.509] (assumes normal distribution) ``` Reviewers: Ismael Juma <ismael@juma.me.uk>, Jason Gustafson <jason@confluent.io>	2019-09-18 09:11:39 -07:00
Scott Hendricks	39afe9fe0e	KAFKA-8853; Create sustained connections test for Trogdor This creates a test that generates sustained connections against Kafka. There are three different components we can stress with this, KafkaConsumer, KafkaProducer, and AdminClient. This test tries use minimal bandwidth per connection to reduce overhead impacts. This test works by creating a threadpool that creates connections and then maintains a central pool of connections at a specified keepalive rate. The keepalive action varies by which component is being stressed: * KafkaProducer: Sends one single produce record. The configuration for the produce request uses the same key/value generator as the ProduceBench test. * KafkaConsumer: Subscribes to a single partition, seeks to the end, and then polls a minimal number of records. Each consumer connection is its own consumer group, and defaults to 1024 bytes as FETCH_MAX_BYTES to keep traffic to a minimum. * AdminClient: Makes an API call to get the nodes in the cluster. NOTE: This test is designed to be run alongside a ProduceBench test for a specific topic, due to the way the Consumer test polls a single partition. There may be no data returned by the consumer test if this is run on its own. The connection should still be kept alive, but with no data returned. Author: Scott Hendricks <scott.hendricks@confluent.io> Reviewers: Stanislav Kozlovski, Gwen Shapira Closes #7289 from scott-hendricks/trunk	2019-09-08 19:49:13 -07:00
Boyang Chen	c0019e6538	KAFKA-8590; Use automated TxnOffsetCommit type and add tests for OffsetCommit (#6994 ) This PR changes the TxnOffsetCommit protocol to auto-generated types, and add more unit test coverage to the plain OffsetCommit protocol. Reviewers: Jason Gustafson <jason@confluent.io>	2019-09-05 23:07:42 -07:00
Rajini Sivaram	364794866f	KAFKA-8760; New Java Authorizer API (KIP-504) (#7268 ) New Java Authorizer API and a new out-of-the-box authorizer (AclAuthorizer) that implements the new interface. Reviewers: Manikumar Reddy <manikumar.reddy@gmail.com>	2019-09-02 14:43:17 +01:00
cpettitt-confluent	7334222a71	KAFKA-8412: Fix nullpointer exception thrown on flushing before closing producers (#7207 ) Prior to this change an NPE is raised when calling AssignedTasks.close under the following conditions: 1. EOS is enabled 2. The task was in a suspended state The cause for the NPE is that when a clean close is requested for a StreamTask the StreamTask tries to commit. However, in the suspended state there is no producer so ultimately an NPE is thrown for the contained RecordCollector in flush. The fix put forth in this commit is to have AssignedTasks call closeSuspended when it knows the underlying StreamTask is suspended. Note also that this test is quite involved. I could have just tested that AssignedTasks calls closeSuspended when appropriate, but that is testing, IMO, a detail of the implementation and doesn't actually verify we reproduced the original problem as it was described. I feel much more confident that we are reproducing the behavior - and we can test exactly the conditions that lead to it - when testing across AssignedTasks and StreamTask. I believe this is an additional support for the argument of eventually consolidating the state split across classes. Reviewers: Matthias J. Sax <matthias@confluent.io>, Guozhang Wang <wangguoz@gmail.com>	2019-08-26 09:53:36 -07:00
Stanislav Kozlovski	5129ab53ee	KAFKA-8345: KIP-455: Admin API changes (Part 2) (#7120 ) Add the AlterPartitionReassignments and ListPartitionReassignments APIs. Also remove an unused methodlength suppression for KafkaAdminClient. Reviewers: Colin P. McCabe <cmccabe@apache.org>, Viktor Somogyi <viktorsomogyi@gmail.com>	2019-08-14 09:25:17 -07:00
Justine Olshan	717c55be97	KAFKA-8601: Implement KIP-480: Sticky Partitioning for keyless records (#6997 ) Implement KIP-480, which specifies that the default partitioner should use a "sticky" partitioning strategy for records that have a null key. Reviewers: Colin P. McCabe <cmccabe@apache.org>, Lucas Bradstreet <lucasbradstreet@gmail.com>, Stanislav Kozlovski <stanislav_kozlovski@outlook.com>, Jun Rao <junrao@gmail.com>, Kamal Chandraprakash <kamal.chandraprakash@gmail.com>	2019-08-01 14:36:12 -07:00
Boyang Chen	74c90f46c3	KAFKA-8221; Add batch leave group request (#6714 ) This patch is part of KIP-345. We are aiming to support batch leave group request issued from admin client. This diff is the first effort to bump leave group request version. Reviewers: Guozhang Wang <wangguoz@gmail.com>, Jason Gustafson <jason@confluent.io>	2019-07-26 23:13:37 -07:00
Andy Coates	2a133ba656	KAFKA-8454; Add Java AdminClient Interface (KIP-476) (#7087 ) Adds an `Admin` interface as specified in [KIP-476](https://cwiki.apache.org/confluence/display/KAFKA/KIP-476%3A+Add+Java+AdminClient+Interface). Reviewers: Ismael Juma <ismael@juma.me.uk>, Jason Gustafson <jason@confluent.io>	2019-07-22 15:47:34 -07:00
Ismael Juma	57903be496	MINOR: Remove zkclient dependency (#7036 ) ZkUtils was removed so we don't need this anymore. Also: * Fix ZkSecurityMigrator and ReplicaManagerTest not to reference ZkClient classes. * Remove references to zkclient in various `log4j.properties` and `import-control.xml`. Reviewers: Manikumar Reddy <manikumar.reddy@gmail.com>, Stanislav Kozlovski <stanislav_kozlovski@outlook.com>	2019-07-05 07:50:32 -07:00
Boyang Chen	b4e20495fa	remove cs (#6950 ) cleanup of some redundant checkstyle Reviewers: Bill Bejeck <bbejeck@gmail.com>	2019-06-28 18:47:16 -04:00
A. Sophie Blee-Goldman	16769d263e	KAFKA-8215: Upgrade Rocks to v5.18.3 (#6743 ) This upgrade exposes a number of new options, including the WriteBufferManager which -- along with existing TableConfig options -- allows users to limit the total memory used by RocksDB across instances. This can alleviate some cascading OOM potential when, for example, a large number of stateful tasks are suddenly migrated to the same host. The RocksDB docs guarantee backwards format compatibility across versions Reviewers: Matthias J. Sax <mjsax@apache.org>, Bill Bejeck <bbejeck@gmail.com>,	2019-05-17 16:32:17 -04:00
Magesh Nandakumar	2e91a310d7	KAFKA-8265: Initial implementation for ConnectorClientConfigPolicy to enable overrides (KIP-458) (#6624 ) Implementation to enable policy for Connector Client config overrides. This is implemented per the KIP-458. Reviewers: Randall Hauch <rhauch@gmail.com>	2019-05-17 01:37:32 -07:00
Chris Egerton	cc097e909c	KAFKA-8304: Fix registration of Connect REST extensions (#6651 ) Fix registration of Connect REST extensions to prevent deadlocks when extensions get the list of connectors before the herder is available. Added integration test to check the behavior. Author: Chris Egerton <cegerton@oberlin.edu> Reviewers: Arjun Satish <arjun@confluent.io>, Randall Hauch <rhauch@gmail.com>	2019-05-07 17:20:51 -05:00
Boyang Chen	0f995ba6be	KAFKA-7862 & KIP-345 part-one: Add static membership logic to JoinGroup protocol (#6177 ) This is the first diff for the implementation of JoinGroup logic for static membership. The goal of this diff contains: * Add group.instance.id to be unique identifier for consumer instances, provided by end user; Modify group coordinator to accept JoinGroupRequest with/without static membership, refactor the logic for readability and code reusability. * Add client side support for incorporating static membership changes, including new config for group.instance.id, apply stream thread client id by default, and new join group exception handling. * Increase max session timeout to 30 min for more user flexibility if they are inclined to tolerate partial unavailability than burdening rebalance. * Unit tests for each module changes, especially on the group coordinator logic. Crossing the possibilities like: 6.1 Dynamic/Static member 6.2 Known/Unknown member id 6.3 Group stable/unstable 6.4 Leader/Follower The rest of the 345 change will be broken down to 4 separate diffs: * Avoid kicking out members through rebalance.timeout, only do the kick out through session timeout. * Changes around LeaveGroup logic, including version bumping, broker logic, client logic, etc. * Admin client changes to add ability to batch remove static members * Deprecate group.initial.rebalance.delay Reviewers: Liquan Pei <liquanpei@gmail.com>, Stanislav Kozlovski <familyguyuser192@windowslive.com>, Jason Gustafson <jason@confluent.io>, Guozhang Wang <wangguoz@gmail.com>	2019-04-26 11:44:38 -07:00
Manikumar Reddy	3b1524c5df	KAFKA-7466: Add IncrementalAlterConfigs API (KIP-339) (#6247 ) Reviewers: Colin P. McCabe <cmccabe@apache.org>, Viktor Somogyi <viktorsomogyi@gmail.com>, Stanislav Kozlovski <stanislav_kozlovski@outlook.com>, Rajini Sivaram <rajinisivaram@googlemail.com>, Ismael Juma <ismael@juma.me.uk>	2019-04-16 16:26:33 -07:00
Viktor Somogyi	7bd81628d9	KAFKA-6635; Producer close awaits pending transactions (#5971 ) Currently close() only awaits completion of pending produce requests. If there is a transaction ongoing, it may be dropped. For example, if one thread is calling commitTransaction() and another calls close(), then the commit may never happen even if the caller is willing to wait for it (by using a long timeout). What's more, the thread blocking in commitTransaction() will be stuck since the result will not be completed once the producer has shutdown. This patch ensures that 1) completing transactions are awaited, 2) ongoing transactions are aborted, and 3) pending callbacks are completed before close() returns. Reviewers: Jason Gustafson <jason@confluent.io>	2019-04-15 15:56:36 -07:00
Konstantine Karantasis	e4cad35312	KAFKA-8014: Extend Connect integration tests to add and remove workers dynamically (#6342 ) Extend Connect's integration test framework to add or remove workers to EmbeddedConnectCluster, and choosing whether to fail the test on ungraceful service shutdown. Also added more JavaDoc and other minor improvements. Author: Konstantine Karantasis <konstantine@confluent.io> Reviewers: Arjun Satish <arjun@confluent.io>, Randall Hauch <rhauch@gmail.com> Closes #6342 from kkonstantine/KAFKA-8014	2019-03-25 09:29:33 -05:00
Mickael Maison	4824dc994d	KAFKA-7972: Use automatic RPC generation in SaslHandshake Author: Mickael Maison <mickael.maison@gmail.com> Reviewers: Manikumar Reddy <manikumar.reddy@gmail.com> Closes #6301 from mimaison/sasl-handshake	2019-02-25 11:20:07 +05:30
Alex Diachenko	ec42e0378e	KAFKA-7799; Use httpcomponents-client in RestServerTest. The test `org.apache.kafka.connect.runtime.rest.RestServerTest#testCORSEnabled` assumes Jersey client can send restricted HTTP headers(`Origin`). Jersey client uses `sun.net.www.protocol.http.HttpURLConnection`. `sun.net.www.protocol.http.HttpURLConnection` drops restricted headers(`Host`, `Keep-Alive`, `Origin`, etc) based on static property `allowRestrictedHeaders`. This property is initialized in a static block by reading Java system property `sun.net.http.allowRestrictedHeaders`. So, if classloader loads `HttpURLConnection` before we set `sun.net.http.allowRestrictedHeaders=true`, then all subsequent changes of this system property won't take any effect(which happens if `org.apache.kafka.connect.integration.ExampleConnectIntegrationTest` is executed before `RestServerTest`). To prevent this, we have to either make sure we set `sun.net.http.allowRestrictedHeaders=true` as early as possible or do not rely on this system property at all. This PR adds test dependency on `httpcomponents-client` which doesn't depend on `sun.net.http.allowRestrictedHeaders` system property. Thus none of existing tests should interfere with `RestServerTest`. Author: Alex Diachenko <sansanichfb@gmail.com> Reviewers: Randall Hauch, Konstantine Karantasis, Gwen Shapira Closes #6236 from avocader/KAFKA-7799	2019-02-12 12:03:08 -08:00
Chia-Ping Tsai	3ebf058123	MINOR: fix checkstyle suppressions for generated RPC code to work on Windows Reviewed-by: Colin P. McCabe <cmccabe@apache.org>	2019-01-31 10:29:55 -08:00
Matthias J. Sax	1fa02d5aef	MINOR: Update usage of deprecated API (#6146 ) Reviewers: Guozhang Wang <guozhang@confluent.io>, Ismael Juma <ismael@confluent.io>, Jorge Quilcate Otoya <quilcate.jorge@gmail.com>, John Roesler <john@confluent.io>, Bill Bejeck <bill@confluent.io>	2019-01-29 11:15:43 -08:00
Tom Bentley	269b65279c	KAFKA-5692: Change PreferredReplicaLeaderElectionCommand to use Admin… (#3848 ) See also KIP-183. This implements the following algorithm: AdminClient sends ElectPreferredLeadersRequest. KafakApis receives ElectPreferredLeadersRequest and delegates to ReplicaManager.electPreferredLeaders() ReplicaManager delegates to KafkaController.electPreferredLeaders() KafkaController adds a PreferredReplicaLeaderElection to the EventManager, ReplicaManager.electPreferredLeaders()'s callback uses the delayedElectPreferredReplicasPurgatory to wait for the results of the election to appear in the metadata cache. If there are no results because of errors, or because the preferred leaders are already leading the partitions then a response is returned immediately. In the EventManager work thread the preferred leader is elected as follows: The EventManager runs PreferredReplicaLeaderElection.process() process() calls KafkaController.onPreferredReplicaElectionWithResults() KafkaController.onPreferredReplicaElectionWithResults() calls the PartitionStateMachine.handleStateChangesWithResults() to perform the election (asynchronously the PSM will send LeaderAndIsrRequest to the new and old leaders and UpdateMetadataRequest to all brokers) then invokes the callback. Reviewers: Colin P. McCabe <cmccabe@apache.org>, Jun Rao <junrao@gmail.com>	2019-01-25 14:06:18 -08:00
Colin Patrick McCabe	a79d6dcdb6	KAFKA-7793: Improve the Trogdor command line. (#6133 ) * Allow the Trogdor agent to be started in "exec mode", where it simply runs a single task and exits after it is complete. * For AgentClient and CoordinatorClient, allow the user to pass the path to a file containing JSON, instead of specifying the JSON object in the command-line text itself. This means that we can get rid of the bash scripts whose only function was to load task specs into a bash string and run a Trogdor command. * Print dates and times in a human-readable way, rather than as numbers of milliseconds. * When listing tasks or workers, output human-readable tables of information. * Allow the user to filter on task ID name, task ID pattern, or task state. * Support a --json flag to provide raw JSON output if desired. Reviewed-by: David Arthur <mumrah@gmail.com>, Stanislav Kozlovski <stanislav_kozlovski@outlook.com>	2019-01-24 09:26:51 -08:00
Arjun Satish	dc935c4beb	MINOR: Handle case where connector status endpoints returns 404 (#6176 ) Reviewers: Randall Hauch <randall@confluent.io>, Matthias J. Sax <matthias@confluent.io>	2019-01-20 19:31:20 -08:00
Arjun Satish	69d8d2ea11	KAFKA-7503: Connect integration test harness Expose a programmatic way to bring up a Kafka and Zk cluster through Java API to facilitate integration tests for framework level changes in Kafka Connect. The Kafka classes would be similar to KafkaEmbedded in streams. The new classes would reuse the kafka.server.KafkaServer classes from :core, and provide a simple interface to bring up brokers in integration tests. Signed-off-by: Arjun Satish <arjunconfluent.io> Author: Arjun Satish <arjun@confluent.io> Author: Arjun Satish <wicknicks@users.noreply.github.com> Reviewers: Randall Hauch <rhauch@gmail.com>, Konstantine Karantasis <konstantine@confluent.io>, Ewen Cheslack-Postava <ewen@confluent.io> Closes #5516 from wicknicks/connect-integration-test	2019-01-14 13:50:23 -08:00
Matthias J. Sax	82d1db6358	MINOR: code cleanup (#6054 ) Reviewers: Bill Bejeck <bill@confluent.io>, John Roesler <john@confluent.io>, Guozhang Wang <guozhang@confluent.io>, Ryanne Dolan <ryannedolan@gmail.com>, Ismael Juma <ismael@confuent.io>	2019-01-14 13:36:36 -08:00
Colin Patrick McCabe	71e85f5e84	KAFKA-7609; Add Protocol Generator for Kafka (#5893 ) This patch adds a framework to automatically generate the request/response classes for Kafka's protocol. The code will be updated to use the generated classes in follow-up patches. Below is a brief summary of the included components: buildSrc/src The message generator code is here. This code is automatically re-run by gradle when one of the schema files changes. The entire directory is processed at once to minimize the number of times we have to start a new JVM. We use Jackson to translate the JSON files into Java objects. clients/src/main/java/org/apache/kafka/common/protocol/Message.java This is the interface implemented by all automatically generated messages. clients/src/main/java/org/apache/kafka/common/protocol/MessageUtil.java Some utility functions used by the generated message code. clients/src/main/java/org/apache/kafka/common/protocol/Readable.java, Writable.java, ByteBufferAccessor.java The generated message code uses these classes for writing to a buffer. clients/src/main/message/README.md This README file explains how the JSON schemas work. *clients/src/main/message/\.json The JSON files in this directory implement every supported version of every Kafka API. The unit tests automatically validate that the generated schemas match the hand-written schemas in our code. Additionally, there are some things like request and response headers that have schemas here. clients/src/main/java/org/apache/kafka/common/utils/ImplicitLinkedHashSet.java** I added an optimization here for empty sets. This is useful here because I want all messages to start with empty sets by default prior to being loaded with data. This is similar to the "empty list" optimizations in the `java.util.ArrayList` class. Reviewers: Stanislav Kozlovski <stanislav_kozlovski@outlook.com>, Ismael Juma <ismael@juma.me.uk>, Bob Barrett <bob.barrett@outlook.com>, Jason Gustafson <jason@confluent.io>	2019-01-11 16:40:21 -08:00
Stanislav Kozlovski	7fadf0a11d	Trogdor: Add Task State filter to /coordinator/tasks endpoint (#5907 ) Reviewers: Colin McCabe <cmccabe@apache.org>	2018-11-26 16:46:58 -08:00
Ron Dagostino	e8a3bc7425	KAFKA-7352; Allow SASL Connections to Periodically Re-Authenticate (KIP-368) (#5582 ) KIP-368 implementation to enable periodic re-authentication of SASL clients. Also adds a broker configuration option to terminate client connections that do not re-authenticate within the configured interval.	2018-10-26 23:18:15 +01:00
Rajini Sivaram	4c602e6130	KAFKA-7498: Remove references from `common.requests` to `clients` (#5784 ) Add CreatePartitionsRequest.PartitionDetails similar to CreateTopicsRequest.TopicDetails to avoid references from `common.requests` package to `clients`. Reviewers: Ismael Juma <ismael@juma.me.uk>	2018-10-15 13:21:15 +01:00
Ismael Juma	578205cadd	KAFKA-7439; Replace EasyMock and PowerMock with Mockito in clients module Development of EasyMock and PowerMock has stagnated while Mockito continues to be actively developed. With the new Java release cadence, it's a problem to depend on libraries that do bytecode manipulation and are not actively maintained. In addition, Mockito is also easier to use. While updating the tests, I attempted to go from failing test to passing test. In cases where the updated test passed on the first attempt, I artificially broke it to ensure the test was still doing its job. I included a few improvements that were helpful while making these changes: 1. Better exception if there are no nodes in `leastLoadedNodes` 2. Always close the producer in `KafkaProducerTest` 3. requestsInFlight producer metric should not hold a reference to `Sender` Finally, `Metadata` is no longer final so that we don't need `PowerMock` to mock it. It's an internal class, so it's OK. Author: Ismael Juma <ismael@juma.me.uk> Reviewers: Viktor Somogyi <viktorsomogyi@gmail.com>, Dong Lin <lindong28@gmail.com> Closes #5691 from ijuma/kafka-7438-mockito	2018-10-09 15:55:09 -07:00
Randall Hauch	f0282498e7	KAFKA-6926: Simplified some logic to eliminate some suppressions of NPath complexity checks (#5051 ) Modified several classes' `equals` methods and simplified a complex method to reduce the NPath complexity so they could be removed from the checkstyle suppressions that were required with the recent move to Java 8 and upgrade of Checkstyle: https://github.com/apache/kafka/pull/5046. Reviewers: Robert Yokota <rayokota@gmail.com>, Arjun Satish <arjun@confluent.io>, Ismael Juma <ismael@juma.me.uk>	2018-09-12 21:21:46 -07:00
Anna Povzner	e2ec2d79c8	KAFKA-7044; Fix Fetcher.fetchOffsetsByTimes and NPE in describe consumer group (#5627 ) A call to `kafka-consumer-groups --describe --group ...` can result in NullPointerException for two reasons: 1) `Fetcher.fetchOffsetsByTimes()` may return too early, without sending list offsets request for topic partitions that are not in cached metadata. 2) `ConsumerGroupCommand.getLogEndOffsets()` and `getLogStartOffsets()` assumed that endOffsets()/beginningOffsets() which eventually call Fetcher.fetchOffsetsByTimes(), would return a map with all the topic partitions passed to endOffsets()/beginningOffsets() and that values are not null. Because of (1), null values were possible if some of the topic partitions were already known (in metadata cache) and some not (metadata cache did not have entries for some of the topic partitions). However, even with fixing (1), endOffsets()/beginningOffsets() may return a map with some topic partitions missing, when list offset request returns a non-retriable error. This happens in corner cases such as message format on broker is before 0.10, or maybe in cases of some other errors. Testing: -- added unit test to verify fix in Fetcher.fetchOffsetsByTimes() -- did some manual testing with `kafka-consumer-groups --describe`, causing NPE. Was not able to reproduce any NPE cases with DescribeConsumerGroupTest.scala, Reviewers: Jason Gustafson <jason@confluent.io>	2018-09-11 10:10:42 -07:00
John Roesler	d57fe1b053	MINOR: single Jackson serde for PageViewTypedDemo (#5590 ) Previously, we depicted creating a Jackson serde for every pojo class, which becomes a burden in practice. There are many ways to avoid this and just have a single serde, so we've decided to model this design choice instead. Reviewers: Viktor Somogyi <viktorsomogyi@gmail.com>, Bill Bejeck <bill@confluent.io>, Guozhang Wang <wangguoz@gmail.com>	2018-08-31 13:13:42 -07:00
Ismael Juma	b282b2ab10	KAFKA-7308: Fix rat and checkstyle config for Java 11 support (#5529 ) Relative paths in Gradle break when the Gradle daemon is used unless user.dir can be changed while the process is running. Java 11 disallows this, so we use project paths instead. Verified that rat and checkstyle work with Java 11 after these changes. Reviewers: Dong Lin <lindong28@gmail.com>	2018-08-18 08:18:28 -07:00
Colin Patrick McCabe	609c81ec8b	KAFKA-7183: Add a trogdor test that creates many connections to brokers (#5393 ) Reviewers: Ismael Juma <ismael@juma.me.uk>, Rajini Sivaram <rajinisivaram@googlemail.com>	2018-08-06 08:47:25 +01:00
John Roesler	3637b2c374	MINOR: Require final variables in Streams (#5452 ) Reviewers: Bill Bejeck <bill@confluent.io>, Guozhang Wang <guozhang@confluent.io>, Matthias J. Sax <matthias@confluent.io>	2018-08-03 13:19:46 -07:00
Manikumar Reddy O	96c53e96b8	MINOR: Remove deprecated ZkUtils usage from EmbeddedKafkaCluster (#5324 ) Reviewers: Matthias J. Sax <mjsax@apache.org>, Ismael Juma <ismael@juma.me.uk>, Guozhang Wang <wangguoz@gmail.com>	2018-07-19 14:28:12 -07:00
Joan Goyeau	05c5854d1f	MINOR: Add Scalafmt to Streams Scala API (#4965 ) Reviewers: Guozhang Wang <wangguoz@gmail.com>	2018-07-09 16:48:34 -07:00
Rajini Sivaram	1f8527b331	KAFKA-7136: Avoid deadlocks in synchronized metrics reporters (#5341 ) We need to use the same lock for metric update and read to avoid NPE and concurrent modification exceptions. Sensor add/remove/update are synchronized on Sensor since they access lists and maps that are not thread-safe. Reporters are notified of metrics add/remove while holding (Sensor, Metrics) locks and reporters may synchronize on the reporter lock. Metric read may be invoked by metrics reporters while holding a reporter lock. So read/update cannot be synchronized using Sensor since that could lead to deadlock. This PR introduces a new lock in Sensor for update/read. Locking order: - Sensor#add: Sensor -> Metrics -> MetricsReporter - Metrics#removeSensor: Sensor -> Metrics -> MetricsReporter - KafkaMetric#metricValue: MetricsReporter -> Sensor#metricLock - Sensor#record: Sensor -> Sensor#metricLock Reviewers: Jun Rao <junrao@gmail.com>, Guozhang Wang <wangguoz@gmail.com>	2018-07-06 10:54:28 -07:00
Ismael Juma	cc4dce94af	KAFKA-2983: Remove Scala consumers and related code (#5230 ) - Removed Scala consumers (`SimpleConsumer` and `ZooKeeperConsumerConnector`) and their tests. - Removed Scala request/response/message classes. - Removed any mention of new consumer or new producer in the code with the exception of MirrorMaker where the new.consumer option was never deprecated so we have to keep it for now. The non-code documentation has not been updated either, that will be done separately. - Removed a number of tools that only made sense in the context of the Scala consumers (see upgrade notes). - Updated some tools that worked with both Scala and Java consumers so that they only support the latter (see upgrade notes). - Removed `BaseConsumer` and related classes apart from `BaseRecord` which is used in `MirrorMakerMessageHandler`. The latter is a pluggable interface so effectively public API. - Removed `ZkUtils` methods that were only used by the old consumers. - Removed `ZkUtils.registerBroker` and `ZKCheckedEphemeral` since the broker now uses the methods in `KafkaZkClient` and no-one else should be using that method. - Updated system tests so that they don't use the Scala consumers except for multi-version tests. - Updated LogDirFailureTest so that the consumer offsets topic would continue to be available after all the failures. This was necessary for it to work with the Java consumer. - Some multi-version system tests had not been updated to include recently released Kafka versions, fixed it. - Updated findBugs and checkstyle configs not to refer to deleted classes and packages. Reviewers: Dong Lin <lindong28@gmail.com>, Manikumar Reddy <manikumar.reddy@gmail.com>	2018-06-19 07:32:54 -07:00
Andy Coates	b3aa655a70	KAFKA-6841: Support Prefixed ACLs (KIP-290) (#5117 ) Reviewers: Colin Patrick McCabe <colin@cmccabe.xyz>, Jun Rao <junrao@gmail.com> Co-authored-by: Piyush Vijay <pvijay@apple.com> Co-authored-by: Andy Coates <big-andy-coates@users.noreply.github.com>	2018-06-06 07:22:57 -07:00
Robert Yokota	08e8facdc9	KAFKA-6886: Externalize secrets from Connect configs (KIP-297) This commit allows secrets in Connect configs to be externalized and replaced with variable references of the form `${provider:[path:]key}`, where the "path" is optional. There are 2 main additions to `org.apache.kafka.common.config`: a `ConfigProvider` and a `ConfigTransformer`. The `ConfigProvider` is an interface that allows key-value pairs to be provided by an external source for a given "path". An a TTL can be associated with the key-value pairs returned from the path. The `ConfigTransformer` will use instances of `ConfigProvider` to replace variable references in a set of configuration values. In the Connect framework, `ConfigProvider` classes can be specified in the worker config, and then variable references can be used in the connector config. In addition, the herder can be configured to restart connectors (or not) based on the TTL returned from a `ConfigProvider`. The main class that performs restarts and transformations is `WorkerConfigTransformer`. Finally, a `configs()` method has been added to both `SourceTaskContext` and `SinkTaskContext`. This allows connectors to get configs with variables replaced by the latest values from instances of `ConfigProvider`. Most of the other changes in the Connect framework are threading various objects through classes to enable the above functionality. Author: Robert Yokota <rayokota@gmail.com> Author: Ewen Cheslack-Postava <me@ewencp.org> Reviewers: Randall Hauch <rhauch@gmail.com>, Ewen Cheslack-Postava <ewen@confluent.io> Closes #5068 from rayokota/KAFKA-6886-connect-secrets	2018-05-30 14:43:11 -07:00
Magesh Nandakumar	98094954a2	KAFKA-6776: ConnectRestExtension Interfaces & Rest integration (KIP-285) This PR provides the implementation for KIP-285 and also a reference implementation for authenticating BasicAuth credentials using JAAS LoginModule Author: Magesh Nandakumar <magesh.n.kumar@gmail.com> Reviewers: Randall Hauch <rhauch@gmail.com>, Arjun Satish <wicknicks@users.noreply.github.com>, Konstantine Karantasis <konstantine@confluent.io>, Ewen Cheslack-Postava <ewen@confluent.io> Closes #4931 from mageshn/KIP-285	2018-05-29 21:35:22 -07:00
Ron Dagostino	8c5d7e0408	KAFKA-6562: OAuth Authentication via SASL/OAUTHBEARER (KIP-255) (#4994 ) This KIP adds the following functionality related to SASL/OAUTHBEARER: 1) Allow clients (both brokers when SASL/OAUTHBEARER is the inter-broker protocol as well as non-broker clients) to flexibly retrieve an access token from an OAuth 2 authorization server based on the declaration of a custom login CallbackHandler implementation and have that access token transparently and automatically transmitted to a broker for authentication. 2) Allow brokers to flexibly validate provided access tokens when a client establishes a connection based on the declaration of a custom SASL Server CallbackHandler implementation. 3) Provide implementations of the above retrieval and validation features based on an unsecured JSON Web Token that function out-of-the-box with minimal configuration required (i.e. implementations of the two types of callback handlers mentioned above will be used by default with no need to explicitly declare them). 4) Allow clients (both brokers when SASL/OAUTHBEARER is the inter-broker protocol as well as non-broker clients) to transparently retrieve a new access token in the background before the existing access token expires in case the client has to open new connections.	2018-05-26 08:18:41 +01:00
Ismael Juma	e70a191d30	KAFKA-4423: Drop support for Java 7 (KIP-118) and update deps (#5046 ) * Set --source, --target and --release to 1.8. * Build Scala 2.12 by default. * Remove some conditionals in the build file now that Java 8 is the minimum version. * Bump the version of Jetty, Jersey and Checkstyle (the newer versions require Java 8). * Fixed issues uncovered by the new version if Checkstyle. * A couple of minor updates to handle an incompatible source change in the new version of Jetty. * Add dependency to jersey-hk2 to fix failing tests caused by Jersey upgrade. * Update release script to use Java 8 and to take into account that Scala 2.12 is now built by default. * While we're at it, bump the version of Gradle, Gradle plugins, ScalaLogging, JMH and apache directory api. * Minor documentation updates including the readme and upgrade notes. A number of Streams Java 7 examples can be removed subsequently.	2018-05-21 23:17:42 -07:00
John Roesler	ed51b2cdf5	KAFKA-6376; refactor skip metrics in Kafka Streams * unify skipped records metering * log warnings when things get skipped * tighten up metrics usage a bit ### Testing strategy: Unit testing of the metrics and the logs should be sufficient. Author: John Roesler <john@confluent.io> Reviewers: Bill Bejeck <bill@confluent.io>, Matthias J. Sax <matthias@confluent.io>, Guozhang Wang <wangguoz@gmail.com> Closes #4812 from vvcephei/kip-274-streams-skip-metrics	2018-04-23 11:41:03 -07:00
Colin Patrick McCabe	832b096f4f	KAFKA-6696 Trogdor should support destroying tasks (#4759 ) Implement destroying tasks and workers. This means erasing all record of them on the Coordinator and the Agent. Workers should be identified by unique 64-bit worker IDs, rather than by the names of the tasks they are implementing. This ensures that when a task is destroyed and re-created with the same task ID, the old workers will be not be treated as part of the new task instance. Fix some return results from RPCs. In some cases RPCs were returning values that were never used. Attempting to re-create the same task ID with different arguments should fail. Add RequestConflictException to represent HTTP error code 409 (CONFLICT) for this scenario. If only one worker in a task stops, don't stop all the other workers for that task, unless the worker that stopped had an error. Reviewers: Anna Povzner <anna@confluent.io>, Rajini Sivaram <rajinisivaram@googlemail.com>	2018-04-16 08:51:33 +01:00
Jorge Quilcate Otoya	6a99da87ab	KAFKA-6058: KIP-222; Add Consumer Group operations to Admin API KIP: https://cwiki.apache.org/confluence/display/KAFKA/KIP-222+-+Add+Consumer+Group+operations+to+Admin+API Author: Jorge Quilcate Otoya <quilcate.jorge@gmail.com> Author: Jorge Esteban Quilcate Otoya <quilcate.jorge@gmail.com> Author: Guozhang Wang <wangguoz@gmail.com> Reviewers: Colin Patrick McCabe <colin@cmccabe.xyz>, Guozhang Wang <wangguoz@gmail.com> Closes #4454 from jeqo/feature/admin-client-describe-consumer-group	2018-04-11 14:17:46 -07:00
Manikumar Reddy O	47918f2d79	KAFKA-6447: Add Delegation Token Operations to KafkaAdminClient (KIP-249) (#4427 ) Reviewers: Jun Rao <junrao@gmail.com>	2018-04-11 10:48:04 -07:00
Matthias J. Sax	0c0d8363e5	KAFKA-6054: Fix upgrade path from Kafka Streams v0.10.0 (#4779 ) Reviewers: Guozhang Wang <guozhang@confluent.io>, Bill Bejeck <bill@confluent.io>, John Roesler <john@confluent.io>, Damian Guy <damian@confluent.io>	2018-04-06 17:00:52 -07:00
Anna Povzner	5c24295d44	Trogdor's ProducerBench does not fail if topics exists (#4673 ) Added configs to ProducerBenchSpec: topicPrefix: name of topics will be of format topicPrefix + topic index. If not provided, default is "produceBenchTopic". partitionsPerTopic: number of partitions per topic. If not provided, default is 1. replicationFactor: replication factor per topic. If not provided, default is 3. The behavior of producer bench is changed such that if some or all topics already exist (with topic names = topicPrefix + topic index), and they have the same number of partitions as requested, the worker uses those topics and does not fail. The producer bench fails if one or more existing topics has number of partitions that is different from expected number of partitions. Added unit test for WorkerUtils -- for existing methods and new methods. Fixed bug in MockAdminClient, where createTopics() would over-write existing topic's replication factor and number of partitions while correctly completing the appropriate futures exceptionally with TopicExistsException. Reviewers: Colin P. Mccabe <cmccabe@confluent.io>, Rajini Sivaram <rajinisivaram@googlemail.com>	2018-03-20 13:51:45 +00:00
Colin Patrick McCabe	9e0e6e43a7	MINOR: Trogdor should not assume an agent co-located with the controller (#4712 )	2018-03-16 17:57:38 +00:00
Guozhang Wang	f26fbb9adc	MINOR: Rename stream partition assignor to streams partition assignor (#4621 ) This is a straight-forward change that make the name of the partition assignor to be aligned with Streams. Reviewers: Matthias J. Sax <mjsax@apache.org>	2018-02-26 14:39:47 -08:00
Colin Patrick McCabe	66039b1312	MINOR: Fix ConcurrentModificationException in TransactionManager (#4608 )	2018-02-22 22:23:14 -08:00
Konstantine Karantasis	17aaff3606	KAFKA-6288: Broken symlink interrupts scanning of the plugin path Submitting a fail safe fix for rare IOExceptions on symbolic links. The fix is submitted without a test case since it does seem easy to reproduce such type of failures (just having a broken symbolic link does not reproduce the issue) and it's considered pretty low risk. If accepted, needs to be ported at least to 1.0, if not 0.11 Author: Konstantine Karantasis <konstantine@confluent.io> Reviewers: Randall Hauch <rhauch@gmail.com>, Ewen Cheslack-Postava <ewen@confluent.io> Closes #4481 from kkonstantine/KAFKA-6288-Broken-symlink-interrupts-scanning-the-plugin-path	2018-02-04 14:57:25 -08:00
Rajini Sivaram	4019b21d60	KAFKA-6246; Dynamic update of listeners and security configs (#4488 ) Dynamic update of listeners as described in KIP-226. This includes: - Addition of new listeners with listener-prefixed security configs - Removal of existing listeners - Password encryption - sasl.jaas.config property for broker's JAAS config prefixed with listener and mechanism name	2018-02-04 09:19:16 -08:00
Randall Hauch	4c48942f9d	KAFKA-5142: Add Connect support for message headers (KIP-145) [KIP-145](https://cwiki.apache.org/confluence/display/KAFKA/KIP-145+-+Expose+Record+Headers+in+Kafka+Connect) has been accepted, and this PR implements KIP-145 except without the SMTs. Changed the Connect API and runtime to support message headers as described in [KIP-145](https://cwiki.apache.org/confluence/display/KAFKA/KIP-145+-+Expose+Record+Headers+in+Kafka+Connect). The new `Header` interface defines an immutable representation of a Kafka header (key-value pair) with support for the Connect value types and schemas. This interface provides methods for easily converting between many of the built-in primitive, structured, and logical data types. The new `Headers` interface defines an ordered collection of headers and is used to track all headers associated with a `ConnectRecord` (and thus `SourceRecord` and `SinkRecord`). This does allow multiple headers with the same key. The `Headers` contains methods for adding, removing, finding, and modifying headers. Convenience methods allow connectors and transforms to easily use and modify the headers for a record. A new `HeaderConverter` interface is also defined to enable the Connect runtime framework to be able to serialize and deserialize headers between the in-memory representation and Kafka’s byte[] representation. A new `SimpleHeaderConverter` implementation has been added, and this serializes to strings and deserializes by inferring the schemas (`Struct` header values are serialized without the schemas, so they can only be deserialized as `Map` instances without a schema.) The `StringConverter`, `JsonConverter`, and `ByteArrayConverter` have all been extended to also be `HeaderConverter` implementations. Each connector can be configured with a different header converter, although by default the `SimpleHeaderConverter` is used to serialize header values as strings without schemas. Unit and integration tests are added for `ConnectHeader` and `ConnectHeaders`, the two implementation classes for headers. Additional test methods are added for the methods added to the `Converter` implementations. Finally, the `ConnectRecord` object is already used heavily, so only limited tests need to be added while quite a few of the existing tests already cover the changes. Author: Randall Hauch <rhauch@gmail.com> Reviewers: Arjun Satish <arjun@confluent.io>, Ted Yu <yuzhihong@gmail.com>, Magesh Nandakumar <magesh.n.kumar@gmail.com>, Konstantine Karantasis <konstantine@confluent.io>, Ewen Cheslack-Postava <ewen@confluent.io> Closes #4319 from rhauch/kafka-5142-b	2018-01-31 10:40:24 -08:00
Matthias J. Sax	a78f66a5ae	KAFKA-3625: Add public test utils for Kafka Streams (#4402 ) * KAFKA-3625: Add public test utils for Kafka Streams - add new artifact test-utils - add TopologyTestDriver - add MockTime, TestRecord, add TestRecordFactory Reviewers: Guozhang Wang <wangguoz@gmail.com>, Damian Guy <damian.guy@gmail.com>, Bill Bejeck <bill@confluent.io>	2018-01-29 17:21:48 -08:00
Rajini Sivaram	b814a16b96	KAFKA-6241; Enable dynamic updates of broker SSL keystore (#4263 ) Enable dynamic broker configuration (see KIP-226 for details). Includes - Base implementation to allow specific broker configs and custom configs to be dynamically updated - Extend DescribeConfigsRequest/Response to return all synonym configs and their sources in the order of precedence - Extend AdminClient to alter dynamic broker configs - Dynamic update of SSL keystores Reviewers: Ted Yu <yuzhihong@gmail.com>, Jason Gustafson <jason@confluent.io>	2018-01-23 08:44:31 -08:00
Manikumar Reddy	27a8d0f9e7	KAFKA-4541; Support for delegation token mechanism - Add capability to create delegation token - Add authentication based on delegation token. - Add capability to renew/expire delegation tokens. - Add units tests and integration tests Author: Manikumar Reddy <manikumar.reddy@gmail.com> Reviewers: Jun Rao <junrao@gmail.com> Closes #3616 from omkreddy/KAFKA-4541	2018-01-16 09:50:31 -08:00
Manikumar Reddy	488ea4b9fd	KAFKA-5647; Use KafkaZkClient in ReassignPartitionsCommand and PreferredReplicaLeaderElectionCommand * Use KafkaZkClient in ReassignPartitionsCommand * Use KafkaZkClient in PreferredReplicaLeaderElectionCommand * Updated test classes to use new methods * All existing tests should pass Author: Manikumar Reddy <manikumar.reddy@gmail.com> Reviewers: Jun Rao <junrao@gmail.com> Closes #4260 from omkreddy/KAFKA-5647-ADMINCOMMANDS	2017-12-20 12:19:36 -08:00
Matthias J. Sax	234ec8a8af	KAFKA-4857: Replace StreamsKafkaClient with AdminClient in Kafka Streams Author: Matthias J. Sax <matthias@confluent.io> Reviewers: Ismael Juma <ismael@juma.me.uk>, Bill Bejeck <bbejeck@gmail.com>, Guozhang Wang <wangguoz@gmail.com> Closes #4242 from mjsax/kafka-4857-admit-client	2017-12-07 16:16:54 -08:00
Jorge Quilcate Otoya	30f08d158a	KAFKA-5520: KIP-171; Extend Consumer Group Reset Offset for Stream Application KIP: https://cwiki.apache.org/confluence/display/KAFKA/KIP-171+-+Extend+Consumer+Group+Reset+Offset+for+Stream+Application Merge changes from KIP-198 Ref: https://github.com/apache/kafka/pull/3831 Author: Jorge Quilcate Otoya <quilcate.jorge@gmail.com> Author: Ismael Juma <ismael@juma.me.uk> Author: Matthias J. Sax <matthias@confluent.io> Author: Manikumar Reddy <manikumar.reddy@gmail.com> Author: Guozhang Wang <wangguoz@gmail.com> Author: Apurva Mehta <apurva@confluent.io> Author: Rajini Sivaram <rajinisivaram@googlemail.com> Author: Jason Gustafson <jason@confluent.io> Author: Vahid Hashemian <vahidhashemian@us.ibm.com> Author: Bill Bejeck <bill@confluent.io> Author: Dong Lin <lindong28@gmail.com> Author: Soenke Liebau <soenke.liebau@opencore.com> Author: Colin P. Mccabe <cmccabe@confluent.io> Author: Damian Guy <damian.guy@gmail.com> Author: Xavier Léauté <xl+github@xvrl.net> Author: Maytee Chinavanichkit <maytee.chinavanichkit@linecorp.com> Author: Joel Hamill <git config --global user.email> Author: Paolo Patierno <ppatierno@live.com> Author: siva santhalingam <siva.santhalingam@gmail.com> Author: Tommy Becker <tobecker@tivo.com> Author: Mickael Maison <mickael.maison@gmail.com> Author: Onur Karaman <okaraman@linkedin.com> Author: tedyu <yuzhihong@gmail.com> Author: Xin Li <Xin.Li@trivago.com> Author: Magnus Edenhill <magnus@edenhill.se> Author: Manjula K <manjula@kafka-summit.org> Author: Hugo Louro <hmclouro@gmail.com> Author: Jeff Widman <jeff@jeffwidman.com> Author: bartdevylder <bartdevylder@gmail.com> Author: Ewen Cheslack-Postava <me@ewencp.org> Author: Jacek Laskowski <jacek@japila.pl> Author: Tom Bentley <tbentley@redhat.com> Author: Konstantine Karantasis <konstantine@confluent.io> Reviewers: Matthias J. Sax <matthias@confluent.io>, Guozhang Wang <wangguoz@gmail.com> Closes #4159 from jeqo/feature/kip-171	2017-12-06 11:38:38 -08:00
Colin P. Mccabe	d9cbc6b1a2	KAFKA-5811; Add Kibosh integration for Trogdor and Ducktape For ducktape: add Kibosh to the testing Dockerfile. Create files_unreadable_fault_spec.py. For trogdor: create FilesUnreadableFaultSpec.java. Add a unit test of using the Kibosh service. Author: Colin P. Mccabe <cmccabe@confluent.io> Reviewers: Rajini Sivaram <rajinisivaram@googlemail.com> Closes #4195 from cmccabe/KAFKA-5811	2017-11-16 17:59:24 +00:00
Colin P. Mccabe	4fac83ba1f	KAFKA-6060; Add workload generation capabilities to Trogdor Previously, Trogdor only handled "Faults." Now, Trogdor can handle "Tasks" which may be either faults, or workloads to execute in the background. The Agent and Coordinator have been refactored from a mutexes-and-condition-variables paradigm into a message passing paradigm. No locks are necessary, because only one thread can access the task state or worker state. This makes them a lot easier to reason about. The MockTime class can now handle mocking deferred message passing (adding a message to an ExecutorService with a delay). I added a MockTimeTest. MiniTrogdorCluster now starts up Agent and Coordinator classes in paralle in order to minimize junit test time. RPC messages now inherit from a common Message.java class. This class handles implementing serialization, equals, hashCode, etc. Remove FaultSet, since it is no longer necessary. Previously, if CoordinatorClient or AgentClient hit a networking problem, they would throw an exception. They now retry several times before giving up. Additionally, the REST RPCs to the Coordinator and Agent have been changed to be idempotent. If a response is lost, and the request is resent, no harm will be done. Author: Colin P. Mccabe <cmccabe@confluent.io> Reviewers: Rajini Sivaram <rajinisivaram@googlemail.com>, Ismael Juma <ismael@juma.me.uk> Closes #4073 from cmccabe/KAFKA-6060	2017-11-03 09:37:29 +00:00
Randall Hauch	11afff0990	KAFKA-5990: Enable generation of metrics docs for Connect (KIP-196) A new mechanism was added recently to the Metrics framework to make it easier to generate the documentation. It uses a registry with a MetricsNameTemplate for each metric, and then those templates are used when creating the actual metrics. The metrics framework provides utilities that can generate the HTML documentation from the registry of templates. This change moves the recently-added Connect metrics over to use these templates and to then generate the metric documentation for Connect. This PR is based upon #3975 and can be rebased once that has been merged. Author: Randall Hauch <rhauch@gmail.com> Reviewers: Ewen Cheslack-Postava <ewen@confluent.io> Closes #3987 from rhauch/kafka-5990	2017-10-04 11:05:50 -07:00
Rajini Sivaram	021d8a8e96	KAFKA-5746; Add new metrics to support health checks (KIP-188) Adds new metrics to support health checks: 1. Error rates for each request type, per-error code 2. Request size and temporary memory size 3. Message conversion rate and time 4. Successful and failed authentication rates 5. ZooKeeper latency and status 6. Client version Author: Rajini Sivaram <rajinisivaram@googlemail.com> Reviewers: Ismael Juma <ismael@juma.me.uk> Closes #3705 from rajinisivaram/KAFKA-5746-new-metrics	2017-09-28 21:58:59 +01:00
Bill Bejeck	271f6b5aec	KAFKA-5862: Remove ZK dependency from Streams reset tool, Part I Author: Bill Bejeck <bill@confluent.io> Author: bbejeck <bbejeck@gmail.com> Reviewers: Matthias J. Sax <matthias@confluent.io>, Ted Yu <yuzhihong@gmail.com>, Guozhang Wang <wangguoz@gmail.com> Closes #3927 from bbejeck/KAFKA-5862_remove_zk_dependency_from_streams_reset_tool	2017-09-23 12:05:16 +08:00
Rajini Sivaram	96ba21e0df	KAFKA-5947; Handle authentication failure in admin client, txn producer 1. Raise AuthenticationException for authentication failures in admin client 2. Handle AuthenticationException as a fatal error for transactional producer 3. Add comments to authentication exceptions Author: Rajini Sivaram <rajinisivaram@googlemail.com> Reviewers: Vahid Hashemian <vahidhashemian@us.ibm.com>, Ismael Juma <ismael@juma.me.uk> Closes #3928 from rajinisivaram/KAFKA-5947-auth-failure	2017-09-21 13:58:43 +01:00
Jason Gustafson	0cf7708007	MINOR: Move request/response schemas to the corresponding object representation This refactor achieves the following: 1. Breaks up the increasingly unmanageable `Protocol` class and moves schemas closer to their actual usage. 2. Removes the need for redundant field identifiers maintained separately in `Protocol` and the respective request/response objects. 3. Provides a better mechanism for sharing common fields between different schemas (e.g. topics, partitions, error codes, etc.). 4. Adds convenience helpers to `Struct` for common patterns (such as setting a field only if it exists). Author: Jason Gustafson <jason@confluent.io> Reviewers: Ismael Juma <ismael@juma.me.uk> Closes #3813 from hachikuji/protocol-schema-refactor	2017-09-19 05:12:55 +01:00
Rajini Sivaram	8fca432223	KAFKA-4764; Wrap SASL tokens in Kafka headers to improve diagnostics (KIP-152) SASL handshake protocol changes from KIP-152. Author: Rajini Sivaram <rajinisivaram@googlemail.com> Reviewers: Jun Rao <junrao@gmail.com>, Ismael Juma <ismael@juma.me.uk>, Manikumar Reddy <manikumar.reddy@gmail.com> Closes #3708 from rajinisivaram/KAFKA-4764-SASL-diagnostics	2017-09-15 17:16:29 +01:00
Jason Gustafson	3b5d88febb	KAFKA-5783; Add KafkaPrincipalBuilder with support for SASL (KIP-189) Author: Jason Gustafson <jason@confluent.io> Reviewers: Ismael Juma <ismael@juma.me.uk>, Rajini Sivaram <rajinisivaram@googlemail.com>, Manikumar Reddy <manikumar.reddy@gmail.com> Closes #3795 from hachikuji/KAFKA-5783	2017-09-14 10:16:00 +01:00
James Cheng	2fb5664bf4	KAFKA-5597: Autogenerate producer sender metrics Subtask of https://issues.apache.org/jira/browse/KAFKA-3480 The changes are very similar to what was done for the consumer in https://issues.apache.org/jira/browse/KAFKA-5191 (pull request https://github.com/apache/kafka/pull/2993) Author: James Cheng <jylcheng@yahoo.com> Reviewers: Ismael Juma <ismael@juma.me.uk>, Rajini Sivaram <rajinisivaram@googlemail.com>, Guozhang Wang <wangguoz@gmail.com> Closes #3535 from wushujames/producer_sender_metrics_docs Fix one minor naming bug	2017-09-05 17:38:58 -07:00
Colin P. Mccabe	0772fde562	KAFKA-5776; Add the Trogdor fault injection daemon Author: Colin P. Mccabe <cmccabe@confluent.io> Reviewers: Ismael Juma <ismael@juma.me.uk>, Rajini Sivaram <rajinisivaram@googlemail.com> Closes #3699 from cmccabe/trogdor-review	2017-08-25 12:29:40 -07:00
huxihx	607c3c21f6	KAFKA-5755; KafkaProducer should be refactored to use LogContext With LogContext, each producer log item is automatically prefixed with client id and transactional id. Author: huxihx <huxi_2b@hotmail.com> Reviewers: Jason Gustafson <jason@confluent.io> Closes #3703 from huxihx/KAFKA-5755	2017-08-25 10:42:40 -07:00
Ismael Juma	ed96523a2c	KAFKA-4501; Java 9 compilation and runtime fixes Compilation error fixes: - Avoid ambiguity error when appending to Properties in Scala code (https://github.com/scala/bug/issues/10418) - Use position() and limit() to fix ambiguity issue ( https://github.com/scala/bug/issues/10418#issuecomment-316364778) - Disable findBugs if Java 9 is used ( https://github.com/findbugsproject/findbugs/issues/105) Compilation warning fixes: - Avoid deprecated Class.newInstance in Utils.newInstance - Silence a few Java 9 deprecation warnings - var -> val and unused fixes Runtime error fixes: - Introduce Base64 class that works in Java 7 and Java 9 Also: - Set --release option if building with Java 9 Note that tests involving EasyMock (https://github.com/easymock/easymock/issues/193) or PowerMock (https://github.com/powermock/powermock/issues/783) will fail as neither supports Java 9 currently. Author: Ismael Juma <ismael@juma.me.uk> Reviewers: Jason Gustafson <jason@confluent.io> Closes #3647 from ijuma/kafka-4501-support-java-9	2017-08-19 08:55:29 +01:00
Randall Hauch	3b1cea60e9	KAFKA-5731; Corrected how the sink task worker updates the last committed offsets Prior to this change, it was possible for the synchronous consumer commit request to be handled before previously-submitted asynchronous commit requests. If that happened, the out-of-order handlers improperly set the last committed offsets, which then became inconsistent with the offsets the connector task is working with. This change ensures that the last committed offsets are updated only for the most recent commit request, even if the consumer reorders the calls to the callbacks. Author: Randall Hauch <rhauch@gmail.com> Reviewers: Jason Gustafson <jason@confluent.io> Closes #3662 from rhauch/kafka-5731	2017-08-15 14:16:00 -07:00
Colin P. Mccabe	c9ffab1622	KAFKA-5658; Fix AdminClient request timeout handling bug resulting in continual BrokerNotAvailableExceptions The AdminClient does not properly clear calls from the callsInFlight structure. Later, in an effort to clear the lingering call objects, it closes the connection they are associated with. This disrupts new incoming calls, which then get BrokerNotAvailableException. This patch fixes this bug by properly removing completed calls from the callsInFlight structure. It also adds the Call#aborted flag, which ensures that we throw the right exception (TimeoutException instead of DisconnectException) and only abort a connection once -- even if there is a similar bug in the future which causes old Call objects to linger. Author: Colin P. Mccabe <cmccabe@confluent.io> Reviewers: Ismael Juma <ismael@juma.me.uk> Closes #3584 from cmccabe/KAFKA-5658	2017-08-08 09:38:22 +01:00
radai-rosenblatt	47ee8e954d	KAFKA-4602; KIP-72 - Allow putting a bound on memory consumed by Incoming requests this is the initial implementation. Author: radai-rosenblatt <radai.rosenblatt@gmail.com> Reviewers: Ewen Cheslack-Postava <me@ewencp.org>, Ismael Juma <ismael@juma.me.uk>, Rajini Sivaram <rajinisivaram@googlemail.com>, Jun Rao <junrao@gmail.com> Closes #2330 from radai-rosenblatt/broker-memory-pool-with-muting	2017-07-26 08:19:56 +02:00
Matthias J. Sax	b04bed022a	KAFKA-3856; Cleanup Kafka Stream builder API (KIP-120) Author: Matthias J. Sax <matthias@confluent.io> Reviewers: Damian Guy <damian.guy@gmail.com>, Bill Bejeck <bbejeck@gmail.com> Closes #2301 from mjsax/kafka-3856-topology-builder-API	2017-07-20 18:47:47 +01:00
Ismael Juma	a293e1dc0c	MINOR: Adjust checkstyle suppression paths to work on Windows Use the file name whenever possible and replace / with [/\\] when it's not. Also remove unnecessary suppresions. Author: Ismael Juma <ismael@juma.me.uk> Reviewers: Vahid Hashemian <vahidhashemian@us.ibm.com>, Jason Gustafson <jason@confluent.io> Closes #3431 from ijuma/fix-checkstyle-suppressions-on-windows	2017-06-28 01:47:00 +01:00
Eno Thereska	55a90938a1	MINOR: add Yahoo benchmark to nightly runs Author: Eno Thereska <eno.thereska@gmail.com> Reviewers: Damian Guy <damian.guy@gmail.com> Closes #3289 from enothereska/yahoo-benchmark	2017-06-21 11:46:59 +01:00
Ismael Juma	b20d9333be	KAFKA-5274: AdminClient Javadoc improvements Publish Javadoc for common.annotation package, which contains InterfaceStability. Finally, mark AdminClient classes with `Evolving` instead of `Unstable`. Author: Ismael Juma <ismael@juma.me.uk> Reviewers: Colin Mccabe, Gwen Shapira Closes #3316 from ijuma/kafka-5274-admin-client-javadoc	2017-06-14 08:57:49 -07:00
Matthias J. Sax	ba07d828c5	KAFKA-5362: Add EOS system tests for Streams API Author: Matthias J. Sax <matthias@confluent.io> Reviewers: Damian Guy <damian.guy@gmail.com>, Bill Bejeck <bill@confluent.io>, Guozhang Wang <wangguoz@gmail.com> Closes #3201 from mjsax/kafka-5362-add-eos-system-tests-for-streams-api	2017-06-08 14:08:54 -07:00
Ismael Juma	39eb31feae	MINOR: Optimise performance of `Topic.validate()` I included a JMH benchmark and the results follow. The implementation in this PR takes no more than 1/10th of the time when compared to trunk. I also included results for an alternative implementation that is a little slower than the one in the PR. Trunk: ```text TopicBenchmark.testValidate topic avgt 15 134.107 ± 3.956 ns/op TopicBenchmark.testValidate longer-topic-name avgt 15 316.241 ± 13.379 ns/op TopicBenchmark.testValidate very-long-topic-name_with_more_text avgt 15 636.026 ± 30.272 ns/op ``` Implementation in the PR: ```text TopicBenchmark.testValidate topic avgt 15 13.153 ± 0.383 ns/op TopicBenchmark.testValidate longer-topic-name avgt 15 26.139 ± 0.896 ns/op TopicBenchmark.testValidate very-long-topic-name.with_more_text avgt 15 44.829 ± 1.390 ns/op ``` Alternative implementation where boolean validChar = Character.isLetterOrDigit(c) \|\| c == '.' \|\| c == '_' \|\| c == '-'; ```text TopicBenchmark.testValidate topic avgt 15 18.883 ± 1.044 ns/op TopicBenchmark.testValidate longer-topic-name avgt 15 36.696 ± 1.220 ns/op TopicBenchmark.testValidate very-long-topic-name_with_more_text avgt 15 65.956 ± 0.669 ns/op ``` Author: Ismael Juma <ismael@juma.me.uk> Reviewers: Guozhang Wang <wangguoz@gmail.com> Closes #3234 from ijuma/optimise-topic-is-valid	2017-06-06 03:08:40 +01:00
Ismael Juma	c7bc8f7d8c	MINOR: Remove redundant volatile write in RecordHeaders The JMH benchmark included shows that the redundant volatile write causes the constructor of `ProducerRecord` to take more than 50% longer: ProducerRecordBenchmark.constructorBenchmark avgt 15 24.136 ± 1.458 ns/op (before) ProducerRecordBenchmark.constructorBenchmark avgt 15 14.904 ± 0.231 ns/op (after) Author: Ismael Juma <ismael@juma.me.uk> Reviewers: Jason Gustafson <jason@confluent.io> Closes #3233 from ijuma/remove-volatile-write-in-records-header-constructor	2017-06-04 10:48:34 -07:00
Colin P. Mccabe	f389b71570	KAFKA-5374; Set allow auto topic creation to false when requesting node information only It avoids the need to handle protocol downgrades and it's safe (i.e. it will never cause the auto creation of topics). Author: Colin P. Mccabe <cmccabe@confluent.io> Reviewers: Ismael Juma <ismael@juma.me.uk> Closes #3220 from ijuma/kafka-5374-admin-client-metadata	2017-06-03 06:26:16 +01:00
Mario Molina	dc5bf4bd45	KAFKA-5218; New Short serializer, deserializer, serde Author: Mario Molina <mmolimar@gmail.com> Reviewers: Matthias J. Sax <matthias@confluent.io>, Damian Guy <damian.guy@gmail.com>, Michael G. Noll <michael@confluent.io>, Guozhang Wang <wangguoz@gmail.com> Closes #3017 from mmolimar/KAFKA-5218	2017-05-31 15:09:58 -07:00
Colin P. Mccabe	da9a171c99	KAFKA-5265; Move ACLs, Config, Topic classes into org.apache.kafka.common Also introduce TopicConfig. Author: Colin P. Mccabe <cmccabe@confluent.io> Reviewers: Ismael Juma <ismael@juma.me.uk> Closes #3120 from cmccabe/KAFKA-5265	2017-05-31 17:35:31 +01:00
Xavier Léauté	c060c48285	KAFKA-5150; Reduce LZ4 decompression overhead - reuse decompression buffers in consumer Fetcher - switch lz4 input stream to operate directly on ByteBuffers - avoids performance impact of catching exceptions when reaching the end of legacy record batches - more tests with both compressible / incompressible data, multiple blocks, and various other combinations to increase code coverage - fixes bug that would cause exception instead of invalid block size for invalid incompressible blocks - fixes bug if incompressible flag is set on end frame block size Overall this improves LZ4 decompression performance by up to 40x for small batches. Most improvements are seen for batches of size 1 with messages on the order of ~100B. We see at least 2x improvements for for batch sizes of < 10 messages, containing messages < 10kB This patch also yields 2-4x improvements on v1 small single message batches for other compression types. Full benchmark results can be found here https://gist.github.com/xvrl/05132e0643513df4adf842288be86efd Author: Xavier Léauté <xavier@confluent.io> Author: Ismael Juma <ismael@juma.me.uk> Reviewers: Jason Gustafson <jason@confluent.io>, Ismael Juma <ismael@juma.me.uk> Closes #2967 from xvrl/kafka-5150	2017-05-31 02:22:07 +01:00
Jason Gustafson	dfa3c8a92d	KAFKA-5316; LogCleaner should account for larger record sets after cleaning Author: Jason Gustafson <jason@confluent.io> Reviewers: Ismael Juma <ismael@juma.me.uk>, Jun Rao <junrao@gmail.com> Closes #3142 from hachikuji/KAFKA-5316	2017-05-28 09:57:59 -07:00
Rajini Sivaram	73ca0d215e	KAFKA-5320: Include all request throttling in client throttle metrics Author: Rajini Sivaram <rajinisivaram@googlemail.com> Reviewers: Jun Rao <junrao@gmail.com>, Ismael Juma <ismael@juma.me.uk> Closes #3137 from rajinisivaram/KAFKA-5320	2017-05-25 20:28:18 +01:00
Matthias J. Sax	495836a4a2	KAFKA-4923: Modify compatibility test for Exaclty-Once Semantics in Streams - add broker compatibility system tests Author: Matthias J. Sax <matthias@confluent.io> Reviewers: Damian Guy, Eno Thereska, Guozhang Wang Closes #2974 from mjsax/kafka-4923-add-eos-to-streams-add-broker-check-and-system-test	2017-05-21 22:16:18 -07:00
Jiangjie Qin	7fad45557e	KAFKA-3995; KIP-126 Allow KafkaProducer to split and resend oversized batches Author: Jiangjie Qin <becket.qin@gmail.com> Reviewers: Joel Koshy <jjkoshy.w@gmail.com>, Ismael Juma <ismael@juma.me.uk> Closes #2638 from becketqin/KAFKA-3995	2017-05-21 17:31:31 -07:00
Xavier Léauté	e287523577	KAFKA-5192: add WindowStore range scan (KIP-155) Implements range scan for keys in windowed and session stores Modifies caching session and windowed stores to use segmented cache keys. Cache keys are internally prefixed with their segment id to ensure key ordering in the cache matches the ordering in the underlying store for keys spread across multiple segments. This should also result in fewer cache keys getting scanned for queries spanning only some segments. Author: Xavier Léauté <xavier@confluent.io> Reviewers: Damian Guy, Guozhang Wang Closes #3027 from xvrl/windowstore-range-scan	2017-05-18 17:02:51 -07:00
Konstantine Karantasis	45f2261763	KAFKA-3487: Support classloading isolation in Connect (KIP-146) Author: Konstantine Karantasis <konstantine@confluent.io> Reviewers: Randall Hauch <rhauch@gmail.com>, Ewen Cheslack-Postava <ewen@confluent.io> Closes #3028 from kkonstantine/KAFKA-3487-Support-classloading-isolation-in-Connect	2017-05-18 10:39:15 -07:00
Ismael Juma	972b754536	KAFKA-3267; Describe and Alter Configs Admin APIs (KIP-133) Author: Ismael Juma <ismael@juma.me.uk> Reviewers: Jun Rao <junrao@gmail.com> Closes #3076 from ijuma/kafka-3267-describe-alter-configs-protocol	2017-05-18 06:51:02 +01:00
Colin P. Mccabe	9815e18fef	KAFKA-3266; Describe, Create and Delete ACLs Admin APIs (KIP-140) Includes server-side code, protocol and AdminClient. Author: Colin P. Mccabe <cmccabe@confluent.io> Reviewers: Ismael Juma <ismael@juma.me.uk> Closes #2941 from cmccabe/KAFKA-3266	2017-05-18 03:20:30 +01:00
Guozhang Wang	c64cfd2e2b	KAFKA-5231: Bump up producer epoch when sending abort txn markers on InitPid Author: Guozhang Wang <wangguoz@gmail.com> Reviewers: Jason Gustafson, Jun Rao Closes #3066 from guozhangwang/K5231-bump-up-epoch-when-abort-txn	2017-05-17 18:05:12 -07:00
Rajini Sivaram	4c75f31a5f	KAFKA-5179; Log connection termination during authentication Author: Rajini Sivaram <rajinisivaram@googlemail.com> Reviewers: Ismael Juma, Jun Rao Closes #2980 from rajinisivaram/KAFKA-5179	2017-05-15 18:13:20 -04:00
Colin P. Mccabe	4aed28d189	KAFKA-3265; Add a public AdminClient API in Java (KIP-117) Author: Colin P. Mccabe <cmccabe@confluent.io> Reviewers: Dan Norwood <norwood@confluent.io>, Ismael Juma <ismael@juma.me.uk> Closes #2472 from cmccabe/KAFKA-3265	2017-05-02 00:20:22 +01:00
Michael Andre Pearce	6185bc0276	KAFKA-4208; Add Record Headers As per KIP-82 Adding record headers api to ProducerRecord, ConsumerRecord Support to convert from protocol to api added Kafka Producer, Kafka Fetcher (Consumer) Updated MirrorMaker, ConsoleConsumer and scala BaseConsumer Add RecordHeaders and RecordHeader implementation of the interfaces Headers and Header Some bits using are reverted to being Java 7 compatible, for the moment until KIP-118 is implemented. Author: Michael Andre Pearce <Michael.Andre.Pearce@me.com> Reviewers: Radai Rosenblatt <radai.rosenblatt@gmail.com>, Jiangjie Qin <becket.qin@gmail.com>, Ismael Juma <ismael@juma.me.uk>, Jason Gustafson <jason@confluent.io> Closes #2772 from michaelandrepearce/KIP-82	2017-04-28 19:18:27 -07:00
Apurva Mehta	a82f194b21	KAFKA-4818; Exactly once transactional clients Author: Apurva Mehta <apurva@confluent.io> Reviewers: Guozhang Wang <wangguoz@gmail.com>, Ismael Juma <ismael@juma.me.uk>, Jason Gustafson <jason@confluent.io> Closes #2840 from apurvam/exactly-once-transactional-clients	2017-04-27 14:11:36 -07:00
Damian Guy	f69d94158c	KAFKA-5059: Implement Transactional Coordinator Author: Damian Guy <damian.guy@gmail.com> Author: Guozhang Wang <wangguoz@gmail.com> Author: Apurva Mehta <apurva@confluent.io> Reviewers: Guozhang Wang, Jason Gustafson, Apurva Mehta, Jun Rao Closes #2849 from dguy/exactly-once-tc	2017-04-26 14:10:38 -07:00
Matthias J. Sax	148f8c2545	KAFKA-4986; Producer per StreamTask support (KIP-129) Enable producer per task if exactly-once config is enabled. Author: Matthias J. Sax <matthias@confluent.io> Reviewers: Eno Thereska <eno@confluent.io>, Damian Guy <damian.guy@gmail.com>, Ismael Juma <ismael@juma.me.uk> Closes #2773 from mjsax/exactly-once-streams-producer-per-task	2017-04-14 15:19:52 +01:00
Matthias J. Sax	865d82af2c	KAFKA-4990; Request/response classes for transactions (KIP-98) Author: Matthias J. Sax <matthias@confluent.io> Author: Guozhang Wang <wangguoz@gmail.com> Author: Jason Gustafson <jason@confluent.io> Reviewers: Apurva Mehta <apurva@confluent.io>, Jun Rao <junrao@gmail.com>, Ismael Juma <ismael@juma.me.uk> Closes #2799 from mjsax/kafka-4990-add-api-stub-config-parameters-request-types	2017-04-07 12:07:25 +01:00
Ben Stopford	0baea2ac13	KIP-101: Alter Replication Protocol to use Leader Epoch rather than High Watermark for Truncation This PR replaces https://github.com/apache/kafka/pull/2743 (just raising from Confluent repo) This PR describes the addition of Partition Level Leader Epochs to messages in Kafka as a mechanism for fixing some known issues in the replication protocol. Full details can be found here: [KIP-101 Reference](https://cwiki.apache.org/confluence/display/KAFKA/KIP-101+-+Alter+Replication+Protocol+to+use+Leader+Epoch+rather+than+High+Watermark+for+Truncation) The key elements are: - Epochs are stamped on messages as they enter the leader. - Epochs are tracked in both leader and follower in a new checkpoint file. - A new API allows followers to retrieve the leader's latest offset for a particular epoch. - The logic for truncating the log, when a replica becomes a follower, has been moved from Partition into the ReplicaFetcherThread - When partitions are added to the ReplicaFetcherThread they are added in an initialising state. Initialising partitions request leader epochs and then truncate their logs appropriately. This test provides a good overview of the workflow `EpochDrivenReplicationProtocolAcceptanceTest.shouldFollowLeaderEpochBasicWorkflow()` The corrupted log use case is covered by the test `EpochDrivenReplicationProtocolAcceptanceTest.offsetsShouldNotGoBackwards()` Remaining work: There is a do list here: https://docs.google.com/document/d/1edmMo70MfHEZH9x38OQfTWsHr7UGTvg-NOxeFhOeRew/edit?usp=sharing Author: Ben Stopford <benstopford@gmail.com> Author: Jun Rao <junrao@gmail.com> Reviewers: Ismael Juma <ismael@juma.me.uk>, Jun Rao <junrao@gmail.com> Closes #2808 from benstopford/kip-101-v2	2017-04-06 14:51:09 -07:00
Apurva Mehta	bdf4cba047	KAFKA-4817; Add idempotent producer semantics This is from the KIP-98 proposal. The main points of discussion surround the correctness logic, particularly the Log class where incoming entries are validated and duplicates are dropped, and also the producer error handling to ensure that the semantics are sound from the users point of view. There is some subtlety in the idempotent producer semantics. This patch only guarantees idempotent production upto the point where an error has to be returned to the user. Once we hit a such a non-recoverable error, we can no longer guarantee message ordering nor idempotence without additional logic at the application level. In particular, if an application wants guaranteed message order without duplicates, then it needs to do the following in the error callback: 1. Close the producer so that no queued batches are sent. This is important for guaranteeing ordering. 2. Read the tail of the log to inspect the last message committed. This is important for avoiding duplicates. Author: Apurva Mehta <apurva@confluent.io> Author: hachikuji <jason@confluent.io> Author: Apurva Mehta <apurva.1618@gmail.com> Author: Guozhang Wang <wangguoz@gmail.com> Author: fpj <fpj@apache.org> Author: Jason Gustafson <jason@confluent.io> Reviewers: Jason Gustafson <jason@confluent.io>, Ismael Juma <ismael@juma.me.uk>, Jun Rao <junrao@gmail.com> Closes #2735 from apurvam/exactly-once-idempotent-producer	2017-04-02 19:41:44 -07:00
Colin P. Mccabe	d345d53e4e	KAFKA-4902; Utils#delete should correctly handle I/O errors and symlinks Author: Colin P. Mccabe <cmccabe@confluent.io> Reviewers: Jun Rao <junrao@gmail.com>, Apurva Mehta <apurva@confluent.io>, Ismael Juma <ismael@juma.me.uk> Closes #2691 from cmccabe/KAFKA-4902	2017-03-30 13:38:09 +01:00
Jason Gustafson	5bd06f1d54	KAFKA-4816; Message format changes for idempotent/transactional producer (KIP-98) Author: Jason Gustafson <jason@confluent.io> Reviewers: Jun Rao <junrao@gmail.com>, Apurva Mehta <apurva@confluent.io>, Guozhang Wang <wangguoz@gmail.com>, Ismael Juma <ismael@juma.me.uk> Closes #2614 from hachikuji/exactly-once-message-format	2017-03-24 19:38:43 +00:00
Ewen Cheslack-Postava	52a15d7c0b	KAFKA-4783: Add ByteArrayConverter (KIP-128) Author: Ewen Cheslack-Postava <me@ewencp.org> Reviewers: Guozhang Wang <wangguoz@gmail.com> Closes #2599 from ewencp/kafka-4783-byte-array-converter	2017-03-14 17:20:49 -07:00
bbejeck	79f85039d7	KAFKA-3989; Initial support for adding a JMH benchmarking module Author: bbejeck <bbejeck@gmail.com> Reviewers: Ewen Cheslack-Postava <ewen@confluent.io>, Ismael Juma <ismael@juma.me.uk> Closes #1712 from bbejeck/KAFKA-3989_create_jmh_benchmarking_module	2017-03-06 11:56:14 +00:00
Colin P. Mccabe	d9b784e147	KAFKA-4796; Fix some findbugs warnings in Kafka Java client Author: Colin P. Mccabe <cmccabe@confluent.io> Reviewers: Ismael Juma <ismael@juma.me.uk> Closes #2593 from cmccabe/KAFKA-4796	2017-03-04 00:52:26 +00:00
Damian Guy	de05c9d3a0	MINOR: Add code quality checks (and suppressions) to checkstyle.xml Author: Damian Guy <damian.guy@gmail.com> Reviewers: Guozhang Wang <wangguoz@gmail.com>, Ewen Cheslack-Postava <me@ewencp.org>, Jason Gustafson <jason@confluent.io>, Ismael Juma <ismael@juma.me.uk> Closes #2594 from dguy/checkstyle	2017-02-28 22:57:57 +00:00
Matthias J. Sax	d0e436c471	MINOR: improve license header check by providing head file instead of (prefix) header regex Author: Matthias J. Sax <matthias@confluent.io> Reviewers: Jason Gustafson <jason@confluent.io>, Ewen Cheslack-Postava <ewen@confluent.io> Closes #2303 from mjsax/licenseHeader	2017-02-28 12:35:04 -08:00
Maysam Yabandeh	cb674e5487	KAFKA-4039; Fix deadlock during shutdown due to log truncation not allowed Author: Maysam Yabandeh <myabandeh@dropbox.com> Author: Ismael Juma <ismael@juma.me.uk> Reviewers: Jun Rao <junrao@gmail.com> Closes #2474 from ijuma/kafka-4039-deadlock-during-shutdown	2017-02-02 22:23:49 +00:00
Damian Guy	825f225bc5	KAFKA-4588: Wait for topics to be created in QueryableStateIntegrationTest.shouldNotMakeStoreAvailableUntilAllStoresAvailable After debugging this i can see the times that it fails there is a race between when the topic is actually created/ready on the broker and when the assignment happens. When it fails `StreamPartitionAssignor.assign(..)` gets called with a `Cluster` with no topics. Hence the test hangs as no tasks get assigned. To fix this I added a `waitForTopics` method to `EmbeddedKafkaCluster`. This will wait until the topics have been created. Author: Damian Guy <damian.guy@gmail.com> Reviewers: Matthias J. Sax, Guozhang Wang Closes #2371 from dguy/integration-test-fix	2017-01-17 12:33:11 -08:00
Ismael Juma	da57bc27e7	KAFKA-4591; Create Topic Policy (KIP-108) Author: Ismael Juma <ismael@juma.me.uk> Reviewers: Apurva Mehta <apurva.1618@gmail.com>, Gwen Shapira <cshapi@gmail.com>, Jason Gustafson <jason@confluent.io> Closes #2361 from ijuma/kafka-4591-create-topic-policy	2017-01-13 19:46:02 -08:00
Shikhar Bhushan	2f90488323	KAFKA-3209: KIP-66: single message transforms Besides API and runtime changes, this PR also includes 2 data transformations (`InsertField`, `HoistToStruct`) and 1 routing transformation (`TimestampRouter`). There is some gnarliness in `ConnectorConfig` / `ConfigDef` around creating, parsing and validating a dynamic `ConfigDef`. Author: Shikhar Bhushan <shikhar@confluent.io> Reviewers: Ewen Cheslack-Postava <ewen@confluent.io> Closes #2299 from shikhar/smt-2017	2017-01-12 16:14:53 -08:00
Rajini Sivaram	275c5e1df2	KAFKA-3751; SASL/SCRAM implementation Implementation of KIP-84: https://cwiki.apache.org/confluence/display/KAFKA/KIP-84%3A+Support+SASL+SCRAM+mechanisms Author: Rajini Sivaram <rajinisivaram@googlemail.com> Reviewers: Jun Rao <junrao@gmail.com>, Ismael Juma <ismael@juma.me.uk> Closes #2086 from rajinisivaram/KAFKA-3751	2017-01-10 08:05:07 -05:00
Jason Gustafson	67f1e5b91b	KAFKA-4390; Replace MessageSet usage with client-side alternatives Author: Jason Gustafson <jason@confluent.io> Reviewers: Ismael Juma <ismael@juma.me.uk>, Guozhang Wang <wangguoz@gmail.com>, Jun Rao <junrao@gmail.com> Closes #2140 from hachikuji/KAFKA4390	2016-12-13 10:26:25 -08:00
Matthias J. Sax	9bed8fbcfc	KAFKA-4393: Improve invalid/negative TS handling Author: Matthias J. Sax <matthias@confluent.io> Reviewers: Michael G. Noll, Eno Thereska, Damian Guy, Guozhang Wang Closes #2117 from mjsax/kafka-4393-improveInvalidTsHandling	2016-12-09 16:17:36 -08:00
Jason Gustafson	3b4c347949	KAFKA-2066; Use client-side FetchRequest/FetchResponse on server Author: Jason Gustafson <jason@confluent.io> Reviewers: Jun Rao <junrao@gmail.com> Closes #2069 from hachikuji/KAFKA-2066	2016-11-14 16:31:04 -08:00
Sumit Arrawatia	ecc1fb10fa	KAFKA-4093; Cluster Id (KIP-78) This PR implements KIP-78:Cluster Identifiers [(link)](https://cwiki.apache.org/confluence/display/KAFKA/KIP-78%3A+Cluster+Id#KIP-78:ClusterId-Overview) and includes the following changes: 1. Changes to broker code - generate cluster id and store it in Zookeeper - update protocol to add cluster id to metadata request and response - add ClusterResourceListener interface, ClusterResource class and ClusterMetadataListeners utility class - send ClusterResource events to the metric reporters 2. Changes to client code - update Cluster and Metadata code to support cluster id - update clients for sending ClusterResource events to interceptors, (de)serializers and metric reporters 3. Integration tests for interceptors, (de)serializers and metric reporters for clients and for protocol changes and metric reporters for broker. 4. System tests for upgrading from previous versions. Author: Sumit Arrawatia <sumit.arrawatia@gmail.com> Author: Ismael Juma <ismael@juma.me.uk> Reviewers: Jun Rao <junrao@gmail.com>, Ismael Juma <ismael@juma.me.uk> Closes #1830 from arrawatia/kip-78	2016-09-17 07:53:25 +01:00
Matthias J. Sax	f7976d2fc1	KAFKA-4008: Module "tools" should not be dependent on "core" moved streams application reset tool from tools to core Author: Matthias J. Sax <matthias@confluent.io> Reviewers: Ismael Juma <ismael@juma.me.uk>, Damian Guy <damian.guy@gmail.com>, Guozhang Wang <wangguoz@gmail.com>, Ewen Cheslack-Postava <ewen@confluent.io> Closes #1685 from mjsax/moveResetTool (cherry picked from commit `f2405a73ea`) Signed-off-by: Ewen Cheslack-Postava <me@ewencp.org>	2016-08-01 20:12:38 -07:00
Matthias J. Sax	8deedcacb6	KAFKA-3185: Allow users to cleanup internal Kafka Streams data - added Kafka Stream Application Reset Tool Author: Matthias J. Sax <matthias@confluent.io> Reviewers: Guozhang Wang, Michael G. Noll Closes #1636 from mjsax/kafka-3185	2016-07-27 14:11:40 -07:00
Damian Guy	7d9d1cb235	KAFKA-3561: Auto create through topic for KStream aggregation and join guozhangwang enothereska mjsax miguno If you get a chance can you please take a look at this. I've done the repartitioning in the join, but it results in 2 internal topics for each join. This seems like overkill as sometimes we wouldn't need to repartition at all, others just 1 topic, and then sometimes both, but I'm not sure how we can know that. I'd also need to implement something similar for leftJoin, but again, i'd like to see if i'm heading down the right path or if anyone has any other bright ideas. For reference - https://github.com/apache/kafka/pull/1453 - the previous PR Thanks for taking the time and looking forward to getting some welcome advice :-) Author: Damian Guy <damian.guy@gmail.com> Author: Damian Guy <damian@continuum.local> Reviewers: Guozhang Wang <wangguoz@gmail.com> Closes #1472 from dguy/KAFKA-3561	2016-06-16 11:56:32 -07:00
Guozhang Wang	768328689a	MINOR: Modify checkstyle to allow import classes only used in javadoc Author: Guozhang Wang <wangguoz@gmail.com> Reviewers: Gwen Shapira, Ismael Juma Closes #1317 from guozhangwang/KJavaDocImport	2016-05-04 11:25:26 -07:00
Eno Thereska	b3d2c0dabb	MINOR: Added more integration tests Author: Eno Thereska <eno.thereska@gmail.com> Reviewers: Ismael Juma, Michael G. Noll, Guozhang Wang Closes #1285 from enothereska/more-integration-tests	2016-05-03 13:26:22 -07:00
Liquan Pei	316389d6ad	KAFKA-3611: Remove warnings when using reflections ewencp granders Can you take a look? Thanks! Author: Liquan Pei <liquanpei@gmail.com> Reviewers: Ewen Cheslack-Postava <ewen@confluent.io> Closes #1259 from Ishiihara/fix-warning	2016-04-28 11:59:02 -07:00
Eno Thereska	94aee2143e	KAFKA-3612: Added structure for integration tests Author: Eno Thereska <eno.thereska@gmail.com> Reviewers: Ismael Juma, Damian Guy, Michael G. Noll, Guozhang Wang Closes #1260 from enothereska/KAFKA-3612-integration-tests	2016-04-27 16:55:51 -07:00
Rajini Sivaram	5b375d7bf9	KAFKA-3149; Extend SASL implementation to support more mechanisms Code changes corresponding to KIP-43 to enable review of the KIP. Author: Rajini Sivaram <rajinisivaram@googlemail.com> Reviewers: Jun Rao <junrao@apache.org>, Ismael Juma <ismael@juma.me.uk> Closes #812 from rajinisivaram/KAFKA-3149	2016-04-26 16:56:42 -07:00
Rajini Sivaram	9d71489ff0	KAFKA-3548: Use root locale for case transformation of constant strings For enums and other constant strings, use locale independent case conversions to enable comparisons to work regardless of the default locale. Author: Rajini Sivaram <rajinisivaram@googlemail.com> Reviewers: Manikumar Reddy, Ismael Juma, Guozhang Wang, Gwen Shapira Closes #1220 from rajinisivaram/KAFKA-3548	2016-04-20 18:54:30 -07:00
Guozhang Wang	edeb11bc56	MINOR: Move streams-examples source files under src folder Also remove some unused imports. Author: Guozhang Wang <wangguoz@gmail.com> Reviewers: Ismael Juma <ismael@juma.me.uk>, Ewen Cheslack-Postava <ewen@confluent.io> Closes #992 from guozhangwang/KSExamples	2016-03-01 18:53:58 -08:00
Jiangjie Qin	45c8195fa1	KAFKA-3025; Added timetamp to Message and use relative offset. See KIP-31 and KIP-32 for details. A few notes on the patch: 1. This patch implements KIP-31 and KIP-32. The patch includes features in both KAFKA-3025, KAFKA-3026 and KAFKA-3036 2. All unit tests passed. 3. The unit tests were run with new and old message format. 4. When message format conversion occurs during consumption, the consumer will not be able to detect the message size too large situation. I did not try to fix this because the situation seems rare and only happen during migration phase. Author: Jiangjie Qin <becket.qin@gmail.com> Author: Ismael Juma <ismael@juma.me.uk> Author: Jiangjie (Becket) Qin <becket.qin@gmail.com> Reviewers: Jason Gustafson <jason@confluent.io>, Anna Povzner <anna@confluent.io>, Ismael Juma <ismael@juma.me.uk>, Guozhang Wang <wangguoz@gmail.com>, Jun Rao <junrao@gmail.com> Closes #764 from becketqin/KAFKA-3025	2016-02-19 07:56:40 -08:00
Gwen Shapira	b93f48f749	KAFKA-2422: Allow copycat connector plugins to be aliased to simpler names …names Author: Gwen Shapira <cshapi@gmail.com> Reviewers: Ewen Cheslack-Postava <ewen@confluent.io> Closes #687 from gwenshap/KAFKA-2422	2016-01-04 15:01:58 -05:00
Grant Henke	64b746bd8b	KAFKA-3020; Ensure CheckStyle runs on all Java code - Adds CheckStyle to core and examples modules - Fixes any existing CheckStyle issues Author: Grant Henke <granthenke@gmail.com> Reviewers: Ewen Cheslack-Postava <ewen@confluent.io> Closes #703 from granthenke/checkstyle-core	2015-12-21 22:48:03 -08:00
manasvigupta	a0d21407cb	KAFKA-3009; Disallow star imports Summary of code changes ------------------------------------ 1) Added a new Checkstyle rule to flag any code using star imports 2) Fixed ALL existing code violations using star imports Testing ----------- Local build was successful ALL JUnits ran successfully on local. ewencp - Request you to please review changes. Thank you ! I state that the contribution is my original work and I license the work to the project under the project's open source license. Author: manasvigupta <manasvigupta@yahoo.co.in> Reviewers: Ewen Cheslack-Postava <ewen@confluent.io> Closes #700 from manasvigupta/KAFKA-3009	2015-12-21 13:30:59 -08:00
Guozhang Wang	d05fa0a03b	KAFKA-2804: manage changelog topics through ZK in PartitionAssignor Author: Guozhang Wang <wangguoz@gmail.com> Author: wangguoz@gmail.com <guozhang@Guozhang-Macbook.local> Author: Guozhang Wang <guozhang@Guozhang-Macbook.local> Reviewers: Yasuhiro Matsuda Closes #579 from guozhangwang/K2804	2015-12-07 15:12:09 -08:00
Grant Henke	0a52ddfd03	MINOR: Fix building from subproject directory This patch fixes some releative paths so bulding from a subproject directory like ($rootDir/core) will not fail Author: Grant Henke <granthenke@gmail.com> Reviewers: Ewen Chesklack-Postava Closes #509 from granthenke/minor	2015-12-01 11:52:38 -08:00
Ismael Juma	52d5e88393	KAFKA-2847; Remove principal builder class from client configs Also mark `PrincipalBuilder` as `Unstable` and tweak docs. Author: Ismael Juma <ismael@juma.me.uk> Reviewers: Jun Rao <junrao@gmail.com> Closes #542 from ijuma/kafka-2847-remove-principal-builder-class-from-client-configs	2015-11-17 08:36:43 -08:00
Ewen Cheslack-Postava	1408c670ea	KAFKA-2807: Fix Kafka Connect packaging and move VerifiableSource/Sink into runtime jar. Gradle does not handle subprojects with the same name (top-level tools vs connect/tools) properly, making the dependency impossible to express correctly since we need to move the ThroughputThrottler class into the top level tools project. Moving the current set of tools into the runtime jar works fine since they are only used for system tests at the moment. Author: Ewen Cheslack-Postava <me@ewencp.org> Reviewers: Gwen Shapira Closes #512 from ewencp/kafka-2807-redux	2015-11-12 11:11:56 -08:00
Ewen Cheslack-Postava	8db55618d5	KAFKA-2752: Add VerifiableSource/Sink connectors and rolling bounce Copycat system tests. Author: Ewen Cheslack-Postava <me@ewencp.org> Reviewers: Ben Stopford, Geoff Anderson, Guozhang Wang Closes #432 from ewencp/kafka-2752-copycat-clean-bounce-test	2015-11-10 14:54:15 -08:00
Jason Gustafson	bce664b42a	KAFKA-2274: verifiable consumer and integration testing Author: Jason Gustafson <jason@confluent.io> Reviewers: Guozhang Wang, Geoff Anderson Closes #465 from hachikuji/KAFKA-2274	2015-11-09 18:38:22 -08:00
Ewen Cheslack-Postava	f2031d4063	KAFKA-2774: Rename Copycat to Kafka Connect Author: Ewen Cheslack-Postava <me@ewencp.org> Reviewers: Gwen Shapira Closes #456 from ewencp/kafka-2774-rename-copycat	2015-11-08 22:11:03 -08:00
Ewen Cheslack-Postava	c001b2040c	KAFKA-2369: Add REST API for Copycat. Author: Ewen Cheslack-Postava <me@ewencp.org> Reviewers: Gwen Shapira, James Cheng Closes #378 from ewencp/kafka-2369-copycat-rest-api	2015-10-30 15:00:00 -07:00
Grant Henke	2e4aed7070	KAFKA-2516: Rename o.a.k.client.tools to o.a.k.tools Author: Grant Henke <granthenke@gmail.com> Reviewers: Gwen Shapira, Ewen Cheslack-Postava Closes #310 from granthenke/tools-packaging	2015-10-27 07:44:32 -07:00
Ewen Cheslack-Postava	2e61773590	KAFKA-2371: Add distributed support for Copycat. This adds coordination between DistributedHerders using the generalized consumer support, allowing automatic balancing of connectors and tasks across workers. A few pieces that require interaction between workers (resolving config inconsistencies, forwarding of configuration changes to the leader worker) are incomplete because they require REST API support to implement properly. Author: Ewen Cheslack-Postava <me@ewencp.org> Reviewers: Jason Gustafson, Gwen Shapira Closes #321 from ewencp/kafka-2371-distributed-herder	2015-10-23 16:37:30 -07:00
Sriharsha Chintalapani	403158b54b	KAFKA-1686; Implement SASL/Kerberos This PR implements SASL/Kerberos which was originally submitted by harshach as https://github.com/apache/kafka/pull/191. I've been submitting PRs to Harsha's branch with fixes and improvements and he has integrated all, but the most recent one. I'm creating this PR so that the Jenkins can run the tests on the branch (they pass locally). Author: Ismael Juma <ismael@juma.me.uk> Author: Sriharsha Chintalapani <harsha@hortonworks.com> Author: Harsha <harshach@users.noreply.github.com> Reviewers: Ismael Juma <ismael@juma.me.uk>, Rajini Sivaram <rajinisivaram@googlemail.com>, Parth Brahmbhatt <brahmbhatt.parth@gmail.com>, Jun Rao <junrao@gmail.com> Closes #334 from ijuma/KAFKA-1686-V1	2015-10-20 14:13:34 -07:00
flavio junqueira	ce306ba4eb	KAFKA-2639: Refactoring of ZkUtils I've split the work of KAFKA-1695 because this refactoring touches a large number of files. Most of the changes are trivial, but I feel it will be easier to review this way. This pull request includes the one Parth-Brahmbhatt started to address KAFKA-1695. Author: flavio junqueira <fpj@apache.org> Author: Flavio Junqueira <fpj@apache.org> Reviewers: Ismael Juma <ismael@juma.me.uk>, Jun Rao <junrao@gmail.com> Closes #303 from fpj/KAFKA-2639	2015-10-18 15:23:52 -07:00
Ismael Juma	4e7db39556	MINOR: Make indenting in `checkstyle.xml` and `import-control.xml` consistent They now both use 2 spaces for indents, which is what `checkstyle.xml` was already doing. `import.xml` had a mixture of tabs and 4 spaces previously. Author: Ismael Juma <ismael@juma.me.uk> Reviewers: Gwen Shapira Closes #253 from ijuma/fix-xml-indents	2015-09-28 14:51:06 -07:00
Ewen Cheslack-Postava	c20e3bf8d8	HOTFIX: Checkstye fixes follow up for KAKFA-2531. Author: Ewen Cheslack-Postava <me@ewencp.org> Reviewers: Gwen Shapira Closes #248 from ewencp/hotfix-kafka-2531-checkstyle	2015-09-28 05:58:11 -07:00
Guozhang Wang	263c10ab7c	KIP-28: Add a processor client for Kafka Streaming This work has been contributed by Jesse Anderson, Randall Hauch, Yasuhiro Matsuda and Guozhang Wang. The detailed design can be found in https://cwiki.apache.org/confluence/display/KAFKA/KIP-28+-+Add+a+processor+client. Author: Guozhang Wang <wangguoz@gmail.com> Author: Yasuhiro Matsuda <yasuhiro.matsuda@gmail.com> Author: Yasuhiro Matsuda <yasuhiro@confluent.io> Author: ymatsuda <yasuhiro.matsuda@gmail.com> Author: Randall Hauch <rhauch@gmail.com> Author: Jesse Anderson <jesse@smokinghand.com> Author: Ismael Juma <ismael@juma.me.uk> Author: Jesse Anderson <eljefe6a@gmail.com> Reviewers: Ismael Juma, Randall Hauch, Edward Ribeiro, Gwen Shapira, Jun Rao, Jay Kreps, Yasuhiro Matsuda, Guozhang Wang Closes #130 from guozhangwang/streaming	2015-09-25 17:27:58 -07:00
Ewen Cheslack-Postava	48b4d6938d	KAFKA-2373: Add Kafka-backed offset storage for Copycat. Author: Ewen Cheslack-Postava <me@ewencp.org> Reviewers: Gwen Shapira, James Cheng Closes #202 from ewencp/kafka-2373-copycat-distributed-offset	2015-09-24 19:01:11 -07:00

... 2 3 4 5 6 ...

357 Commits