kafka

Commit Graph

Author	SHA1	Message	Date
Yash Mayya	7ff2dbb107	KAFKA-14368: Connect offset write REST API (#13465 ) Reviewers: Greg Harris <greg.harris@aiven.io>, Chris Egerton <chrise@aiven.io>	2023-05-26 12:08:06 -04:00
Colin P. McCabe	12130cfcec	MINOR: Create the MetadataNode classes to introspect MetadataImage Metadata image classes such as MetadataImage, ClusterImage, FeaturesImage, and so forth contain numerous sub-images. This PR adds a structured way of traversing those sub-images. This is useful for the metadata shell, and also for implementing toString functions. In both cases, the previous solution was suboptimal. The metadata shell was previously implemented in an ad-hoc way by mutating text-based tree nodes when records were replayed. This was difficult to keep in sync with changes to the record types (for example, we forgot to do this for SCRAM). It was also pretty low-level, being done at a level below that of the image classes. For toString, it was difficult to keep the implementations consistent previously, and also support both redacted and non-redacted output. The metadata shell directory was getting crowded since we never had submodules for it. This PR creates glob/, command/, node/, and state/ directories to keep things better organized. Reviewers: David Arthur <mumrah@gmail.com>, Ron Dagostino <rdagostino@confluent.io>	2023-05-23 10:11:26 -07:00
Jeff Kim	cc011f77aa	KAFKA-14500; [2/N] Rewrite GroupMetadata in Java (#13663 ) This patch introduces `GenericGroup` which rewrite the `GroupMetadata` in Java. The `GenericGroup` is basically a group using the current rebalance protocol in the new group coordinator. Reviewers: Ritika Reddy <rreddy@confluent.io>, Christo Lolov <lolovc@amazon.com>, David Jacot <djacot@confluent.io>	2023-05-12 11:22:29 +02:00
Gantigmaa Selenge	ea540fa400	KAFKA-14592: Move FeatureCommand to tools (#13459 ) KAFKA-14592: Move FeatureCommand to tools Reviewers: Luke Chen <showuon@gmail.com>	2023-04-25 20:28:37 +08:00
Ron Dagostino	e27926f92b	KAFKA-14735: Improve KRaft metadata image change performance at high … (#13280 ) topic counts. Introduces the use of persistent data structures in the KRaft metadata image to avoid copying the entire TopicsImage upon every change. Performance that was O(<number of topics in the cluster>) is now O(<number of topics changing>), which has dramatic time and GC improvements for the most common topic-related metadata events. We abstract away the chosen underlying persistent collection library via ImmutableMap<> and ImmutableSet<> interfaces and static factory methods. Reviewers: Luke Chen <showuon@gmail.com>, Colin P. McCabe <cmccabe@apache.org>, Ismael Juma <ismael@juma.me.uk>, Purshotam Chauhan <pchauhan@confluent.io>	2023-04-17 17:52:28 -04:00
David Jacot	e1e3900ba1	KAFKA-14462; [4/N] Add Group, Record and Result (#13520 ) This patch adds Group, Record and Result. Reviewers: Jason Gustafson <jason@confluent.io>, Jeff Kim <jeff.kim@confluent.io>, Justine Olshan <jolshan@confluent.io>	2023-04-12 13:16:49 +02:00
José Armando García Sancio	672dd3ab6a	KAFKA-13020; Implement reading Snapshot log append timestamp (#13345 ) The SnapshotReader exposes the "last contained log time". This is mainly used during snapshot cleanup. The previous implementation used the append time of the snapshot record. This is not accurate as this is the time when the snapshot was created and not the log append time of the last record included in the snapshot. The log append time of the last record included in the snapshot is store in the header control record of the snapshot. The header control record is the first record of the snapshot. To be able to read this record, this change extends the RecordsIterator to decode and expose the control records in the Records type. Reviewers: Colin Patrick McCabe <cmccabe@apache.org>	2023-04-07 09:25:54 -07:00
Yash Mayya	970dea60e8	KAFKA-14785 (KIP-875): Connect offset read REST API (#13434 ) Reviewers: Chris Egerton <chrise@aiven.io>	2023-04-02 13:09:33 -04:00
vamossagar12	c14f56b484	KAFKA-14586: Moving StreamResetter to tools (#13127 ) Moves StreamResetter to tools project. Reviewers: Federico Valeri <fedevaleri@gmail.com>, Christo Lolov <lolovc@amazon.com>, Bruno Cadonna <cadonna@apache.org>	2023-03-28 14:43:22 +02:00
David Arthur	f1b3732fa6	KAFKA-14796 Migrate ACLs from AclAuthorizor to KRaft (#13368 ) This patch refactors the loadCache method in AclAuthorizer to make it reusable by ZkMigrationClient. The loaded ACLs are converted to AccessControlEntryRecord. I noticed we still have the defunct AccessControlRecord, so I've deleted it. Also included here are the methods to write ACL changes back to ZK while in dual-write mode. Reviewers: Rajini Sivaram <rajinisivaram@googlemail.com>, Colin P. McCabe <cmccabe@apache.org>	2023-03-27 16:12:02 -07:00
Colin Patrick McCabe	ed400e4c0d	KAFKA-14835: Create ControllerMetadataMetricsPublisher (#13438 ) Separate out KRaft controller metrics into two groups: metrics directly managed by the QuorumController, and metrics handled by an external publisher. This separation of concerns makes the code easier to reason about, by clarifying what metrics can be changed where. The external publisher, ControllerServerMetricsPublisher, handles all metrics which are related to the content of metadata. For example, metrics about number of topics or number of partitions, etc. etc. It fits into the MetadataLoader metadata publishing framework as another publisher. Since ControllerServerMetricsPublisher operates off of a MetadataImage, we don't have to create (essentially) another copy of the metadata in memory, as ControllerMetricsManager. This reduces memory consumption. Another benefit of operating off of the MetadataImage is that we don't have to have special handling for each record type, like we do now in ControllerMetricsManager. Reviewers: David Arthur <mumrah@gmail.com>	2023-03-24 11:26:53 -07:00
David Jacot	788cc11f45	KAFKA-14462; [3/N] Add `onNewMetadataImage` to `GroupCoordinator` interface (#13357 ) The new group coordinator needs to access cluster metadata (e.g. topics, partitions, etc.) and it needs a mechanism to be notified when the metadata changes (e.g. to trigger a rebalance). In KRaft clusters, the easiest is to subscribe to metadata changes via the MetadataPublisher. Reviewers: Justine Olshan <jolshan@confluent.io>	2023-03-08 08:52:01 +01:00
Proven Provenzano	38c409cf33	KAFKA-14084: SCRAM support in KRaft. (#13114 ) This commit adds support to store the SCRAM credentials in a cluster with KRaft quorum servers and no ZK cluster backing the metadata. This includes creating ScramControlManager in the controller, and adding support for SCRAM to MetadataImage and MetadataDelta. Change UserScramCredentialRecord to contain only a single tuple (name, mechanism, salt, pw, iter) rather than a mapping between name and a list. This will avoid creating an excessively large record if a single user has many entries. Because record ID 11 (UserScramCredentialRecord) has not been used before, this is a compatible change. SCRAM will be supported in 3.5-IV0 and later. This commit does not include KIP-900 SCRAM bootstrapping support, or updating the credential cache on the controller (as opposed to broker). We will implement these in follow-on commits. Reviewers: Manikumar Reddy <manikumar.reddy@gmail.com>, Colin P. McCabe <cmccabe@apache.org>	2023-03-03 10:23:34 -08:00
vamossagar12	bb3111f472	KAFKA-14580: Moving EndToEndLatency from core to tools module (#13095 ) Reviewers: Mickael Maison <mickael.maison@gmail.com>, Federico Valeri <fedevaleri@gmail.com>, Ismael Juma <mlists@juma.me.uk>	2023-03-02 12:05:22 +01:00
Ron Dagostino	631e6be3a0	KAFKA-14711: kafaka-metadata-quorum.sh does not honor --command-confi… (#13241 ) …g option https://github.com/apache/kafka/pull/12951 accidentally changed the behavior of the `kafaka-metadata-quorum.sh` CLI by making it silently ignore a `--command-config <filename>` properties file that exists. This was an undetected regression in the 3.4.0 release. This patch fixes the issue such that any such specified file will be honored. Reviewers: José Armando García Sancio <jsancio@apache.org>, Ismael Juma <ismael@juma.me.uk>	2023-02-13 18:33:20 -05:00
David Jacot	39962eeeb3	KAFKA-14513; Add broker side PartitionAssignor interface (#13202 ) This patch adds the broker side `PartitionAssignor` interface as detailed in KIP-848. The interfaces differs a bit from the KIP in the following ways: * The POJOs are not defined within the interface because the interface is to heavy like this. * The interface is kept in the `group-coordinator` module for now. We don't want to have it out there until KIP-848 is ready to be released. We will move it to its final destination later. Reviewers: Jeff Kim <jeff.kim@confluent.io>, Justine Olshan <jolshan@confluent.io>, Christo Lolov <lolovc@amazon.com>, Guozhang Wang <wangguoz@gmail.com>	2023-02-10 08:26:00 +01:00
Chris Egerton	f93d5af839	KAFKA-15086, KAFKA-9981: Intra-cluster communication for Mirror Maker 2 (#13137 ) Reviewers: Daniel Urban <durban@cloudera.com>, Greg Harris <greg.harris@aiven.io>, Viktor Somogyi-Vass <viktorsomogyi@gmail.com>, Mickael Maison <mickael.maison@gmail.com>	2023-02-09 10:50:07 -05:00
Satish Duggana	da2e8dce71	KAFKA-14551 Move/Rewrite LeaderEpochFileCache and its dependencies to the storage module. (#13046 ) KAFKA-14551 Move/Rewrite LeaderEpochFileCache and its dependencies to the storage module. For broader context on this change, you may want to look at KAFKA-14470: Move log layer to the storage module Reviewers: Ismael Juma <ismael@juma.me.uk>, Jun Rao <junrao@gmail.com>, Alexandre Dupriez <alexandre.dupriez@gmail.com>	2023-02-07 15:37:23 +05:30
David Jacot	094e343f18	KAFKA-14678; Move `__consumer_offsets` records from `core` to `group-coordinator` (#13200 ) This patch moves the current `__consumer_offsets` records from the `core` module to the new `group-coordinator` module. Reviewers: Christo Lolov <lolovc@amazon.com>, Mickael Maison <mickael.maison@gmail.com>	2023-02-07 09:06:56 +01:00
Federico Valeri	50e0e3c257	KAFKA-14582: Move JmxTool to tools (#13136 ) Reviewers: Mickael Maison <mickael.maison@gmail.com>	2023-02-02 11:23:26 +01:00
Federico Valeri	72cfc994f5	KAFKA-14628: Move CommandLineUtils and CommandDefaultOptions to tools (#13131 ) Reviewers: Mickael Maison <mickael.maison@gmail.com>, Christo Lolov <christololov@gmail.com>, Sagar Rao <sagarmeansocean@gmail.com>	2023-01-26 20:06:09 +01:00
Akhilesh C	db49070760	KAFKA-14493: Introduce Zk to KRaft migration state machine STUBs in KRaft controller. (#12998 ) This patch introduces a preliminary state machine that can be used by KRaft controller to drive online migration from Zk to KRaft. MigrationState -- Defines the states we can have while migration from Zk to KRaft. KRaftMigrationDriver -- Defines the state transitions, and events to handle actions like controller change, metadata change, broker change and have interfaces through which it claims Zk controllership, performs zk writes and sends RPCs to ZkBrokers. MigrationClient -- Interface that defines the functions used to claim and relinquish Zk controllership, read to and write from Zk. Co-authored-by: David Arthur <mumrah@gmail.com> Reviewers: Colin P. McCabe <cmccabe@apache.org>	2023-01-09 10:44:11 -08:00
Ismael Juma	96d9710c17	KAFKA-14478: Move LogConfig/CleanerConfig and related to storage module (#13049 ) Additional notable changes to fix multiple dependency ordering issues: * Moved `ConfigSynonym` to `server-common` * Moved synonyms from `LogConfig` to `ServerTopicConfigSynonyms ` * Removed `LogConfigDef` `define` overrides and rely on `ServerTopicConfigSynonyms` instead. * Moved `LogConfig.extractLogConfigMap` to `KafkaConfig` * Consolidated relevant defaults from `KafkaConfig`/`LogConfig` in the latter * Consolidate relevant config name definitions in `TopicConfig` * Move `ThrottledReplicaListValidator` to `storage` Reviewers: Satish Duggana <satishd@apache.org>, Mickael Maison <mickael.maison@gmail.com>	2023-01-04 02:42:52 -08:00
Colin Patrick McCabe	29c09e2ca1	MINOR: ControllerServer should use the new metadata loader and snapshot generator (#12983 ) This PR introduces the new metadata loader and snapshot generator. For the time being, they are only used by the controller, but a PR for the broker will come soon. The new metadata loader supports adding and removing publishers dynamically. (In contrast, the old loader only supported adding a single publisher.) It also passes along more information about each new image that is published. This information can be found in the LogDeltaManifest and SnapshotManifest classes. The new snapshot generator replaces the previous logic for generating snapshots in QuorumController.java and associated classes. The new generator is intended to be shared between the broker and the controller, so it is decoupled from both. There are a few small changes to the old snapshot generator in this PR. Specifically, we move the batch processing time and batch size metrics out of BrokerMetadataListener.scala and into BrokerServerMetrics.scala. Finally, fix a case where we are using 'is' rather than '==' for a numeric comparison in snapshot_test.py. Reviewers: David Arthur <mumrah@gmail.com>	2022-12-15 16:53:07 -08:00
Ismael Juma	88725669e7	MINOR: Move MetadataQuorumCommand from `core` to `tools` (#12951 ) `core` should only be used for legacy cli tools and tools that require access to `core` classes instead of communicating via the kafka protocol (typically by using the client classes). Summary of changes: 1. Convert the command implementation and tests to Java and move it to the `tools` module. 2. Introduce mechanism to capture stdout and stderr from tests. 3. Change `kafka-metadata-quorum.sh` to point to the new command class. 4. Adjusted the test classpath of the `tools` module so that it supports tests that rely on the `@ClusterTests` annotation. 5. Improved error handling when an exception different from `TerseFailure` is thrown. 6. Changed `ToolsUtils` to avoid usage of arrays in favor of `List`. Reviewers: dengziming <dengziming1993@gmail.com>	2022-12-09 09:22:58 -08:00
David Arthur	d40561e90a	KAFKA-14427 ZK client support for migrations (#12946 ) This patch adds support for reading and writing ZooKeeper metadata during a KIP-866 migration. For reading metadata from ZK, methods from KafkaZkClient and ZkData are reused to ensure we are decoding the JSON consistently. For writing metadata, we use a new multi-op transaction that ensures only a single controller is writing to ZK. This is similar to the existing multi-op transaction that KafkaController uses, but it also includes a check on the new "/migration" ZNode. The transaction consists of three operations: * CheckOp on /controller_epoch * SetDataOp on /migration with zkVersion * CreateOp/SetDataOp/DeleteOp (the actual operation being applied) In the case of a batch of operations (such as topic creation), only the final MultiOp has a SetDataOp on /migration while the other requests use a CheckOp (similar to /controller_epoch). Reviewers: Colin Patrick McCabe <cmccabe@apache.org>, dengziming <dengziming1993@gmail.com>	2022-12-08 13:14:01 -05:00
Colin Patrick McCabe	100e874671	MINOR: Move dynamic config logic to DynamicConfigPublisher (#12958 ) Split out the logic for applying dynamic configurations to a KafkaConfig object from BrokerMetadataPublisher into a new class, DynamicConfigPublisher. This will allow the ControllerServer to also run this code, in a follow-up change. Create separate KafkaConfig objects in BrokerServer versus ControllerServer. This is necessary because the controller will apply configuration changes as soon as its raft client catches up to the high water mark, whereas the broker will wait for the active controller to acknowledge it has caught up in a heartbeat response. So when running in combined mode, we want two separate KafkaConfig objects that are changed at different times. Minor changes: improve the error message when catching up broker metadata fails. Fix incorrect indentation in checkstyle/import-control.xml. Invoke AppInfoParser.unregisterAppInfo from SharedServer.stop so that it happens only when both the controller and broker have shut down. Reviewers: David Arthur <mumrah@gmail.com>	2022-12-07 10:43:34 -08:00
Patrik Marton	1c10d107fe	KAFKA-14293: Basic Auth filter should set the SecurityContext after a successful login (#12846 ) Reviewers: Greg Harris <greg.harris@aiven.io>, Chris Egerton <chrise@aiven.io>	2022-12-05 09:38:40 -05:00
Colin Patrick McCabe	a3f5eb6e35	MINOR: Implement EventQueue#size and EventQueue#empty (#12930 ) Implement functions to measure the number of events in the event queue. Reviewers: David Arthur <mumrah@gmail.com>	2022-12-01 09:04:04 -08:00
David Jacot	98e19b3000	KAFKA-14367; Add `JoinGroup` to the new `GroupCoordinator` interface (#12845 ) This patch adds `joinGroup` to the new `GroupCoordinator` interface and updates `KafkaApis` to use it. For the context, I will do the same for all the other interactions with the current group coordinator. In order to limit the changes, I have chosen to introduce the `GroupCoordinatorAdapter` that translates the new interface to the old one. It is basically a wrapper. This allows keeping the current group coordinator untouched for now and focus on the `KafkaApis` changes. Eventually, we can remove `GroupCoordinatorAdapter`. Reviewers: Justine Olshan <jolshan@confluent.io>, Jeff Kim <jeff.kim@confluent.io>, Luke Chen <showuon@gmail.com>, Jason Gustafson <jason@confluent.io>	2022-11-29 20:39:12 +01:00
Greg Harris	fca5bfe13c	KAFKA-14346: Remove hard-to-mock RestClient calls (#12828 ) Reviewers: Chris Egerton <chrise@aiven.io>	2022-11-17 17:51:54 -05:00
Colin Patrick McCabe	dac81161db	MINOR; Introduce ImageWriter and ImageWriterOptions (#12715 ) This PR adds a new ImageWriter interface which replaces the generic Consumer interface which accepted lists of records. It is better to do batching in the ImageWriter than to try to deal with that complexity in the MetadataImage#write functions, especially since batching is not semantically meaningful in KRaft snapshots. The new ImageWriter interface also supports freeze and close, which more closely matches the semantics of the underlying Raft classes. The PR also adds an ImageWriterOptions class which we can use to pass parameters to control how the new image is written. Right now, the parameters that we are interested in are the target metadata version (which may be more or less than the original image's version) and a handler function which is invoked whenever metadata is lost due to the target version. Convert over the MetadataImage#write function (and associated functions) to use the new ImageWriter and ImageWriterOptions. In particular, we now have a way to handle metadata losses by invoking ImageWriterOptions#handleLoss. This allows us to handle writing an image at a lower version, for the first time. This support is still not enabled externally by this PR, though. That will come in a future PR. Get rid of the use of SOME_RECORD_TYPE.highestSupportedVersion() in several places. In general, we do not want to "silently" change the version of a record that we output, just because a new version was added. We should be explicit about what record version numbers we are outputting. Implement ProducerIdsDelta#toString, to make debug logs look better. Move MockRandom to the server-common package so that other internal broker packages can use it. Reviewers: José Armando García Sancio <jsancio@apache.org>	2022-10-13 09:56:19 -07:00
Chris Egerton	18e60cb000	KAFKA-12497: Skip periodic offset commits for failed source tasks (#10528 ) Also moves the Streams LogCaptureAppender class into the clients module so that it can be used by both Streams and Connect. Reviewers: Nigel Liang <nigel@nigelliang.com>, Kalpesh Patel <kpatel@confluent.io>, John Roesler <vvcephei@apache.org>, Tom Bentley <tbentley@redhat.com>	2022-10-13 10:15:42 -04:00
Alexandre Garnier	62914129c7	KAFKA-14099 - Fix request logging in connect (#12434 ) Reviewers: Chris Egerton <chrise@aiven.io>	2022-10-12 10:28:55 -04:00
Jason Gustafson	c5745d2845	MINOR: Add initial property tests for StandardAuthorizer (#12703 ) In https://github.com/apache/kafka/pull/12695, we discovered a gap in our testing of `StandardAuthorizer`. We addressed the specific case that was failing, but I think we need to establish a better methodology for testing which incorporates randomized inputs. This patch is a start in that direction. We implement a few basic property tests using jqwik which focus on prefix searching. It catches the case from https://github.com/apache/kafka/pull/12695 prior to the fix. In the future, we can extend this to cover additional operation types, principal matching, etc. Reviewers: David Arthur <mumrah@gmail.com>	2022-10-04 16:31:43 -07:00
Kirk True	8e43548175	KAFKA-13725: KIP-768 OAuth code mixes public and internal classes in same package (#12039 ) * KAFKA-13725: KIP-768 OAuth code mixes public and internal classes in same package Move classes into a sub-package of "internal" named "secured" that matches the layout more closely of the "unsecured" package. Replaces the concrete implementations in the former packages with sub-classes of the new package layout and marks them as deprecated. If anyone is already using the newer OAuth code, this should still work. * Fix checkstyle and spotbugs violations Co-authored-by: Kirk True <kirk@mustardgrain.com> Reviewers: Manikumar Reddy <manikumar.reddy@gmail.com>	2022-09-23 13:15:15 +05:30
Colin Patrick McCabe	f0f918b242	KAFKA-14177: Correctly support older kraft versions without FeatureLevelRecord (#12513 ) The main changes here are ensuring that we always have a metadata.version record in the log, making ˘sure that the bootstrap file can be used for records other than the metadata.version record (for example, we will want to put SCRAM initialization records there), and fixing some bugs. If no feature level record is in the log and the IBP is less than 3.3IV0, then we assume the minimum KRaft version for all records in the log. Fix some issues related to initializing new clusters. If there are no records in the log at all, then insert the bootstrap records in a single batch. If there are records, but no metadata version, process the existing records as though they were metadata.version 3.3IV0 and then append a metadata version record setting version 3.3IV0. Previously, we were not clearly distinguishing between the case where the metadata log was empty, and the case where we just needed to add a metadata.version record. Refactor BootstrapMetadata into an immutable class which contains a 3-tuple of metadata version, record list, and source. The source field is used to log where the bootstrap metadata was obtained from. This could be a bootstrap file, the static configuration, or just the software defaults. Move the logic for reading and writing bootstrap files into BootstrapDirectory.java. Add LogReplayTracker, which tracks whether the log is empty. Fix a bug in FeatureControlManager where it was possible to use a "downgrade" operation to transition to a newer version. Do not store whether we have seen a metadata version or not in FeatureControlManager, since that is now handled by LogReplayTracker. Introduce BatchFileReader, which is a simple way of reading a file containing batches of snapshots that does not require spawning a thread. Rename SnapshotFileWriter to BatchFileWriter to be consistent, and to reflect the fact that bootstrap files aren't snapshots. QuorumController#processBrokerHeartbeat: add an explanatory comment. Reviewers: David Arthur <mumrah@gmail.com>, Jason Gustafson <jason@confluent.io>	2022-08-25 18:12:31 -07:00
dengziming	150fd5b0b1	KAFKA-13914: Add command line tool kafka-metadata-quorum.sh (#12469 ) Add `MetadataQuorumCommand` to describe quorum status, I'm trying to use arg4j style command format, currently, we only support one sub-command which is "describe" and we can specify 2 arguments which are --status and --replication. ``` # describe quorum status kafka-metadata-quorum.sh --bootstrap-server localhost:9092 describe --replication ReplicaId LogEndOffset Lag LastFetchTimeMs LastCaughtUpTimeMs Status 0 10 0 -1 -1 Leader 1 10 0 -1 -1 Follower 2 10 0 -1 -1 Follower kafka-metadata-quorum.sh --bootstrap-server localhost:9092 describe --status ClusterId: fMCL8kv1SWm87L_Md-I2hg LeaderId: 3002 LeaderEpoch: 2 HighWatermark: 10 MaxFollowerLag: 0 MaxFollowerLagTimeMs: -1 CurrentVoters: [3000,3001,3002] CurrentObservers: [0,1,2] # specify AdminClient properties kafka-metadata-quorum.sh --bootstrap-server localhost:9092 --command-config config.properties describe --status ``` Reviewers: Jason Gustafson <jason@confluent.io>	2022-08-20 08:37:26 -07:00
Colin Patrick McCabe	555744da70	KAFKA-14124: improve quorum controller fault handling (#12447 ) Before trying to commit a batch of records to the __cluster_metadata log, the active controller should try to apply them to its current in-memory state. If this application process fails, the active controller process should exit, allowing another node to take leadership. This will prevent most bad metadata records from ending up in the log and help to surface errors during testing. Similarly, if the active controller attempts to renounce leadership, and the renunciation process itself fails, the process should exit. This will help avoid bugs where the active controller continues in an undefined state. In contrast, standby controllers that experience metadata application errors should continue on, in order to avoid a scenario where a bad record brings down the whole controller cluster. The intended effect of these changes is to make it harder to commit a bad record to the metadata log, but to continue to ride out the bad record as well as possible if such a record does get committed. This PR introduces the FaultHandler interface to implement these concepts. In junit tests, we use a FaultHandler implementation which does not exit the process. This allows us to avoid terminating the gradle test runner, which would be very disruptive. It also allows us to ensure that the test surfaces these exceptions, which we previously were not doing (the mock fault handler stores the exception). In addition to the above, this PR fixes a bug where RaftClient#resign was not being called from the renounce() function. This bug could have resulted in the raft layer not being informed of an active controller resigning. Reviewers: David Arthur <mumrah@gmail.com>	2022-08-04 22:49:45 -07:00
Mickael Maison	4a06458633	KAFKA-13780: Generate OpenAPI file for Connect REST API (#12067 ) New gradle task `connect:runtime:genConnectOpenAPIDocs` that generates `connect_rest.yaml` under `docs/generated`. This task is executed when `siteDocsTar` runs.	2022-06-10 11:35:22 +02:00
David Arthur	1135f22eaf	KAFKA-13830 MetadataVersion integration for KRaft controller (#12050 ) This patch builds on #12072 and adds controller support for metadata.version. The kafka-storage tool now allows a user to specify a specific metadata.version to bootstrap into the cluster, otherwise the latest version is used. Upon the first leader election of the KRaft quroum, this initial metadata.version is written into the metadata log. When writing snapshots, a FeatureLevelRecord for metadata.version will be written out ahead of other records so we can decode things at the correct version level. This also includes additional validation in the controller when setting feature levels. It will now check that a given metadata.version is supportable by the quroum, not just the brokers. Reviewers: José Armando García Sancio <jsancio@gmail.com>, Colin P. McCabe <cmccabe@apache.org>, dengziming <dengziming1993@gmail.com>, Alyssa Huang <ahuang@confluent.io>	2022-05-18 12:08:36 -07:00
Colin Patrick McCabe	1521813a3a	KAFKA-13807: Fix incrementalAlterConfig and refactor some things (#12033 ) Ensure that we can set log.flush.interval.ms at the broker or cluster level via IncrementalAlterConfigs. This was broken by KAFKA-13749, which added log.flush.interval.ms as the second synonym rather than the first. Add a regression test to DynamicConfigChangeTest. Create ControllerRequestContext and pass it to every controller API. This gives us a uniform way to pass through information like the deadline (if there is one) and the Kafka principal which is making the request (in the future we will want to log this information). In ControllerApis, enforce a timeout for broker heartbeat requests which is equal to the heartbeat request interval, to avoid heartbeats piling up on the controller queue. This should have been done previously, but we overlooked it. Add a builder for ClusterControlManager and ReplicationControlManager to avoid the need to deal with a lot of churn (especially in test code) whenever a new constructor parameter gets added for one of these. In ControllerConfigurationValidator, create a separate function for when we just want to validate that a ConfigResource is a valid target for DescribeConfigs. Previously we had been re-using the validation code for IncrementalAlterConfigs, but this was messy. Split out the replica placement code into a separate package and reorganize it a bit. Reviewers: David Arthur <mumrah@gmail.com	2022-04-15 16:07:23 -07:00
Colin Patrick McCabe	62ea4c46a9	KAFKA-13749: CreateTopics in KRaft must return configs (#11941 ) Previously, when in KRaft mode, CreateTopics did not return the active configurations for the topic(s) it had just created. This PR addresses that gap. We will now return these topic configuration(s) when the user has DESCRIBE_CONFIGS permission. (In the case where the user does not have this permission, we will omit the configurations and set TopicErrorCode. We will also omit the number of partitions and replication factor data as well.) For historical reasons, we use different names to refer to each topic configuration when it is set in the broker context, as opposed to the topic context. For example, the topic configuration "segment.ms" corresponds to the broker configuration "log.roll.ms". Additionally, some broker configurations have synonyms. For example, the broker configuration "log.roll.hours" can be used to set the log roll time instead of "log.roll.ms". In order to track all of this, this PR adds a table in LogConfig.scala which maps each topic configuration to an ordered list of ConfigSynonym classes. (This table is then passed to KafkaConfigSchema as a constructor argument.) Some synonyms require transformations. For example, in order to convert from "log.roll.hours" to "segment.ms", we must convert hours to milliseconds. (Note that our assumption right now is that topic configurations do not have synonyms, only broker configurations. If this changes, we will need to add some logic to handle it.) This PR makes the 8-argument constructor for ConfigEntry public. We need this in order to make full use of ConfigEntry outside of the admin namespace. This change is probably inevitable in general since otherwise we cannot easily test the output from various admin APIs in junit tests outside the admin package. Testing: This PR adds PlaintextAdminIntegrationTest#testCreateTopicsReturnsConfigs. This test validates some of the configurations that it gets back from the call to CreateTopics, rather than just checking if it got back a non-empty map like some of the existing tests. In order to test the configuration override logic, testCreateDeleteTopics now sets up some custom static and dynamic configurations. In QuorumTestHarness, we now allow tests to configure what the ID of the controller should be. This allows us to set dynamic configurations for the controller in testCreateDeleteTopics. We will have a more complete fix for setting dynamic configuations on the controller later. This PR changes ConfigurationControlManager so that it is created via a Builder. This will make it easier to add more parameters to its constructor without having to update every piece of test code that uses it. It will also make the test code easier to read. Reviewers: David Arthur <mumrah@gmail.com>	2022-04-01 10:50:25 -07:00
Jason Gustafson	b2cb6caa1e	MINOR: Move `KafkaYammerMetrics` to server-common (#11970 ) With major server components like the new quorum controller being moved outside of the `core` module, it is useful to have shared dependencies moved into `server-common`. An example of this is Yammer metrics which server components still rely heavily upon. All server components should have access to the default registry used by the broker so that new metrics can be registered and metric naming conventions should be standardized. This is particularly important in KRaft where we are attempting to recreate identically named metrics in the controller context. This patch takes a step in this direction. It moves `KafkaYammerMetrics` into `server-common` and it implements standard metric naming utilities there. Reviewers: Manikumar Reddy <manikumar.reddy@gmail.com>	2022-03-30 13:59:22 -07:00
Idan Kamara	eddb98df67	MINOR: Fix class comparison in `AlterConfigPolicy.RequestMetadata.equals()` (#11900 ) This patch fixes a bug in the `AlterConfigPolicy.RequestMetadata.equals` method where we were not comparing the class correctly. Co-authored-by: David Jacot <djacot@confluent.io> Reviewers: David Jacot <djacot@confluent.io>	2022-03-22 09:45:04 +01:00
Colin Patrick McCabe	07553d13f7	MINOR: create KafkaConfigSchema and TimelineObject (#11809 ) Create KafkaConfigSchema to encapsulate the concept of determining the types of configuration keys. This is useful in the controller because we can't import KafkaConfig, which is part of core. Also introduce the TimelineObject class, which is a more generic version of TimelineInteger / TimelineLong. Reviewers: David Arthur <mumrah@gmail.com>	2022-03-02 14:26:31 -08:00
Wenjun Ruan	760e6f3741	Add license header in suppressions.xml (#11753 ) Add license header in suppressions.xml Reviewers: Luke Chen <showuon@gmail.com>	2022-02-17 14:35:36 +08:00
Colin Patrick McCabe	d35283f011	KAFKA-13646; Implement KIP-801: KRaft authorizer (#11649 ) Currently, when using KRaft mode, users still have to have an Apache ZooKeeper instance if they want to use AclAuthorizer. We should have a built-in Authorizer for KRaft mode that does not depend on ZooKeeper. This PR introduces such an authorizer, called StandardAuthorizer. See KIP-801 for a full description of the new Authorizer design. Authorizer.java: add aclCount API as described in KIP-801. StandardAuthorizer is currently the only authorizer that implements it, but eventually we may implement it for AclAuthorizer and others as well. ControllerApis.scala: fix a bug where createPartitions was authorized using CREATE on the topic resource rather than ALTER on the topic resource as it should have been. QuorumTestHarness: rename the controller endpoint to CONTROLLER for consistency (the brokers already called it that). This is relevant in AuthorizerIntegrationTest where we are examining endpoint names. Also add the controllerServers call. TestUtils.scala: adapt the ACL functions to be usable from KRaft, by ensuring that they use the Authorizer from the current active controller. BrokerMetadataPublisher.scala: add broker-side ACL application logic. Controller.java: add ACL APIs. Also add a findAllTopicIds API in order to make junit tests that use KafkaServerTestHarness#getTopicNames and KafkaServerTestHarness#getTopicIds work smoothly. AuthorizerIntegrationTest.scala: convert over testAuthorizationWithTopicExisting (more to come soon) QuorumController.java: add logic for replaying ACL-based records. This means storing them in the new AclControlManager object, and integrating them into controller snapshots. It also means applying the changes in the Authorizer, if one is configured. In renounce, when reverting to a snapshot, also set newBytesSinceLastSnapshot to 0. Reviewers: YeonCheol Jang <YeonCheolGit@users.noreply.github.com>, Jason Gustafson <jason@confluent.io>	2022-02-09 10:38:52 -08:00
Kirk True	7b379539a5	KAFKA-13202: KIP-768: Extend SASL/OAUTHBEARER with Support for OIDC (#11284 ) This task is to provide a concrete implementation of the interfaces defined in KIP-255 to allow Kafka to connect to an OAuth/OIDC identity provider for authentication and token retrieval. While KIP-255 provides an unsecured JWT example for development, this will fill in the gap and provide a production-grade implementation. The OAuth/OIDC work will allow out-of-the-box configuration by any Apache Kafka users to connect to an external identity provider service (e.g. Okta, Auth0, Azure, etc.). The code will implement the standard OAuth client credentials grant type. The proposed change is largely composed of a pair of AuthenticateCallbackHandler implementations: one to login on the client and one to validate on the broker. See the following for more detail: KIP-768 KAFKA-13202 Reviewers: Yi Ding <dingyi.zj@gmail.com>, Ismael Juma <ismael@juma.me.uk>, Jun Rao <junrao@gmail.com>	2021-10-28 11:36:53 -07:00
José Armando García Sancio	da58d75c43	MINOR: Fix highest offset when loading KRaft metadata snapshots (#11386 ) When loading a snapshot the broker BrokerMetadataListener was using the batch's append time, offset and epoch. These are not the same as the append time, offset and epoch from the log. This PR fixes it to instead use the lastContainedLogTimeStamp, lastContainedLogOffset and lastContainedLogEpoch from the SnapshotReader. This PR refactors the MetadataImage and MetadataDelta to include an offset and epoch. It also swaps the order of the arguments for ReplicaManager.applyDelta, in order to be more consistent with MetadataPublisher.publish. Reviewers: Colin P. McCabe <cmccabe@apache.org>	2021-10-12 17:19:03 -07:00
Satish Duggana	34d56dc8d0	KAFKA-12802 Added a file based cache for consumed remote log metadata for each partition to avoid consuming again incase of broker restarts. (#11058 ) Added snapshots for consumed remote log metadata for each partition to avoid consuming again in case of broker restarts. These snapshots are stored in the respective topic partition log directories. Reviewers: Kowshik Prakasam <kprakasam@confluent.io>, Cong Ding <cong@ccding.com>, Jun Rao <junrao@gmail.com>	2021-10-11 10:24:55 -07:00
Colin Patrick McCabe	3f3a0e0d9e	KAFKA-13280: Avoid O(N) behavior in KRaftMetadataCache#topicNamesToIds (#11311 ) Avoid O(N) behavior in KRaftMetadataCache#topicNamesToIds and KRaftMetadataCache#topicIdsToNames by returning a map subclass that exposes the TopicsImage data structures without copying them. Reviewers: Jason Gustafson <jason@confluent.io>, Ismael Juma <ismael@juma.me.uk>	2021-10-07 09:41:57 -07:00
Colin Patrick McCabe	85548acafb	KAFKA-13279: allow CreateTopicsPolicy, AlterConfigsPolicy in KRaft mode (#11310 ) Add support for CreateTopicsPolicy and AlterConfigsPolicy when running in KRaft mode. Reviewers: David Arthur <mumrah@gmail.com>, Niket Goel <ngoel@confluent.io>	2021-09-22 13:07:45 -07:00
Satish Duggana	e8ce93bd53	KAFKA-9555 Added default RLMM implementation based on internal topic storage. (#10579 ) KAFKA-9555 Added default RLMM implementation based on internal topic storage. This is the initial version of the default RLMM implementation. This includes changes containing default RLMM configs, RLMM implementation, producer/consumer managers. Introduced TopicBasedRemoteLogMetadataManagerHarness which takes care of bringing up a Kafka cluster and create remote log metadata topic and initializes TopicBasedRemoteLogMetadataManager. Refactored existing RemoteLogMetadataCacheTest to RemoteLogSegmentLifecycleTest to have parameterized tests to run both RemoteLogMetadataCache and also TopicBasedRemoteLogMetadataManager. Refactored existing InmemoryRemoteLogMetadataManagerTest, RemoteLogMetadataManagerTest to have parameterized tests to run both InmemoryRemoteLogMetadataManager and also TopicBasedRemoteLogMetadataManager. This is part of tiered storage KIP-405 efforts. Reviewers: Kowshik Prakasam <kprakasam@confluent.io>, Cong Ding <cong@ccding.com>, Jun Rao <junrao@gmail.com>	2021-07-19 09:05:46 -07:00
Colin Patrick McCabe	b4e45cd0d2	KAFKA-13019: Add MetadataImage and MetadataDelta classes for KRaft Snapshots (#10949 ) Create the image/ module for storing, reading, and writing broker metadata images. Metadata images are immutable. New images are produced from existing images using delta classes. Delta classes are mutable, and represent changes to a base image. MetadataImage objects can be converted to lists of KRaft metadata records. This is essentially writing a KRaft snapshot. The resulting snapshot can be read back into a MetadataDelta object. In practice, we will typically read the snapshot, and then read a few more records to get fully up to date. After that, the MetadataDelta can be converted to a MetadataImage as usual. Sometimes, we have to load a snapshot even though we already have an existing non-empty MetadataImage. We would do this if the broker fell too far behind and needed to receive a snapshot to catch up. This is handled just like the normal snapshot loading process. Anything that is not in the snapshot will be marked as deleted in the MetadataDelta once finishSnapshot() is called. In addition to being used for reading and writing snapshots, MetadataImage also serves as a cache for broker information in memory. A follow-up PR will replace MetadataCache, CachedConfigRepository, and the client quotas cache with the corresponding Image classes. TopicsDelta also replaces the "deferred partition" state that the RaftReplicaManager currently implements. (That change is also in a follow-up PR.) Reviewers: Jason Gustafson <jason@confluent.io>, David Arthur <mumrah@gmail.com>	2021-07-01 00:08:25 -07:00
Niket	d3ec9f940c	KAFKA-12952 Add header and footer records for raft snapshots (#10899 ) Add header and footer records for raft snapshots. This helps identify when the snapshot starts and ends. The header also contains a time. The time field is currently set to 0. KAFKA-12997 will add in the necessary wiring to use the correct timestamp. Reviewers: Jose Sancio <jsancio@gmail.com>, Colin P. McCabe <cmccabe@apache.org>	2021-06-29 09:37:20 -07:00
Ismael Juma	d27a84f70c	KAFKA-12945: Remove port, host.name and related configs in 3.0 (#10872 ) They have been deprecated since 0.10.0. Full list of removes configs: * port * host.name * advertised.port * advertised.host.name Also adjust tests to take the removals into account. Some tests were no longer relevant and have been removed. Finally, took the chance to: * Clean up unnecessary usage of `KafkaConfig$.MODULE$` in related files. * Add missing `Test` annotations to `AdvertiseBrokerTest` and make necessary changes for the tests to pass. Reviewers: David Jacot <djacot@confluent.io>, Luke Chen <showuon@gmail.com>	2021-06-17 05:32:34 -07:00
José Armando García Sancio	b67a77d5b9	KAFKA-12787; Integrate controller snapshoting with raft client (#10786 ) Directly use `RaftClient.Listener`, `SnapshotWriter` and `SnapshotReader` in the quorum controller. 1. Allow `RaftClient` users to create snapshots by specifying the last committed offset and last committed epoch. These values are validated against the log and leader epoch cache. 2. Remove duplicate classes in the metadata module for writing and reading snapshots. 3. Changed the logic for comparing snapshots. The old logic was assuming a certain batch grouping. This didn't match the implementation of the snapshot writer. The snapshot writer is free to merge batches before writing them. 4. Improve `LocalLogManager` to keep track of multiple snapshots. 5. Improve the documentation and API for the snapshot classes to highlight the distinction between the offset of batches in the snapshot vs the offset of batches in the log. These two offsets are independent of one another. `SnapshotWriter` and `SnapshotReader` expose a method called `lastOffsetFromLog` which represents the last inclusive offset from the log that is represented in the snapshot. Reviewers: dengziming <swzmdeng@163.com>, Jason Gustafson <jason@confluent.io>	2021-06-15 10:32:01 -07:00
José Armando García Sancio	f50f13d781	KAFKA-12342: Remove MetaLogShim and use RaftClient directly (#10705 ) This patch removes the temporary shim layer we added to bridge the interface differences between MetaLogManager and RaftClient. Instead, we now use the RaftClient directly from the metadata module. This also means that the metadata gradle module now depends on raft, rather than the other way around. Finally, this PR also consolidates the handleResign and handleNewLeader APIs into a single handleLeaderChange API. Co-authored-by: Jason Gustafson <jason@confluent.io>	2021-05-20 15:39:46 -07:00
Colin Patrick McCabe	9e5b77fb96	KAFKA-12788: improve KRaft replica placement (#10494 ) Implement a striped replica placement algorithm for KRaft. This also means implementing rack awareness. Previously, KRraft just chose replicas randomly in a non-rack-aware fashion. Also, allow replicas to be placed on fenced brokers if there are no other choices. This was specified in KIP-631 but previously not implemented. Reviewers: Jun Rao <junrao@gmail.com>	2021-05-17 16:49:47 -07:00
Daniyar Yeralin	6d1ae8bc00	KAFKA-8326: Introduce List Serde (#6592 ) Introduce List serde for primitive types or custom serdes with a serializer and a deserializer according to KIP-466 Reviewers: Anna Sophie Blee-Goldman <ableegoldman@apache.org>, Matthias J. Sax <mjsax@conflunet.io>, John Roesler <roesler@confluent.io>, Michael Noll <michael@confluent.io>	2021-05-13 15:54:00 -07:00
Satish Duggana	7ef3879429	KAFKA-12758 Added `server-common` module to have server side common classes. (#10638 ) Added server-common module to have server side common classes. Moved ApiMessageAndVersion, RecordSerde, AbstractApiMessageSerde, and BytesApiMessageSerde to server-common module. Reivewers: Kowshik Prakasam <kprakasam@confluent.io>, Jun Rao <junrao@gmail.com>	2021-05-11 09:58:28 -07:00
Satish Duggana	a1367f57f5	KAFKA-12429: Added serdes for the default implementation of RLMM based on an internal topic as storage. (#10271 ) KAFKA-12429: Added serdes for the default implementation of RLMM based on an internal topic as storage. This topic will receive events of RemoteLogSegmentMetadata, RemoteLogSegmentUpdate, and RemotePartitionDeleteMetadata. These events are serialized into Kafka protocol message format. Added tests for all the event types for that topic. This is part of the tiered storaqe implementation KIP-405. Reivewers: Kowshik Prakasam <kprakasam@confluent.io>, Jun Rao <junrao@gmail.com>	2021-05-05 07:48:52 -07:00
José Armando García Sancio	6203bf8b94	KAFKA-12154; Raft Snapshot Loading API (#10085 ) Implement Raft Snapshot loading API. 1. Adds a new method `handleSnapshot` to `raft.Listener` which is called whenever the `RaftClient` determines that the `Listener` needs to load a new snapshot before reading the log. This happens when the `Listener`'s next offset is less than the log start offset also known as the earliest snapshot. 2. Adds a new type `SnapshotReader<T>` which provides a `Iterator<Batch<T>>` interface and de-serializes records in the `RawSnapshotReader` into `T`s 3. Adds a new type `RecordsIterator<T>` that implements an `Iterator<Batch<T>>` by scanning a `Records` object and deserializes the batches and records into `Batch<T>`. This type is used by both `SnapshotReader<T>` and `RecordsBatchReader<T>` internally to implement the `Iterator` interface that they expose. 4. Changes the `MockLog` implementation to read one or two batches at a time. The previous implementation always read from the given offset to the high-watermark. This made it impossible to test interesting snapshot loading scenarios. 5. Removed `throws IOException` from some methods. Some of types were inconsistently throwing `IOException` in some cases and throwing `RuntimeException(..., new IOException(...))` in others. This PR improves the consistent by wrapping `IOException` in `RuntimeException` in a few more places and replacing `Closeable` with `AutoCloseable`. 6. Updated the Kafka Raft simulation test to take into account snapshot. `ReplicatedCounter` was updated to generate snapshot after 10 records get committed. This means that the `ConsistentCommittedData` validation was extended to take snapshots into account. Also added a new invariant to ensure that the log start offset is consistently set with the earliest snapshot. Reviewers: dengziming <swzmdeng@163.com>, David Arthur <mumrah@gmail.com>, Jason Gustafson <jason@confluent.io>	2021-05-01 10:05:45 -07:00
Satish Duggana	327809024f	KAFKA-12368: Added inmemory implementations for RemoteStorageManager and RemoteLogMetadataManager. (#10218 ) KAFKA-12368: Added inmemory implementations for RemoteStorageManager and RemoteLogMetadataManager. Added inmemory implementation for RemoteStorageManager and RemoteLogMetadataManager. A major part of inmemory RLMM will be used in the default RLMM implementation which will be based on topic storage. These will be used in unit tests for tiered storage. Added tests for both the implementations and their supported classes. This is part of tiered storage implementation, KIP-405. Reivewers: Kowshik Prakasam <kprakasam@confluent.io>, Jun Rao <junrao@gmail.com>	2021-04-13 10:14:03 -07:00
Jason Gustafson	8ef1619f3e	KAFKA-12459; Use property testing library for raft event simulation tests (#10323 ) This patch changes the raft simulation tests to use jqwik, which is a property testing library. This provides two main benefits: - It simplifies the randomization of test parameters. Currently the tests use a fixed set of `Random` seeds, which means that most builds are doing redundant work. We get a bigger benefit from allowing each build to test different parameterizations. - It makes it easier to reproduce failures. Whenever a test fails, jqwik will report the random seed that failed. A developer can then modify the `@Property` annotation to use that specific seed in order to reproduce the failure. This patch also includes an optimization for `MockLog.earliestSnapshotId` which reduces the time to run the simulation tests dramatically. Reviewers: Ismael Juma <ismael@juma.me.uk>, Chia-Ping Tsai <chia7712@gmail.com>, José Armando García Sancio <jsancio@gmail.com>, David Jacot <djacot@confluent.io>	2021-03-17 19:20:07 -07:00
Lee Dongjin	28ee656081	MINOR: Remove redundant allows in import-control.xml (#10339 ) 1. Remove org.apache.log4j from allowed import list of shell, trogdor subpackage; they uses slf4j, not log4. 2. Remove org.slf4j from allowed import list of clients, server subpackage: org.slf4j is allowed globally. 3. Remove org.apache.log4j from streams subpackage's allowed import list Reviewers: David Jacot <david.jacot@gmail.com>, Chia-Ping Tsai <chia7712@gmail.com>	2021-03-17 19:03:29 +08:00
Colin Patrick McCabe	5eac5a822f	KAFKA-12276: Add the quorum controller code (#10070 ) The quorum controller stores metadata in the KIP-500 metadata log, not in Apache ZooKeeper. Each controller node is a voter in the metadata quorum. The leader of the quorum is the active controller, which processes write requests. The followers are standby controllers, which replay the operations written to the log. If the active controller goes away, a standby controller can take its place. Like the ZooKeeper-based controller, the quorum controller is based on an event queue backed by a single-threaded executor. However, unlike the ZK-based controller, the quorum controller can have multiple operations in flight-- it does not need to wait for one operation to be finished before starting another. Therefore, calls into the QuorumController return CompleteableFuture objects which are completed with either a result or an error when the operation is done. The QuorumController will also time out operations that have been sitting on the queue too long without being processed. In this case, the future is completed with a TimeoutException. The controller uses timeline data structures to store multiple "versions" of its in-memory state simultaneously. "Read operations" read only committed state, which is slightly older than the most up-to-date in-memory state. "Write operations" read and write the latest in-memory state. However, we can not return a successful result for a write operation until its state has been committed to the log. Therefore, if a client receives an RPC response, it knows that the requested operation has been performed, and can not be undone by a controller failover. Reviewers: Jun Rao <junrao@gmail.com>, Ron Dagostino <rdagostino@confluent.io>	2021-02-19 18:03:23 -08:00
Colin P. Mccabe	690f72dd69	KAFKA-12334: Add the KIP-500 metadata shell The Kafka Metadata shell is a new command which allows users to interactively examine the metadata stored in a KIP-500 cluster. It can examine snapshot files that are specified via --snapshot. The metadata tool works by replaying the log and storing the state into in-memory nodes. These nodes are presented in a fashion similar to filesystem directories. Reviewers: Jason Gustafson <jason@confluent.io>, David Arthur <mumrah@gmail.com>, Igor Soarez <soarez@apple.com>	2021-02-19 15:46:34 -08:00
Jason Gustafson	698319b8e2	KAFKA-12278; Ensure exposed api versions are consistent within listener (#10666 ) Previously all APIs were accessible on every listener exposed by the broker, but with KIP-500, that is no longer true. We now have more complex requirements for API accessibility. For example, the KIP-500 controller exposes some APIs which are not exposed by brokers, such as BrokerHeartbeatRequest, and does not expose most client APIs, such as JoinGroupRequest, etc. Similarly, the KIP-500 broker does not implement some APIs that the ZK-based broker does, such as LeaderAndIsrRequest and UpdateFeaturesRequest. All of this means that we need more sophistication in how we expose APIs and keep them consistent with the ApiVersions API. Up until now, we have been working around this using the controllerOnly flag inside ApiKeys, but this is not rich enough to support all of the cases listed above. This PR introduces a new "listeners" field to the request schema definitions. This field is an array of strings which indicate the listener types in which the API should be exposed. We currently support "zkBroker", "broker", and "controller". ("broker" indicates the KIP-500 broker, whereas zkBroker indicates the old broker). This PR also creates ApiVersionManager to encapsulate the creation of the ApiVersionsResponse based on the listener type. Additionally, it modifies SocketServer to check the listener type of received requests before forwarding them to the request handler. Finally, this PR also fixes a bug in the handling of the ApiVersionsResponse prior to authentication. Previously a static response was sent, which means that changes to features would not get reflected. This also meant that the logic to ensure that only the intersection of version ranges supported by the controller would get exposed did not work. I think this is important because some clients rely on the initial pre-authenticated ApiVersions response rather than doing a second round after authentication as the Java client does. One final cleanup note: I have removed the expectation that envelope requests are only allowed on "privileged" listeners. This made sense initially because we expected to use forwarding before the KIP-500 controller was available. That is not the case anymore and we expect the Envelope API to only be exposed on the controller listener. I have nevertheless preserved the existing workarounds to allow verification of the forwarding behavior in integration testing. Reviewers: Colin P. McCabe <cmccabe@apache.org>, Ismael Juma <ismael@juma.me.uk>	2021-02-18 16:25:51 -08:00
Ron Dagostino	a30f92bf59	MINOR: Add KIP-500 BrokerServer and ControllerServer (#10113 ) This PR adds the KIP-500 BrokerServer and ControllerServer classes and makes some related changes to get them working. Note that the ControllerServer does not instantiate a QuorumController object yet, since that will be added in PR #10070. * Add BrokerServer and ControllerServer * Change ApiVersions#computeMaxUsableProduceMagic so that it can handle endpoints which do not support PRODUCE (such as KIP-500 controller nodes) * KafkaAdminClientTest: fix some lingering references to decommissionBroker that should be references to unregisterBroker. * Make some changes to allow SocketServer to be used by ControllerServer as we as by the broker. * We now return a random active Broker ID as the Controller ID in MetadataResponse for the Raft-based case as per KIP-590. * Add the RaftControllerNodeProvider * Add EnvelopeUtils * Add MetaLogRaftShim * In ducktape, in config_property.py: use a KIP-500 compatible cluster ID. Reviewers: Colin P. McCabe <cmccabe@apache.org>, David Arthur <mumrah@gmail.com>	2021-02-17 21:35:13 -08:00
Ismael Juma	744d05b128	KAFKA-12327: Remove MethodHandle usage in CompressionType (#10123 ) We don't really need it and it causes problems in older Android versions and GraalVM native image usage (there are workarounds for the latter). Move the logic to separate classes that are only invoked when the relevant compression library is actually used. Place such classes in their own package and enforce via checkstyle that only these classes refer to compression library packages. To avoid cyclic dependencies, moved `BufferSupplier` to the `utils` package. Reviewers: Chia-Ping Tsai <chia7712@gmail.com>	2021-02-14 08:12:25 -08:00
Colin Patrick McCabe	bf5e1f1cc0	MINOR: add the MetaLogListener, LocalLogManager, and Controller interface. (#10106 ) Add MetaLogListener, LocalLogManager, and related classes. These classes are used by the KIP-500 controller and broker to interface with the Raft log. Also add the Controller interface. The implementation will be added in a separate PR. Reviewers: Ron Dagostino <rdagostino@confluent.io>, David Arthur <mumrah@gmail.com>	2021-02-11 08:42:59 -08:00
Jason Gustafson	f58c2acf26	KAFKA-12250; Add metadata record serde for KIP-631 (#9998 ) This patch adds a `RecordSerde` implementation for the metadata record format expected by KIP-631. Reviewers: Colin McCabe <cmccabe@apache.org>, Ismael Juma <mlists@juma.me.uk>	2021-02-03 16:16:35 -08:00
Colin Patrick McCabe	772f2cfc82	MINOR: Replace BrokerStates.scala with BrokerState.java (#10028 ) Replace BrokerStates.scala with BrokerState.java, to make it easier to use from Java code if needed. This also makes it easier to go from a numeric type to an enum. Reviewers: Chia-Ping Tsai <chia7712@gmail.com>	2021-02-03 13:41:38 -08:00
Ismael Juma	24a2ed26a6	MINOR: Update zstd-jni to 1.4.8-2 (#9957 ) * The latest version zstd-jni doesn't use `RecyclingBufferPool` by default, so we pass it via the relevant constructors to maintain the behavior before this change. * zstd-jni fixes an issue when using Alpine, see https://github.com/luben/zstd-jni/issues/157. * zstd 1.4.7 includes several months of improvements across many axis, from performance to various fixes. Details: https://github.com/facebook/zstd/releases/tag/v1.4.7 * zstd 1.4.8 is a hotfix release, details: https://github.com/facebook/zstd/releases/tag/v1.4.8 Reviewers: Chia-Ping Tsai <chia7712@gmail.com>	2021-01-24 20:20:52 -08:00
Colin Patrick McCabe	217334b0f4	KAFKA-12183: Add the KIP-631 metadata record definitions (#9876 ) Add the metadata gradle module, which will contain the metadata record definitions, and other metadata-related broker-side code. Add MetadataParser, MetadataParseException, etc. Reviewers: José Armando García Sancio <jsancio@gmail.com>, Ismael Juma <ismael@juma.me.uk>, David Arthur <mumrah@gmail.com>	2021-01-14 09:58:52 -08:00
Ning Zhang	2cde6f61b8	KAFKA-10304: Refactor MM2 integration tests (#9224 ) Co-authored-by: Ning Zhang <nzhang1220@fb.com> Reviewers: Mickael Maison <mickael.maison@gmail.com>	2021-01-14 14:48:17 +00:00
Ismael Juma	52b8aa0fdc	KAFKA-7340: Migrate clients module to JUnit 5 (#9874 ) * Use the packages/classes from JUnit 5 * Move description in `assert` methods to last parameter * Convert parameterized tests so that they work with JUnit 5 * Remove `hamcrest`, it didn't seem to add much value * Fix `Utils.mkEntry` to have correct `equals` implementation * Add a missing `@Test` annotation in `SslSelectorTest` override * Adjust regex in `SaslAuthenticatorTest` due to small change in the assert failure string in JUnit 5 Reviewers: Chia-Ping Tsai <chia7712@gmail.com>	2021-01-13 16:17:45 -08:00
José Armando García Sancio	ab0807dd85	KAFKA-10394: Add classes to read and write snapshot for KIP-630 (#9512 ) This PR adds support for generating snapshot for KIP-630. 1. Adds the interfaces `RawSnapshotWriter` and `RawSnapshotReader` and the implementations `FileRawSnapshotWriter` and `FileRawSnapshotReader` respectively. These interfaces and implementations are low level API for writing and reading snapshots. They are internal to the Raft implementation and are not exposed to the users of `RaftClient`. They operation at the `Record` level. These types are exposed to the `RaftClient` through the `ReplicatedLog` interface. 2. Adds a buffered snapshot writer: `SnapshotWriter<T>`. This type is a higher-level type and it is exposed through the `RaftClient` interface. A future PR will add the related `SnapshotReader<T>`, which will be used by the state machine to load a snapshot. Reviewers: Jason Gustafson <jason@confluent.io>	2020-12-07 14:06:25 -08:00
Boyang Chen	0814e4f645	KAFKA-10181: Use Envelope RPC to do redirection for (Incremental)AlterConfig, AlterClientQuota and CreateTopics (#9103 ) This PR adds support for forwarding of the following RPCs: AlterConfigs IncrementalAlterConfigs AlterClientQuotas CreateTopics Co-authored-by: Jason Gustafson <jason@confluent.io> Reviewers: Jason Gustafson <jason@confluent.io>	2020-11-04 14:21:44 -08:00
Lee Dongjin	8d4bbf22ad	MINOR: trivial cleanups, javadoc errors, omitted StateStore tests, etc. (#8130 ) Reviewers: Manikumar Reddy <manikumar.reddy@gmail.com>, Guozhang Wang <guozhang@confluent.io>, Matthias J. Sax <matthias@confluent.io>	2020-10-07 19:08:31 -07:00
Rajini Sivaram	7be8bd8cbf	KAFKA-10338; Support PEM format for SSL key and trust stores (KIP-651) (#9345 ) Adds support for SSL key and trust stores to be specified in PEM format either as files or directly as configuration values. Reviewers: Manikumar Reddy <manikumar.reddy@gmail.com>	2020-10-06 19:13:43 +01:00
Guozhang Wang	53a35c1de3	MINOR: Refactor unit tests around RocksDBConfigSetter (#9358 ) * Extract the mock RocksDBConfigSetter into a separate class. * De-dup unit tests covering RocksDBConfigSetter. Reviewers: Boyang Chen <boyang@confluent.io>	2020-10-06 09:09:54 -07:00
Jason Gustafson	b7c8490cf4	KAFKA-10492; Core Kafka Raft Implementation (KIP-595) (#9130 ) This is the core Raft implementation specified by KIP-595: https://cwiki.apache.org/confluence/display/KAFKA/KIP-595%3A+A+Raft+Protocol+for+the+Metadata+Quorum. We have created a separate "raft" module where most of the logic resides. The new APIs introduced in this patch in order to support Raft election and such are disabled in the server until the integration with the controller is complete. Until then, there is a standalone server which can be used for testing the performance of the Raft implementation. See `raft/README.md` for details. Reviewers: Guozhang Wang <wangguoz@gmail.com>, Boyang Chen <boyang@confluent.io> Co-authored-by: Boyang Chen <boyang@confluent.io> Co-authored-by: Guozhang Wang <wangguoz@gmail.com>	2020-09-22 11:32:44 -07:00
Colin Patrick McCabe	b6ba67482f	KAFKA-10384: Separate converters from generated messages (#9194 ) For the generated message code, put the JSON conversion functionality in a separate JsonConverter class. Make MessageDataGenerator simply another generator class, alongside the new JsonConverterGenerator class. Move some of the utility functions from MessageDataGenerator into FieldSpec and other places, so that they can be used by other generator classes. Use argparse4j to support a better command-line for the generator. Reviewers: David Arthur <mumrah@gmail.com>	2020-08-26 15:10:09 -07:00
Jason Gustafson	3a189ad868	KAFKA-10386; Fix flexible version support for `records` type (#9163 ) This patch fixes the generated serde logic for the 'records' type so that it uses the compact byte array representation consistently when flexible versions are enabled. Reviewers: David Arthur <mumrah@gmail.com>	2020-08-13 09:52:23 -07:00
David Arthur	4cd2396db3	KAFKA-9629 Use generated protocol for Fetch API (#9008 ) Refactored FetchRequest and FetchResponse to use the generated message classes for serialization and deserialization. This allows us to bypass unnecessary Struct conversion in a few places. A new "records" type was added to the message protocol which uses BaseRecords as the field type. When sending, we can set a FileRecords instance on the message, and when receiving the message class will use MemoryRecords. Also included a few JMH benchmarks which indicate a small performance improvement for requests with high partition counts or small record sizes. Reviewers: Jason Gustafson <jason@confluent.io>, Boyang Chen <boyang@confluent.io>, David Jacot <djacot@confluent.io>, Lucas Bradstreet <lucas@confluent.io>, Ismael Juma <ismael@juma.me.uk>, Colin P. McCabe <cmccabe@apache.org>	2020-07-30 13:29:39 -04:00
Mickael Maison	caa806cd82	KAFKA-10232: MirrorMaker2 internal topics Formatters KIP-597 (#8604 ) This PR includes 3 MessageFormatters for MirrorMaker2 internal topics: - HeartbeatFormatter - CheckpointFormatter - OffsetSyncFormatter This also introduces a new public interface org.apache.kafka.common.MessageFormatter that users can implement to build custom formatters. Reviewers: Konstantine Karantasis <k.karantasis@gmail.com>, Ryanne Dolan <ryannedolan@gmail.com>, David Jacot <djacot@confluent.io> Co-authored-by: Mickael Maison <mickael.maison@gmail.com> Co-authored-by: Edoardo Comar <ecomar@uk.ibm.com>	2020-07-03 10:41:45 +01:00
Adam Bellemare	bcf45b09d3	KAFKA-10049: Fixed FKJ bug where wrapped serdes are set incorrectly when using default StreamsConfig serdes (#8764 ) Bug Details: Mistakenly setting the value serde to the key serde for an internal wrapped serde in the FKJ workflow. Testing: Modified the existing test to reproduce the issue, then verified that the test passes. Reviewers: Guozhang Wang <wangguoz@gmail.com>, John Roesler <vvcephei@apache.org>	2020-06-12 10:00:38 -05:00
Kowshik Prakasam	4f96c5b424	KAFKA-10027: Implement read path for feature versioning system (KIP-584) (#8680 ) In this PR, I have implemented various classes and integration for the read path of the feature versioning system (KIP-584). The ultimate plan is that the cluster-wide finalized features information is going to be stored in ZK under the node /feature. The read path implemented in this PR is centered around reading this finalized features information from ZK, and, processing it inside the Broker. Here is a summary of what's in this PR (a lot of it is new classes): A facility is provided in the broker to declare its supported features, and advertise its supported features via its own BrokerIdZNode under a features key. A facility is provided in the broker to listen to and propagate cluster-wide finalized feature changes from ZK. When new finalized features are read from ZK, feature incompatibilities are detected by comparing against the broker's own supported features. ApiVersionsResponse is now served containing supported and finalized feature information (using the newly added tagged fields). Reviewers: Boyang Chen <boyang@confluent.io>, Jun Rao <junrao@gmail.com>	2020-06-11 11:28:57 -07:00
Jeff Huang	2988eac082	KAFKA-9944: Added supporting customized HTTP response headers for Kafka Connect. (#8620 ) Added support for customizing the HTTP response headers for Kafka Connect as described in KIP-577. Author: Jeff Huang <jeff.huang@confluent.io> Reviewer: Randall Hauch <rhauch@gmail.com>	2020-05-24 08:56:27 -05:00
Colin Patrick McCabe	bf6dffe93b	KAFKA-9309: Add the ability to translate Message classes to and from JSON (#7844 ) Reviewers: David Arthur <mumrah@gmail.com>, Ron Dagostino <rdagostino@confluent.io>	2020-04-09 13:11:36 -07:00
Gardner Vickers	8cf781ef01	MINOR: Improve performance of checkpointHighWatermarks, patch 1/2 (#6741 ) This PR works to improve high watermark checkpointing performance. `ReplicaManager.checkpointHighWatermarks()` was found to be a major contributor to GC pressure, especially on Kafka clusters with high partition counts and low throughput. Added a JMH benchmark for `checkpointHighWatermarks` which establishes a performance baseline. The parameterized benchmark was run with 100, 1000 and 2000 topics. Modified `ReplicaManager.checkpointHighWatermarks()` to avoid extra copies and cached the Log parent directory Sting to avoid frequent allocations when calculating `File.getParent()`. A few clean-ups: * Changed all usages of Log.dir.getParent to Log.parentDir and Log.dir.getParentFile to Log.parentDirFile. * Only expose public accessor for `Log.dir` (consistent with `Log.parentDir`) * Removed unused parameters in `Partition.makeLeader`, `Partition.makeFollower` and `Partition.createLogIfNotExists`. Benchmark results: \| Topic Count \| Ops/ms \| MB/sec allocated \| \|-------------\|---------\|------------------\| \| 100 \| + 51% \| - 91% \| \| 1000 \| + 143% \| - 49% \| \| 2000 \| + 149% \| - 50% \| Reviewers: Lucas Bradstreet <lucas@confluent.io>. Ismael Juma <ismael@juma.me.uk> Co-authored-by: Gardner Vickers <gardner@vickers.me> Co-authored-by: Ismael Juma <ismael@juma.me.uk>	2020-03-25 20:53:42 -07:00
Brian Byrne	227a7322b7	KIP-546: Implement describeClientQuotas and alterClientQuotas. (#8083 ) Reviewers: Colin P. McCabe <cmccabe@apache.org>	2020-03-14 23:03:13 -07:00
Konstantine Karantasis	16ee326755	KAFKA-9556; Fix two issues with KIP-558 and expand testing coverage (#8085 ) Correct the Connect worker logic to properly disable the new topic status (KIP-558) feature when `topic.tracking.enable=false`, and fix automatic topic status reset after a connector is deleted. Also adds new `ConnectorTopicsIntegrationTest` and expanded unit tests. Reviewers: Randall Hauch <rhauch@gmail.com>	2020-02-14 14:34:34 -08:00
Mickael Maison	3953204d35	MINOR: Fix connect:mirror checkstyle (#7951 ) Reviewers: Guozhang Wang <wangguoz@gmail.com>, Jason Gustafson <jason@confluent.io>	2020-01-13 15:25:24 -08:00
Greg Harris	ff68b60429	KAFKA-8340, KAFKA-8819: Use PluginClassLoader while statically initializing plugins (#7315 ) Added plugin isolation unit tests for various scenarios, with a `TestPlugins` class that compiles and builds multiple test plugins without them being on the classpath and verifies that the Plugins and DelegatingClassLoader behave properly. These initially failed for several cases, but now pass since the issues have been fixed. KAFKA-8340 and KAFKA-8819 are closely related, and this fix corrects the problems reported in both issues. Author: Greg Harris <gregh@confluent.io> Reviewers: Chris Egerton <chrise@confluent.io>, Magesh Nandakumar <mageshn@confluent.io>, Konstantine Karantasis <konstantine@confluent.io>, Randall Hauch <rhauch@gmail.com>	2019-10-16 20:43:00 -05:00
Ryanne Dolan	4ac892ca78	KAFKA-7500: MirrorMaker 2.0 (KIP-382) Implementation of [KIP-382 "MirrorMaker 2.0"](https://cwiki.apache.org/confluence/display/KAFKA/KIP-382%3A+MirrorMaker+2.0) Author: Ryanne Dolan <ryannedolan@gmail.com> Author: Arun Mathew <arunmathew88@gmail.com> Author: In Park <inpark@cloudera.com> Author: Andre Price <obsoleted@users.noreply.github.com> Author: christian.hagel@rio.cloud <christian.hagel@rio.cloud> Reviewers: Eno Thereska <eno.thereska@gmail.com>, William Hammond <william.t.hammond@gmail.com>, Viktor Somogyi <viktorsomogyi@gmail.com>, Jakub Korzeniowski, Tim Carey-Smith, Kamal Chandraprakash <kamal.chandraprakash@gmail.com>, Arun Mathew, Jeremy-l-ford, vpernin, Oleg Kasian <oleg.kasian@gmail.com>, Mickael Maison <mickael.maison@gmail.com>, Qihong Chen, Sriharsha Chintalapani <sriharsha@apache.org>, Jun Rao <junrao@gmail.com>, Randall Hauch <rhauch@gmail.com>, Manikumar Reddy <manikumar.reddy@gmail.com>, Ismael Juma <ismael@juma.me.uk> Closes #6295 from ryannedolan/KIP-382	2019-10-07 13:57:54 +05:30
Chris Egerton	791d0d61bf	KAFKA-8804: Secure internal Connect REST endpoints (#7310 ) Implemented KIP-507 to secure the internal Connect REST endpoints that are only for intra-cluster communication. A new V2 of the Connect subprotocol enables this feature, where the leader generates a new session key, shares it with the other workers via the configuration topic, and workers send and validate requests to these internal endpoints using the shared key. Currently the internal `POST /connectors/<connector>/tasks` endpoint is the only one that is secured. This change adds unit tests and makes some small alterations to system tests to target the new `sessioned` Connect subprotocol. A new integration test ensures that the endpoint is actually secured (i.e., requests with missing/invalid signatures are rejected with a 400 BAD RESPONSE status). Author: Chris Egerton <chrise@confluent.io> Reviewed: Konstantine Karantasis <konstantine@confluent.io>, Randall Hauch <rhauch@gmail.com>	2019-10-02 17:06:57 -05:00
Arjun Satish	1c831c22e1	KAFKA-7772: Dynamically Adjust Log Levels in Connect (#7403 ) Implemented KIP-495 to expose a new `admin/loggers` endpoint for the Connect REST API that lists the current log levels and allows the caller to change log levels. Author: Arjun Satish <arjun@confluent.io> Reviewer: Randall Hauch <rhauch@gmail.com>	2019-10-02 17:00:37 -05:00
Colin Patrick McCabe	92688ef82c	MINOR: improve the Kafka RPC code generator (#7340 ) Move the generator checkstyle suppressions to a special section, rather than mixing them in with the other sections. For generated code, do not complain about variable names or cyclic complexity. FieldType.java: remove isInteger since it isn't used anywhere. This way, we don't have to decide whether a UUID is an integer or not (there are arguments for both choices). Add FieldType#serializationIsDifferentInFlexibleVersions and FieldType#isVariableLength. HeaderGenerator: add the ability to generate static imports. Add IsNullConditional, VersionConditional, and ClauseGenerator as easier ways of generating "if" statements.	2019-09-25 11:58:54 -04:00
Rajini Sivaram	364794866f	KAFKA-8760; New Java Authorizer API (KIP-504) (#7268 ) New Java Authorizer API and a new out-of-the-box authorizer (AclAuthorizer) that implements the new interface. Reviewers: Manikumar Reddy <manikumar.reddy@gmail.com>	2019-09-02 14:43:17 +01:00
cpettitt-confluent	7334222a71	KAFKA-8412: Fix nullpointer exception thrown on flushing before closing producers (#7207 ) Prior to this change an NPE is raised when calling AssignedTasks.close under the following conditions: 1. EOS is enabled 2. The task was in a suspended state The cause for the NPE is that when a clean close is requested for a StreamTask the StreamTask tries to commit. However, in the suspended state there is no producer so ultimately an NPE is thrown for the contained RecordCollector in flush. The fix put forth in this commit is to have AssignedTasks call closeSuspended when it knows the underlying StreamTask is suspended. Note also that this test is quite involved. I could have just tested that AssignedTasks calls closeSuspended when appropriate, but that is testing, IMO, a detail of the implementation and doesn't actually verify we reproduced the original problem as it was described. I feel much more confident that we are reproducing the behavior - and we can test exactly the conditions that lead to it - when testing across AssignedTasks and StreamTask. I believe this is an additional support for the argument of eventually consolidating the state split across classes. Reviewers: Matthias J. Sax <matthias@confluent.io>, Guozhang Wang <wangguoz@gmail.com>	2019-08-26 09:53:36 -07:00
Ismael Juma	57903be496	MINOR: Remove zkclient dependency (#7036 ) ZkUtils was removed so we don't need this anymore. Also: * Fix ZkSecurityMigrator and ReplicaManagerTest not to reference ZkClient classes. * Remove references to zkclient in various `log4j.properties` and `import-control.xml`. Reviewers: Manikumar Reddy <manikumar.reddy@gmail.com>, Stanislav Kozlovski <stanislav_kozlovski@outlook.com>	2019-07-05 07:50:32 -07:00
Magesh Nandakumar	2e91a310d7	KAFKA-8265: Initial implementation for ConnectorClientConfigPolicy to enable overrides (KIP-458) (#6624 ) Implementation to enable policy for Connector Client config overrides. This is implemented per the KIP-458. Reviewers: Randall Hauch <rhauch@gmail.com>	2019-05-17 01:37:32 -07:00
Chris Egerton	cc097e909c	KAFKA-8304: Fix registration of Connect REST extensions (#6651 ) Fix registration of Connect REST extensions to prevent deadlocks when extensions get the list of connectors before the herder is available. Added integration test to check the behavior. Author: Chris Egerton <cegerton@oberlin.edu> Reviewers: Arjun Satish <arjun@confluent.io>, Randall Hauch <rhauch@gmail.com>	2019-05-07 17:20:51 -05:00
Konstantine Karantasis	e4cad35312	KAFKA-8014: Extend Connect integration tests to add and remove workers dynamically (#6342 ) Extend Connect's integration test framework to add or remove workers to EmbeddedConnectCluster, and choosing whether to fail the test on ungraceful service shutdown. Also added more JavaDoc and other minor improvements. Author: Konstantine Karantasis <konstantine@confluent.io> Reviewers: Arjun Satish <arjun@confluent.io>, Randall Hauch <rhauch@gmail.com> Closes #6342 from kkonstantine/KAFKA-8014	2019-03-25 09:29:33 -05:00
Mickael Maison	4824dc994d	KAFKA-7972: Use automatic RPC generation in SaslHandshake Author: Mickael Maison <mickael.maison@gmail.com> Reviewers: Manikumar Reddy <manikumar.reddy@gmail.com> Closes #6301 from mimaison/sasl-handshake	2019-02-25 11:20:07 +05:30
Alex Diachenko	ec42e0378e	KAFKA-7799; Use httpcomponents-client in RestServerTest. The test `org.apache.kafka.connect.runtime.rest.RestServerTest#testCORSEnabled` assumes Jersey client can send restricted HTTP headers(`Origin`). Jersey client uses `sun.net.www.protocol.http.HttpURLConnection`. `sun.net.www.protocol.http.HttpURLConnection` drops restricted headers(`Host`, `Keep-Alive`, `Origin`, etc) based on static property `allowRestrictedHeaders`. This property is initialized in a static block by reading Java system property `sun.net.http.allowRestrictedHeaders`. So, if classloader loads `HttpURLConnection` before we set `sun.net.http.allowRestrictedHeaders=true`, then all subsequent changes of this system property won't take any effect(which happens if `org.apache.kafka.connect.integration.ExampleConnectIntegrationTest` is executed before `RestServerTest`). To prevent this, we have to either make sure we set `sun.net.http.allowRestrictedHeaders=true` as early as possible or do not rely on this system property at all. This PR adds test dependency on `httpcomponents-client` which doesn't depend on `sun.net.http.allowRestrictedHeaders` system property. Thus none of existing tests should interfere with `RestServerTest`. Author: Alex Diachenko <sansanichfb@gmail.com> Reviewers: Randall Hauch, Konstantine Karantasis, Gwen Shapira Closes #6236 from avocader/KAFKA-7799	2019-02-12 12:03:08 -08:00
Tom Bentley	269b65279c	KAFKA-5692: Change PreferredReplicaLeaderElectionCommand to use Admin… (#3848 ) See also KIP-183. This implements the following algorithm: AdminClient sends ElectPreferredLeadersRequest. KafakApis receives ElectPreferredLeadersRequest and delegates to ReplicaManager.electPreferredLeaders() ReplicaManager delegates to KafkaController.electPreferredLeaders() KafkaController adds a PreferredReplicaLeaderElection to the EventManager, ReplicaManager.electPreferredLeaders()'s callback uses the delayedElectPreferredReplicasPurgatory to wait for the results of the election to appear in the metadata cache. If there are no results because of errors, or because the preferred leaders are already leading the partitions then a response is returned immediately. In the EventManager work thread the preferred leader is elected as follows: The EventManager runs PreferredReplicaLeaderElection.process() process() calls KafkaController.onPreferredReplicaElectionWithResults() KafkaController.onPreferredReplicaElectionWithResults() calls the PartitionStateMachine.handleStateChangesWithResults() to perform the election (asynchronously the PSM will send LeaderAndIsrRequest to the new and old leaders and UpdateMetadataRequest to all brokers) then invokes the callback. Reviewers: Colin P. McCabe <cmccabe@apache.org>, Jun Rao <junrao@gmail.com>	2019-01-25 14:06:18 -08:00
Arjun Satish	dc935c4beb	MINOR: Handle case where connector status endpoints returns 404 (#6176 ) Reviewers: Randall Hauch <randall@confluent.io>, Matthias J. Sax <matthias@confluent.io>	2019-01-20 19:31:20 -08:00
Arjun Satish	69d8d2ea11	KAFKA-7503: Connect integration test harness Expose a programmatic way to bring up a Kafka and Zk cluster through Java API to facilitate integration tests for framework level changes in Kafka Connect. The Kafka classes would be similar to KafkaEmbedded in streams. The new classes would reuse the kafka.server.KafkaServer classes from :core, and provide a simple interface to bring up brokers in integration tests. Signed-off-by: Arjun Satish <arjunconfluent.io> Author: Arjun Satish <arjun@confluent.io> Author: Arjun Satish <wicknicks@users.noreply.github.com> Reviewers: Randall Hauch <rhauch@gmail.com>, Konstantine Karantasis <konstantine@confluent.io>, Ewen Cheslack-Postava <ewen@confluent.io> Closes #5516 from wicknicks/connect-integration-test	2019-01-14 13:50:23 -08:00
Colin Patrick McCabe	71e85f5e84	KAFKA-7609; Add Protocol Generator for Kafka (#5893 ) This patch adds a framework to automatically generate the request/response classes for Kafka's protocol. The code will be updated to use the generated classes in follow-up patches. Below is a brief summary of the included components: buildSrc/src The message generator code is here. This code is automatically re-run by gradle when one of the schema files changes. The entire directory is processed at once to minimize the number of times we have to start a new JVM. We use Jackson to translate the JSON files into Java objects. clients/src/main/java/org/apache/kafka/common/protocol/Message.java This is the interface implemented by all automatically generated messages. clients/src/main/java/org/apache/kafka/common/protocol/MessageUtil.java Some utility functions used by the generated message code. clients/src/main/java/org/apache/kafka/common/protocol/Readable.java, Writable.java, ByteBufferAccessor.java The generated message code uses these classes for writing to a buffer. clients/src/main/message/README.md This README file explains how the JSON schemas work. *clients/src/main/message/\.json The JSON files in this directory implement every supported version of every Kafka API. The unit tests automatically validate that the generated schemas match the hand-written schemas in our code. Additionally, there are some things like request and response headers that have schemas here. clients/src/main/java/org/apache/kafka/common/utils/ImplicitLinkedHashSet.java** I added an optimization here for empty sets. This is useful here because I want all messages to start with empty sets by default prior to being loaded with data. This is similar to the "empty list" optimizations in the `java.util.ArrayList` class. Reviewers: Stanislav Kozlovski <stanislav_kozlovski@outlook.com>, Ismael Juma <ismael@juma.me.uk>, Bob Barrett <bob.barrett@outlook.com>, Jason Gustafson <jason@confluent.io>	2019-01-11 16:40:21 -08:00
Rajini Sivaram	4c602e6130	KAFKA-7498: Remove references from `common.requests` to `clients` (#5784 ) Add CreatePartitionsRequest.PartitionDetails similar to CreateTopicsRequest.TopicDetails to avoid references from `common.requests` package to `clients`. Reviewers: Ismael Juma <ismael@juma.me.uk>	2018-10-15 13:21:15 +01:00
Ismael Juma	578205cadd	KAFKA-7439; Replace EasyMock and PowerMock with Mockito in clients module Development of EasyMock and PowerMock has stagnated while Mockito continues to be actively developed. With the new Java release cadence, it's a problem to depend on libraries that do bytecode manipulation and are not actively maintained. In addition, Mockito is also easier to use. While updating the tests, I attempted to go from failing test to passing test. In cases where the updated test passed on the first attempt, I artificially broke it to ensure the test was still doing its job. I included a few improvements that were helpful while making these changes: 1. Better exception if there are no nodes in `leastLoadedNodes` 2. Always close the producer in `KafkaProducerTest` 3. requestsInFlight producer metric should not hold a reference to `Sender` Finally, `Metadata` is no longer final so that we don't need `PowerMock` to mock it. It's an internal class, so it's OK. Author: Ismael Juma <ismael@juma.me.uk> Reviewers: Viktor Somogyi <viktorsomogyi@gmail.com>, Dong Lin <lindong28@gmail.com> Closes #5691 from ijuma/kafka-7438-mockito	2018-10-09 15:55:09 -07:00
John Roesler	d57fe1b053	MINOR: single Jackson serde for PageViewTypedDemo (#5590 ) Previously, we depicted creating a Jackson serde for every pojo class, which becomes a burden in practice. There are many ways to avoid this and just have a single serde, so we've decided to model this design choice instead. Reviewers: Viktor Somogyi <viktorsomogyi@gmail.com>, Bill Bejeck <bill@confluent.io>, Guozhang Wang <wangguoz@gmail.com>	2018-08-31 13:13:42 -07:00
Colin Patrick McCabe	609c81ec8b	KAFKA-7183: Add a trogdor test that creates many connections to brokers (#5393 ) Reviewers: Ismael Juma <ismael@juma.me.uk>, Rajini Sivaram <rajinisivaram@googlemail.com>	2018-08-06 08:47:25 +01:00
Manikumar Reddy O	96c53e96b8	MINOR: Remove deprecated ZkUtils usage from EmbeddedKafkaCluster (#5324 ) Reviewers: Matthias J. Sax <mjsax@apache.org>, Ismael Juma <ismael@juma.me.uk>, Guozhang Wang <wangguoz@gmail.com>	2018-07-19 14:28:12 -07:00
Andy Coates	b3aa655a70	KAFKA-6841: Support Prefixed ACLs (KIP-290) (#5117 ) Reviewers: Colin Patrick McCabe <colin@cmccabe.xyz>, Jun Rao <junrao@gmail.com> Co-authored-by: Piyush Vijay <pvijay@apple.com> Co-authored-by: Andy Coates <big-andy-coates@users.noreply.github.com>	2018-06-06 07:22:57 -07:00
Magesh Nandakumar	98094954a2	KAFKA-6776: ConnectRestExtension Interfaces & Rest integration (KIP-285) This PR provides the implementation for KIP-285 and also a reference implementation for authenticating BasicAuth credentials using JAAS LoginModule Author: Magesh Nandakumar <magesh.n.kumar@gmail.com> Reviewers: Randall Hauch <rhauch@gmail.com>, Arjun Satish <wicknicks@users.noreply.github.com>, Konstantine Karantasis <konstantine@confluent.io>, Ewen Cheslack-Postava <ewen@confluent.io> Closes #4931 from mageshn/KIP-285	2018-05-29 21:35:22 -07:00
Ron Dagostino	8c5d7e0408	KAFKA-6562: OAuth Authentication via SASL/OAUTHBEARER (KIP-255) (#4994 ) This KIP adds the following functionality related to SASL/OAUTHBEARER: 1) Allow clients (both brokers when SASL/OAUTHBEARER is the inter-broker protocol as well as non-broker clients) to flexibly retrieve an access token from an OAuth 2 authorization server based on the declaration of a custom login CallbackHandler implementation and have that access token transparently and automatically transmitted to a broker for authentication. 2) Allow brokers to flexibly validate provided access tokens when a client establishes a connection based on the declaration of a custom SASL Server CallbackHandler implementation. 3) Provide implementations of the above retrieval and validation features based on an unsecured JSON Web Token that function out-of-the-box with minimal configuration required (i.e. implementations of the two types of callback handlers mentioned above will be used by default with no need to explicitly declare them). 4) Allow clients (both brokers when SASL/OAUTHBEARER is the inter-broker protocol as well as non-broker clients) to transparently retrieve a new access token in the background before the existing access token expires in case the client has to open new connections.	2018-05-26 08:18:41 +01:00
Ismael Juma	e70a191d30	KAFKA-4423: Drop support for Java 7 (KIP-118) and update deps (#5046 ) * Set --source, --target and --release to 1.8. * Build Scala 2.12 by default. * Remove some conditionals in the build file now that Java 8 is the minimum version. * Bump the version of Jetty, Jersey and Checkstyle (the newer versions require Java 8). * Fixed issues uncovered by the new version if Checkstyle. * A couple of minor updates to handle an incompatible source change in the new version of Jetty. * Add dependency to jersey-hk2 to fix failing tests caused by Jersey upgrade. * Update release script to use Java 8 and to take into account that Scala 2.12 is now built by default. * While we're at it, bump the version of Gradle, Gradle plugins, ScalaLogging, JMH and apache directory api. * Minor documentation updates including the readme and upgrade notes. A number of Streams Java 7 examples can be removed subsequently.	2018-05-21 23:17:42 -07:00
John Roesler	ed51b2cdf5	KAFKA-6376; refactor skip metrics in Kafka Streams * unify skipped records metering * log warnings when things get skipped * tighten up metrics usage a bit ### Testing strategy: Unit testing of the metrics and the logs should be sufficient. Author: John Roesler <john@confluent.io> Reviewers: Bill Bejeck <bill@confluent.io>, Matthias J. Sax <matthias@confluent.io>, Guozhang Wang <wangguoz@gmail.com> Closes #4812 from vvcephei/kip-274-streams-skip-metrics	2018-04-23 11:41:03 -07:00
Jorge Quilcate Otoya	6a99da87ab	KAFKA-6058: KIP-222; Add Consumer Group operations to Admin API KIP: https://cwiki.apache.org/confluence/display/KAFKA/KIP-222+-+Add+Consumer+Group+operations+to+Admin+API Author: Jorge Quilcate Otoya <quilcate.jorge@gmail.com> Author: Jorge Esteban Quilcate Otoya <quilcate.jorge@gmail.com> Author: Guozhang Wang <wangguoz@gmail.com> Reviewers: Colin Patrick McCabe <colin@cmccabe.xyz>, Guozhang Wang <wangguoz@gmail.com> Closes #4454 from jeqo/feature/admin-client-describe-consumer-group	2018-04-11 14:17:46 -07:00
Rajini Sivaram	4019b21d60	KAFKA-6246; Dynamic update of listeners and security configs (#4488 ) Dynamic update of listeners as described in KIP-226. This includes: - Addition of new listeners with listener-prefixed security configs - Removal of existing listeners - Password encryption - sasl.jaas.config property for broker's JAAS config prefixed with listener and mechanism name	2018-02-04 09:19:16 -08:00
Randall Hauch	4c48942f9d	KAFKA-5142: Add Connect support for message headers (KIP-145) [KIP-145](https://cwiki.apache.org/confluence/display/KAFKA/KIP-145+-+Expose+Record+Headers+in+Kafka+Connect) has been accepted, and this PR implements KIP-145 except without the SMTs. Changed the Connect API and runtime to support message headers as described in [KIP-145](https://cwiki.apache.org/confluence/display/KAFKA/KIP-145+-+Expose+Record+Headers+in+Kafka+Connect). The new `Header` interface defines an immutable representation of a Kafka header (key-value pair) with support for the Connect value types and schemas. This interface provides methods for easily converting between many of the built-in primitive, structured, and logical data types. The new `Headers` interface defines an ordered collection of headers and is used to track all headers associated with a `ConnectRecord` (and thus `SourceRecord` and `SinkRecord`). This does allow multiple headers with the same key. The `Headers` contains methods for adding, removing, finding, and modifying headers. Convenience methods allow connectors and transforms to easily use and modify the headers for a record. A new `HeaderConverter` interface is also defined to enable the Connect runtime framework to be able to serialize and deserialize headers between the in-memory representation and Kafka’s byte[] representation. A new `SimpleHeaderConverter` implementation has been added, and this serializes to strings and deserializes by inferring the schemas (`Struct` header values are serialized without the schemas, so they can only be deserialized as `Map` instances without a schema.) The `StringConverter`, `JsonConverter`, and `ByteArrayConverter` have all been extended to also be `HeaderConverter` implementations. Each connector can be configured with a different header converter, although by default the `SimpleHeaderConverter` is used to serialize header values as strings without schemas. Unit and integration tests are added for `ConnectHeader` and `ConnectHeaders`, the two implementation classes for headers. Additional test methods are added for the methods added to the `Converter` implementations. Finally, the `ConnectRecord` object is already used heavily, so only limited tests need to be added while quite a few of the existing tests already cover the changes. Author: Randall Hauch <rhauch@gmail.com> Reviewers: Arjun Satish <arjun@confluent.io>, Ted Yu <yuzhihong@gmail.com>, Magesh Nandakumar <magesh.n.kumar@gmail.com>, Konstantine Karantasis <konstantine@confluent.io>, Ewen Cheslack-Postava <ewen@confluent.io> Closes #4319 from rhauch/kafka-5142-b	2018-01-31 10:40:24 -08:00
Manikumar Reddy	488ea4b9fd	KAFKA-5647; Use KafkaZkClient in ReassignPartitionsCommand and PreferredReplicaLeaderElectionCommand * Use KafkaZkClient in ReassignPartitionsCommand * Use KafkaZkClient in PreferredReplicaLeaderElectionCommand * Updated test classes to use new methods * All existing tests should pass Author: Manikumar Reddy <manikumar.reddy@gmail.com> Reviewers: Jun Rao <junrao@gmail.com> Closes #4260 from omkreddy/KAFKA-5647-ADMINCOMMANDS	2017-12-20 12:19:36 -08:00
Jorge Quilcate Otoya	30f08d158a	KAFKA-5520: KIP-171; Extend Consumer Group Reset Offset for Stream Application KIP: https://cwiki.apache.org/confluence/display/KAFKA/KIP-171+-+Extend+Consumer+Group+Reset+Offset+for+Stream+Application Merge changes from KIP-198 Ref: https://github.com/apache/kafka/pull/3831 Author: Jorge Quilcate Otoya <quilcate.jorge@gmail.com> Author: Ismael Juma <ismael@juma.me.uk> Author: Matthias J. Sax <matthias@confluent.io> Author: Manikumar Reddy <manikumar.reddy@gmail.com> Author: Guozhang Wang <wangguoz@gmail.com> Author: Apurva Mehta <apurva@confluent.io> Author: Rajini Sivaram <rajinisivaram@googlemail.com> Author: Jason Gustafson <jason@confluent.io> Author: Vahid Hashemian <vahidhashemian@us.ibm.com> Author: Bill Bejeck <bill@confluent.io> Author: Dong Lin <lindong28@gmail.com> Author: Soenke Liebau <soenke.liebau@opencore.com> Author: Colin P. Mccabe <cmccabe@confluent.io> Author: Damian Guy <damian.guy@gmail.com> Author: Xavier Léauté <xl+github@xvrl.net> Author: Maytee Chinavanichkit <maytee.chinavanichkit@linecorp.com> Author: Joel Hamill <git config --global user.email> Author: Paolo Patierno <ppatierno@live.com> Author: siva santhalingam <siva.santhalingam@gmail.com> Author: Tommy Becker <tobecker@tivo.com> Author: Mickael Maison <mickael.maison@gmail.com> Author: Onur Karaman <okaraman@linkedin.com> Author: tedyu <yuzhihong@gmail.com> Author: Xin Li <Xin.Li@trivago.com> Author: Magnus Edenhill <magnus@edenhill.se> Author: Manjula K <manjula@kafka-summit.org> Author: Hugo Louro <hmclouro@gmail.com> Author: Jeff Widman <jeff@jeffwidman.com> Author: bartdevylder <bartdevylder@gmail.com> Author: Ewen Cheslack-Postava <me@ewencp.org> Author: Jacek Laskowski <jacek@japila.pl> Author: Tom Bentley <tbentley@redhat.com> Author: Konstantine Karantasis <konstantine@confluent.io> Reviewers: Matthias J. Sax <matthias@confluent.io>, Guozhang Wang <wangguoz@gmail.com> Closes #4159 from jeqo/feature/kip-171	2017-12-06 11:38:38 -08:00
Colin P. Mccabe	4fac83ba1f	KAFKA-6060; Add workload generation capabilities to Trogdor Previously, Trogdor only handled "Faults." Now, Trogdor can handle "Tasks" which may be either faults, or workloads to execute in the background. The Agent and Coordinator have been refactored from a mutexes-and-condition-variables paradigm into a message passing paradigm. No locks are necessary, because only one thread can access the task state or worker state. This makes them a lot easier to reason about. The MockTime class can now handle mocking deferred message passing (adding a message to an ExecutorService with a delay). I added a MockTimeTest. MiniTrogdorCluster now starts up Agent and Coordinator classes in paralle in order to minimize junit test time. RPC messages now inherit from a common Message.java class. This class handles implementing serialization, equals, hashCode, etc. Remove FaultSet, since it is no longer necessary. Previously, if CoordinatorClient or AgentClient hit a networking problem, they would throw an exception. They now retry several times before giving up. Additionally, the REST RPCs to the Coordinator and Agent have been changed to be idempotent. If a response is lost, and the request is resent, no harm will be done. Author: Colin P. Mccabe <cmccabe@confluent.io> Reviewers: Rajini Sivaram <rajinisivaram@googlemail.com>, Ismael Juma <ismael@juma.me.uk> Closes #4073 from cmccabe/KAFKA-6060	2017-11-03 09:37:29 +00:00
Rajini Sivaram	021d8a8e96	KAFKA-5746; Add new metrics to support health checks (KIP-188) Adds new metrics to support health checks: 1. Error rates for each request type, per-error code 2. Request size and temporary memory size 3. Message conversion rate and time 4. Successful and failed authentication rates 5. ZooKeeper latency and status 6. Client version Author: Rajini Sivaram <rajinisivaram@googlemail.com> Reviewers: Ismael Juma <ismael@juma.me.uk> Closes #3705 from rajinisivaram/KAFKA-5746-new-metrics	2017-09-28 21:58:59 +01:00
Rajini Sivaram	96ba21e0df	KAFKA-5947; Handle authentication failure in admin client, txn producer 1. Raise AuthenticationException for authentication failures in admin client 2. Handle AuthenticationException as a fatal error for transactional producer 3. Add comments to authentication exceptions Author: Rajini Sivaram <rajinisivaram@googlemail.com> Reviewers: Vahid Hashemian <vahidhashemian@us.ibm.com>, Ismael Juma <ismael@juma.me.uk> Closes #3928 from rajinisivaram/KAFKA-5947-auth-failure	2017-09-21 13:58:43 +01:00
Jason Gustafson	0cf7708007	MINOR: Move request/response schemas to the corresponding object representation This refactor achieves the following: 1. Breaks up the increasingly unmanageable `Protocol` class and moves schemas closer to their actual usage. 2. Removes the need for redundant field identifiers maintained separately in `Protocol` and the respective request/response objects. 3. Provides a better mechanism for sharing common fields between different schemas (e.g. topics, partitions, error codes, etc.). 4. Adds convenience helpers to `Struct` for common patterns (such as setting a field only if it exists). Author: Jason Gustafson <jason@confluent.io> Reviewers: Ismael Juma <ismael@juma.me.uk> Closes #3813 from hachikuji/protocol-schema-refactor	2017-09-19 05:12:55 +01:00
Jason Gustafson	3b5d88febb	KAFKA-5783; Add KafkaPrincipalBuilder with support for SASL (KIP-189) Author: Jason Gustafson <jason@confluent.io> Reviewers: Ismael Juma <ismael@juma.me.uk>, Rajini Sivaram <rajinisivaram@googlemail.com>, Manikumar Reddy <manikumar.reddy@gmail.com> Closes #3795 from hachikuji/KAFKA-5783	2017-09-14 10:16:00 +01:00
Colin P. Mccabe	0772fde562	KAFKA-5776; Add the Trogdor fault injection daemon Author: Colin P. Mccabe <cmccabe@confluent.io> Reviewers: Ismael Juma <ismael@juma.me.uk>, Rajini Sivaram <rajinisivaram@googlemail.com> Closes #3699 from cmccabe/trogdor-review	2017-08-25 12:29:40 -07:00
Ismael Juma	ed96523a2c	KAFKA-4501; Java 9 compilation and runtime fixes Compilation error fixes: - Avoid ambiguity error when appending to Properties in Scala code (https://github.com/scala/bug/issues/10418) - Use position() and limit() to fix ambiguity issue ( https://github.com/scala/bug/issues/10418#issuecomment-316364778) - Disable findBugs if Java 9 is used ( https://github.com/findbugsproject/findbugs/issues/105) Compilation warning fixes: - Avoid deprecated Class.newInstance in Utils.newInstance - Silence a few Java 9 deprecation warnings - var -> val and unused fixes Runtime error fixes: - Introduce Base64 class that works in Java 7 and Java 9 Also: - Set --release option if building with Java 9 Note that tests involving EasyMock (https://github.com/easymock/easymock/issues/193) or PowerMock (https://github.com/powermock/powermock/issues/783) will fail as neither supports Java 9 currently. Author: Ismael Juma <ismael@juma.me.uk> Reviewers: Jason Gustafson <jason@confluent.io> Closes #3647 from ijuma/kafka-4501-support-java-9	2017-08-19 08:55:29 +01:00
radai-rosenblatt	47ee8e954d	KAFKA-4602; KIP-72 - Allow putting a bound on memory consumed by Incoming requests this is the initial implementation. Author: radai-rosenblatt <radai.rosenblatt@gmail.com> Reviewers: Ewen Cheslack-Postava <me@ewencp.org>, Ismael Juma <ismael@juma.me.uk>, Rajini Sivaram <rajinisivaram@googlemail.com>, Jun Rao <junrao@gmail.com> Closes #2330 from radai-rosenblatt/broker-memory-pool-with-muting	2017-07-26 08:19:56 +02:00
Eno Thereska	55a90938a1	MINOR: add Yahoo benchmark to nightly runs Author: Eno Thereska <eno.thereska@gmail.com> Reviewers: Damian Guy <damian.guy@gmail.com> Closes #3289 from enothereska/yahoo-benchmark	2017-06-21 11:46:59 +01:00
Ismael Juma	b20d9333be	KAFKA-5274: AdminClient Javadoc improvements Publish Javadoc for common.annotation package, which contains InterfaceStability. Finally, mark AdminClient classes with `Evolving` instead of `Unstable`. Author: Ismael Juma <ismael@juma.me.uk> Reviewers: Colin Mccabe, Gwen Shapira Closes #3316 from ijuma/kafka-5274-admin-client-javadoc	2017-06-14 08:57:49 -07:00
Matthias J. Sax	ba07d828c5	KAFKA-5362: Add EOS system tests for Streams API Author: Matthias J. Sax <matthias@confluent.io> Reviewers: Damian Guy <damian.guy@gmail.com>, Bill Bejeck <bill@confluent.io>, Guozhang Wang <wangguoz@gmail.com> Closes #3201 from mjsax/kafka-5362-add-eos-system-tests-for-streams-api	2017-06-08 14:08:54 -07:00
Ismael Juma	c7bc8f7d8c	MINOR: Remove redundant volatile write in RecordHeaders The JMH benchmark included shows that the redundant volatile write causes the constructor of `ProducerRecord` to take more than 50% longer: ProducerRecordBenchmark.constructorBenchmark avgt 15 24.136 ± 1.458 ns/op (before) ProducerRecordBenchmark.constructorBenchmark avgt 15 14.904 ± 0.231 ns/op (after) Author: Ismael Juma <ismael@juma.me.uk> Reviewers: Jason Gustafson <jason@confluent.io> Closes #3233 from ijuma/remove-volatile-write-in-records-header-constructor	2017-06-04 10:48:34 -07:00
Colin P. Mccabe	f389b71570	KAFKA-5374; Set allow auto topic creation to false when requesting node information only It avoids the need to handle protocol downgrades and it's safe (i.e. it will never cause the auto creation of topics). Author: Colin P. Mccabe <cmccabe@confluent.io> Reviewers: Ismael Juma <ismael@juma.me.uk> Closes #3220 from ijuma/kafka-5374-admin-client-metadata	2017-06-03 06:26:16 +01:00
Colin P. Mccabe	da9a171c99	KAFKA-5265; Move ACLs, Config, Topic classes into org.apache.kafka.common Also introduce TopicConfig. Author: Colin P. Mccabe <cmccabe@confluent.io> Reviewers: Ismael Juma <ismael@juma.me.uk> Closes #3120 from cmccabe/KAFKA-5265	2017-05-31 17:35:31 +01:00
Xavier Léauté	c060c48285	KAFKA-5150; Reduce LZ4 decompression overhead - reuse decompression buffers in consumer Fetcher - switch lz4 input stream to operate directly on ByteBuffers - avoids performance impact of catching exceptions when reaching the end of legacy record batches - more tests with both compressible / incompressible data, multiple blocks, and various other combinations to increase code coverage - fixes bug that would cause exception instead of invalid block size for invalid incompressible blocks - fixes bug if incompressible flag is set on end frame block size Overall this improves LZ4 decompression performance by up to 40x for small batches. Most improvements are seen for batches of size 1 with messages on the order of ~100B. We see at least 2x improvements for for batch sizes of < 10 messages, containing messages < 10kB This patch also yields 2-4x improvements on v1 small single message batches for other compression types. Full benchmark results can be found here https://gist.github.com/xvrl/05132e0643513df4adf842288be86efd Author: Xavier Léauté <xavier@confluent.io> Author: Ismael Juma <ismael@juma.me.uk> Reviewers: Jason Gustafson <jason@confluent.io>, Ismael Juma <ismael@juma.me.uk> Closes #2967 from xvrl/kafka-5150	2017-05-31 02:22:07 +01:00
Konstantine Karantasis	45f2261763	KAFKA-3487: Support classloading isolation in Connect (KIP-146) Author: Konstantine Karantasis <konstantine@confluent.io> Reviewers: Randall Hauch <rhauch@gmail.com>, Ewen Cheslack-Postava <ewen@confluent.io> Closes #3028 from kkonstantine/KAFKA-3487-Support-classloading-isolation-in-Connect	2017-05-18 10:39:15 -07:00
Colin P. Mccabe	9815e18fef	KAFKA-3266; Describe, Create and Delete ACLs Admin APIs (KIP-140) Includes server-side code, protocol and AdminClient. Author: Colin P. Mccabe <cmccabe@confluent.io> Reviewers: Ismael Juma <ismael@juma.me.uk> Closes #2941 from cmccabe/KAFKA-3266	2017-05-18 03:20:30 +01:00
Colin P. Mccabe	4aed28d189	KAFKA-3265; Add a public AdminClient API in Java (KIP-117) Author: Colin P. Mccabe <cmccabe@confluent.io> Reviewers: Dan Norwood <norwood@confluent.io>, Ismael Juma <ismael@juma.me.uk> Closes #2472 from cmccabe/KAFKA-3265	2017-05-02 00:20:22 +01:00
Michael Andre Pearce	6185bc0276	KAFKA-4208; Add Record Headers As per KIP-82 Adding record headers api to ProducerRecord, ConsumerRecord Support to convert from protocol to api added Kafka Producer, Kafka Fetcher (Consumer) Updated MirrorMaker, ConsoleConsumer and scala BaseConsumer Add RecordHeaders and RecordHeader implementation of the interfaces Headers and Header Some bits using are reverted to being Java 7 compatible, for the moment until KIP-118 is implemented. Author: Michael Andre Pearce <Michael.Andre.Pearce@me.com> Reviewers: Radai Rosenblatt <radai.rosenblatt@gmail.com>, Jiangjie Qin <becket.qin@gmail.com>, Ismael Juma <ismael@juma.me.uk>, Jason Gustafson <jason@confluent.io> Closes #2772 from michaelandrepearce/KIP-82	2017-04-28 19:18:27 -07:00
Apurva Mehta	a82f194b21	KAFKA-4818; Exactly once transactional clients Author: Apurva Mehta <apurva@confluent.io> Reviewers: Guozhang Wang <wangguoz@gmail.com>, Ismael Juma <ismael@juma.me.uk>, Jason Gustafson <jason@confluent.io> Closes #2840 from apurvam/exactly-once-transactional-clients	2017-04-27 14:11:36 -07:00
Jason Gustafson	5bd06f1d54	KAFKA-4816; Message format changes for idempotent/transactional producer (KIP-98) Author: Jason Gustafson <jason@confluent.io> Reviewers: Jun Rao <junrao@gmail.com>, Apurva Mehta <apurva@confluent.io>, Guozhang Wang <wangguoz@gmail.com>, Ismael Juma <ismael@juma.me.uk> Closes #2614 from hachikuji/exactly-once-message-format	2017-03-24 19:38:43 +00:00

1 2 3 4 5 ...

293 Commits