rabbitmq-server

Commit Graph

Author	SHA1	Message	Date
David Ansari	0f5fe8fadd	Add Prometheus metric messages dropped by MQTT QoS 0 queue type Why: A RabbitMQ operator should be able to see whether RabbitMQ drops MQTT QoS 0 messages due to overload protection. It's an indication that an MQTT subscriber does not consume fast enough. How: Use Prometheus global counters. There are 2 valid solutions: 1. Introduce a new metric called messages_dropped specifically for the rabbitmq_mqtt_qos0_queue type. This would work in a similar fashion how streams extends the per protocol global counters, but requires extending the per protocol & queue type global counters for the MQTT QoS queue type. The emitted metrics would look as follows: ``` rabbitmq_global_messages_dropped_total{protocol="mqtt310",queue_type="rabbit_mqtt_qos0_queue"} 0 rabbitmq_global_messages_dropped_total{protocol="mqtt311",queue_type="rabbit_mqtt_qos0_queue"} 0 rabbitmq_global_messages_dropped_total{protocol="mqtt50",queue_type="rabbit_mqtt_qos0_queue"} 0 ``` 2. Reuse the existing metric rabbitmq_global_messages_dead_lettered_maxlen_total This commit decides to go for the 2nd approach because: a) there is no need to add a new metric. Even though dead lettering is not supported for the MQTT QoS 0 queue type, this metric maps nicely to what happens: The queue drop messages since itx max length (mqtt.mailbox_soft_limit) is exceeded with overflow behaviour drop-head. Furtheremore the label `dead_letter_strategy="disabled"` tells that dead lettering is not taking place from this queue type. b) this metric allows to support dead lettering for the MQTT QoS 0 queue type in the future. The new dead lettering metrics look as follows: ``` rabbitmq_global_messages_dead_lettered_maxlen_total{queue_type="rabbit_classic_queue",dead_letter_strategy="at_most_once"} 0 rabbitmq_global_messages_dead_lettered_maxlen_total{queue_type="rabbit_classic_queue",dead_letter_strategy="disabled"} 0 rabbitmq_global_messages_dead_lettered_maxlen_total{queue_type="rabbit_mqtt_qos0_queue",dead_letter_strategy="disabled"} 0 rabbitmq_global_messages_dead_lettered_maxlen_total{queue_type="rabbit_quorum_queue",dead_letter_strategy="at_most_once"} 0 rabbitmq_global_messages_dead_lettered_maxlen_total{queue_type="rabbit_quorum_queue",dead_letter_strategy="disabled"} 0 rabbitmq_global_messages_dead_lettered_expired_total{queue_type="rabbit_classic_queue",dead_letter_strategy="at_most_once"} 0 rabbitmq_global_messages_dead_lettered_expired_total{queue_type="rabbit_classic_queue",dead_letter_strategy="disabled"} 0 rabbitmq_global_messages_dead_lettered_expired_total{queue_type="rabbit_quorum_queue",dead_letter_strategy="at_least_once"} 0 rabbitmq_global_messages_dead_lettered_expired_total{queue_type="rabbit_quorum_queue",dead_letter_strategy="at_most_once"} 0 rabbitmq_global_messages_dead_lettered_expired_total{queue_type="rabbit_quorum_queue",dead_letter_strategy="disabled"} 0 rabbitmq_global_messages_dead_lettered_rejected_total{queue_type="rabbit_classic_queue",dead_letter_strategy="at_most_once"} 0 rabbitmq_global_messages_dead_lettered_rejected_total{queue_type="rabbit_classic_queue",dead_letter_strategy="disabled"} 0 rabbitmq_global_messages_dead_lettered_rejected_total{queue_type="rabbit_quorum_queue",dead_letter_strategy="at_least_once"} 0 rabbitmq_global_messages_dead_lettered_rejected_total{queue_type="rabbit_quorum_queue",dead_letter_strategy="at_most_once"} 0 rabbitmq_global_messages_dead_lettered_rejected_total{queue_type="rabbit_quorum_queue",dead_letter_strategy="disabled"} 0 rabbitmq_global_messages_dead_lettered_delivery_limit_total{queue_type="rabbit_quorum_queue",dead_letter_strategy="at_least_once"} 0 rabbitmq_global_messages_dead_lettered_delivery_limit_total{queue_type="rabbit_quorum_queue",dead_letter_strategy="at_most_once"} 0 rabbitmq_global_messages_dead_lettered_delivery_limit_total{queue_type="rabbit_quorum_queue",dead_letter_strategy="disabled"} 0 rabbitmq_global_messages_dead_lettered_confirmed_total{queue_type="rabbit_quorum_queue",dead_letter_strategy="at_least_once"} 0 ```	2023-08-15 16:06:15 +02:00
David Ansari	2a4301e12d	Nack rejected messages to MQTT 5.0 client since MQTT 5.0 supports negative acknowledgements thanks to reason codes in the PUBACK packet. We could either choose reason code 128 or 131. The description code for 131 applies for rejected messages, hence this commit uses 131: > The PUBLISH is valid but the receiver is not willing to accept it.	2023-08-09 15:31:14 +02:00
David Ansari	52262bb6e9	Rename to x-opt-reply-to-topic due to AMQP 1.0 spec: > The annotations type is a map where the keys are restricted to be of type symbol or of type ulong. All ulong keys, and all symbolic keys except those beginning with "x-" are reserved. Keys beginning with "x-opt-" MUST be ignored if not understood. On receiving an annotation key which is not understood, and which does not begin with "x-opt", the receiving AMQP container MUST detach the link with a not-implemented error. So, for new headers being introduced it makes sense to comply with that AMQP 1.0 requirement. We don't want the receiver to force to understand this header.	2023-07-14 17:25:28 +02:00
David Ansari	ed31a818c3	Make mqtt.subscription_ttl unsupported Starting with RabbitMQ 3.13 mqtt.max_session_expiry_interval_seconds (set in seconds) will replace the previous setting mqtt.subscription_ttl. MQTT 5.0 introduces the Session Expiry Interval feature which does not only apply to subscribers, but also to publishers. The new config name mqtt.max_session_expiry_interval_seconds makes it clear that it also applies to publishers. Prior to this commit, when mqtt.subscription_ttl was set, a warning got logged and the value was ignored. This is dangerous if an operator does not see the warning but relies for example on `mqtt.subscription = infinity` to not expire non clean session. It's safer to make the boot fail if that unsupported config name is still set. A clear error message will be logged: ``` [error] <0.142.0> Error preparing configuration in phase apply_translations: [error] <0.142.0> - Translation for 'rabbitmq_mqtt.subscription_ttl' found invalid configuration: Since 3.13 mqtt.subscription_ttl (in milliseconds) is unsupported. Use mqtt.max_session_expiry_interval_seconds (in seconds) instead. ``` Alternatively, RabbitMQ could translate mqtt.subscription_ttl to mqtt.max_session_expiry_interval_seconds. However, forcing the new config option sounds the better way to go. Once we write MQTT 5.0 docs, this change must go into the 3.13 release notes. This commit also renames max_session_expiry_interval_secs to max_session_expiry_interval_seconds. The latter is clearer to users.	2023-07-13 14:47:33 +00:00
David Ansari	c7e4984d59	Display MQTT 5 CONNECT User Property in Management UI and CLI as requested in https://github.com/rabbitmq/rabbitmq-server/issues/2554#issuecomment-1604205277 "User Properties on the CONNECT packet can be used to send connection related properties from the Client to the Server. The meaning of these properties is not defined by this specification." [v5 3.1.2.11.8] It makes sense to display the User Property of the CONNECT packet in the Management UI in the connection's Client Properties.	2023-06-26 14:08:05 +02:00
David Ansari	8601ca7557	Use same AMQP 0.9.1 header for correlation ID For correlation IDs greater than 255 bytes, AMQP 1.0 to AMQP 0.9.1 conversion puts the correlation ID into an AMQP 0.9.1 header called x-correlation-id (see `a17b28ec61/deps/rabbit/src/rabbit_msg_record.erl (L380)` ) Let's use the same key for the MQTT 5.0 to AMQP 0.9.1 conversion.	2023-06-26 13:00:01 +02:00
David Ansari	92addc067e	Fix MQTT crash This commit fixes the following crash: ``` 2388-2023-06-21 08:53:28.189519+00:00 [error] <0.2191.0> exception error: bad argument 2389-2023-06-21 08:53:28.189519+00:00 [error] <0.2191.0> in function lists:keymember/3 2390-2023-06-21 08:53:28.189519+00:00 [error] <0.2191.0> called as lists:keymember(<<"x-mqtt-publish-qos">>,1,undefined) 2391-2023-06-21 08:53:28.189519+00:00 [error] <0.2191.0> *** argument 3: not a list 2392-2023-06-21 08:53:28.189519+00:00 [error] <0.2191.0> in call from rabbit_mqtt_processor:amqp_props_to_mqtt_props/2 (rabbit_mqtt_processor.erl, line 2219) ``` This crash occurs when a message without AMQP 0.9.1 #P_basic.headers is sent to the MQTT v4 connection process, but consumed by an MQTT v5 connection process. This is the case when an AMQP 0.9.1 client sends a message to a v4 MQTT client, and this same client subsequently upgrades to v5 consuming the message. When sending from AMQP 0.9.1 client directly to a v5 MQTT connection, the AMQP 0.9.1 header <<"x-binding-keys">> is set. When sending from AMQP 0.9.1 client directly to a v4 MQTT connection, no header is set, but this code branch was not evaluated.	2023-06-21 17:14:08 +01:00
David Ansari	2fec1a22d9	Allow route/3 to return additional infos Allow rabbitmq_exchange:route_return() to return infos in addition to the matched binding keys. For example, some exchange types read the full #amqqueue{} record from the database and can in future return it from the route/3 function to avoid reading the full record again in the channel.	2023-06-21 17:14:08 +01:00
David Ansari	3dc799afb5	Fix test expectation Matching against #{} does not validate that the map is empty.	2023-06-21 17:14:08 +01:00
Chunyi Lyu	08fd9d00e3	Set topic alias when publish retained message - remove topic alias from message props when storing retained msgs - set topic alias for outbound before sending retained msgs	2023-06-21 17:14:08 +01:00
David Ansari	0e00e3479e	CONNACK with Bad Authentication Method RabbitMQ MQTT already supports authenticating clients via username + password, OAuth tokens, and certificates. We could make use of RabbitMQ SASL mechanisms in the future, if needed. For now, if the client wants to perform extended authentication, we return Bad Authentication Method in the CONNACK packet.	2023-06-21 17:14:08 +01:00
David Ansari	3b65c97184	Add type alias for topic and client ID	2023-06-21 17:14:08 +01:00
David Ansari	14d81b430f	Add Topic Aliases from server to client Once the server's Topic Alias cache for messages from server to client is full, this commit does not replace any existing aliases. So, the first topics "win" and stay in the cache forever. This matches the behaviour of VerneMQ and EMQX. For now that's good enough. In the future, we can easily change that behaviour to some smarter strategy, for example 1. Hash the TopicName to an Topic Alias and replace the old alias, or 2. For the Topic Alias Cache from server to client, keep 2 Maps: #{TopicName => TopicAlias} and #{TopicAlias => TopicName} and a counter that wraps to 1 once the Topic Alias Maximum is reached and just replace an existing Alias if the TopicName is not cached. Also, refactor Topic Alias Maximum: * Remove code duplication * Allow an operator to prohibit Topic Aliases by allowing value 0 to be configured * Change config name to topic_alias_maximum to that it matches exactly the MQTT feture name * Fix wrong code formatting * Add the invalid or unkown Topic Alias to log message for easier troubleshooting	2023-06-21 17:14:08 +01:00
Chunyi Lyu	fd52caa211	Support topic alias from client to broker	2023-06-21 17:14:08 +01:00
Chunyi Lyu	60f6784d30	Make Topic Alias Maximum configurable - default to 20, configurable through cuttlefish config - add test to v5 suite for invalid topic alias in publish	2023-06-21 17:14:08 +01:00
David Ansari	bb20618b13	Return matched binding keys faster For MQTT 5.0 destination queues, the topic exchange does not only have to return the destination queue names, but also the matched binding keys. This is needed to implement MQTT 5.0 subscription options No Local, Retain As Published and Subscription Identifiers. Prior to this commit, as the trie was walked down, we remembered the edges being walked and assembled the final binding key with list_to_binary/1. list_to_binary/1 is very expensive with long lists (long topic names), even in OTP 26. The CPU flame graph showed ~3% of CPU usage was spent only in list_to_binary/1. Unfortunately and unnecessarily, the current topic exchange implementation stores topic levels as lists. It would be better to store topic levels as binaries: split_topic_key/1 should ideally use binary:split/3 similar as follows: ``` 1> P = binary:compile_pattern(<<".">>). {bm,#Ref<0.1273071188.1488322568.63736>} 2> Bin = <<"aaa.bbb..ccc">>. <<"aaa.bbb..ccc">> 3> binary:split(Bin, P, [global]). [<<"aaa">>,<<"bbb">>,<<>>,<<"ccc">>] ``` The compiled pattern could be placed into persistent term. This commit decided to avoid migrating Mnesia tables to use binaries instead of lists. Mnesia migrations are non-trivial, especially with the current feature flag subsystem. Furthermore the Mnesia topic tables are already getting migrated to their Khepri counterparts in 3.13. Adding additional migration only for Mnesia does not make sense. So, instead of assembling the binding key as we walk down the trie and then calling list_to_binary/1 in the leaf, it would be better to just fetch the binding key from the database in the leaf. As we reach the leaf of the trie, we know both source and destination. Unfortunately, we cannot fetch the binding key efficiently with the current rabbit_route (sorted by source exchange) and rabbit_reverse_route (sorted by destination) tables as the key is in the middle between source and destination. If there are a huge number of bindings for a given sourc exchange (very realistic in MQTT use cases) or a large number of bindings for a given destination (also realistic), it would require scanning these large number of bindings. Therefore this commit takes the simplest possible solution: The solution leverages the fact that binding arguments are already part of table rabbit_topic_trie_binding. So, if we simply include the binding key into the binding arguments, we can fetch and return it efficiently in the topic exchange implementation. The following patch omitting fetching the empty list binding argument (the default) makes routing slower because function `analyze_pattern.constprop.0` requires significantly more (~2.5%) CPU time ``` @@ -273,7 +273,11 @@ trie_bindings(X, Node) -> node_id = Node, destination = '$1', arguments = '$2'}}, - mnesia:select(?MNESIA_BINDING_TABLE, [{MatchHead, [], [{{'$1', '$2'}}]}]). + mnesia:select( + ?MNESIA_BINDING_TABLE, + [{MatchHead, [{'andalso', {'is_list', '$2'}, {'=/=', '$2', []}}], [{{'$1', '$2'}}]}, + {MatchHead, [], ['$1']} + ]). ``` Hence, this commit always fetches the binding arguments. All MQTT 5.0 destination queues will create a binding that contains the binding key in the binding arguments. Not only does this solution avoid expensive list_to_binay/1 calls, but it also means that Erlang app rabbit (specifically the topic exchange implementation) does not need to be aware of MQTT anymore: It just returns the binding key when the binding args tell to do so. In future, once the Khepri migration completed, we should be able to relatively simply remove the binding key from the binding arguments again to free up some storage space. Note that one of the advantages of a trie data structue is its space efficiency that you don't have to store the same prefixes multiple times. However, for RabbitMQ the binding key is already stored at least N times in various routing tables, so storing it a few times more via the binding arguments should be acceptable. The speed improvements are favoured over a few more MBs ETS usage.	2023-06-21 17:14:08 +01:00
David Ansari	d7882b00dc	PUBACK with reason code "No matching subscribers" Support reason code "No matching subscribers" in PUBACK. This somewhat corresponds to the `mandatory` message property in AMQP 0.9.1.	2023-06-21 17:14:08 +01:00
David Ansari	8c0b0e9338	Support MQTT 5.0 Properties The following PUBLISH and Will properties are forwarded unaltered by the server: * Payload Format Indicator * Content Type * Response Topic * Correlation Data * User Property Not only must these properties be forwarded unaltered from an MQTT publishing client to an MQTT receiving client, but it would also be nice to allow for protocol interoperability: Think about RPC request-response style patterns where the requester is an MQTT client and the responder is an AMQP 0.9.1 or STOMP client. We reuse the P_basic fields where possible: * content_type (if <= 255 bytes) * correlation_id (if <= 255 bytes) Otherwise, we add custom AMQP 0.9.1 headers. The headers follow the naming pattern "x-mqtt-<property>" where <property> is the MQTT v5 property if that property makes only (mainly) sense in the MQTT world: * x-mqtt-user-property * x-mqtt-payload-format-indicator If the MQTT v5 property makes also sense outside of the MQTT world, we name it more generic: * x-correlation (if > 255 bytes) * x-reply-to-topic (since P_basic.reply_to assumes a queue name) In the future, we can think about adding a header x-reply-to-exchange and have the MQTT plugin set its value to the configured mqtt.exchange such that clients don't have to assume the default topic exchange amq.topic.	2023-06-21 17:14:08 +01:00
David Ansari	fb7af48df6	Support Will Delay Interval Previously, the Will Message could be kept in memory in the MQTT connection process state. Upon termination, the Will Message is sent. The new MQTT 5.0 feature Will Delay Interval requires storing the Will Message outside of the MQTT connection process state. The Will Message should not be stored node local because the client could reconnect to a different node. Storing the Will Message in Mnesia is not an option because we want to get rid of Mnesia. Storing the Will Message in a Ra cluster or in Khepri is only an option if the Will Payload is small as there is currently no way in Ra to efficiently snapshot large binary data (Note that these Will Messages are not consumed in a FIFO style workload like messages in quorum queues. A Will Message needs to be stored for as long as the Session lasts - up to 1 day by default, but could also be much longer if RabbitMQ is configured with a higher maximum session expiry interval.) Usually Will Payloads are small: They are just a notification that its MQTT session ended abnormally. However, we don't know how users leverage the Will Message feature. The MQTT protocol allows for large Will Payloads. Therefore, the solution implemented in this commit - which should work good enough - is storing the Will Message in a queue. Each MQTT session which has a Session Expiry Interval and Will Delay Interval of > 0 seconds will create a queue if the current Network Connection ends where it stores its Will Message. The Will Message has a message TTL set (corresponds to the Will Delay Interval) and the queue has a queue TTL set (corresponds to the Session Expiry Interval). If the client does not reconnect within the Will Delay Interval, the message is dead lettered to the configured MQTT topic exchange (amq.topic by default). The Will Delay Interval can be set by both publishers and subscribers. Therefore, the Will Message is the 1st session state that RabbitMQ keeps for publish-only MQTT clients. One current limitation of this commit is that a Will Message that is delayed (i.e. Will Delay Interval is set) and retained (i.e. Will Retain flag set) will not be retained. One solution to retain delayed Will Messages is that the retainer process consumes from a queue and the queue binds to the topic exchange with a topic starting with `$`, for example `$retain/#`. The AMQP 0.9.1 Will Message that is dead lettered could then be added a CC header such that it won't not only be published with the Will Topic, but also with `$retain` topic. For example, if the Will Topic is `a/b`, it will publish with routing key `a/b` and CC header `$retain/a/b`. The reason this is not implemented in this commit is that to keep the currently broken retained message store behaviour, we would require creating at least one queue per node and publishing only to that local queue. In future, once we have a replicated retained message store based on a Stream for example, we could just publish all retained messages to the `$retain` topic and thefore into the Stream. So, for now, we list "retained and delayed Will Messages" as a limitation that they actually won't be retained.	2023-06-21 17:14:08 +01:00
David Ansari	becb92ca6f	Do not set queue pid in MQTT connection process Every queue type sets the queue pid when it creates the queue. Prior to this commit, the queue pid set within the MQTT connection process was a bit confusing as the queue pid will be different for classic queues and quorum queues.	2023-06-21 17:14:08 +01:00
David Ansari	60a6af0054	Rename will_msg to will_payload when only the payload is meant. See [v5 3.1.3.4]	2023-06-21 17:14:08 +01:00
David Ansari	605e033f43	Fix test decode_basic_properties This commit fixes 2 separate issues: 1. No quorum queue got created in v5 because Session Expiry Interval was 0. 2. Fix a function_clause error. Pass the decoded properties further to other functions looking up headers.	2023-06-21 17:14:08 +01:00
David Ansari	48a442b23e	Change routing options from list to map as small maps with atom keys are optimized in OTP 26. Rename v2 to return_binding_keys to make the routing option clearer.	2023-06-21 17:14:08 +01:00
David Ansari	ce573c35fa	Support MQTT 5.0 Subscription Option Retain Handling The MQTT v5 spec is a bit vague on Retain Handling 1: "If Retain Handling is set to 1 then if the subscription did not already exist, the Server MUST send all retained message matching the Topic Filter of the subscription to the Client, and if the subscription did exist the Server MUST NOT send the retained messages. [MQTT-3.3.1-10]." [v5 3.3.1.3] Does a subscription with the same topic filter but different subscription options mean that "the subscription did exist"? This commit interprets "subscription exists" as both topic filter and subscription options must be the same. Therefore, if a client creates a subscription with a topic filter that is identical to a previous subscription and subscription options that are different and Retain Handling 1, the server sends the retained message.	2023-06-21 17:14:08 +01:00
David Ansari	e2b545f270	Support MQTT 5.0 features No Local, RAP, Subscription IDs Support subscription options "No Local" and "Retain As Published" as well as Subscription Identifiers. All three MQTT 5.0 features can be set on a per subscription basis. Due to wildcards in topic filters, multiple subscriptions can match a given topic. Therefore, to implement Retain As Published and Subscription Identifiers, the destination MQTT connection process needs to know what subscription(s) caused it to receive the message. There are a few ways how this could be implemented: 1. The destination MQTT connection process is aware of all its subscriptions. Whenever, it receives a message, it can match the message's routing key / topic against all its known topic filters. However, to iteratively match the routing key against all topic filters for every received message can become very expensive in the worst case when the MQTT client creates many subscriptions containing wildcards. This could be the case for an MQTT client that acts as a bridge or proxy or dispatcher: It could subscribe via a wildcard for each of its own clients. 2. Instead of interatively matching the topic of the received message against all topic filters that contain wildcards, a better approach would be for every MQTT subscriber connection process to maintain a local trie datastructure (similar to how topic exchanges are implemented) and perform matching therefore more efficiently. However, this does not sound optimal either because routing is effectively performed twice: in the topic exchange and again against a much smaller trie in each destination connection process. 3. Given that the topic exchange already perform routing, a much more sensible way would be to send the matched binding key(s) to the destination MQTT connection process. A subscription (topic filter) maps to a binding key in AMQP 0.9.1 routing. Therefore, for the first time in RabbitMQ, the routing function should not only output a list of unique destination queues, but also the binding keys (subscriptions) that caused the message to be routed to the destination queue. This commit therefore implements the 3rd approach. The downside of the 3rd approach is that it requires API changes to the routing function and topic exchange. Specifically, this commit adds a new function rabbit_exchange:route/3 that accepts a list of routing options. If that list contains version 2, the caller of the routing function knows how to handle the return value that could also contain binding keys. This commits allows an MQTT connection process, the channel process, and at-most-once dead lettering to handle binding keys. Binding keys are included as AMQP 0.9.1 headers into the basic message. Therefore, whenever a message is sent from an MQTT client or AMQP 0.9.1 client or AMQP 1.0 client or STOMP client, the MQTT receiver will know the subscription identifier that caused the message to be received. Note that due to the low number of allowed wildcard characters (# and +), the cardinality of matched binding keys shouldn't be high even if the topic contains for example 3 levels and the message is sent to for example 5 million destination queues. In other words, sending multiple distinct basic messages to the destination shouldn't hurt the delegate optimisation too much. The delegate optimisation implemented for classic queues and rabbit_mqtt_qos0_queue(s) still takes place for all basic messages that contain the same set of matched binding keys. The topic exchange returns all matched binding keys by remembering the edges walked down to the leaves. As an optimisation, only for MQTT queues are binding keys being returned. This does add a small dependency from app rabbit to app rabbitmq_mqtt which is not optimal. However, this dependency should be simple to remove when omitting this optimisation. Another important feature of this commit is persisting subscription options and subscription identifiers because they are part of the MQTT 5.0 session state. In MQTT v3 and v4, the only subscription information that were part of the session state was the topic filter and the QoS level. Both information were implicitly stored in the form of bindings: The topic filter as the binding key and the QoS level as the destination queue name of the binding. For MQTT v5 we need to persist more subscription information. From a domain perspective, it makes sense to store subscription options as part of subscriptions, i.e. bindings, even though they are currently not used in routing. Therefore, this commits stores subscription options as binding arguments. Storing subscription options as binding arguments comes in turn with new challenges: How to handle mixed version clusters and upgrading an MQTT session from v3 or v4 to v5? Imagine an MQTT client connects via v5 with Session Expiry Interval > 0 to a new node in a mixed version cluster, creates a subscription, disconnects, and subsequently connects via v3 to an old node. The client should continue to receive messages. To simplify such edge cases, this commit introduces a new feature flag called mqtt_v5. If mqtt_v5 is disabled, clients cannot connect to RabbitMQ via MQTT 5.0. This still doesn't entirely solve the problem of MQTT session upgrades (v4 to v5 client) or session downgrades (v5 to v4 client). Ideally, once mqtt_v5 is enabled, all MQTT bindings contain non-empty binding arguments. However, this will require a feature flag migration function to modify all MQTT bindings. To be more precise, all MQTT bindings need to be deleted and added because the binding argument is part of the Mnesia table key. Since feature flag migration functions are non-trivial to implement in RabbitMQ (they can run on every node multiple times and concurrently), this commit takes a simpler approach: All v3 / v4 sessions keep the empty binding argument []. All v5 sessions use the new binding argument [#mqtt_subscription_opts{}]. This requires only handling a session upgrade / downgrade by creating a binding (with the new binding arg) and deleting the old binding (with the old binding arg) when processing the CONNECT packet. Note that such session upgrades or downgrades should be rather rare in practice. Therefore these binding transactions shouldn't hurt peformance. The No Local option is implemented within the MQTT publishing connection process: The message is not sent to the MQTT destination if the destination queue name matches the current MQTT client ID and the message was routed due to a subscription that has the No Local flag set. This avoids unnecessary traffic on the MQTT queue. The alternative would have been that the "receiving side" (same process) filters the message out - which would have been more consistent in how Retain As Published and Subscription Identifiers are implemented, but would have caused unnecessary load on the MQTT queue.	2023-06-21 17:14:08 +01:00
David Ansari	e273b4c87b	Fix two small bugs	2023-06-21 17:14:08 +01:00
David Ansari	3b3ccd4d42	Simplify UNSUBACK reply Whether a payload is sent to the client is decided by the serialiser.	2023-06-21 17:14:08 +01:00
David Ansari	cd7f396bea	Simplify code and remove code duplication	2023-06-21 17:14:08 +01:00
David Ansari	6f4f9506a4	Add a test case for large Receive Maximum value	2023-06-21 17:14:08 +01:00
Chunyi Lyu	818a1f410b	Max 10 unack msgs from server to client - This is a revision on the commit that implemented receive maximum 670d6b2 which sends as many unack msgs as the client has set to be their receive max. This commit adds back the unack msgs limit from the server (10) which is a much safer limit than a user provided max.	2023-06-21 17:14:08 +01:00
Chunyi Lyu	bb9fed85f5	Add return codes in unsuback packet	2023-06-21 17:14:08 +01:00
Chunyi Lyu	d1b173de8c	Return v5 failure reason codes for suback - mqtt v5 has more descriptive return values for suback - added two possible failure reason codes for suback packet one for permission error, another for quota exceeded error - modified auth suite to assert on reason codes for v5 - no new test case since failures were already covered	2023-06-21 17:14:08 +01:00
Chunyi Lyu	471540dbdc	Implement client ReceiveMaximum - rename processor state prefetch to receive_maximum to better match property name for mqtt 5 - defaults to 10 (as previously) when not set not saved in session state, configuration is per connection	2023-06-21 17:14:08 +01:00
David Ansari	acd249cb0f	Add a test for Session Expiry Interval	2023-06-21 17:14:08 +01:00
Chunyi Lyu	68d59bcaf3	Update sess exp interval when client reconnect - when client reconnecting with clean start false, server respects the new session expiry interval provided by the client	2023-06-21 17:14:08 +01:00
David Ansari	df64e3a41c	Rename #mqtt_topic{} to #mqtt_subscription{} because that's what the record represents and is more in line with the terminology used in the MQTT specification.	2023-06-21 17:14:08 +01:00
David Ansari	2efd9c06b8	Support Session Expiry Interval Allow Session Expiry Interval to be changed when client DISCONNECTs. Deprecate config subscription_ttl in favour of max_session_expiry_interval_secs because the Session Expiry Interval will also apply to publishers that connect with a will message and will delay interval. "The additional session state of an MQTT v5 server includes: * The Will Message and the Will Delay Interval * If the Session is currently not connected, the time at which the Session will end and Session State will be discarded." The Session Expiry Interval picked by the server and sent to the client in the CONNACK is the minimum of max_session_expiry_interval_secs and the requested Session Expiry Interval by the client in CONNECT. This commit favours dynamically changing the queue argument x-expires over creating millions of different policies since that many policies will come with new scalability issues. Dynamically changing queue arguments is not allowed by AMQP 0.9.1 clients. However, it should be perfectly okay for the MQTT plugin to do so for the queues it manages. MQTT clients are not aware that these queues exist.	2023-06-21 17:14:08 +01:00
David Ansari	6e9aa952ea	Remove code duplication	2023-06-21 17:14:08 +01:00
Chunyi Lyu	8ce0813bda	Close conn when will msg qos is 2 [MQTT-3.2.2-12] If a Server receives a CONNECT packet containing a Will QoS that exceeds its capabilities, it MUST reject the connection. It SHOULD use a CONNACK packet with Reason Code 0x9B (QoS not supported) as described in section 4.13 Handling errors, and MUST close the Network Connection	2023-06-21 17:14:08 +01:00
David Ansari	c31ce01443	Dead letter negatively ACKed MQTT v5 messages MQTT v5 allows client and server to negatively ack a message by setting a reason code of 128 or greater indicating failure. "If PUBACK or PUBREC is received containing a Reason Code of 0x80 or greater the corresponding PUBLISH packet is treated as acknowledged, and MUST NOT be retransmitted [MQTT-4.4.0-2]." Even though the spec prohibits resending such messages, if a client does not accept a message, RabbitMQ can still dead letter the message.	2023-06-21 17:14:08 +01:00
David Ansari	66fe9630b5	Add Message Expiry Interval for retained messages MQTT v5 spec: "If the current retained message for a Topic expires, it is discarded and there will be no retained message for that topic." This commit also supports Message Expiry Interval for retained messages when a node is restarted. Therefore, the insertion timestamp needs to be stored on disk. Upon recovery, the Erlang timers are re-created.	2023-06-21 17:14:08 +01:00
Chunyi Lyu	c39079f657	Disconnect at pub qos > server max qos - "If the Server included a Maximum QoS in its CONNACK response to a Client and it receives a PUBLISH packet with a QoS greater than this then it uses DISCONNECT with Reason Code 0x9B (QoS not supported)" - only affects mqtt v5, server max qos is 1	2023-06-21 17:14:08 +01:00
David Ansari	044ee02b36	Add MQTT v5 feature Message Expiry Interval This commit does not yet implement Message Expiry Interval of * retained messages: "If the current retained message for a Topic expires, it is discarded and there will be no retained message for that topic."	2023-06-21 17:14:08 +01:00
David Ansari	2ef1f79fdd	Test server restart with retained messages "Retained messages do not form part of the Session State in the Server, they are not deleted as a result of a Session ending." Both retained message stores ETS and DETS implement recovery. This commit adds a test that recovery works as intended.	2023-06-21 17:14:08 +01:00
David Ansari	e50e994ef4	Return Assigned Client Identifier in CONNACK "If the Client connects using a zero length Client Identifier, the Server MUST respond with a CONNACK containing an Assigned Client Identifier."	2023-06-21 17:14:08 +01:00
David Ansari	f1f8167ec4	Add MQTT v5 feature Maximum Packet Size set by server "Allow the Client and Server to independently specify the maximum packet size they support. It is an error for the session partner to send a larger packet." This commit implements the part where the Server specifies the maximum packet size. "In the case of an error in a CONNECT packet it MAY send a CONNACK packet containing the Reason Code, before closing the Network Connection. In the case of an error in any other packet it SHOULD send a DISCONNECT packet containing the Reason Code before closing the Network Connection." This commit implements only the "SHOULD" (second) part, not the "MAY" (first) part. There are now 2 different global wide MQTT settings on the server: 1. max_packet_size_unauthenticated which applies to the CONNECT packet (and maybe AUTH packet in the future) 2. max_packet_size_authenticated which applies to all other MQTT packets (that is, after the client successfully authenticated). These two settings will apply to all MQTT versions. In MQTT v5, if a non-CONNECT packet is too large, the server will send a DISCONNECT packet to the client with Reason Code "Packet Too Large" before closing the network connection.	2023-06-21 17:14:08 +01:00
David Ansari	49f1071591	Add MQTT v5 feature Maximum Packet Size set by client "Allow the Client and Server to independently specify the maximum packet size they support. It is an error for the session partner to send a larger packet." This commit implements the part where the Client specifies the maximum packet size. As per protocol spec, instead of sending, the server drops the MQTT packet if it's too large. A debug message is logged for "infrequent" packet types. For PUBLISH packets, the messages is rejected to the queue such that it will be dead lettered, if dead lettering is configured. At the very least, Prometheus metrics for dead lettered messages will be increased, even if dead lettering is not configured.	2023-06-21 17:14:08 +01:00
David Ansari	c44b546f73	Test MQTT v5 in existing MQTT suites	2023-06-21 17:14:08 +01:00
David Ansari	be6ff92692	Serialise and parse MQTT 5.0 packets	2023-06-21 17:14:08 +01:00
David Ansari	f485e51d80	Fix Native MQTT crash if properties encoded Fixes https://github.com/rabbitmq/rabbitmq-server/discussions/8252 The MQTT connection must decode AMQP 0.9.1 properties as they are getting encoded for example in: `712c2b9ec9/deps/rabbit/src/rabbit_variable_queue.erl (L2219)` and `712c2b9ec9/deps/rabbit/src/rabbit_quorum_queue.erl (L1680)` Prior to this commit, the MQTT connection process could crash with: ``` [{rabbit_mqtt_processor,deliver_one_to_client, [{{resource,<<"/">>,queue, <<"mqtt-subscription-mqtt-explorer-c5351d21qos0">>}, <0.4546.0>,undefined,false, {basic_message, {resource,<<"/">>,exchange,<<"amq.topic">>}, [<<"plant.v1.M3.BCD423.rev.fillStateChangedEvent">>], {content,60,none, <<80,64,16,97,112,112,108,105,99,97,116,105,111,110,47, 106,115,111,110,2,0,0,0,0,100,103,141,186>>, rabbit_framing_amqp_0_9_1, [<<"{\"plantIdentificationCode\":\"M3/BCD423/rev\",\"isFullSensorTriggered\":false,\"numberOfCarriers\":20,\"maxNumberOfCarriers\":1174,\"numberOfEmpties\":0,\"numberOfCarriersWithPayload\":20,\"numberOfCarriersWithOrder\":0,\"trackLength\":30000,\"trackLengthOccupied\":1130}">>]}, <<31,230,178,158,209,240,53,221,100,60,64,5,227,237,58,21>>, true}}, false, {state, {cfg,#Port<0.282>,mqtt311,true,undefined, {resource,<<"/">>,exchange,<<"amq.topic">>}, undefined,false,none,<0.680.0>,flow,none,10,<<"/">>, <<"mqtt-explorer-c5351d21">>,undefined, {192,168,10,131}, 1883, {192,168,10,130}, 53244,1684508087392,#Fun<rabbit_mqtt_reader.0.106886>}, {rabbit_queue_type, #{{resource,<<"/">>,queue, <<"mqtt-subscription-mqtt-explorer-c5351d21qos0">>} => {ctx,rabbit_classic_queue, {rabbit_classic_queue,<0.4546.0>,#{}, #{<0.4546.0> => ok}}}}}, #{},#{},1, #{<<"#">> => 0,<<"$SYS/#">> => 0}, {auth_state, {user,<<"rabbit">>,[], [{rabbit_auth_backend_internal, #Fun<rabbit_auth_backend_internal.3.114557357>}]}, #{<<"client_id">> => <<"mqtt-explorer-c5351d21">>}}, registered,#{},0}], [{file,"rabbit_mqtt_processor.erl"},{line,1414}]}, {lists,foldl,3,[{file,"lists.erl"},{line,1350}]}, {rabbit_mqtt_processor,handle_queue_event,2, [{file,"rabbit_mqtt_processor.erl"},{line,1345}]}, {rabbit_mqtt_reader,handle_cast,2, [{file,"rabbit_mqtt_reader.erl"},{line,134}]}, {gen_server,try_dispatch,4,[{file,"gen_server.erl"},{line,1123}]}, {gen_server,handle_msg,6,[{file,"gen_server.erl"},{line,1200}]}, {proc_lib,init_p_do_apply,3,[{file,"proc_lib.erl"},{line,240}]}]} ```	2023-05-19 17:05:31 +00:00

1 2 3 4 5 ...

323 Commits