Commit Graph

225 Commits

Author SHA1 Message Date
Gerhard Lazu dec54306c9 Use publisher confirms for Quorum Queue workload
It activates and extra graph on the RabbitMQ-Overview dashboard and
let's be honest - why use Quorum Queues if the workload didn't care
whether the broker received the message? They go together, seriously!

Signed-off-by: Gerhard Lazu <gerhard@lazu.co.uk>
2020-02-11 13:23:26 +00:00
Gerhard Lazu e7c997744d Improve config for returning metrics per object
Since metrics are now aggregated by default, it made more sense to use
the inverse meaning of disabling aggregation, and call it a positive and
explicit action: return_per_object_metrics.

Naming pair: @michaelklishin

Signed-off-by: Gerhard Lazu <gerhard@lazu.co.uk>
2020-02-11 13:08:00 +00:00
Gerhard Lazu 4622974d1b Bump Grafana & Prometheus to latest
Signed-off-by: Gerhard Lazu <gerhard@lazu.co.uk>
2020-02-10 17:50:13 +00:00
Gerhard Lazu c079459e9c Bump Docker image to latest RabbitMQ 3.9 dev
Signed-off-by: Gerhard Lazu <gerhard@lazu.co.uk>
2020-02-10 17:49:06 +00:00
Gerhard Lazu e91e4ea32b Bump to latest RabbitMQ 3.9.0 dev build & Erlang/OTP v22.2.6
Signed-off-by: Gerhard Lazu <gerhard@lazu.co.uk>
2020-02-06 17:57:31 +00:00
Gerhard Lazu 8088a50e13 Merge pull request #28 from rabbitmq/metrics-aggregation
Option to aggregate channel, queue and connection metrics
2020-02-04 12:43:49 +00:00
Gerhard Lazu f632014e2c Bump RabbitMQ to latest dev & OTP to latest stable
Signed-off-by: Gerhard Lazu <gerhard@lazu.co.uk>
2020-01-15 19:02:33 +00:00
Gerhard Lazu 29c5d2e241 Fix QQ PerfTest instance name in Prometheus config
Signed-off-by: Gerhard Lazu <gerhard@lazu.co.uk>
2020-01-15 12:58:54 +00:00
Gerhard Lazu 89efb964d9 Convert raft_entry_commit_latency to seconds & be explicit about unit
This is a follow-up to https://github.com/rabbitmq/ra/pull/160

Had to introduce mf_convert/3 so that METRICS_REQUIRING_CONVERSIONS
proplist does not clash with METRICS_RAW proplists that have the same
number of elements. This is begging to be refactored, but I know that
@dcorbacho is working on https://github.com/rabbitmq/rabbitmq-prometheus/issues/26

Also modified the RabbitMQ-Quorum-Queues-Raft dashboard

Signed-off-by: Gerhard Lazu <gerhard@lazu.co.uk>
2020-01-07 16:20:59 +00:00
Gerhard Lazu 5602a9eb4c Update Docker image to latest dev
Signed-off-by: Gerhard Lazu <gerhard@lazu.co.uk>
2020-01-07 16:11:57 +00:00
Gerhard Lazu 1e96189826 Bump grafana version to latest stable since flant-statusmap-panel v0.2.0
Thanks @diafour & @briangann for grafana/grafana-plugin-repository#531 👍

Signed-off-by: Gerhard Lazu <gerhard@lazu.co.uk>
2019-12-20 10:09:04 +00:00
Gerhard Lazu 0af70418b9 Bump OTP to latest stable & RabbitMQ to latest dev in Dockerfile
Signed-off-by: Gerhard Lazu <gerhard@lazu.co.uk>
2019-12-20 10:07:58 +00:00
Gerhard Lazu a10c7ce6f1 Move the __inputs partial out of the Grafan dashboards dir
Grafan will keep failing with the following error message otherwise:

    failed to load dashboard from /dashboards/__inputs.json Dashboard title cannot be empty
2019-12-04 21:12:26 +00:00
Gerhard Lazu 400ebdf9f8 Publish Erlang-Distribution Grafana dashboard to grafana.com
https://grafana.com/grafana/dashboards/11352

[finishes #166355345]
2019-12-04 21:12:16 +00:00
Gerhard Lazu b51288cfc5 Drop Visualise in Erlang-Memory-Allocators dashboard description
[#169264435]
2019-12-04 12:15:35 +00:00
Gerhard Lazu ad450779ba Publish Erlang-Memory-Allocators Grafana dashboard to grafana.com
https://grafana.com/grafana/dashboards/11350

[finishes #169264435]
2019-12-04 12:11:41 +00:00
Gerhard Lazu 076c65becb Version control descriptions for all our grafana.com dashboards 2019-12-03 13:34:43 +00:00
Gerhard Lazu 35525db9df Publish RabbitMQ-Quorum-Queues-Raft Grafana dashboard to grafana.com
https://grafana.com/grafana/dashboards/11340

[finishes #166926415]
2019-12-03 13:33:31 +00:00
Gerhard Lazu 4df7e701ee Decrease load on qq deployment
It still puts a significant load on the host, but any lower and we won't
see any change in the Uncommited log entries graph, and too little
variation in the Log entry commit latency.
2019-12-03 11:28:01 +00:00
Gerhard Lazu 79284d0b02 Bump RabbitMQ to latest alpha 2019-12-03 11:26:40 +00:00
Gerhard Lazu d6f7c8b884 Update Prometheus & Grafana to latest stable
Well, almost. flat-statusmap-panel v0.1.1 breaks on Grafana v6.5.0.
Since it's already been mentioned in
https://github.com/flant/grafana-statusmap/issues/76 for a different
reason, let's wait until it this is addressed.
2019-11-26 21:44:55 +00:00
Gerhard Lazu 1723ac4357 Bump RabbitMQ to latest dev version + master
Makes it easy to test
https://github.com/rabbitmq/rabbitmq-prometheus/issues/19
2019-11-26 21:44:10 +00:00
Gerhard Lazu f752fa0640 Add make target for import-friendly rabbitmq-exporter_vs_rabbitmq-prometheus dashboard 2019-10-30 13:46:01 +00:00
Gerhard Lazu 098387db8b Bump to latest RabbitMQ 3.9.x alpha, OTP & PerfTest stable 2019-10-30 13:36:48 +00:00
Michal Kuratczyk 7b073f503d Make intervals consitent with other dashboards 2019-10-29 15:48:49 +01:00
Gerhard Lazu d43ff6356b Compare rabbitmq-exporter to rabbitmq-prometheus 2019-10-29 13:00:53 +00:00
Gerhard Lazu 83a52e3dc6 Finish Erlang-Memory-Allocators dashboard v1
[#169264435]
2019-10-28 15:09:09 +00:00
Gerhard Lazu 57c5f17fd9 Add Erlang-Memory-Allocators Grafana dashboard WIP
[#169264435]
2019-10-28 09:55:13 +00:00
Gerhard Lazu 1d64fe9f67 Add RabbitMQ-Overview screenshots published to grafana.com
[#167836027]
2019-10-21 15:30:36 +01:00
Gerhard Lazu 33f03d9aa1 Bump Erlang/OTP & RabbitMQ versions to latest 2019-10-21 13:30:04 +01:00
Gerhard Lazu 98e0932d25 Bump Prometheus & Grafana versions to latest stable 2019-10-21 11:50:44 +01:00
Gerhard Lazu 5f99991063 Rename RabbitMQ-Raft to RabbitMQ-Quorum-Queues-Raft
It captures the Quorum-Queues Raft, so let's be specific, especially
since we know that there will be other Raft implementations in RabbitMQ,
not just Quorum Queues.

[#166926415]
2019-10-21 11:49:26 +01:00
Gerhard Lazu 73908bdf7a Display RabbitMQ & Erlang/OTP version on RabbitMQ-Overview dashboard
It is essential to know which RabbitMQ & Erlang/OTP version the cluster
is running, as well as how many nodes there are in the cluster. We now
have a table which lists this information, right under all singlestat
panels.

The singlestat panels have been re-organized to make room for 2 new
ones: Nodes & Publishers. Classic & Quorum Queues would be great to
have, as would VHosts. The last singlestats that I would add are Alarms
& Partitions. This would bring the total number of singlestat panels to
14 (we currently have 10). While 14 feel overwhelming, it captures all
the important information that I believe is worth knowing about any
RabbitMQ cluster.

All message-related sections now display 2 graph panels instead of 3.
While 3 panels look good on 27" screens, they don't work as well on 15"
screens, which is what the majority will be using. Also the 3rd panel
would always be for anti-pattern graphs (e.g. unroutable messages,
polling operations, etc.) and would be mostly empty in the majority of
cases. Fitting fewer panels per row not only helps focusing and
understanding what is being displayed, but it also makes it easier to
compare when viewing 2 panels side-by-side, on 27" screens. Nodes &
churn sections still have 3 panels, which works well when 1 panel is
more important than the others. The compromise that we need to make is
between giving enough horizontal space to equally important panels vs
making the dashboard page too long. RabbitMQ-Overview has always been a
comprehensive dashboard which captures a lot of imformation, it was
always tough balancing the important vs the complete.

[finishes #167836027]
2019-10-21 11:28:02 +01:00
Gerhard Lazu 9bdc2c61ea Do not limit max CPU utilisation to 100%
Multiple cores can go above this limit.

[#168734621]
2019-10-21 10:52:02 +01:00
Gerhard Lazu a0423f0aa5 Limit memory & disk units to 2 decimals on RabbitMQ Overview
9.313226 GiB is a lot harder to read than 9.31 GiB, and therefore less
useful. Observing other people use this made it obvious that limiting
the precision was the human-friendly thing to do.
2019-10-17 09:14:54 +01:00
Gerhard Lazu de8a51c6f3 Allow multi RabbitMQ cluster selection when comparing Erlang Distribution
Thanks @acogoluegnes!

[#168734621]
2019-10-14 16:34:53 +01:00
Gerhard Lazu 13d603c5f0 Add screenshots used for RabbitMQ-Overview on grafana.com
https://grafana.com/grafana/dashboards/10991

[finishes #165818656]
2019-10-14 13:58:21 +01:00
Gerhard Lazu 69679b9e67 Add descriptions to Grafana dashboards
This will be used in the Short Description when uploading to grafana.com

[#168734621]
2019-10-14 13:11:11 +01:00
Gerhard Lazu 37da59a715 Add screenshots used for Erlang-Distributions-Compare on grafana.com
https://grafana.com/grafana/dashboards/10988

[#168734621]
2019-10-14 12:14:48 +01:00
Gerhard Lazu e214677a2c Rename vm to host, fix CPU expressions
[#168734621]
2019-10-14 11:57:26 +01:00
Gerhard Lazu fbb075d83c Finish Erlang-Distributions-Compare GRafana dashboard
* explains source of metrics via row names
* makes tables slightly wider to mitigate long names line wrapping
* do not limit entries in tables, refresh resets table pagination

[finishes #168734621]
2019-10-14 10:27:55 +01:00
Gerhard Lazu 9fc0d79238 Make Erlang-Distribution-Compare dashboard 15"-friendly
The yardstick for all Grafana dashboards should be 1920 x 1200, the
screen format most common in our team. If the dashboards look good on
our screens, they will look good on other screens too. Smalle
resolutions won't look too crammed, and bigger resolutions can be split
in half (e.g. 27" iMacs).

Some take-aways from optimising the layout of this dashboard:

* limit horizontal graph panels to 3
* limit horizontal panels to 2 if the information is dense (e.g. table + graph)
* use the same width for graph panels that need comparing, stack vertically
2019-10-14 09:19:16 +01:00
Gerhard Lazu a356e2f630 Set all datasources to null, simplify dashboard tags
When exporting dashboards, all datasources are set to a dynamic
datasource, otherwise use the default local one (prometheus).
2019-10-14 09:18:43 +01:00
Gerhard Lazu 82c47bf352 Fix a couple more inconsistencies in Erlang-Distributions-Compare
[#168734621]
2019-10-04 22:23:43 +01:00
Gerhard Lazu 64d87d06fc Fix Erlang-Distributions-Compare network panels title 2019-10-04 22:11:12 +01:00
Gerhard Lazu e5fc8b18c8 Fix Erlang-Distributions-Compare title, reset time_options 2019-10-04 22:08:27 +01:00
Gerhard Lazu 13a2d411e2 Compare different Erlang Distributions
* tls, deflate, lz4, zstd, etc.

[#168734621]
2019-10-04 22:00:17 +01:00
Gerhard Lazu 59f7663486 Add screenshots that have been used for RabbitMQ-PerfTest
Available here: https://grafana.com/grafana/dashboards/6566
2019-10-04 21:57:02 +01:00
Gerhard Lazu 71692f2dbf Remove shared __requires & update-dashboards make target
__requires differs across dashboards

update-dashboards is not as useful anymore, vimdiffing most of the time.
2019-10-04 21:54:14 +01:00
Gerhard Lazu 19cbbbf755 Update tags for all Grafana dashboards 2019-10-03 17:39:19 +01:00
Gerhard Lazu 722ce8bf86 Add RabbitMQ-Perftest Grafana dashboard & wire all PerfTest instances
[#168734745]
2019-10-03 17:39:09 +01:00
Gerhard Lazu 402aa4722f Extract __requires from Grafana dashboards, template all datasources 2019-10-03 17:32:40 +01:00
Gerhard Lazu 59ef1f1fa2 Update OTP, RabbitMQ & PerfTest images to latest versions
Once 3.8.0 got released, master became 3.9.x, so we are now tracking
RabbitMQ 3.9 dev builds.
2019-10-02 13:11:21 +01:00
Gerhard Lazu dae49b5c08 Extract __inputs from Grafana dashboards
While __inputs are required for the dashboards to work in environments
where Prometheus is not the default datasource, it breaks the local
development flow. In other words,
9aa22e1895
prevents `make metrics overview` from working as designed.

We are going to add shortly a simple way of converting the local
dashboards into a format that can be imported in Grafana and will work
when Prometheus is not the default datasource (e.g. when using
https://github.com/coreos/kube-prometheus)

Long-term, these dashboards will be available via grafana.com, which is
the preferred way of consuming them.

cc @mkuratczyk
2019-10-02 12:51:33 +01:00
Gerhard Lazu f7866a1908 Add option to disable stats in management to overview example
Commented by default - we don't want to shock people just yet.
2019-09-26 17:35:06 +01:00
Gerhard Lazu c2aef07678 Bump OTP to latest stable & RabbitMQ to latest 3.8 dev 2019-09-26 17:33:39 +01:00
Michal Kuratczyk 9aa22e1895 Make the datasource configurable for all dashboards 2019-09-24 15:40:18 +02:00
Gerhard Lazu 7f049b9e05 Finish adding descriptions to all RabbitMQ-Overview panels
Thanks @michaelklishin!

[#167542609]
2019-09-06 18:52:45 +01:00
Gerhard Lazu 477d32e3cb Review incoming messages nodes panels with @michaelklishin
[#167542609]
2019-09-05 17:53:38 +01:00
Gerhard Lazu 235297eb6b Review nodes panels description with @michaelklishin
[#167542609]
2019-09-05 17:09:44 +01:00
Gerhard Lazu c66959ec4a Start adding missing descriptions to RabbitMQ-Overview panels
[#167542609]
2019-09-04 18:31:57 +01:00
Gerhard Lazu 7e9013e0ad Update Grafana dashboards to account for metric name changes
[#167846096]
2019-09-04 15:32:52 +01:00
Gerhard Lazu aaedcef4a5 Bump RabbitMQ & PerfTest Docker image versions
[#167846096]
2019-09-04 12:52:46 +01:00
Gerhard Lazu 98e488f1c4 Use standard naming for metrics expected from the client library
As described in
https://prometheus.io/docs/instrumenting/writing_clientlibs/#process-metrics.

Until prometheus.erl has the prometheus_process_collector functionality
built-in - this may not happen -, we are exposing a subset of those
metrics via rabbitmq_core_metrics_collector, so we are going to stick to
the expected naming conventions.

This commit supercedes the thought process captured in
1e5f4de4cb

[#167846096]
2019-09-03 15:31:55 +01:00
Gerhard Lazu 1e5f4de4cb Rename process-related metrics to stay closer to conventions
While `process_open_fds` would have been ideal, because the value is
cached within RabbitMQ, and computed differently across platforms, it is
important to keep the distinction from, say, what the kernel reports
just-in-time.

I am also capturing the Erlang context by adding `erlang_` to the
relevant metrics. The full context is: RabbitMQ observed this Erlang VM
process metric to be X, so this is why some metrics are prefixed with
`rabbitmq_erlang_process_`

Because there is a difference betwen what RabbitMQ limits are set to,
e.g. `rabbitmq_memory_used_limit_bytes`, vs. what RabbitMQ reports about
the Erlang process, e.g. `rabbitmq_erlang_process_memory_used_bytes`.

This is the best that we can do while staying honest about what is being
reported. cc @brian-brazil

[#167846096]
2019-09-03 12:30:48 +01:00
Gerhard Lazu b3336da844 Finish updating Erlang-Distribution dashboard to use new info metric
[#167846096]
2019-09-03 10:48:17 +01:00
Gerhard Lazu dbe8f331bc Bump Grafana Docker image to latest stable 2019-09-02 22:41:11 +01:00
Gerhard Lazu 6639f5f68f Start updating Erlang-Distribution dashboard to use new info metric
[#167846096]
2019-09-02 22:40:24 +01:00
Gerhard Lazu 48ee9875e7 Finish updating RabbitMQ-Raft dashboard to use the new info metric
[#167846096]
2019-09-02 21:37:26 +01:00
Gerhard Lazu eeaf49b2d5 Start updating RabbitMQ-Raft dashboard to use new info metric
[#167846096]
2019-09-02 18:15:21 +01:00
Gerhard Lazu 4df949fe32 Finish updating RabbitMQ-Overview dashboard to use the new info metric
[#167846096]
2019-08-27 18:43:52 +01:00
Gerhard Lazu ecd0581514 Bump Prometheus & Grafana Docker images to latest stable 2019-08-27 18:43:34 +01:00
Gerhard Lazu 2e686f1131 Continue updating RabbitMQ-Overview dashboard to use the new info metric
[#167846096]
2019-08-27 17:11:41 +01:00
Gerhard Lazu 9a4cba7c50 Start using the new rabbitmq_identity_info metric to filter by cluster
[#167846096]
2019-08-15 21:24:01 +01:00
Gerhard Lazu 052d92c74b Replace global labels with build_info & identity_info metrics
This started in the context of prometheus/docs#1414, specifically
https://github.com/prometheus/docs/pull/1414#issuecomment-520505757

Rather than labelling all metrics with the same label, we are
introducing 2 new metrics: rabbitmq_build_info & rabbitmq_identity_info.

I suspect that we may want to revert deadtrickster/prometheus.erl#91
when we agree that the proposed alternative is better.

We are yet to see through changes in Grafana dashboards. I am most
interested in how the updated queries will look like and, more
importantly, if we will have the same panels as we do now. More commits
to follow shortly, wanted to get this out the door first.

In summary, this commit changes:

    # TYPE erlang_mnesia_held_locks gauge
    # HELP erlang_mnesia_held_locks Number of held locks.
    erlang_mnesia_held_locks{node="rabbit@920f1e3272af",cluster="rabbit@920f1e3272af",rabbitmq_version="3.8.0-alpha.806",erlang_version="22.0.7"} 0
    # TYPE erlang_mnesia_lock_queue gauge
    # HELP erlang_mnesia_lock_queue Number of transactions waiting for a lock.
    erlang_mnesia_lock_queue{node="rabbit@920f1e3272af",cluster="rabbit@920f1e3272af",rabbitmq_version="3.8.0-alpha.806",erlang_version="22.0.7"} 0
    ...

To this:

    # TYPE erlang_mnesia_held_locks gauge
    # HELP erlang_mnesia_held_locks Number of held locks.
    erlang_mnesia_held_locks 0
    # TYPE erlang_mnesia_lock_queue gauge
    # HELP erlang_mnesia_lock_queue Number of transactions waiting for a lock.
    erlang_mnesia_lock_queue 0
    ...
    # TYPE rabbitmq_build_info untyped
    # HELP rabbitmq_build_info RabbitMQ & Erlang/OTP version info
    rabbitmq_build_info{rabbitmq_version="3.8.0-alpha.809",prometheus_plugin_version="3.8.0-alpha.809-2019.08.15",prometheus_client_version="4.4.0",erlang_version="22.0.7"} 1
    # TYPE rabbitmq_identity_info untyped
    # HELP rabbitmq_identity_info Node & cluster identity info
    rabbitmq_identity_info{node="rabbit@bc7aeb0c2564",cluster="rabbit@bc7aeb0c2564"} 1
    ...

[#167846096]
2019-08-15 16:00:29 +01:00
Gerhard Lazu 4aa3871194 Use different names for *_process_reductions_total metrics
It is invalid to have multiple metrics with the same name, TYPE & HELP,
but differing labels.

[#167846096]
2019-08-14 16:17:48 +01:00
Gerhard Lazu 3d741bb1cd Use past-tense in metrics that capture what happened 2019-08-13 17:44:47 +01:00
Gerhard Lazu d5c83792bc Increase Prometheus scrape to 15s & match across all metrics
We want to use a consistent range for all metrics that use rate() and a
safe value (4x the Prometheus scrape interval):
https://www.robustperception.io/what-range-should-i-use-with-rate

This also prompted a change in RabbitMQ's default
collect_statistics_interval, so that we don't update metrics
unnecessarily. We are OK if the Management UI doesn't update on every 5s
auto-refresh.

Related a929f22233

[#167846096]
2019-08-13 17:20:49 +01:00
Gerhard Lazu 3da98e74dc Review metric naming with @brian-brazil
Started as a Prometheus docs discussion in prometheus/docs#1414, mostly
based on https://prometheus.io/docs/instrumenting/writing_exporters/

Raft metrics are of type gauge, not counter. _If you care about the
absolute value rather than only how fast it's increasing, that's a
gauge_

All node_persister_metrics are now counters - some were gauges before.
They are now named using metric naming best practices:
https://prometheus.io/docs/practices/naming/

All metrics names that should have units, do. Some use microseconds,
others milliseconds and others bytes or ops (operations). We don't do
any unit conversion in the collector but simply expose the units that
are used when the metric value is written to ETS.

While some metrics such as io_sync_time_microseconds_total would be
better expressed as Sumarries, the refactoring required to achieve that
is not worth the effort. Will keep things simple & imperfect for now,
especially since we don't have a dashboard that helps visualise these
metrics.

The next step is to address global labels - will submit as a separate
PR.

[#167846096]
2019-08-13 15:57:24 +01:00
Gerhard Lazu b297d7b9eb Set cluster name via config
Now that there is a 3.8 alpha build that includes
rabbitmq/rabbitmq-server#2075, let's make use of it!

Without this, when a new cluster was started, some nodes ended up wtih
`rabbit@localhost` for the cluster label, instead of e.g. `rmq-gcp-38`.
The main suspect was a race condition, where the rabbitmq_prometheus app
starts before the cluster name is set via `rabbitmqctl
set_cluster_name`.

[finishes #167835770]
2019-08-13 15:50:01 +01:00
Gerhard Lazu 62dc0543cc Update RabbitMQ version to latest alpha 2019-08-08 18:38:56 +01:00
Gerhard Lazu 57b6092348 Remove duplicate filter
Thanks @mkuratczyk for spotting it!
2019-08-05 18:29:16 +01:00
Gerhard Lazu 4f54a512d3 Link to Monitoring with Prometheus & Grafana guide to RabbitMQ-Overview 2019-07-26 18:25:17 +01:00
Gerhard Lazu f08914dfc8 Update Nodes panels descriptions on RabbitMQ-Overview
Thanks @acoguelegnes for pointing out the broken link!

[#165818779]
2019-07-26 18:21:12 +01:00
Gerhard Lazu ab88238c09 Review RabbitMQ-Raft Dashboard with @michaelklishin 2019-07-26 17:02:01 +01:00
Gerhard Lazu f252a57bc4 Update metrics images to latest 2019-07-23 09:57:59 +01:00
Gerhard Lazu 9970059bc2 Bump RabbitMQ & PerfTest images to latest
PerfTest image includes rabbitmq/rabbitmq-perf-test#212
2019-07-22 13:59:49 +01:00
Gerhard Lazu c2a49d1513 Build on top of latest published RabbitMQ alpha version 2019-07-17 14:51:16 +01:00
Gerhard Lazu 4dec886e52 Add greedy-consumer to simulate unhealthy unack'ed messages
Re-order apps to nodes so that graphs show in a certain colour that
matches everything else. Little details like these matter.

[#165818779]
2019-06-27 19:26:05 +01:00
Gerhard Lazu cd99957d2d Do not abbreviate values for fds & sockets
Draw nulls as zeroes, otherwise the msgs unack'ed graph will look choppy
2019-06-27 19:24:50 +01:00
Gerhard Lazu c4dfdc6471 Do not allow entry commit latency to display below 0 on RabbitMQ Raft 2019-06-26 15:21:16 +01:00
Gerhard Lazu 825a5ca49b Fix messages redelivered critical threshold on RabbitMQ Overview 2019-06-26 15:16:54 +01:00
Gerhard Lazu 30eab6b0ac Build Docker image with latest alpha & make it easier next time 2019-06-26 11:35:17 +01:00
Gerhard Lazu 201365278d Update Docker image to latest 2019-06-25 11:47:57 +01:00
Gerhard Lazu d56ab528a7 Explain Raft panels & add link to QQ docs
[finishes #166819045]
2019-06-25 11:07:58 +01:00
Gerhard Lazu c92e551007 Improve Erlang Dist & Overview dashboards based on recent learnings
Learned a couple of new things while building RabbitMQ-Raft, applied
them here.
2019-06-24 18:29:20 +01:00
Gerhard Lazu 3cdf507c63 Visualise Ra entry commit latency & members with many log entries
Did a couple other improvements to all other panels, feels almost MVP.

[#166819045]
2019-06-24 18:28:01 +01:00
Gerhard Lazu 0a5b355ee3 Build Docker image with latest ra metrics tweaks
rabbitmq/ra#98

[#166819045]
2019-06-24 18:26:48 +01:00
Gerhard Lazu 5e280c0281 Add first version of RabbitMQ Raft metrics
Depends on https://github.com/rabbitmq/ra/tree/metrics_tweaks &
https://github.com/rabbitmq/rabbitmq-server/tree/qq_metrics_tweak

[#166819045]
2019-06-20 20:11:31 +01:00
Gerhard Lazu 31aa440bc4 Bump Erlang to latest stable & RabbitMQ to latest alpha 2019-06-20 20:10:02 +01:00