Commit Graph

205 Commits

Author SHA1 Message Date
Gerhard Lazu 4df949fe32 Finish updating RabbitMQ-Overview dashboard to use the new info metric
[#167846096]
2019-08-27 18:43:52 +01:00
Gerhard Lazu ecd0581514 Bump Prometheus & Grafana Docker images to latest stable 2019-08-27 18:43:34 +01:00
Gerhard Lazu 2e686f1131 Continue updating RabbitMQ-Overview dashboard to use the new info metric
[#167846096]
2019-08-27 17:11:41 +01:00
Gerhard Lazu 9a4cba7c50 Start using the new rabbitmq_identity_info metric to filter by cluster
[#167846096]
2019-08-15 21:24:01 +01:00
Gerhard Lazu 052d92c74b Replace global labels with build_info & identity_info metrics
This started in the context of prometheus/docs#1414, specifically
https://github.com/prometheus/docs/pull/1414#issuecomment-520505757

Rather than labelling all metrics with the same label, we are
introducing 2 new metrics: rabbitmq_build_info & rabbitmq_identity_info.

I suspect that we may want to revert deadtrickster/prometheus.erl#91
when we agree that the proposed alternative is better.

We are yet to see through changes in Grafana dashboards. I am most
interested in how the updated queries will look like and, more
importantly, if we will have the same panels as we do now. More commits
to follow shortly, wanted to get this out the door first.

In summary, this commit changes:

    # TYPE erlang_mnesia_held_locks gauge
    # HELP erlang_mnesia_held_locks Number of held locks.
    erlang_mnesia_held_locks{node="rabbit@920f1e3272af",cluster="rabbit@920f1e3272af",rabbitmq_version="3.8.0-alpha.806",erlang_version="22.0.7"} 0
    # TYPE erlang_mnesia_lock_queue gauge
    # HELP erlang_mnesia_lock_queue Number of transactions waiting for a lock.
    erlang_mnesia_lock_queue{node="rabbit@920f1e3272af",cluster="rabbit@920f1e3272af",rabbitmq_version="3.8.0-alpha.806",erlang_version="22.0.7"} 0
    ...

To this:

    # TYPE erlang_mnesia_held_locks gauge
    # HELP erlang_mnesia_held_locks Number of held locks.
    erlang_mnesia_held_locks 0
    # TYPE erlang_mnesia_lock_queue gauge
    # HELP erlang_mnesia_lock_queue Number of transactions waiting for a lock.
    erlang_mnesia_lock_queue 0
    ...
    # TYPE rabbitmq_build_info untyped
    # HELP rabbitmq_build_info RabbitMQ & Erlang/OTP version info
    rabbitmq_build_info{rabbitmq_version="3.8.0-alpha.809",prometheus_plugin_version="3.8.0-alpha.809-2019.08.15",prometheus_client_version="4.4.0",erlang_version="22.0.7"} 1
    # TYPE rabbitmq_identity_info untyped
    # HELP rabbitmq_identity_info Node & cluster identity info
    rabbitmq_identity_info{node="rabbit@bc7aeb0c2564",cluster="rabbit@bc7aeb0c2564"} 1
    ...

[#167846096]
2019-08-15 16:00:29 +01:00
Gerhard Lazu 4aa3871194 Use different names for *_process_reductions_total metrics
It is invalid to have multiple metrics with the same name, TYPE & HELP,
but differing labels.

[#167846096]
2019-08-14 16:17:48 +01:00
Gerhard Lazu 3d741bb1cd Use past-tense in metrics that capture what happened 2019-08-13 17:44:47 +01:00
Gerhard Lazu d5c83792bc Increase Prometheus scrape to 15s & match across all metrics
We want to use a consistent range for all metrics that use rate() and a
safe value (4x the Prometheus scrape interval):
https://www.robustperception.io/what-range-should-i-use-with-rate

This also prompted a change in RabbitMQ's default
collect_statistics_interval, so that we don't update metrics
unnecessarily. We are OK if the Management UI doesn't update on every 5s
auto-refresh.

Related a929f22233

[#167846096]
2019-08-13 17:20:49 +01:00
Gerhard Lazu 3da98e74dc Review metric naming with @brian-brazil
Started as a Prometheus docs discussion in prometheus/docs#1414, mostly
based on https://prometheus.io/docs/instrumenting/writing_exporters/

Raft metrics are of type gauge, not counter. _If you care about the
absolute value rather than only how fast it's increasing, that's a
gauge_

All node_persister_metrics are now counters - some were gauges before.
They are now named using metric naming best practices:
https://prometheus.io/docs/practices/naming/

All metrics names that should have units, do. Some use microseconds,
others milliseconds and others bytes or ops (operations). We don't do
any unit conversion in the collector but simply expose the units that
are used when the metric value is written to ETS.

While some metrics such as io_sync_time_microseconds_total would be
better expressed as Sumarries, the refactoring required to achieve that
is not worth the effort. Will keep things simple & imperfect for now,
especially since we don't have a dashboard that helps visualise these
metrics.

The next step is to address global labels - will submit as a separate
PR.

[#167846096]
2019-08-13 15:57:24 +01:00
Gerhard Lazu b297d7b9eb Set cluster name via config
Now that there is a 3.8 alpha build that includes
rabbitmq/rabbitmq-server#2075, let's make use of it!

Without this, when a new cluster was started, some nodes ended up wtih
`rabbit@localhost` for the cluster label, instead of e.g. `rmq-gcp-38`.
The main suspect was a race condition, where the rabbitmq_prometheus app
starts before the cluster name is set via `rabbitmqctl
set_cluster_name`.

[finishes #167835770]
2019-08-13 15:50:01 +01:00
Gerhard Lazu 62dc0543cc Update RabbitMQ version to latest alpha 2019-08-08 18:38:56 +01:00
Gerhard Lazu 57b6092348 Remove duplicate filter
Thanks @mkuratczyk for spotting it!
2019-08-05 18:29:16 +01:00
Gerhard Lazu 4f54a512d3 Link to Monitoring with Prometheus & Grafana guide to RabbitMQ-Overview 2019-07-26 18:25:17 +01:00
Gerhard Lazu f08914dfc8 Update Nodes panels descriptions on RabbitMQ-Overview
Thanks @acoguelegnes for pointing out the broken link!

[#165818779]
2019-07-26 18:21:12 +01:00
Gerhard Lazu ab88238c09 Review RabbitMQ-Raft Dashboard with @michaelklishin 2019-07-26 17:02:01 +01:00
Gerhard Lazu f252a57bc4 Update metrics images to latest 2019-07-23 09:57:59 +01:00
Gerhard Lazu 9970059bc2 Bump RabbitMQ & PerfTest images to latest
PerfTest image includes rabbitmq/rabbitmq-perf-test#212
2019-07-22 13:59:49 +01:00
Gerhard Lazu c2a49d1513 Build on top of latest published RabbitMQ alpha version 2019-07-17 14:51:16 +01:00
Gerhard Lazu 4dec886e52 Add greedy-consumer to simulate unhealthy unack'ed messages
Re-order apps to nodes so that graphs show in a certain colour that
matches everything else. Little details like these matter.

[#165818779]
2019-06-27 19:26:05 +01:00
Gerhard Lazu cd99957d2d Do not abbreviate values for fds & sockets
Draw nulls as zeroes, otherwise the msgs unack'ed graph will look choppy
2019-06-27 19:24:50 +01:00
Gerhard Lazu c4dfdc6471 Do not allow entry commit latency to display below 0 on RabbitMQ Raft 2019-06-26 15:21:16 +01:00
Gerhard Lazu 825a5ca49b Fix messages redelivered critical threshold on RabbitMQ Overview 2019-06-26 15:16:54 +01:00
Gerhard Lazu 30eab6b0ac Build Docker image with latest alpha & make it easier next time 2019-06-26 11:35:17 +01:00
Gerhard Lazu 201365278d Update Docker image to latest 2019-06-25 11:47:57 +01:00
Gerhard Lazu d56ab528a7 Explain Raft panels & add link to QQ docs
[finishes #166819045]
2019-06-25 11:07:58 +01:00
Gerhard Lazu c92e551007 Improve Erlang Dist & Overview dashboards based on recent learnings
Learned a couple of new things while building RabbitMQ-Raft, applied
them here.
2019-06-24 18:29:20 +01:00
Gerhard Lazu 3cdf507c63 Visualise Ra entry commit latency & members with many log entries
Did a couple other improvements to all other panels, feels almost MVP.

[#166819045]
2019-06-24 18:28:01 +01:00
Gerhard Lazu 0a5b355ee3 Build Docker image with latest ra metrics tweaks
rabbitmq/ra#98

[#166819045]
2019-06-24 18:26:48 +01:00
Gerhard Lazu 5e280c0281 Add first version of RabbitMQ Raft metrics
Depends on https://github.com/rabbitmq/ra/tree/metrics_tweaks &
https://github.com/rabbitmq/rabbitmq-server/tree/qq_metrics_tweak

[#166819045]
2019-06-20 20:11:31 +01:00
Gerhard Lazu 31aa440bc4 Bump Erlang to latest stable & RabbitMQ to latest alpha 2019-06-20 20:10:02 +01:00
Gerhard Lazu c4f28ed7e1 Clarify for which alarms publishers get blocked 2019-06-19 19:15:16 +01:00
Gerhard Lazu 4b78d41055 Improve node naming, standardise the colour pinning regex 2019-06-17 22:20:54 +01:00
Gerhard Lazu 6daccf9b88 Improve node colour pinning
* start from 0, not 1
* fix colour pinning for nodes with numbers - e.q. rmq-gcp-38
2019-06-17 19:04:28 +01:00
Gerhard Lazu d5b1a03648 Increase erlang_vm_dist_node_queue_size threshold to 64MB & expand info
[#166037004]
2019-06-17 17:20:04 +01:00
Gerhard Lazu 8a60eef9a3 Fix erlang_vm_dist_node_queue_size graph
It's not a rate, it's the actual buffered data

[#166037004]
2019-06-17 17:01:27 +01:00
Gerhard Lazu 22e59f3cf4 Use latest RabbitMQ 3.8.0 alpha release 2019-06-13 09:54:41 +01:00
Gerhard Lazu cf339a49e8 Respond to learnings from a LRE PromStack & Erlang Distribution metrics
re deadtrickster/prometheus.erl#94
re erlang/otp#2270

[#166574772]
2019-06-11 19:03:50 +01:00
Gerhard Lazu b795bd44a4 Add link to Publishers guide from Messages published / s panel 2019-06-10 16:19:08 +01:00
Gerhard Lazu 7339c2efe3 Update to latest RabbitMQ 3.8 alpha 2019-06-10 10:33:17 +01:00
Gerhard Lazu 4ea1e4b29d Update to latest perf-test
Fixes rabbitmq/rabbitmq-perf-test#207
2019-06-10 10:08:16 +01:00
Gerhard Lazu 8f061f31e6 Bump to latest RabbitMQ & PerfTest images
This will be helpful to continue rabbitmq/rabbitmq-perf-test#207
2019-06-06 17:31:44 +01:00
Gerhard Lazu 773b8f8670 Show legends on process states
It's hard to understand what the different colours mean otherwise. Also,
yellow is preferable to purple when it comes to displaying runnable
processes - those stuck in the run queue.

cc @michaelklishin
2019-06-03 22:00:43 +01:00
Gerhard Lazu 0945511e7f Capture learnings from ERL-959 into Erlang Distribution Grafana dashboard
It explains the correlation between inet packets & TCP packets, and why
the inet packet size varies when TLS is used for inter-node
communication.

[finishes 166419953]
2019-06-03 18:04:07 +01:00
Gerhard Lazu 90b5653dc1 Improve Memory available panel title & description
Thanks @essen!
2019-06-03 18:02:45 +01:00
Gerhard Lazu ebd4ffc67f Modify the default distribution link buffer
It makes a big difference for stable throughput. See screenshots from
https://bugs.erlang.org/browse/ERL-959

We need to test this in a real network - I'm thinking GCP -, outside of
Docker. The results will inform whether we should change the default -
which is 1436 bytes.

[#166419953]
2019-06-03 18:00:26 +01:00
Gerhard Lazu e7fa4a2753 Bump Erlang/OTP to 22.0.2 2019-06-03 17:59:57 +01:00
Gerhard Lazu cd0e804180 Use consistent naming for the Erlang Distribution cluster
[#165818813]
2019-05-30 15:31:52 +01:00
Gerhard Lazu 112254ed96 Enable filtering Erlang Distribution metrics in Grafana by cluster
[finishes #165818813]
2019-05-30 13:58:56 +01:00
Gerhard Lazu 9ddb7e6cd6 Enable filtering RabbitMQ Overview metrics in Grafana by cluster
[#165818813]
2019-05-30 13:57:12 +01:00
Gerhard Lazu 6eeeb6b0b4 Set up separate clusters for Overview & Distribution metrics
This will allow to simulate multiple clusters feeding metrics into
Prometheus & Grafana.

[#165818813]
2019-05-30 13:56:17 +01:00
Gerhard Lazu 931e07455e Extract metrics services into a separate docker-compose file
Add cadvisor & node-exporter & Docker metrics.
Inspired by https://github.com/stefanprodan/dockprom
There are no Grafana dashboards for these metrics yet. The dockprom ones
don't show any panels in Grafana 6.

[#165818813]
2019-05-30 13:53:02 +01:00
Gerhard Lazu 7b632674c9 Update Distribution & Overview dashboard tags 2019-05-30 09:37:24 +01:00
Gerhard Lazu 6e614301d0 Bump Grafana & Prometheus images to latest stable 2019-05-30 09:36:52 +01:00
Gerhard Lazu 21aa9e9e68 Awlays install Grafana plugins
Even though this slows down Grafana container startup, we need to ensure
that this plugin is present, otherwise the panels that track process
state won't work. This will be slow the first time the plugin is
downloaded, and slightly faster on subsequent runs.

[#166004512]
2019-05-30 09:35:12 +01:00
Gerhard Lazu 06bae443f6 Put more pressure on the distribution
This forces the tls_connection and tls_sender processes to be in
runnable & running states more often. waiting is the ideal state.

[#166004512]
2019-05-30 09:30:21 +01:00
Gerhard Lazu f06502ff5a Improve RabbitMQ Overview Grafana dashboard
* pin nodes to specific colours
* add message-related single-stats
* reshuffle rows
  * node metrics are most useful
  * queue, channel & connection churn are least useful
2019-05-29 18:21:25 +01:00
Gerhard Lazu 2645082738 Finish Erlang Distribution Grafana dashboard
Includes Erlang node to colour pinning

Adds a few make targets to help with docker-compose repetitive commands
& Grafana dashboard updates.

Split Overview & Distribution Docker deployments

re deadtrickster/prometheus.erl#92

[finishes #166004512]
2019-05-29 18:19:09 +01:00
Gerhard Lazu c4f0105bad Fix invalid docker-compose
An empty environment is not allowed
2019-05-20 22:21:06 +01:00
Gerhard Lazu 4e81af4cfc Pin RabbitMQ nodes to colours in all Grafana panesl
Regex is greedy, need to look into non-greedy matching, especially for
Erlang Distribution metrics.

[#166004512]
2019-05-20 22:19:43 +01:00
Gerhard Lazu 6a6877cc8b Do not install Grafana plugins all the time
Even if the plugins are stored on a volume, updating them can be slow.
2019-05-20 22:19:02 +01:00
Gerhard Lazu 204f170e7e Bump Grafana, Prometheus & PerfTest versions to latest 2019-05-20 22:18:32 +01:00
Gerhard Lazu d01f96a9af Bump Erlang/OTP to 22.0.1 in Docker image
Picking up the latest 3.8.0 alpha while at it.

It's time to start stress-testing OTP 22 re docker-library/rabbitmq#336

[#166037004]
2019-05-20 21:43:01 +01:00
Gerhard Lazu d1460d5b44 Stress Erlang Distribution metrics on OTP 21
We (+@essen) have answered a bunch of questions (see the story) and
improved the metrics + dashboard in the process. Added some improvements
to the RabbitMQ Overview metrics as well.

[#166004104]
2019-05-20 21:41:27 +01:00
Gerhard Lazu 1f333ebed6 Display the number of Erlang Distribution links
[#166004104]
2019-05-20 10:12:22 +01:00
Gerhard Lazu 691af35dfb Mirror slow-consumer-persistent queue to all 3 nodes
This puts load on the distribution and makes the Erlang-Distribution
dashboard show an interesting behaviour in TCP sockets. @dcorbacho
thinks so too.

re deadtrickster/prometheus.erl#92

[#166004512]
2019-05-15 18:49:58 +01:00
Gerhard Lazu f9ce43677b Review metrics with @dcorbacho
[accepts #165831668]
2019-05-15 17:06:42 +01:00
Gerhard Lazu ebde2ff663 Default Erlang Distribution Grafana dashboard to 10 minutes
It's the same as RabbitMQ Overview
2019-05-15 17:05:07 +01:00
Gerhard Lazu 7652799e05 Add Grafana dashboard for Erlang Distribution
Just the first version, imperfect in many ways, but better than nothing.

[#166004512]
2019-05-14 16:17:04 +01:00
Gerhard Lazu 6272f30724 Missed get empty Grafana panel
[#165831668]
2019-05-09 17:38:34 +01:00
Gerhard Lazu c596efb58e Review all metrics to ETS mappings
Clarify descriptions, improve metric names, fix typos etc. Follow-up to
deadtrickster/prometheus_rabbitmq_exporter#75.

Helpful metric descriptions

* https://www.rabbitmq.com/monitoring.html
* https://docs.signalfx.com/en/latest/integrations/integrations-reference/integrations.rabbitmq.html
* https://github.com/rabbitmq/rabbitmq-common/blob/master/include/rabbit_core_metrics.hrl
* https://github.com/rabbitmq/rabbitmq-common/blob/master/src/rabbit_core_metrics.erl

Thanks for the pair-up @michaelklishin!

[finishes #165831668]
2019-05-09 17:25:17 +01:00
Gerhard Lazu 8186847754 Increase msg redelivered / s thresholds to 20 & 100 2019-05-07 16:29:57 +01:00
Gerhard Lazu 982b1c798d Force connection, channel & queue churn 2019-05-07 16:25:49 +01:00
Gerhard Lazu 5b7a2edc62 Add redelivered thresholds & make unroutable panels red
Warn at 2 redelivered msg/s & critical at 10 redelivered msg/s
2019-05-07 15:40:37 +01:00
Gerhard Lazu ceffc587a1 Add consumer that nacks messages every 0.5s
re rabbitmq/rabbitmq-perf-test#204
2019-05-07 15:40:31 +01:00
Gerhard Lazu d78cb66435 Fix basic.get metrics on Grafana dashboard
Use 1m instead of $__interval for rates that track metrics with slow
rate of change. Using $__interval will miss changes.

Stop rounding, it skews values.

All `basic.get` metrics are bad. The 0 threshold and the red colour for
all lines is hopefully enought to convey this.

re rabbitmq/rabbitmq-perf-test#203

[finishes #165852775]
2019-05-07 13:03:41 +01:00
Gerhard Lazu ce89fdbcd1 Use latest PerfTest, with consumer rate fix
rabbitmq/rabbitmq-perf-test#202
2019-05-02 17:58:39 +01:00
Gerhard Lazu 9d6708ebab Use full descriptions in titles
Otherwise it's really hard to know what we are looking at when expanding
panels.

Also, pin to colours. Otherwise, rabbit@rabbitmq1 metrics in one panel
will appear yellow, and green in another panel. This is a one-off
which doesn't scale, should be automated in some way. Grafana doesn't
support pinning colors to labels 🤔
2019-05-02 17:33:47 +01:00
Gerhard Lazu d44c0edf35 Fix rate interval for / s metrics
This explains why we want rate() instead of irate() and a 1m interval
for smaller changes & $__interval for higher ones:
https://utcc.utoronto.ca/~cks/space/blog/sysadmin/PrometheusRateVsIrate

[finishes #164374397]
2019-05-02 14:43:09 +01:00
Gerhard Lazu 7206cd31f1 Expose Erlang distribution port 2019-05-01 18:18:16 +01:00
Gerhard Lazu 3ec84b3bbd Add object churn panels
queue_created is not currently exported via rabbitmq_prometheus.

[#164374397]
2019-05-01 18:16:49 +01:00
Gerhard Lazu a2e6687162 Bump prometheus.erl to v4.3.0
This includes the global_labels feature introduced in deadtrickster/prometheus.erl#91

To test, run `docker-compose up` in docker dir, then navigate to
localhost:15692/metrics & localhost:3000/dashboards (admin:admin) to see
the Grafana RabbitMQ Overview dashboard.
2019-05-01 12:58:35 +01:00
Gerhard Lazu c664fdc5e1 Make use of variable message size
re rabbitmq/rabbitmq-perf-test#200
2019-04-29 17:51:42 +01:00
Gerhard Lazu 58a8e5a011 Add vhosts instead of exchanges in global counts
Add nodes, alarms & partitions to global counts. These are too important
to not show them. Need to discuss how to expose these via metrics.

[#164374397]
2019-04-29 09:49:01 +01:00
Gerhard Lazu a2500b4784 Add global counts to RabbitMQ Overview dashboard
[#164374397]
2019-04-25 19:12:14 +01:00
Gerhard Lazu 2a645a4d8c Make all rates vary, force messages to be read from disk
Set memory high watermark to 256MiB to force trigger the memory alarm,
as well as ensure messages get paged to disk (forces disk reads).

Make all legends display as table so that values are easier to see when
toggling them.
2019-04-25 19:11:45 +01:00
Gerhard Lazu ba302951b9 Make use of PerfTest variable rate
Thanks @acogoluegnes! rabbitmq/rabbitmq-perf-test#195
2019-04-25 16:10:14 +01:00
Gerhard Lazu eb25d3ed87 Explain better why some messages were returned vs dropped
rabbitmq/rabbitmq-server#1988
2019-04-25 14:49:37 +01:00
Gerhard Lazu ca995223d9 Add panel for dropped messages / s
Acceptance for rabbitmq/rabbitmq-server#1988
2019-04-25 14:18:32 +01:00
Gerhard Lazu fcf278a7a2 Wording improvements
Thanks @MarcialRosales!

[#164374751]
2019-04-25 11:48:03 +01:00
Gerhard Lazu 4e25ce69c9 Build Docker image based on latest alpha release
Bump Erlang to v21.3.6
2019-04-25 11:47:30 +01:00
Gerhard Lazu cded1500fb Add persistent messages panels, improve panel grouping
[finishes #164374751]
2019-04-24 21:38:13 +01:00
Gerhard Lazu 9c3bbe8c76 Display more message metrics, simulate different types of publishers/consumers
[#164374751]
2019-04-24 18:38:13 +01:00
Gerhard Lazu efe803617b Set demo-friendly memory & disk-space limits
Capture limits in thresholds. Even if they are static and somewhat
specific to this RabbitMQ deployment, it's better to have them when
demo-ing the end-to-end Prometheus/Grafana experience.

[#164374751]
2019-04-23 18:56:30 +01:00
Gerhard Lazu ced5ee74ed Bump to latest rabbitmq-prometheus Docker image 2019-04-23 18:56:04 +01:00
Gerhard Lazu 9f8852b0c1 Make RabbitMQ container fd limit better-suited for demos
1k feels right for a demo environment, where we want to simulate hitting
threshold limits.
2019-04-23 14:06:57 +01:00
Gerhard Lazu ec03866172 Publish messages as mandatory, with publisher confirms
This lights up `Published confirmed / s` Grafana panel.

To light up `Published unroutable / s`, unbind all queues from the
direct exchange.

[#164374751]
2019-04-23 13:48:40 +01:00
Gerhard Lazu f3cae13394 Bump Docker image version used in docker-compose 2019-04-23 13:44:14 +01:00
Gerhard Lazu 0441fae092 Lower docker-compose file version
Not using anything that requires Docker 18.06, dropping to 18.02.
Thanks @MarcialRosales!
2019-04-23 13:42:05 +01:00
Gerhard Lazu 8ec633926b Use nc to check that all nodes are listening on AMQP
Related to rabbitmq/rabbitmq-perf-test#191
2019-04-23 13:41:15 +01:00
Gerhard Lazu dbb1bee383 Build Docker image with the latest management & management_agent
This has support for disabling metrics_collector, as captured in
rabbitmq/rabbitmq-management-agent#78 & rabbitmq/rabbitmq-management#691

Since we want management to be enabled, this doesn't help our use-case,
but this option is perfect for users that want metrics, but don't want
to pay the overhead of Management - especially metric aggregations.

[#164376052]
2019-04-22 17:47:22 +01:00
Gerhard Lazu 789f6c041e Preconfigure Grafana on boot
After running `docker-compose up`, open Grafana via
http://localhost:3000 and login with user admin & password admin. After
logging in, you will see a RabbitMQ Overview dashboard pre-loaded (/・0・)

Thanks @cirocosta! https://github.com/cirocosta/sample-grafana

cc @MarcialRosales

[finishes #164374321]
2019-04-16 15:48:44 +01:00
Gerhard Lazu 5329f9f836 Increse PerfTest publish rate to 10 msg/s per publisher
Wanted to put a bit more load on the RabbitMQ nodes
2019-04-16 15:48:44 +01:00
Gerhard Lazu 0a9a085e91 Make PerfTest wait longer for RabbitMQ nodes to start
If all 3 RabbitMQ nodes take 15s to start, PerfTest container will crash
and not be auto-restarted.
2019-04-16 15:48:43 +01:00
Gerhard Lazu e8e224cc57 Expose management & prometheus ports for all RabbitMQ nodes
* rabbitmq1 - http://localhost:15672 & http://localhost:15692/metrics
* rabbitmq2 - http://localhost:25672 & http://localhost:25692/metrics
* rabbitmq3 - http://localhost:35672 & http://localhost:35692/metrics

[#164374321]
2019-04-16 15:48:43 +01:00
Gerhard Lazu d61c58d0d6 Capture a 3-node RabbitMQ & Prometheus + Grafana stack
cd docker && docker-compose up

[#164374321]
2019-04-15 19:18:45 +01:00