Rework of distributed multisite setup via Ansible #45

ganto · 2017-01-30T17:30:06Z

Since nearly half a year, the checkmk_server role supports automated setup of distributed monitoring sites. I extensively run this role in a large environment with two master sites each having multiple slave sites and several hundered monitoring targets each having from a few dozens to a few hundered service checks. Although the role already assists quite well adding new targets or generating monitoring rules, the distributed multisite setup is still a bit clumsy. Especially when adding a new slave site, a lot of manual definition work is required (in checkmk_server__distributed_sites) which is error-prone and already requires a good understanding about the Check_MK and Ansible role internals. It also doesn't help, that I didn't properly document it yet, as I was always looking for a way to simplify the configuration.

Further there are some limitations which I documented in the following issues:

Wrong credentials with distributed multisite setup #41: Wrong credentials with distributed multisite setup
Monitoring rule updates are not synchronized to slave sites #44: Monitoring rule updates are not synchronized to slave sites

And I plan some extensions such as (#42: Support stunnel for protecting livestatus queries) or automated setup of multiple sites per server which are simply not possible with the current role layout.

TODO
All this made me think, that I need a better way to setup distributed sites and I came along with the following idea:

Instead of attaching the slave site setup to the monitoring server running the site, I plan to move the logic to the master site setup. In this way, a distributed setup is configured and push from a single configuration target (the master server) and it's much easier to pass all required information to a slave site.

In the following weeks I will try to change the role logic in the proposed way. This should not only make it easier to fix the mentioned issues, but hopefully also allow for an easier implementation to support multiple monitoring sites on the same server (which currently has to be done manually).

The text was updated successfully, but these errors were encountered:

ganto added the enhancement label Jan 30, 2017

ganto self-assigned this Jan 30, 2017

ganto added this to the v0.1.0 milestone Mar 19, 2017

ganto mentioned this issue Apr 13, 2017

[WIP] Complete rework of the role to simplify distributed site setup #53

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Rework of distributed multisite setup via Ansible #45

Rework of distributed multisite setup via Ansible #45

ganto commented Jan 30, 2017 •

edited

Loading

Rework of distributed multisite setup via Ansible #45

Rework of distributed multisite setup via Ansible #45

Comments

ganto commented Jan 30, 2017 • edited Loading

ganto commented Jan 30, 2017 •

edited

Loading