Monitoring - checks
To monitor our infrastructure, one of the pillar would be a Nagios-descendant product, like Sensu, Icinga or Shinken.
Those software will allow us to run checks remotely or on premise (NRPE checks, and we've already some written).
Sensu offers to create observability pipelines and to configure them as code.
Plan
Plan is to evaluate if the open source version of Sensu allows us to build correct observability pipeline and to use them to:
- fire notifications to our notifications center when something is wrong
- prepare a dashboard of what's ok / not ok in our infrastructure
To do so, we're first launching a small Sensu instance on Dwellers and we'll implement a dozen of checks.