It should be possible using script to execute on alarm = /your/custom/remediation-script…netdata.cloud/…/agent-notifications-reference. I have not experimented with this yet, but soon will (implementing a custom notification channel for specific alarms)
restarting a service if it isn’t answering requests
I’d rather find the root cause of the downtime/malfunction instead of blindly restarting the service, just my 2 cents.
Monitoring software for a wide array of hw and sw
I’m looking into setting up some monitoring combined with simple automation for my selfhosting. Currently I was thinking about using Zabbix....