-
Notifications
You must be signed in to change notification settings - Fork 17
Do not compute flapping state when host / parent is not up #192
Comments
There are not much people using flapping so it is difficult to have an idea about that. |
Really useful when a service moves up and down around the threshold, to avoid notification spamming.
An idea perhaps, when the engine checks a service which returns a non-OK status, it also checks / knows the related host status, as it will not notify if the host is non-UP. I may then be tempted to say that the host status is known and that it may then be feasible to avoid storing in the last (21) service status used for flapping computation a non-OK status when the related host is non-UP. Just my 2 cents :) ... |
Something like :
Here ? centreon-engine/src/service.cc Lines 1952 to 1953 in 2a0f1fb
We would then be sure not to modify the flapping status of a service at all while the host is down. |
Hi @cpbn, The calculation of the flapping of a service is independent of the host as you explained. More generally, what would be the principle of notification of services if the host is in a flapping state? To disable the notification of all services of the host? |
Hi @lpinsivy, thank you for asking. Simply notify as usual :
If host becomes Thank you again 👍 |
Note that we have the same issue with (parent / child) hosts themselves.
So, as proposed above for services, we should not compute the child host flapping status when the parent host is Thank you again 👍 |
See #524 for a fix 👍 |
Closing this, as it should be solved by #557. |
Hello,
A service is considered as flapping after its state has changed too many times among the last (21) checks. Perfect.
Let's now take a service attached to a host which is itself flapping (or at least changing state regularly).
The service will then frequently switch from OK to UNKNOWN (or any other NOK) state, to OK again etc...
Which will finally bring it to a flapping state, leading to a lot of unwanted and wrong flapping notifications, making flapping detection counter-productive...
According to me, service state change should then not be taken into account for flapping computation when related host is not up.
Unfortunately, sounds like this is not the case for now.
Could we then think about implementing this rule ?
In addition, note that we also receive the service FLAPPING notification even if the related host is not up, whereas other notifications types are muted when host is not up. Anyway this should not occur anymore if we apply the rule above.
Many thanks 👍
Edit : see #524 for a fix.
The text was updated successfully, but these errors were encountered: