Recently some of my peers were discussing issues that their customers are encountering with manual reset monitors. I thought the main points of the discussion were worth sharing.
What is a Manual Reset Monitor?
It is a type of monitor that detects an error condition, turns “red”, and generates an alert. What it does not do is detect the corresponding “success” condition, therefore someone must manually go into the OpsMgr console and reset the state of the monitor that went red back to “green” or healthy. A more useful type of monitor is one that goes “red” when an error occurs and then “green” when the error is no longer occurring – and this is what most well written monitors do.
What challenges do they present?
For administrators that rarely use the OpsMgr Console:
For administrators that do use the OpsMgr Console:
What are an MP author’s alternatives?
The top priority in a monitoring product is generally to notify the appropriate administrator of a problem when an issue occurs. Health state (and therefore monitors) are great and add value, but this should generally not come at the expense of failing to notify/alert on an error condition or recurrence of an error condition. There are several alternatives available to an MP author that require little additional MP development time.
What realistic workarounds are available after an MP has been developed which contains manual reset monitors?