Maintenance¶
Maintenance is a proper way to mute alerting on specific metrics or triggers. It can be useful during planned work. E.g., you are going to move server from one data center to another and don’t want Moira to disturb you.
Examples¶
When you switch a metric or trigger into maintenance, Moira will mute all state changes during that period. You will receive notification about every metric, if the state before and after maintenance turn out to be different.
Example 1. Maintenance metric, alert will not be sent¶
metric
awesomeMetric1is inOKstate;Rick switches metric into maintenance for an hour;
within the hour metric changes its state several times:
OK→WARN,WARN→ERROR,ERROR→OK;
after one-hour maintenance ends, metric is in
OKstate;Moira checks if metric state changed during maintenance:
awesomeMetric1state before maintenance:OK;awesomeMetric1state after maintenanceOK;
nothing to notify about: the state remained the same as it was before the maintenance period.
Example 2. Maintenance metric, alert will be sent¶
metric
awesomeMetric2is inOKstate;Rick switches metric into maintenance for an hour;
within the hour metric changes its state several times:
OK→WARN,WARN→ERROR,ERROR→OK,OK→ERROR;
after one-hour maintenance ends, metric is in
ERRORstate;Moira checks if metric state changed during maintenance:
awesomeMetric2state before maintenance:OK;awesomeMetric2state after maintenanceERROR;
Moira sends message to user: the state has changed from that which was before the maintenance period.
Example 3. Maintenance trigger, alert will be sent¶
metric
awesomeMetric1is inWARNstate;metric
awesomeMetric2is inOKstate;Rick switches trigger with this metrics into maintenance for an hour;
within the hour metric
awesomeMetric2changes its state several times:OK→WARN,WARN→ERROR,ERROR→OK,OK→ERROR;
after one-hour maintenance ends, metric is in
ERRORstate;Moira checks if metric state changed during maintenance:
awesomeMetric1state before maintenance:WARN;awesomeMetric1state after maintenanceWARN;awesomeMetric2state before maintenance:OK;awesomeMetric2state after maintenanceERROR;
Moira sends message about
awesomeMetric2metric to user: the state has changed from that which was before the maintenance period.