Metrics and Alert rules provided by Hardware Observer for PowerEdge RAID

Metrics

The details of the metrics exposed by hardware-observer for PowerEdge RAID Controllers are as follows:

Metric Name Description Labels
poweredgeraid_controllers number of controllers hostname
poweredgeraid_virtual_drive number of virtual drives controller_id
perccli_command_success Indicates the if command is successful or not controller_id
perccli_command_ctrl_success Indicates the if command for each controller is successful or not controller_id
poweredgeraid_virtual_drive_info Indicates the state of virtual drive controller_id, device_group, virtual_drive_id, state, cache_policy
poweredgeraid_physical_device_info Shows the information about the physical device controller_id, enclosure_id, slot_id, size, media_type, state

Alerts

The details of the alets that are provided by hardware-observer for PowerEdge RAID Controllers are as follows:

Alert Rule Name Description Alert Condition Severity
PerccliCommandFailed Failed to run perccli or controller not available. perccli_command_success == 1 critical
PowerEdgeRAIDControllerNotFound PowerEdge RAID controller not found. poweredgeraid_controllers == 1 warning
PowerEdgeRAIDControllerSuccess PowerEdge RAID controller command not successful. perccli_command_ctrl_success == 0 critical
PowerEdgeRAIDVirtualDriveNotOptimal PowerEdge RAID virtual drives are not in optimal state. poweredgeraid_virtual_info{state != “Optl”} == 1 warning