Metrics and Alert rules provided by Hardware Observer for IPMI

Metrics

The details of the metrics exposed by Hardware Observer for IPMI are as follows:

Metric Name Description Labels
ipmi_generic_sensor_value Sensor values from ipmi sensors that aren’t covered in any of the other sensor metrics. id, name, state, unit, event, type
ipmi_fan_speed_rpm Fan speed measure, in rpm. id, name, state, unit, event
ipmi_fan_speed_ratio Fan speed measure, as a percentage of maximum speed. id, name, state, unit, event
ipmi_temperature_celsius Temperature in celsius as recorded by ipmi sensors. id, name, state, unit, event
ipmi_voltage_volts Voltage measure from voltage sensors id, name, state, unit, event
ipmi_current_amperes Current measure from current sensors, in Amperes id, name, state, unit, event
ipmi_power_watts Power measure from power sensors, in watts. id, name, state, unit, event
ipmimonitoring_command_success Indicates if the ipmimonitoring command succeeded or not(1.0 = successful, 0.0 = unsuccessful)
ipmi_sel_state Event state from IPMI SEL entry, mapped to a number (0: NOMINAL, 1: WARNING, 2: CRITICAL) id, date, time, name, type, event
ipmi_sel_command_success Indicates if the ipmi sel command succeeded or not(1.0 = successful, 0.0 = unsuccessful)
ipmi_dcmi_power_cosumption_watts Current power consumption in watts, as given by ipmi-dcmi
ipmi_dcmi_command_success Indicates if the ipmi dcmi command is successful or not(1.0 = successful, 0.0 = unsuccessful)

Alerts

The details of the alerts that are provided by Hardware Observer for IPMI are as follows:

Alert Rule Name Description Severity
IPMIDCMICommandFailed Failed to run ipmi_dcmi. critical
IPMIDCMIPowerConsumptionOutstanding IPMI DCMI power consumption is high. warning
IPMISELCommandFailed Failed to run ipmi-sel. critical
IPMISELStateWarning IPMI system event log in warning state. warning
IPMISELStateCritical IPMI system event log in critical state. critical
IPMIMonitoringCommandFailed Failed to run ipmimonitoring. critical
IPMITemperatureStateNotOk Temperature in warning or critical state warning / critical
IPMIPowerStateNotOk Power in warning or critical state warning / critical
IPMIVoltageStateNotOk Voltage in warning or critical state warning / critical
IPMICurrentStateNotOk Current in warning or critical state warning / critical
IPMIFanSpeedStateNotOk Fan speed in warning or critical state warning / critical
IPMISensorStateNotOk IPMI sensor value in warning or critical state warning / critical
IPMISELDStateWarning IPMISELD service is not active warning
IPMISELDStateCritical IPMI system event log in critical state critical