Metrics
The details of the metrics exposed by Hardware Observer for IPMI are as follows:
Metric Name | Description | Labels |
---|---|---|
ipmi_generic_sensor_value | Sensor values from ipmi sensors that aren’t covered in any of the other sensor metrics. | id, name, state, unit, event, type |
ipmi_fan_speed_rpm | Fan speed measure, in rpm. | id, name, state, unit, event |
ipmi_fan_speed_ratio | Fan speed measure, as a percentage of maximum speed. | id, name, state, unit, event |
ipmi_temperature_celsius | Temperature in celsius as recorded by ipmi sensors. | id, name, state, unit, event |
ipmi_voltage_volts | Voltage measure from voltage sensors | id, name, state, unit, event |
ipmi_current_amperes | Current measure from current sensors, in Amperes | id, name, state, unit, event |
ipmi_power_watts | Power measure from power sensors, in watts. | id, name, state, unit, event |
ipmimonitoring_command_success | Indicates if the ipmimonitoring command succeeded or not(1.0 = successful, 0.0 = unsuccessful) | |
ipmi_sel_state | Event state from IPMI SEL entry, mapped to a number (0: NOMINAL, 1: WARNING, 2: CRITICAL) | id, date, time, name, type, event |
ipmi_sel_command_success | Indicates if the ipmi sel command succeeded or not(1.0 = successful, 0.0 = unsuccessful) | |
ipmi_dcmi_power_cosumption_watts | Current power consumption in watts, as given by ipmi-dcmi | |
ipmi_dcmi_command_success | Indicates if the ipmi dcmi command is successful or not(1.0 = successful, 0.0 = unsuccessful) |
Alerts
The details of the alerts that are provided by Hardware Observer for IPMI are as follows:
Alert Rule Name | Description | Severity |
---|---|---|
IPMIDCMICommandFailed | Failed to run ipmi_dcmi. | critical |
IPMIDCMIPowerConsumptionOutstanding | IPMI DCMI power consumption is high. | warning |
IPMISELCommandFailed | Failed to run ipmi-sel. | critical |
IPMISELStateWarning | IPMI system event log in warning state. | warning |
IPMISELStateCritical | IPMI system event log in critical state. | critical |
IPMIMonitoringCommandFailed | Failed to run ipmimonitoring. | critical |
IPMITemperatureStateNotOk | Temperature in warning or critical state | warning / critical |
IPMIPowerStateNotOk | Power in warning or critical state | warning / critical |
IPMIVoltageStateNotOk | Voltage in warning or critical state | warning / critical |
IPMICurrentStateNotOk | Current in warning or critical state | warning / critical |
IPMIFanSpeedStateNotOk | Fan speed in warning or critical state | warning / critical |
IPMISensorStateNotOk | IPMI sensor value in warning or critical state | warning / critical |
IPMISELDStateWarning | IPMISELD service is not active | warning |
IPMISELDStateCritical | IPMI system event log in critical state | critical |