Metrics
The details of the metrics exposed by Hardware Observer for IPMI are as follows:
| Metric Name | Description | Labels |
|---|---|---|
| ipmi_generic_sensor_value | Sensor values from ipmi sensors that aren’t covered in any of the other sensor metrics. | id, name, state, unit, event, type |
| ipmi_fan_speed_rpm | Fan speed measure, in rpm. | id, name, state, unit, event |
| ipmi_fan_speed_ratio | Fan speed measure, as a percentage of maximum speed. | id, name, state, unit, event |
| ipmi_temperature_celsius | Temperature in celsius as recorded by ipmi sensors. | id, name, state, unit, event |
| ipmi_voltage_volts | Voltage measure from voltage sensors | id, name, state, unit, event |
| ipmi_current_amperes | Current measure from current sensors, in Amperes | id, name, state, unit, event |
| ipmi_power_watts | Power measure from power sensors, in watts. | id, name, state, unit, event |
| ipmimonitoring_command_success | Indicates if the ipmimonitoring command succeeded or not(1.0 = successful, 0.0 = unsuccessful) | |
| ipmi_sel_state_nominal | The ID for IPMI SEL nominal event | id, date, time, name, type, event |
| ipmi_sel_state_warning | The ID for IPMI SEL warning event | id, date, time, name, type, event |
| ipmi_sel_state_critical | The ID for IPMI SEL critical event | id, date, time, name, type, event |
| ipmi_sel_command_success | Indicates if the ipmi sel command succeeded or not(1.0 = successful, 0.0 = unsuccessful) | |
| ipmi_dcmi_power_cosumption_watts | Current power consumption in watts, as given by ipmi-dcmi | |
| ipmi_dcmi_command_success | Indicates if the ipmi dcmi command is successful or not(1.0 = successful, 0.0 = unsuccessful) |
Alerts
The details of the alerts that are provided by Hardware Observer for IPMI are as follows:
| Alert Rule Name | Description | Severity |
|---|---|---|
| IPMIDCMICommandFailed | Failed to run ipmi_dcmi. | critical |
| IPMIDCMIPowerConsumptionOutstanding | IPMI DCMI power consumption is high. | warning |
| IPMISELCommandFailed | Failed to run ipmi-sel. | critical |
| IPMISELStateWarning | IPMI system event log in warning state. | warning |
| IPMISELStateCritical | IPMI system event log in critical state. | critical |
| IPMIMonitoringCommandFailed | Failed to run ipmimonitoring. | critical |
| IPMITemperatureStateNotOk | Temperature in warning or critical state | warning / critical |
| IPMIPowerStateNotOk | Power in warning or critical state | warning / critical |
| IPMIVoltageStateNotOk | Voltage in warning or critical state | warning / critical |
| IPMICurrentStateNotOk | Current in warning or critical state | warning / critical |
| IPMIFanSpeedStateNotOk | Fan speed in warning or critical state | warning / critical |
| IPMISensorStateNotOk | IPMI sensor value in warning or critical state | warning / critical |
| IPMISELDStateWarning | IPMISELD service is not active | warning |
| IPMISELDStateCritical | IPMI system event log in critical state | critical |