Description
[Using SMA-MIB.mib] It monitors the Voltaire InfiniBand Subnet Management Agent (SMA) System Module attributes like sysModuleState, sysModuleTempValue, sysModuleTempState, sysModuleRate, sysModulePowerConsumption and also monitors the state of a remote action. Tested on MellanoxVoltaire-4036E-Infiniband [SysObjID: 1.3.6.1.4.1.5206.1.24].
Prerequisites
SNMP should be enabled in end device and device should support SMA-MIB OIDs and SNMP credentials should be attached against the device in portal.
How to Apply: This template is All instance selection based. It will not ask user to select any instance(s) while assigning it to a device.
Metric Parameters
Parameter | Description |
---|---|
Frequency | Warning Threshold | If the metric value satisfies the condition defined along with Warning Threshold value, then a notification is sent to the user. |
Critical Threshold | If the metric value satisfies the condition defined along with Critical Threshold value, then a notification is sent to the user. |
Alert | The alert value can be set to either Yes or No. If it is Yes, then an alert message is sent to the user. |
Metrics
vol.infiniband.sma.remote.state
Metric Details
Applicable for | Device |
SNMP OID | 1.3.6.1.4.1.5206.2.1.0, 1.3.6.1.4.1.5206.2.4.0, 1.3.6.1.4.1.5206.2.6.0, 1.3.6.1.4.1.5206.2.7.0, 1.3.6.1.4.1.5206.2.8.0 |
Expression | remoteState |
Description | Queries the state of a remote action. [OIDs: 1.3.6.1.4.1.5206.2.7.0, 1.3.6.1.4.1.5206.2.8.0, 1.3.6.1.4.1.5206.2.9.0] |
Category | SNMP monitors |
Collector Type | Gateway |
Monitor Name | Voltaire InfiniBand SMA Remote Action State |
Unit |
Possible Inputs
Metric | Input Value | Range of Values |
---|---|---|
Frequency | 5 | 1 – 1440 (mins) |
Filter | ||
Warning Operator | ||
Warning Threshold | ||
Warning Repeat Count | ||
Critical Operator | NOT_EQUAL | Ends with, ==, !=, >=, <=, >, <, In Range, Out of range, Equals, Not equals, Equals Ignore Case, Not Equals Ignore Case, Contains, Not contains, Regex match, Regex no match, In string list, Not in string list, In List, Not in list, Starts with |
Critical Threshold | 1 | [{"1":"success"},{"2":"ftpExecutionFailed"},{"3":"linuxCmdFailed"},{"4":"invalidRemotePath"},{"5":"invalidFileName"},{"6":"localRepositoryFull"},{"7":"localFileDoesNotExist"},{"8":"invalidFileContents"},{"9":"successToOverwriteFile"},{"10":"unknownError"},{"11":"errNoRemotePathInput"},{"12":"errNoFileNameInput"},{"13":"errcorruptedRepostoryFile"},{"14":"errPlatforTypeNotSupported"},{"15":"errSmbNotInActiveMode"},{"16":"errFailtoSyncSmb"},{"17":"errFailtoUpgradeSystem"},{"18":"errFailtoExportLogs"},{"19":"ftpUpgradeSoftwareInProgress"}] |
Critical Repeat Count | 1 | 1-12 |
Alert | Yes | Yes/No |
Graph (Yes/No) | Yes | Yes/No |
Sample Output
No graph
vol.infiniband.sma.sys.module.state
Metric Details
Applicable for | Device |
SNMP OID | 1.3.6.1.4.1.5206.3.29.1.6, 1.3.6.1.4.1.5206.3.29.1.1, 1.3.6.1.4.1.5206.3.29.1.2 |
Expression | sysModuleState |
Description | State of the module. Possible values "1=>notPresent, 2=>ok, 3=>fault, 4=>dcFault, 5=>acFault, 6=>unknown, 7=>io-fault". [OID: 1.3.6.1.4.1.5206.3.29.1.6] |
Category | SNMP monitors |
Collector Type | Gateway |
Monitor Name | Voltaire InfiniBand SMA System Module Monitors |
Unit |
Possible Inputs
Metric | Input Value | Range of Values |
---|---|---|
Frequency | 5 | 1 – 1440 (mins) |
Filter | NULL | Not Applicable |
Warning Operator | ||
Warning Threshold | ||
Warning Repeat Count | ||
Critical Operator | NOT_IN_LIST | Ends with, ==, !=, >=, <=, >, <, In Range, Out of range, Equals, Not equals, Equals Ignore Case, Not Equals Ignore Case, Contains, Not contains, Regex match, Regex no match, In string list, Not in string list, In List, Not in list, Starts with |
Critical Threshold | 1,2 | [{"1":"notPresent"},{"2":"ok"},{"3":"fault"},{"4":"dcFault"},{"5":"acFault"},{"6":"unknown"},{"7":"io-fault"}] |
Critical Repeat Count | 1 | 1-12 |
Alert | Yes | Yes/No |
Graph (Yes/No) | Yes | Yes/No |
Sample Output
No graph
vol.infiniband.sma.sys.module.temp.value
Metric Details
Applicable for | Device |
SNMP OID | 1.3.6.1.4.1.5206.3.29.1.7 |
Expression | NULL |
Description | A module holds multiple heat sensors. This metric holds the maximum temperature measured across the module. [OID: 1.3.6.1.4.1.5206.3.29.1.7] |
Category | SNMP monitors |
Collector Type | Gateway |
Monitor Name | Voltaire InfiniBand SMA System Module Monitors |
Unit | C |
Possible Inputs
Metric | Input Value | Range of Values |
---|---|---|
Frequency | 5 | 1 – 1440 (mins) |
Filter | NULL | Not Applicable |
Warning Operator | ||
Warning Threshold | ||
Warning Repeat Count | ||
Critical Operator | ||
Critical Threshold | ||
Critical Repeat Count | ||
Alert | No | Yes/No |
Graph (Yes/No) | Yes | Yes/No |
Sample Output
No graph
vol.infiniband.sma.sys.module.temp.state
Metric Details
Applicable for | Device |
SNMP OID | 1.3.6.1.4.1.5206.3.29.1.8 |
Expression | NULL |
Description | State of module according to temperature. Possible values are "1: alarm, 2: warning, 3: normal, 4: sensorFault, 5: notAvalible". [OID: 1.3.6.1.4.1.5206.3.29.1.8] |
Category | SNMP monitors |
Collector Type | Gateway |
Monitor Name | Voltaire InfiniBand SMA System Module Monitors |
Unit |
Possible Inputs
Metric | Input Value | Range of Values |
---|---|---|
Frequency | 5 | 1 – 1440 (mins) |
Filter | NULL | Not Applicable |
Warning Operator | EQUAL | Ends with, ==, !=, >=, <=, >, <, In Range, Out of range, Equals, Not equals, Equals Ignore Case, Not Equals Ignore Case, Contains, Not contains, Regex match, Regex no match, In string list, Not in string list, In List, Not in list, Starts with |
Warning Threshold | 2 | [{"1":"alarm"},{"2":"warning"},{"3":"normal"},{"4":"sensorFault"},{"5":"notAvalible"}] |
Warning Repeat Count | 1 | 1-12 |
Critical Operator | IN_LIST | Ends with, ==, !=, >=, <=, >, <, In Range, Out of range, Equals, Not equals, Equals Ignore Case, Not Equals Ignore Case, Contains, Not contains, Regex match, Regex no match, In string list, Not in string list, In List, Not in list, Starts with |
Critical Threshold | 1,4 | [{"1":"alarm"},{"2":"warning"},{"3":"normal"},{"4":"sensorFault"},{"5":"notAvalible"}] |
Critical Repeat Count | 1 | 1-12 |
Alert | Yes | Yes/No |
Graph (Yes/No) | Yes | Yes/No |
Sample Output
No graph
vol.infiniband.sma.sys.module.power.usage
Metric Details
Applicable for | Device |
SNMP OID | 1.3.6.1.4.1.5206.3.29.1.12 |
Expression | NULL |
Description | The system DC power consumption is Watt. [OID: 1.3.6.1.4.1.5206.3.29.1.12] |
Category | SNMP monitors |
Collector Type | Gateway |
Monitor Name | Voltaire InfiniBand SMA System Module Monitors |
Unit | W |
Possible Inputs
Metric | Input Value | Range of Values |
---|---|---|
Frequency | 5 | 1 – 1440 (mins) |
Filter | NULL | Not Applicable |
Warning Operator | ||
Warning Threshold | ||
Warning Repeat Count | ||
Critical Operator | ||
Critical Threshold | ||
Critical Repeat Count | ||
Alert | No | Yes/No |
Graph (Yes/No) | Yes | Yes/No |
Sample Output
No graph
vol.infiniband.sma.sys.module.fan.rate
Metric Details
Applicable for | Device |
SNMP OID | 1.3.6.1.4.1.5206.3.29.1.11 |
Expression | NULL |
Description | The fan rate of the module. [OID: 1.3.6.1.4.1.5206.3.29.1.11] |
Category | SNMP monitors |
Collector Type | Gateway |
Monitor Name | Voltaire InfiniBand SMA System Module Monitors |
Unit |
Possible Inputs
Metric | Input Value | Range of Values |
---|---|---|
Frequency | 5 | 1 – 1440 (mins) |
Filter | NULL | Not Applicable |
Warning Operator | ||
Warning Threshold | ||
Warning Repeat Count | ||
Critical Operator | ||
Critical Threshold | ||
Critical Repeat Count | ||
Alert | No | Yes/No |
Graph (Yes/No) | Yes | Yes/No |
Sample Output
No graph