Description
This template monitors DataNode related metrics. It is applicable for the devices containing HDFS application.
Prerequisites
Java must be installed on the device. Gateway should be up and running. The device should be reachable from Gateway. The device should be in managed state.
Metric Parameters
Parameter | Description |
---|---|
Frequency | Warning Threshold | If the metric value satisfies the condition defined along with Warning Threshold value, then a notification is sent to the user. |
Critical Threshold | If the metric value satisfies the condition defined along with Critical Threshold value, then a notification is sent to the user. |
Alert | The alert value can be set to either Yes or No. If it is Yes, then an alert message is sent to the user. |
Metrics
hdfs.datanode.dfs.remaining
Metric Details
Applicable for | Device |
Description | The remaining disk space left. |
Category | Application |
Collector Type | Gateway |
Monitor Name | HDFS DataNode Monitor |
Unit | Bytes |
Possible Inputs
Metric | Input Value | Range of Values |
---|---|---|
Frequency | 2 | 1 – 1440 (mins) |
Filter | ||
Warning Operator | ||
Warning Threshold | ||
Warning Repeat Count | ||
Critical Operator | ||
Critical Threshold | ||
Critical Repeat Count | ||
Alert | No | Yes/No |
Graph (Yes/No) | Yes | Yes/No |
Sample Output
No graph
hdfs.datanode.dfs.capacity
Metric Details
Applicable for | Device |
Description | Total configured HDFS storage capacity. |
Category | Application |
Collector Type | Gateway |
Monitor Name | HDFS DataNode Monitor |
Unit | Bytes |
Possible Inputs
Metric | Input Value | Range of Values |
---|---|---|
Frequency | 2 | 1 – 1440 (mins) |
Filter | ||
Warning Operator | ||
Warning Threshold | ||
Warning Repeat Count | ||
Critical Operator | ||
Critical Threshold | ||
Critical Repeat Count | ||
Alert | No | Yes/No |
Graph (Yes/No) | Yes | Yes/No |
Sample Output
No graph
hdfs.datanode.dfs.used
Metric Details
Applicable for | Device |
Description | Total HDFS storage used. |
Category | Application |
Collector Type | Gateway |
Monitor Name | HDFS DataNode Monitor |
Unit | Bytes |
Possible Inputs
Metric | Input Value | Range of Values |
---|---|---|
Frequency | 2 | 1 – 1440 (mins) |
Filter | ||
Warning Operator | ||
Warning Threshold | ||
Warning Repeat Count | ||
Critical Operator | ||
Critical Threshold | ||
Critical Repeat Count | ||
Alert | No | Yes/No |
Graph (Yes/No) | Yes | Yes/No |
Sample Output
No graph
hdfs.datanode.cache.capacity
Metric Details
Applicable for | Device |
Description | The capacity of the HDFS cache on this DataNode. |
Category | Application |
Collector Type | Gateway |
Monitor Name | HDFS DataNode Monitor |
Unit | Bytes |
Possible Inputs
Metric | Input Value | Range of Values |
---|---|---|
Frequency | 2 | 1 – 1440 (mins) |
Filter | ||
Warning Operator | ||
Warning Threshold | ||
Warning Repeat Count | ||
Critical Operator | ||
Critical Threshold | ||
Critical Repeat Count | ||
Alert | No | Yes/No |
Graph (Yes/No) | Yes | Yes/No |
Sample Output
No graph
hdfs.datanode.cache.used
Metric Details
Applicable for | Device |
Description | The total cache used. |
Category | Application |
Collector Type | Gateway |
Monitor Name | HDFS DataNode Monitor |
Unit | Bytes |
Possible Inputs
Metric | Input Value | Range of Values |
---|---|---|
Frequency | 2 | 1 – 1440 (mins) |
Filter | ||
Warning Operator | ||
Warning Threshold | ||
Warning Repeat Count | ||
Critical Operator | ||
Critical Threshold | ||
Critical Repeat Count | ||
Alert | No | Yes/No |
Graph (Yes/No) | Yes | Yes/No |
Sample Output
No graph
hdfs.datanode.last.volume.failure.date
Metric Details
Applicable for | Device |
Description | The date/time of the last volume failure in milliseconds since epoch. |
Category | Application |
Collector Type | Gateway |
Monitor Name | HDFS DataNode Monitor |
Unit | ms |
Possible Inputs
Metric | Input Value | Range of Values |
---|---|---|
Frequency | 2 | 1 – 1440 (mins) |
Filter | ||
Warning Operator | ||
Warning Threshold | ||
Warning Repeat Count | ||
Critical Operator | ||
Critical Threshold | ||
Critical Repeat Count | ||
Alert | No | Yes/No |
Graph (Yes/No) | Yes | Yes/No |
Sample Output
No graph
hdfs.datanode.estimated.capacity.lost.total
Metric Details
Applicable for | Device |
Description | The estimated capacity lost in bytes. |
Category | Application |
Collector Type | Gateway |
Monitor Name | HDFS DataNode Monitor |
Unit | Bytes |
Possible Inputs
Metric | Input Value | Range of Values |
---|---|---|
Frequency | 2 | 1 – 1440 (mins) |
Filter | ||
Warning Operator | ||
Warning Threshold | ||
Warning Repeat Count | ||
Critical Operator | ||
Critical Threshold | ||
Critical Repeat Count | ||
Alert | No | Yes/No |
Graph (Yes/No) | Yes | Yes/No |
Sample Output
No graph
hdfs.datanode.num.blocks.cached
Metric Details
Applicable for | Device |
Description | The number of blocks cached. |
Category | Application |
Collector Type | Gateway |
Monitor Name | HDFS DataNode Monitor |
Unit | count |
Possible Inputs
Metric | Input Value | Range of Values |
---|---|---|
Frequency | 2 | 1 – 1440 (mins) |
Filter | ||
Warning Operator | ||
Warning Threshold | ||
Warning Repeat Count | ||
Critical Operator | ||
Critical Threshold | ||
Critical Repeat Count | ||
Alert | No | Yes/No |
Graph (Yes/No) | Yes | Yes/No |
Sample Output
No graph
hdfs.datanode.num.blocks.failed.to.cache
Metric Details
Applicable for | Device |
Description | The total number of blocks the DataNode failed to cache. |
Category | Application |
Collector Type | Gateway |
Monitor Name | HDFS DataNode Monitor |
Unit | count |
Possible Inputs
Metric | Input Value | Range of Values |
---|---|---|
Frequency | 2 | 1 – 1440 (mins) |
Filter | ||
Warning Operator | ||
Warning Threshold | ||
Warning Repeat Count | ||
Critical Operator | ||
Critical Threshold | ||
Critical Repeat Count | ||
Alert | No | Yes/No |
Graph (Yes/No) | Yes | Yes/No |
Sample Output
No graph
hdfs.datanode.num.blocks.failed.to.uncache
Metric Details
Applicable for | Device |
Description | The total number of blocks the DataNode failed to uncache. |
Category | Application |
Collector Type | Gateway |
Monitor Name | HDFS DataNode Monitor |
Unit | count |
Possible Inputs
Metric | Input Value | Range of Values |
---|---|---|
Frequency | 2 | 1 – 1440 (mins) |
Filter | ||
Warning Operator | ||
Warning Threshold | ||
Warning Repeat Count | ||
Critical Operator | ||
Critical Threshold | ||
Critical Repeat Count | ||
Alert | No | Yes/No |
Graph (Yes/No) | Yes | Yes/No |
Sample Output
No graph
hdfs.datanode.num.failed.volumes
Metric Details
Applicable for | Device |
Description | Total number of failed volumes. |
Category | Application |
Collector Type | Gateway |
Monitor Name | HDFS DataNode Monitor |
Unit | count |
Possible Inputs
Metric | Input Value | Range of Values |
---|---|---|
Frequency | 2 | 1 – 1440 (mins) |
Filter | ||
Warning Operator | ||
Warning Threshold | ||
Warning Repeat Count | ||
Critical Operator | ||
Critical Threshold | ||
Critical Repeat Count | ||
Alert | No | Yes/No |
Graph (Yes/No) | Yes | Yes/No |
Sample Output
No graph