The NameNode is the centerpiece of an HDFS file system. It keeps the directory tree of all files in the file system, and tracks where across the cluster the file data is kept. It does not store the data of these files itself.

Prerequisite

For Virtual Machines, install the Linux Agent.

Configuring the credentials

Configure the credentials in the directory /opt/opsramp/agent/conf/app.d/creds.yaml

hdfs-namenode:
- name: hdfs-namenode
  user: <username>
  pwd: <Password>
  encoding-type: plain
  labels:
    key1: val1
    key2: val2

Configure the application

Virtual machine

Configure the application in the directory /opt/opsramp/agent/conf/app/discovery/auto-detection.yaml

- name: hdfs-namenode
  instance-checks:
    process-check:
      - namenode
    port-check:
      - 50070
      - 9870
    uri-check:
      - "jmx?qry=Hadoop:service=NameNode,name=FSNamesystem*"
      - "jmx?qry=Hadoop:service=NameNode,name=NameNodeInfo"

Docker environment

Configure the application in the directory /opt/opsramp/agent/conf/app/discovery/auto-container-detection.yaml

- name: hdfs-namenode
  container-checks:
    image-check:
      - namenode
    port-check:
      - 50070
      - 9870

Kubernetes environment

Configure the application in config.yaml

- name: hdfs-namenode
  container-checks:
    image-check:
      - namenode
    port-check:
      - 50070
      - 9870

Validate

Go to Resources under the Infrastructure tab to check if your resources are onboarded and the metrics are collected.

Metrics

OpsRamp MetricMetric Display NameUnitDescription
hdfs_namenode_capacity_totalCapacity TotalbytesTotal disk capacity
hdfs_namenode_capacity_usedCapacity UsedbytesDisk usage
hdfs_namenode_capacity_used_percentCapacity Used PercentpercentDisk usage
hdfs_namenode_capacity_remainingCapacity RemainingbytesRemaining disk space left
hdfs_namenode_capacity_remaining_percentCapacity Remaining PercentpercentRemaining disk space left
hdfs_namenode_total_loadTotal LoadTotal load on the file system
hdfs_namenode_fs_lock_queue_lengthFs Lock Queue LengthLock queue length
hdfs_namenode_blocks_totalBlocks TotalBlocksTotal number of blocks
hdfs_namenode_max_objectsMax ObjectsMaximum number of files that HDFS supports
hdfs_namenode_files_totalFiles TotalFilesTotal number of files
hdfs_namenode_pending_replication_blocksPending Replication BlocksBlocksNumber of blocks pending replication
hdfs_namenode_under_replicated_blocksUnder Replicated BlocksBlocksNumber of under replicated blocks
hdfs_namenode_scheduled_replication_blocksScheduled Replication BlocksBlocksNumber of blocks scheduled for replication
hdfs_namenode_pending_deletion_blocksPending Deletion BlocksBlocksNumber of pending deletion blocks
hdfs_namenode_num_live_data_nodesNum Live Data NodesNodesTotal number of live data nodes
hdfs_namenode_num_dead_data_nodesNum Dead Data NodesNodesTotal number of dead data nodes
hdfs_namenode_num_decom_live_data_nodesNum Decom Live Data NodesNodesNumber of decommissioning live data nodes
hdfs_namenode_num_decom_dead_data_nodesNum Decom Dead Data NodesNodesNumber of decommissioning dead data nodes
hdfs_namenode_volume_failures_totalVolume Failures TotalTotal volume failures
hdfs_namenode_estimated_capacity_lost_totalEstimated Capacity Lost TotalbytesEstimated capacity lost
hdfs_namenode_num_decommissioning_data_nodesNum Decommissioning Data NodesNodesNumber of decommissioning data nodes
hdfs_namenode_num_stale_data_nodesNum Stale Data NodesNodesNumber of stale data nodes
hdfs_namenode_num_stale_storagesNum Stale StoragesNumber of stale storages
hdfs_namenode_missing_blocksMissing BlocksBlocksNumber of missing blocks
hdfs_namenode_corrupt_blocksCorrupt BlocksBlocksNumber of corrupt blocks
hdfs_namenode_num_failed_data_nodesNum Failed Data NodesTotal number of failed data nodes
hdfs_namenode_nondfs_used_percentNondfs Used PercentPercentTotal space used by NonDfs