Introduction
Use Azure Batch to run large-scale parallel and high-performance computing (HPC) batch jobs efficiently in Azure.
Azure Batch does the following:
- Creates and manages a pool of compute nodes (virtual machines).
- Installs the applications you want to run.
- Schedules jobs to run on the nodes.
There is no cluster or job scheduler software to install, manage, or scale.
Note
Use the OpsRamp Azure public cloud integration to discover and collect metrics against the Azure service.Setup
To set up the OpsRamp Azure integration and discover the Azure service,
go to Azure Integration Discovery Profile and select Batch Accounts
Metrics
OpsRamp Metric | Metric Display Name | Unit | Aggregation Type | Description |
---|---|---|---|---|
azure_core_count | Dedicated Core Count | Count | Total | Total number of dedicated cores in the batch account. |
azure_total_node_count | Dedicated Node Count | Count | Total | Total number of dedicated nodes in the batch account. |
azure_low_priority_core_count | LowPriority Core Count | Count | Total | Total number of low priority cores in the batch account. |
azure_total_low_priority_node_count | Low Priority Node Count | Count | Total | Total number of low priority nodes in the batch account. |
azure_creating_node_count | Creating Node Count | Count | Total | Number of nodes being created. |
azure_starting_node_count | Starting Node Count | Count | Total | Number of nodes starting. |
azure_waiting_for_start_task_node_count | Waitng For Start Task Node Count | Count | Total | Number of nodes waiting for the Start Task to complete. |
azure_start_task_failed_node_count | Start Task Failed Node Count | Count | Total | Number of nodes where the Start Task has failed. |
azure_idle_node_count | Idle Node Count | Count | Total | Number of idle nodes. |
azure_offline_node_count | Offline Node Count | Count | Total | Number of offline nodes. |
azure_rebooting_node_count | Rebooting Node Count | Count | Total | Number of rebooting nodes. |
azure_remaining_node_count | Reimaging Node Count | Count | Total | Number of reimaging nodes. |
azure_running_node_count | Running Node Count | Count | Total | Number of running nodes. |
azure_leaving_pool_node_count | Leaving Pool Node Count | Count | Total | Number of nodes leaving the pool. |
azure_unusable_node_count | Unusable Node Count | Count | Total | Number of unusable nodes. |
azure_preempted_node_count | Preempted Node Count | Count | Total | Number of preempted nodes. |
azure_task_start_event | Task Start Events | Count | Total | Number of tasks that have started. |
azure_task_complete_event | Task Complete Events | Count | Total | Total number of tasks that have completed. |
azure_task_fail_event | Task Fail Events | Count | Total | Total number of tasks that have completed in a failed state. |
azure_pool_create_event | Pool Create Events | Count | Total | Total number of pools that have been created. |
azure_pool_resize_start_event | Pool Resize Start Events | Count | Total | Total number of pool resizes that have started. |
azure_pool_resize_complete_event | Pool Resize Complete Events | Count | Total | Total number of pool resizes that have completed. |
azure_pool_delete_start_event | Pool Delete Start Events | Count | Total | Total number of pool deletes that have started. |
azure_pool_delete_complete_event | Pool Delete Complete Events | Count | Total | Total number of pool deletes that have completed. |
azure_batchaccount_job_delete_complete_event | Job Delete Complete Events | Count | Total | Total number of jobs that have been successfully deleted. |
azure_batchaccount_job_delete_start_event | Job Delete Start Events | Count | Total | Total number of jobs that have been requested to be deleted. |
azure_batchaccount_job_disable_complete_event | Job Disable Complete Events | Count | Total | Total number of jobs that have been successfully disabled. |
azure_batchaccount_job_disable_start_event | Job Disable Start Events | Count | Total | Total number of jobs that have been requested to be disabled. |
azure_batchaccount_job_start_event | Job Start Events | Count | Total | Total number of jobs that have been successfully started. |
azure_batchaccount_job_terminate_complete_event | Job Terminate Complete Events | Count | Total | Total number of jobs that have been successfully terminated. |
azure_batchaccount_job_terminate_start_event | Job Terminate Start Events | Count | Total | Total number of jobs that have been requested to be terminated. |
cloud_instance_state | Status/State | n/a | n/a | Status/State |
Event support
- Supported
- Configurable in OpsRamp Azure Integration Discovery Profile.