Metric Name | Metric Meaning | Metric Description | Unit | Dimension | Statistical granularity |
GroupCpuUsage | CPU utilization | Resource group CPU utilization | % | ResourceGroupId AppId | [ 10s, avg ] [ 60s, avg ] [ 300s, avg ] [ 3600s, avg ] [ 86400s, avg ] |
GroupGpuUtil | GPU usage | Resource group GPU usage | % | ResourceGroupId AppId | [ 10s, avg ] [ 60s, avg ] [ 300s, avg ] [ 3600s, avg ] [ 86400s, avg ] |
GroupLanInTraffic | Private network inbound bandwidth | Resource Group Private Network Inbound Bandwidth | Mbps | ResourceGroupId AppId | [ 10s, avg ] [ 60s, avg ] [ 300s, avg ] [ 3600s, avg ] [ 86400s, avg ] |
GroupLanOutTraffic | Private Network Outbound Bandwidth | Resource Group Private Network Outbound Bandwidth | Mbps | ResourceGroupId AppId | [ 10s, avg ] [ 60s, avg ] [ 300s, avg ] [ 3600s, avg ] [ 86400s, avg ] |
GroupMemUsage | Memory usage | Resource group memory usage rate | % | ResourceGroupId AppId | [ 10s, avg ] [ 60s, avg ] [ 300s, avg ] [ 3600s, avg ] [ 86400s, avg ] |
GroupWanInTraffic | Public network bandwidth in | Resource Group Public Network Inbound Bandwidth | Mbps | ResourceGroupId AppId | [ 10s, avg ] [ 60s, avg ] [ 300s, avg ] [ 3600s, avg ] [ 86400s, avg ] |
GroupWanOutTraffic | public network outbound bandwidth | Resource Group Public Network Outbound Bandwidth | Mbps | ResourceGroupId AppId | [ 10s, avg ] [ 60s, avg ] [ 300s, avg ] [ 3600s, avg ] [ 86400s, avg ] |
InsCpuUsage | CPU utilization | Resource (node) CPU utilization | % | ResourceId AppId | [ 10s, avg ] [ 60s, avg ] [ 300s, avg ] [ 3600s, avg ] [ 86400s, avg ] |
InsGpuUtil | GPU utilization | Resource (node) GPU utilization | % | ResourceId AppId | [ 10s, avg ] [ 60s, avg ] [ 300s, avg ] [ 3600s, avg ] [ 86400s, avg ] |
InsMemUsage | Memory utilization | Resource (node) memory utilization | % | ResourceId AppId | [ 10s, avg ] [ 60s, avg ] [ 300s, avg ] [ 3600s, avg ] [ 86400s, avg ] |
InsLanInTraffic | Private network inbound bandwidth | Resource (node) private network inbound bandwidth | Mbps | ResourceId AppId | [ 10s, avg ] [ 60s, avg ] [ 300s, avg ] [ 3600s, avg ] [ 86400s, avg ] |
InsLanOutTraffic | Private network outbound bandwidth | Resource (node) private network outbound bandwidth | Mbps | ResourceId AppId | [ 10s, avg ] [ 60s, avg ] [ 300s, avg ] [ 3600s, avg ] [ 86400s, avg ] |
InsWanInTraffic | Public network bandwidth in | Resource (node) public network inbound bandwidth | Mbps | ResourceId AppId | [ 10s, avg ] [ 60s, avg ] [ 300s, avg ] [ 3600s, avg ] [ 86400s, avg ] |
InsWanOutTraffic | Public network bandwidth out | Resource (node) public network outbound bandwidth | Mbps | ResourceId AppId | [ 10s, avg ] [ 60s, avg ] [ 300s, avg ] [ 3600s, avg ] [ 86400s, avg ] |
GroupWanOutratio | Public network bandwidth utilization | Public network bandwidth utilization | % | ResourceGroupId AppId | [ 10s, avg ] [ 60s, avg ] [ 300s, avg ] [ 3600s, avg ] [ 86400s, avg ] |
GroupFp16EngineActivity | FP16 active time ratio | FP16 active time ratio | % | ResourceGroupId AppId | [ 10s, avg ] [ 60s, avg ] [ 300s, avg ] [ 3600s, avg ] [ 86400s, avg ] |
GroupFp32EngineActivity | FP32 active time ratio | FP32 active time ratio | % | ResourceGroupId AppId | [ 10s, avg ] [ 60s, avg ] [ 300s, avg ] [ 3600s, avg ] [ 86400s, avg ] |
GroupFp64EngineActivity | FP64 active time ratio | FP64 active time ratio | % | ResourceGroupId AppId | [ 10s, avg ] [ 60s, avg ] [ 300s, avg ] [ 3600s, avg ] [ 86400s, avg ] |
GroupGpuDecUtil | GPU decode utilization | GPU decode utilization | % | ResourceGroupId AppId | [ 10s, avg ] [ 60s, avg ] [ 300s, avg ] [ 3600s, avg ] [ 86400s, avg ] |
GroupGpuEccPersistent | Persistent ECC error | Persistent ECC error | Count | ResourceGroupId AppId | [ 10s, avg ] [ 60s, avg ] [ 300s, avg ] [ 3600s, avg ] [ 86400s, avg ] |
GroupGpuEccVolatile | Volatile ECC error | Volatile ECC error | Count | ResourceGroupId AppId | [ 10s, avg ] [ 60s, avg ] [ 300s, avg ] [ 3600s, avg ] [ 86400s, avg ] |
GroupGpuEncUtil | GPU encode utilization | GPU encode utilization | % | ResourceGroupId AppId | [ 10s, avg ] [ 60s, avg ] [ 300s, avg ] [ 3600s, avg ] [ 86400s, avg ] |
GroupGpuPowerUsage | GPU power consumption | GPU power consumption | W | ResourceGroupId AppId | [ 10s, avg ] [ 60s, avg ] [ 300s, avg ] [ 3600s, avg ] [ 86400s, avg ] |
GroupGpuRetiredPages | Disabled memory page | Disabled memory page | Count | ResourceGroupId AppId | [ 10s, avg ] [ 60s, avg ] [ 300s, avg ] [ 3600s, avg ] [ 86400s, avg ] |
GroupGpuTemperature | GPU temperature | GPU temperature | °C | ResourceGroupId AppId | [ 10s, avg ] [ 60s, avg ] [ 300s, avg ] [ 3600s, avg ] [ 86400s, avg ] |
GroupGpuXidErrors | GPU xid count | GPU xid count | Count | ResourceGroupId AppId | [ 10s, avg ] [ 60s, avg ] [ 300s, avg ] [ 3600s, avg ] [ 86400s, avg ] |
InsWanOutratio | Public network bandwidth utilization | Public network bandwidth utilization | % | ResourceId AppId | [ 10s, avg ] [ 60s, avg ] [ 300s, avg ] [ 3600s, avg ] [ 86400s, avg ] |
InsGpuDecUtil | GPU decode utilization | GPU decode utilization | % | ResourceId AppId | [ 10s, avg ] [ 60s, avg ] [ 300s, avg ] [ 3600s, avg ] [ 86400s, avg ] |
InsGpuEccPersistent | Persistent ECC error | Persistent ECC error | Count | ResourceId AppId | [ 10s, avg ] [ 60s, avg ] [ 300s, avg ] [ 3600s, avg ] [ 86400s, avg ] |
InsGpuEccVolatile | Volatile ECC error | Volatile ECC error | Count | ResourceId AppId | [ 10s, avg ] [ 60s, avg ] [ 300s, avg ] [ 3600s, avg ] [ 86400s, avg ] |
InsGpuEncUtil | GPU encode utilization | GPU encode utilization | % | ResourceId AppId | [ 10s, avg ] [ 60s, avg ] [ 300s, avg ] [ 3600s, avg ] [ 86400s, avg ] |
InsGpuPowerUsage | GPU power consumption | GPU power consumption | W | ResourceId AppId | [ 10s, avg ] [ 60s, avg ] [ 300s, avg ] [ 3600s, avg ] [ 86400s, avg ] |
InsGpuRetiredPages | Disabled memory page | Disabled memory page | Count | ResourceId AppId | [ 10s, avg ] [ 60s, avg ] [ 300s, avg ] [ 3600s, avg ] [ 86400s, avg ] |
InsGpuTemperature | GPU temperature | GPU temperature | °C | ResourceId AppId | [ 10s, avg ] [ 60s, avg ] [ 300s, avg ] [ 3600s, avg ] [ 86400s, avg ] |
InsGpuXidErrors | GPU xid count | GPU xid count | Count | ResourceId AppId | [ 10s, avg ] [ 60s, avg ] [ 300s, avg ] [ 3600s, avg ] [ 86400s, avg ] |
InsFp16EngineActivity | FP16 active time ratio | FP16 active time ratio | % | ResourceId AppId | [ 10s, avg ] [ 60s, avg ] [ 300s, avg ] [ 3600s, avg ] [ 86400s, avg ] |
InsFp32EngineActivity | FP32 active time ratio | FP32 active time ratio | % | ResourceId AppId | [ 10s, avg ] [ 60s, avg ] [ 300s, avg ] [ 3600s, avg ] [ 86400s, avg ] |
InsFp64EngineActivity | FP64 active time ratio | FP64 active time ratio | % | ResourceId AppId | [ 10s, avg ] [ 60s, avg ] [ 300s, avg ] [ 3600s, avg ] [ 86400s, avg ] |
Parameter Name | Dimension Name | Dimension Explanation | Format |
Instances.N.Dimensions.0.Name | AppId | basic account information APPID dimension name | Enter the dimension name of String type: AppId (automatically selects during SDK call, no need to pass parameters) |
Instances.N.Dimensions.0.Value | AppId | Basic account information APPID | Input ID, for example: 1231231231 (automatically selects during SDK call, no need to pass parameters) |
Instances.N.Dimensions.1.Name | ResourceGroupId | Dimension name of resource group ID | Enter the dimension name of String type: ResourceGroupId |
Instances.N.Dimensions.1.Value | ResourceGroupId | Resource group ID | Input ID, for example: trsg-b564kzx2 |
Instances.N.Dimensions.2.Name | ResourceId | Dimension name of a specific resource ID in the resource group | Enter the dimension name of String type: ResourceId |
Instances.N.Dimensions.2.Value | ResourceId | A specific resource ID in the resource group | Input ID, for example: sm-54wplvsv |
Feedback