tencent cloud

Tencent Cloud Observability Platform

Release Notes and Announcements
Release Notes
Product Introduction
Overview
Strengths
Basic Features
Basic Concepts
Use Cases
Use Limits
Purchase Guide
Tencent Cloud Product Monitoring
Application Performance Management
Mobile App Performance Monitoring
Real User Monitoring
Cloud Automated Testing
Prometheus Monitoring
Grafana
EventBridge
PTS
Quick Start
Monitoring Overview
Instance Group
Tencent Cloud Product Monitoring
Application Performance Management
Real User Monitoring
Cloud Automated Testing
Performance Testing Service
Prometheus Getting Started
Grafana
Dashboard Creation
EventBridge
Alarm Service
Cloud Product Monitoring
Tencent Cloud Service Metrics
Operation Guide
CVM Agents
Cloud Product Monitoring Integration with Grafana
Troubleshooting
Practical Tutorial
Application Performance Management
Product Introduction
Access Guide
Operation Guide
Practical Tutorial
Parameter Information
FAQs
Mobile App Performance Monitoring
Overview
Operation Guide
Access Guide
Practical Tutorial
Tencent Cloud Real User Monitoring
Product Introduction
Operation Guide
Connection Guide
FAQs
Cloud Automated Testing
Product Introduction
Operation Guide
FAQs
Performance Testing Service
Overview
Operation Guide
Practice Tutorial
JavaScript API List
FAQs
Prometheus Monitoring
Product Introduction
Access Guide
Operation Guide
Practical Tutorial
Terraform
FAQs
Grafana
Product Introduction
Operation Guide
Guide on Grafana Common Features
FAQs
Dashboard
Overview
Operation Guide
Alarm Management
Console Operation Guide
Troubleshooting
FAQs
EventBridge
Product Introduction
Operation Guide
Practical Tutorial
FAQs
Report Management
FAQs
General
Alarm Service
Concepts
Monitoring Charts
CVM Agents
Dynamic Alarm Threshold
CM Connection to Grafana
Documentation Guide
Related Agreements
Application Performance Management Service Level Agreement
APM Privacy Policy
APM Data Processing And Security Agreement
RUM Service Level Agreement
Mobile Performance Monitoring Service Level Agreement
Cloud Automated Testing Service Level Agreement
Prometheus Service Level Agreement
TCMG Service Level Agreements
PTS Service Level Agreement
PTS Use Limits
Cloud Monitor Service Level Agreement
API Documentation
History
Introduction
API Category
Making API Requests
Monitoring Data Query APIs
Alarm APIs
Legacy Alert APIs
Notification Template APIs
TMP APIs
Grafana Service APIs
Event Center APIs
TencentCloud Managed Service for Prometheus APIs
Monitoring APIs
Data Types
Error Codes
Glossary

Online Services

PDF
フォーカスモード
フォントサイズ
最終更新日: 2025-05-26 17:06:15

Namespace

Namespace = QCE/TI_MODEL

Monitoring Metrics

Metric Name
Metric Meaning
Description
Unit
Dimension
Statistical Rule
[period, statType]
Apicallerrortotal
Failed API call volume
Failed API call volume
Count
Source
SubUin
ServiceGroupId
[ 10s, sum ]
[ 60s, sum ]
[ 300s, sum ]
[ 3600s, sum ]
[ 86400s, sum ]
Apicalllimittotal
Total number of being restricted requests
Total amount of being restricted API calls
Count
Source
SubUin
ServiceGroupId
[ 10s, sum ]
[ 60s, sum ]
[ 300s, sum ]
[ 3600s, sum ]
[ 86400s, sum ]
Apicallsuccesstotal
Total amount of successful calls
Total amount of successful API calls
Count
SubUin
ServiceGroupId
Source
[ 10s, sum ]
[ 60s, sum ]
[ 300s, sum ]
[ 3600s, sum ]
[ 86400s, sum ]
Apicalltotal
Total API calls
Total API calls
Count
Source
SubUin
ServiceGroupId
[ 10s, sum ]
[ 60s, sum ]
[ 300s, sum ]
[ 3600s, sum ]
[ 86400s, sum ]
Apiresponsetime
Average Response Time
Average Response Time
ms
ServiceGroupId
Source
SubUin
[ 10s, avg ]
[ 60s, avg ]
[ 300s, avg ]
[ 3600s, avg ]
[ 86400s, avg ]
CfsClientDataReadBandwidth
turocfs single-node server read bandwidth
turocfs single-node server read bandwidth
KBytes/s
InstanceId
Source
SubUin
[ 10s, avg ]
[ 60s, avg ]
[ 300s, avg ]
[ 3600s, avg ]
[ 86400s, avg ]
CfsClientDataWriteBandwidth
turocfs single-node server write bandwidth
turocfs single-node server write bandwidth
KBytes/s
Source
SubUin
InstanceId
[ 10s, avg ]
[ 60s, avg ]
[ 300s, avg ]
[ 3600s, avg ]
[ 86400s, avg ]
CfsDataReadIoBytes
cfs server read bandwidth
cfs Server Read Bandwidth
KBytes/s
InstanceId
Source
SubUin
[ 10s, avg ]
[ 60s, avg ]
[ 300s, avg ]
[ 3600s, avg ]
[ 86400s, avg ]
CfsDataReadIoLatency
cfs Read Latency
cfs Read Latency
ms
Source
SubUin
InstanceId
[ 10s, avg ]
[ 60s, avg ]
[ 300s, avg ]
[ 3600s, avg ]
[ 86400s, avg ]
CfsDataWriteIoBytes
cfs Server Write Bandwidth
cfs server write bandwidth
KBytes/s
InstanceId
Source
SubUin
[ 10s, avg ]
[ 60s, avg ]
[ 300s, avg ]
[ 3600s, avg ]
[ 86400s, avg ]
CfsDataWriteIoLatency
cfs Write Latency
cfs Write Latency
ms
InstanceId
Source
SubUin
[ 10s, avg ]
[ 60s, avg ]
[ 300s, avg ]
[ 3600s, avg ]
[ 86400s, avg ]
CfsStrageUsageGb
cfs storage data capacity
cfs storage data capacity
GBytes
InstanceId
Source
SubUin
[ 10s, avg ]
[ 60s, avg ]
[ 300s, avg ]
[ 3600s, avg ]
[ 86400s, avg ]
Cpuutil
CPU utilization
CPU utilization
%
AppId
Source
SubUin
TaskId
[ 10s, avg ]
[ 60s, avg ]
[ 300s, avg ]
[ 3600s, avg ]
[ 86400s, avg ]
DiskIoUtil
Disk ioutil
Disk ioutil
%
Source
SubUin
InstanceId
[ 10s, avg ]
[ 60s, avg ]
[ 300s, avg ]
[ 3600s, avg ]
[ 86400s, avg ]
DiskIoWait
Disk iowait
Disk iowait
%
InstanceId
Source
SubUin
[ 10s, avg ]
[ 60s, avg ]
[ 300s, avg ]
[ 3600s, avg ]
[ 86400s, avg ]
DiskReadByte
Disk Read Bandwidth
Disk Read Bandwidth
MBytes/s
InstanceId
Source
SubUin
[ 10s, avg ]
[ 60s, avg ]
[ 300s, avg ]
[ 3600s, avg ]
[ 86400s, avg ]
DiskReadIops
Disk read iops
Disk read iops
Count
SubUin
InstanceId
Source
[ 10s, avg ]
[ 60s, avg ]
[ 300s, avg ]
[ 3600s, avg ]
[ 86400s, avg ]
DiskUsageRadio
System Disk Partition Utilization
System disk partition utilization
%
InstanceId
Source
SubUin
[ 10s, avg ]
[ 60s, avg ]
[ 300s, avg ]
[ 3600s, avg ]
[ 86400s, avg ]
DiskWriteByte
Disk Write Bandwidth
Disk Write Bandwidth
MBytes/s
InstanceId
Source
SubUin
[ 10s, avg ]
[ 60s, avg ]
[ 300s, avg ]
[ 3600s, avg ]
[ 86400s, avg ]
DiskWriteIops
Disk write iops
Disk write iops
Count
Source
SubUin
InstanceId
[ 10s, avg ]
[ 60s, avg ]
[ 300s, avg ]
[ 3600s, avg ]
[ 86400s, avg ]
Gpumemutil
GPU vRAM utilization
GPU vRAM utilization
%
AppId
Source
SubUin
TaskId
[ 10s, avg ]
[ 60s, avg ]
[ 300s, avg ]
[ 3600s, avg ]
[ 86400s, avg ]
Gpuutil
GPU utilization
GPU utilization
%
AppId
Source
SubUin
TaskId
[ 10s, avg ]
[ 60s, avg ]
[ 300s, avg ]
[ 3600s, avg ]
[ 86400s, avg ]
Instancecpuutil
CPU utilization
CPU utilization
%
InstanceId
Source
SubUin
[ 10s, avg ]
[ 60s, avg ]
[ 300s, avg ]
[ 3600s, avg ]
[ 86400s, avg ]
Instancegpumemutil
GPU vRAM utilization
GPU vRAM utilization
%
SubUin
InstanceId
Source
[ 10s, avg ]
[ 60s, avg ]
[ 300s, avg ]
[ 3600s, avg ]
[ 86400s, avg ]
Instancegpuutil
GPU utilization
GPU utilization
%
InstanceId
Source
SubUin
[ 10s, avg ]
[ 60s, avg ]
[ 300s, avg ]
[ 3600s, avg ]
[ 86400s, avg ]
Instancehttpqps
http call qps
http requests per second of the instance
Count/s
InstanceId
Source
SubUin
[ 10s, max ]
[ 60s, max ]
[ 300s, max ]
[ 3600s, max ]
[ 86400s, max ]
Instancehttpqpslimit
http call being restricted qps
http requests per second of the instance being restricted
Count/s
Source
SubUin
InstanceId
[ 10s, max ]
[ 60s, max ]
[ 300s, max ]
[ 3600s, max ]
[ 86400s, max ]
Instancememutil
Memory utilization
Memory utilization
%
InstanceId
Source
SubUin
[ 10s, avg ]
[ 60s, avg ]
[ 300s, avg ]
[ 3600s, avg ]
[ 86400s, avg ]
Instancememvalue
Memory usage
Memory usage
MBytes
InstanceId
Source
SubUin
[ 10s, avg ]
[ 60s, avg ]
[ 300s, avg ]
[ 3600s, avg ]
[ 86400s, avg ]
Instancenetworkibytes
Network Inbound Traffic
Network Inbound Traffic
MBytes
InstanceId
Source
SubUin
[ 10s, avg ]
[ 60s, avg ]
[ 300s, avg ]
[ 3600s, avg ]
[ 86400s, avg ]
Instanceready
Number of instances running
Number of instances running
Count
AppId
Source
SubUin
TaskId
[ 10s, last ]
[ 60s, last ]
[ 300s, last ]
[ 3600s, last ]
[ 86400s, last ]
InstanceTiemsCurrentRequests
Number of concurrent requests
Number of concurrent requests
Count
InstanceId
Source
SubUin
[ 10s, max ]
[ 60s, max ]
[ 300s, max ]
[ 3600s, max ]
[ 86400s, max ]
Instancetotal
Number of Instances
Number of Instances
Count
AppId
Source
SubUin
TaskId
[ 10s, last ]
[ 60s, last ]
[ 300s, last ]
[ 3600s, last ]
[ 86400s, last ]
Memutil
Memory utilization
Memory utilization
%
AppId
Source
SubUin
TaskId
[ 10s, avg ]
[ 60s, avg ]
[ 300s, avg ]
[ 3600s, avg ]
[ 86400s, avg ]
Memvalue
Memory Usage
Memory Usage
MBytes
AppId
Source
SubUin
TaskId
[ 10s, avg ]
[ 60s, avg ]
[ 300s, avg ]
[ 3600s, avg ]
[ 86400s, avg ]
Networkreceivebytes
Network Inbound Traffic
Network Inbound Traffic
MBytes
AppId
Source
SubUin
TaskId
[ 10s, avg ]
[ 60s, avg ]
[ 300s, avg ]
[ 3600s, avg ]
[ 86400s, avg ]
ServiceCfsClientDataReadBandwidth
turocfs single-node server read bandwidth
turocfs single-node server read bandwidth
KBytes/s
AppId
Source
SubUin
TaskId
[ 10s, avg ]
[ 60s, avg ]
[ 300s, avg ]
[ 3600s, avg ]
[ 86400s, avg ]
ServiceCfsClientDataWriteBandwidth
turocfs single-node server write bandwidth
turocfs single-node server write bandwidth
KBytes/s
AppId
Source
SubUin
TaskId
[ 10s, avg ]
[ 60s, avg ]
[ 300s, avg ]
[ 3600s, avg ]
[ 86400s, avg ]
ServiceCfsDataReadIoBytes
cfs Server Read Bandwidth
cfs Server Read Bandwidth
KBytes/s
AppId
Source
SubUin
TaskId
[ 10s, avg ]
[ 60s, avg ]
[ 300s, avg ]
[ 3600s, avg ]
[ 86400s, avg ]
ServiceCfsDataReadIoLatency
cfs Read Latency
cfs Read Latency
ms
AppId
Source
SubUin
TaskId
[ 10s, avg ]
[ 60s, avg ]
[ 300s, avg ]
[ 3600s, avg ]
[ 86400s, avg ]
ServiceCfsDataWriteIoBytes
cfs server write bandwidth
cfs server write bandwidth
KBytes/s
AppId
Source
SubUin
TaskId
[ 10s, avg ]
[ 60s, avg ]
[ 300s, avg ]
[ 3600s, avg ]
[ 86400s, avg ]
ServiceCfsDataWriteIoLatency
cfs Write Latency
cfs Write Latency
ms
AppId
Source
SubUin
TaskId
[ 10s, avg ]
[ 60s, avg ]
[ 300s, avg ]
[ 3600s, avg ]
[ 86400s, avg ]
ServiceCfsStrageUsageGb
cfs storage data capacity
cfs storage data capacity
GBytes
AppId
Source
SubUin
TaskId
[ 10s, avg ]
[ 60s, avg ]
[ 300s, avg ]
[ 3600s, avg ]
[ 86400s, avg ]
ServiceDiskIoUtil
Disk ioutil
Disk ioutil
%
AppId
Source
SubUin
TaskId
[ 10s, avg ]
[ 60s, avg ]
[ 300s, avg ]
[ 3600s, avg ]
[ 86400s, avg ]
ServiceDiskIoWait
Disk iowait
Disk iowait
%
AppId
Source
SubUin
TaskId
[ 10s, avg ]
[ 60s, avg ]
[ 300s, avg ]
[ 3600s, avg ]
[ 86400s, avg ]
ServiceDiskReadByte
Disk Read Bandwidth
Disk Read Bandwidth
MBytes/s
AppId
Source
SubUin
TaskId
[ 10s, avg ]
[ 60s, avg ]
[ 300s, avg ]
[ 3600s, avg ]
[ 86400s, avg ]
ServiceDiskReadIops
Disk read iops
Disk read iops
Count
AppId
Source
SubUin
TaskId
[ 10s, avg ]
[ 60s, avg ]
[ 300s, avg ]
[ 3600s, avg ]
[ 86400s, avg ]
ServiceDiskUsageRadio
System disk partition utilization
System disk partition utilization
%
AppId
Source
SubUin
TaskId
[ 10s, avg ]
[ 60s, avg ]
[ 300s, avg ]
[ 3600s, avg ]
[ 86400s, avg ]
ServiceDiskWriteByte
Disk Write Bandwidth
Disk Write Bandwidth
MBytes/s
AppId
Source
SubUin
TaskId
[ 10s, avg ]
[ 60s, avg ]
[ 300s, avg ]
[ 3600s, avg ]
[ 86400s, avg ]
ServiceDiskWriteIops
Disk write iops
Disk write iops
Count
AppId
Source
SubUin
TaskId
[ 10s, avg ]
[ 60s, avg ]
[ 300s, avg ]
[ 3600s, avg ]
[ 86400s, avg ]
Servicehttpqps
http call qps
http requests per second of the service
Count/s
AppId
Source
SubUin
TaskId
[ 10s, max ]
[ 60s, max ]
[ 300s, max ]
[ 3600s, max ]
[ 86400s, max ]
Servicehttpqpslimit
http call being restricted qps
http requests per second of the service being restricted
Count/s
AppId
Source
SubUin
TaskId
[ 10s, max ]
[ 60s, max ]
[ 300s, max ]
[ 3600s, max ]
[ 86400s, max ]
ServiceTiemsCurrentRequests
Number of concurrent requests
Number of concurrent requests
Count
AppId
Source
SubUin
TaskId
[ 10s, max ]
[ 60s, max ]
[ 300s, max ]
[ 3600s, max ]
[ 86400s, max ]
ServiceGpuMemValue
GPU memory usage
GPU memory usage
MBytes
AppId
Source
SubUin
TaskId
[ 10s, avg ]
[ 60s, avg ]
[ 300s, avg ]
[ 3600s, avg ]
[ 86400s, avg ]
ServiceEmsTokenThroughput
Tokens processed per minute
Tokens processed per minute
Count
AppId
Source
SubUin
TaskId
[ 10s, avg ]
[ 60s, avg ]
[ 300s, avg ]
[ 3600s, avg ]
[ 86400s, avg ]
ServiceEmsTokenThroughputInput
Tokens processed per minute (input)
Tokens processed per minute (input)
Count
AppId
Source
SubUin
TaskId
[ 10s, avg ]
[ 60s, avg ]
[ 300s, avg ]
[ 3600s, avg ]
[ 86400s, avg ]
ServiceEmsTokenThroughputOutput
Tokens processed per minute (output only)
Tokens processed/generated per minute
Count
AppId
Source
SubUin
TaskId
[ 10s, avg ]
[ 60s, avg ]
[ 300s, avg ]
[ 3600s, avg ]
[ 86400s, avg ]
ServiceEmsFirstTokenLatency
First Token latency
First Token Latency
s
AppId
Source
SubUin
TaskId
[ 10s, avg ]
[ 60s, avg ]
[ 300s, avg ]
[ 3600s, avg ]
[ 86400s, avg ]
ServiceEmsNonFirstTokenLatency
Subsequent Token Latency
Subsequent Token LaActive Time Ratiotency
s
AppId
Source
SubUin
TaskId
[ 10s, avg ]
[ 60s, avg ]
[ 300s, avg ]
[ 3600s, avg ]
[ 86400s, avg ]
ServiceEmsProcessingRequestCount
Number of requests in processing
Number of requests being processed
Count
AppId
Source
SubUin
TaskId
[ 10s, sum ]
[ 60s, sum ]
[ 300s, sum ]
[ 3600s, sum ]
[ 86400s, sum ]
ServiceEmsQueuingRequestCount
Number of requests queuing
Number of requests in queue
Count
AppId
Source
SubUin
TaskId
[ 10s, sum ]
[ 60s, sum ]
[ 300s, sum ]
[ 3600s, sum ]
[ 86400s, sum ]
ServiceEmsTotalProcessedTokens
Total amount of processed tokens
Total amount of processed tokens
Count
AppId
Source
SubUin
TaskId
[ 10s, sum ]
[ 60s, sum ]
[ 300s, sum ]
[ 3600s, sum ]
[ 86400s, sum ]
ServiceEmsTotalProcessedTokensInput
Total amount of processed tokens, only input
input token total amount
Count
AppId
Source
SubUin
TaskId
[ 10s, sum ]
[ 60s, sum ]
[ 300s, sum ]
[ 3600s, sum ]
[ 86400s, sum ]
ServiceEmsTotalProcessedTokensOutput
processed Token total amount
Generate Token total amount
Count
AppId
Source
SubUin
TaskId
[ 10s, sum ]
[ 60s, sum ]
[ 300s, sum ]
[ 3600s, sum ]
[ 86400s, sum ]
ServiceEmsAverageLengthInput
input average length (Token)
input average length (Token)
Count
AppId
Source
SubUin
TaskId
[ 10s, avg ]
[ 60s, avg ]
[ 300s, avg ]
[ 3600s, avg ]
[ 86400s, avg ]
ServiceEmsAverageLengthOutput
Output average length (Token)
Output average length (Token)
Count
AppId
Source
SubUin
TaskId
[ 10s, avg ]
[ 60s, avg ]
[ 300s, avg ]
[ 3600s, avg ]
[ 86400s, avg ]
GpuMemValue
GPU memory usage
GPU memory usage
MBytes
AppId
InstanceId
Source
SubUin
[ 10s, avg ]
[ 60s, avg ]
[ 300s, avg ]
[ 3600s, avg ]
[ 86400s, avg ]
EmsTokenThroughput
Tokens processed per minute
Tokens processed per minute
Count
AppId
InstanceId
Source
SubUin
[ 10s, avg ]
[ 60s, avg ]
[ 300s, avg ]
[ 3600s, avg ]
[ 86400s, avg ]
EmsTokenThroughputInput
Tokens processed per minute (input)
Tokens processed per minute (input)
Count
AppId
InstanceId
Source
SubUin
[ 10s, avg ]
[ 60s, avg ]
[ 300s, avg ]
[ 3600s, avg ]
[ 86400s, avg ]
EmsTokenThroughputOutput
Tokens processed per minute (output only)
Tokens processed per minute (output only)
Count
AppId
InstanceId
Source
SubUin
[ 10s, avg ]
[ 60s, avg ]
[ 300s, avg ]
[ 3600s, avg ]
[ 86400s, avg ]
EmsFirstTokenLatency
First Token Latency
First Token Latency
s
AppId
InstanceId
Source
SubUin
[ 10s, avg ]
[ 60s, avg ]
[ 300s, avg ]
[ 3600s, avg ]
[ 86400s, avg ]
EmsNonFirstTokenLatency
Subsequent Token Latency
Subsequent Token Latency
s
AppId
InstanceId
Source
SubUin
[ 10s, avg ]
[ 60s, avg ]
[ 300s, avg ]
[ 3600s, avg ]
[ 86400s, avg ]
EmsProcessingRequestCount
Number of requests being processed
Number of requests being processed
Count
AppId
InstanceId
Source
SubUin
[ 10s, sum ]
[ 60s, sum ]
[ 300s, sum ]
[ 3600s, sum ]
[ 86400s, sum ]
EmsQueuingRequestCount
Number of requests in queue
Number of requests in queue
Count
AppId
InstanceId
Source
SubUin
[ 10s, sum ]
[ 60s, sum ]
[ 300s, sum ]
[ 3600s, sum ]
[ 86400s, sum ]
EmsTotalProcessedTokens
Total amount of processed tokens
Total amount of processed tokens
Count
AppId
InstanceId
Source
SubUin
[ 10s, sum ]
[ 60s, sum ]
[ 300s, sum ]
[ 3600s, sum ]
[ 86400s, sum ]
EmsTotalProcessedTokensInput
Total amount of processed tokens, only input
Total amount of processed tokens, only input
Count
AppId
InstanceId
Source
SubUin
[ 10s, sum ]
[ 60s, sum ]
[ 300s, sum ]
[ 3600s, sum ]
[ 86400s, sum ]
EmsTotalProcessedTokensOutput
processed Token total amount
processed Token total amount
Count
AppId
InstanceId
Source
SubUin
[ 10s, sum ]
[ 60s, sum ]
[ 300s, sum ]
[ 3600s, sum ]
[ 86400s, sum ]
EmsAverageLengthInput
input average length (Token)
input average length (Token)
Count
AppId
InstanceId
Source
SubUin
[ 10s, avg ]
[ 60s, avg ]
[ 300s, avg ]
[ 3600s, avg ]
[ 86400s, avg ]
EmsAverageLengthOutput
Output average length (Token)
Output average length (Token)
Count
AppId
InstanceId
Source
SubUin
[ 10s, avg ]
[ 60s, avg ]
[ 300s, avg ]
[ 3600s, avg ]
[ 86400s, avg ]
Fp16EngineActivity
FP16 Active Time Ratio
FP16 Active Time Ratio
%
Appld
InstanceGpuNum
Source
SubUin
[ 10s, avg ]
[ 60s, avg ]
[ 300s, avg ]
[ 3600s, avg ]
[ 86400s, avg ]
Fp32EngineActivity
FP32 Active Time Ratio
FP32 Active Time Ratio
%
Appld
InstanceGpuNum
Source
SubUin
[ 10s, avg ]
[ 60s, avg ]
[ 300s, avg ]
[ 3600s, avg ]
[ 86400s, avg ]
Fp64EngineActivity
FP64 Active Time Ratio
FP64 Active Time Ratio
%
Appld
InstanceGpuNum
Source
SubUin
[ 10s, avg ]
[ 60s, avg ]
[ 300s, avg ]
[ 3600s, avg ]
[ 86400s, avg ]
NvlinkBandwidth
nvlink transmission rate
nvlink transmission rate
Bytes/s
Appld
InstanceGpuNum
Source
SubUin
[ 10s, avg ]
[ 60s, avg ]
[ 300s, avg ]
[ 3600s, avg ]
[ 86400s, avg ]
PcieBandwidth
PCIe bus transmission rate
PCIe bus transmission rate
Bytes/s
Appld
InstanceGpuNum
Source
SubUin
[ 10s, avg ]
[ 60s, avg ]
[ 300s, avg ]
[ 3600s, avg ]
[ 86400s, avg ]
SmActivity
SM active state time ratio
SM active state time ratio
%
Appld
InstanceGpuNum
Source
SubUin
[ 10s, avg ]
[ 60s, avg ]
[ 300s, avg ]
[ 3600s, avg ]
[ 86400s, avg ]
TensorActivity
Tensor active state time ratio
Tensor active state time ratio
%
Appld
InstanceGpuNum
Source
SubUin
[ 10s, avg ]
[ 60s, avg ]
[ 300s, avg ]
[ 3600s, avg ]
[ 86400s, avg ]
DcgmFiDevFbUsed
GPU memory usage
GPU memory usage
MBytes
Appld
InstanceGpuNum
Source
SubUin
[ 10s, avg ]
[ 60s, avg ]
[ 300s, avg ]
[ 3600s, avg ]
[ 86400s, avg ]
DcgmFiDevGpuUtil
GPU usage
GPU usage
%
Appld
InstanceGpuNum
Source
SubUin
[ 10s, avg ]
[ 60s, avg ]
[ 300s, avg ]
[ 3600s, avg ]
[ 86400s, avg ]
DcgmFiDevMemCopyUtil
GPU memory usage
GPU memory usage
%
Appld
InstanceGpuNum
Source
SubUin
[ 10s, avg ]
[ 60s, avg ]
[ 300s, avg ]
[ 3600s, avg ]
[ 86400s, avg ]
GpuDecUtil
GPU decode usage
GPU decode usage
%
Appld
InstanceGpuNum
Source
SubUin
[ 10s, avg ]
[ 60s, avg ]
[ 300s, avg ]
[ 3600s, avg ]
[ 86400s, avg ]
GpuEncUtil
GPU encoder usage
GPU encoder usage
%
Appld
InstanceGpuNum
Source
SubUin
[ 10s, avg ]
[ 60s, avg ]
[ 300s, avg ]
[ 3600s, avg ]
[ 86400s, avg ]
GpuMemoryClock
GPU Memory frequency
GPU Memory frequency
S
Appld
InstanceGpuNum
Source
SubUin
[ 10s, avg ]
[ 60s, avg ]
[ 300s, avg ]
[ 3600s, avg ]
[ 86400s, avg ]
GpuNvlinkRxMb
nvlink amount of data received
nvlink amount of data received
Mbps
Appld
InstanceGpuNum
Source
SubUin
[ 10s, avg ]
[ 60s, avg ]
[ 300s, avg ]
[ 3600s, avg ]
[ 86400s, avg ]
GpuNvlinkTxMb
nvlink amount of data sent
nvlink amount of data sent
Mbps
Appld
InstanceGpuNum
Source
SubUin
[ 10s, avg ]
[ 60s, avg ]
[ 300s, avg ]
[ 3600s, avg ]
[ 86400s, avg ]
GpuPcieRxMb
pcie amount of data received
pcie amount of data received
Mbps
Appld
InstanceGpuNum
Source
SubUin
[ 10s, avg ]
[ 60s, avg ]
[ 300s, avg ]
[ 3600s, avg ]
[ 86400s, avg ]
GpuPcieTxMb
pcie amount of data sent
pcie amount of data sent
Mbps
Appld
InstanceGpuNum
Source
SubUin
[ 10s, avg ]
[ 60s, avg ]
[ 300s, avg ]
[ 3600s, avg ]
[ 86400s, avg ]
GpuSmClock
SM clock frequency
SM clock frequency
S
Appld
InstanceGpuNum
Source
SubUin
[ 10s, avg ]
[ 60s, avg ]
[ 300s, avg ]
[ 3600s, avg ]
[ 86400s, avg ]

Overview of Parameters Corresponding to Each Dimension

Parameter Name
Dimension Name
Dimension Explanation
Format
Instances.N.Dimensions.0.Name
AppId
basic account information APPID dimension name
Enter the dimension name of String type: AppId (automatically selects during SDK call, no need to pass parameters)
Instances.N.Dimensions.0.Value
AppId
basic account information APPID
Enter the ID, for example: 1231231231 (automatically selects during SDK call, no need to pass parameters)
Instances.N.Dimensions.1.Name
SubUin
sub-account ID dimension name
Enter the dimension name of String type: SubUin
Instances.N.Dimensions.1.Value
SubUin
sub-account ID
Enter the ID, for example: 100001231231
Instances.N.Dimensions.2.Name
Source
Create Source dimension name
Enter the dimension name of String type: Source
Instances.N.Dimensions.2.Value
Source
Create Source
Input source, for example: normal (use this value by default)
Instances.N.Dimensions.3.Name
InstanceId
online service instance ID dimension name
Enter the dimension name of String type: InstanceId
Instances.N.Dimensions.3.Value
InstanceId
online service instance ID
Enter the specific instance ID, for example: ms-2tgmq6ms-1-5f96656956-272wq
Instances.N.Dimensions.4.Name
TaskId
online service ID dimension name
Enter the dimension name of String type: TaskId
Instances.N.Dimensions.4.Value
TaskId
online service ID
Enter the ID, for example: ms-2tgmq6ms-1
Instances.N.Dimensions.5.Name
ServiceGroupId
online service service group ID dimension name
Enter the dimension name of String type: ServiceGroupId
Instances.N.Dimensions.5.Value
ServiceGroupId
online service service group ID
Enter the ID, for example: ms-2tgmq6ms
Instances.N.Dimensions.6.Name
InstanceGpuNum
GPU Card Number used by online service instances (only for full GPU card tasks)
Enter the dimension name of String type: InstanceGpuNum
Instances.N.Dimensions.6.Value
InstanceGpuNum
GPU Card Number used by online service instances (only for full GPU card tasks)
Concatenate the instance ID with the GPU card number/avg. Enter the specific instance ID, for example: ms-2tgmq6ms-1-5f96656956-272wq-0

Input Parameters

Query online service metric monitoring data. Values are as follows:
&Namespace=QCE/TI_MODEL
&Instances.N.Dimensions.0.Name=AppId
&Instances.N.Dimensions.0.Value=specific account ID
&Instances.N.Dimensions.1.Name=SubUin
&Instances.N.Dimensions.1.Value=specific sub-account ID
&Instances.N.Dimensions.2.Name=Source
&Instances.N.Dimensions.2.Value=specific creation source
&Instances.N.Dimensions.3.Name=InstanceId
&Instances.N.Dimensions.3.Value=online service instance ID
&Instances.N.Dimensions.4.Name=TaskId
&Instances.N.Dimensions.4.Value=specific online service ID
&Instances.N.Dimensions.5.Name=ServiceGroupId
&Instances.N.Dimensions.5.Value=specific online service group ID
&Instances.N.Dimensions.6.Name=InstanceGpuNum
&Instances.N.Dimensions.6.Value=GPU Card Number used by online service instances

ヘルプとサポート

この記事はお役に立ちましたか?

フィードバック