tencent cloud

腾讯云可观测平台

动态与公告
产品动态
产品简介
产品概述
产品优势
基本功能
基本概念
应用场景
使用限制
购买指南
云产品监控
应用性能监控
终端性能监控
前端性能监控
云拨测
Prometheus 监控服务
Grafana 服务
事件总线
云压测
快速入门
监控概览
实例分组
云产品监控
应用性能监控
云拨测
云压测
Prometheus 监控服务
Grafana 服务
创建 Dashboard
事件总线
告警服务
云产品监控
云产品监控指标
控制台操作指南
云服务器监控组件
云产品监控对接 Grafana
故障处理
实践教程
应用性能监控
应用性能监控简介
接入指南
控制台操作指南
实践教程
参考信息
常见问题
终端性能监控
终端性能监控概述
控制台操作指南
接入指南
实践教程
前端性能监控
前端性能监控简介
控制台操作指南
接入指南
常见问题
云拨测
产品简介
控制台操作指南
常见问题
云压测
云压测概述
控制台操作指南
实践教程
JavaScript API 列表
常见问题
Prometheus 监控
Prometheus 监控简介
接入指南
控制台操作指南
实践教程
Terraform
常见问题
Grafana 服务
产品简介
控制台操作指南
Grafana 平台常用功能指引
常见问题
Dashboard
什么是 Dashboard
控制台操作指南
告警管理
控制台操作指南
故障处理
常见问题
事件总线
事件总线简介
控制台操作指南
实践教程
常见问题
报表管理
常见问题
腾讯云可观测平台常见问题
告警服务相关
一般性问题
监控图表相关
云服务器监控组件相关
动态阈值告警相关
云监控对接 Grafana 相关
文档阅读指南
相关协议
应用性能监控服务等级协议
APM 隐私协议
APM 数据处理和安全协议
前端性能监控服务等级协议
终端性能监控服务等级协议
云拨测服务等级协议
Prometheus 监控服务服务等级协议
Grafana 服务服务等级协议
云压测服务等级协议
云压测使用限制
Cloud Monitor Service Level Agreement
API 文档
History
Introduction
API Category
Making API Requests
Monitoring Data Query APIs
Alarm APIs
Legacy Alert APIs
Notification Template APIs
TMP APIs
Grafana Service APIs
Event Center APIs
TencentCloud Managed Service for Prometheus APIs
Monitoring APIs
Data Types
Error Codes
词汇表

在线服务

PDF
聚焦模式
字号
最后更新时间: 2025-05-22 16:59:39

命名空间

Namespace = QCE/TI_MODEL

监控指标

指标英文名
指标中文名
说明
单位
维度
统计规则
[period, statType]
Apicallerrortotal
接口失败调用量
接口失败调用量
Count
Source
SubUin
ServiceGroupId
[ 10s, sum ]
[ 60s, sum ]
[ 300s, sum ]
[ 3600s, sum ]
[ 86400s, sum ]
Apicalllimittotal
被限制请求总数
接口调用被限制总量
Count
Source
SubUin
ServiceGroupId
[ 10s, sum ]
[ 60s, sum ]
[ 300s, sum ]
[ 3600s, sum ]
[ 86400s, sum ]
Apicallsuccesstotal
调用成功总量
接口调用成功总量
Count
SubUin
ServiceGroupId
Source
[ 10s, sum ]
[ 60s, sum ]
[ 300s, sum ]
[ 3600s, sum ]
[ 86400s, sum ]
Apicalltotal
接口调用总量
接口调用总量
Count
Source
SubUin
ServiceGroupId
[ 10s, sum ]
[ 60s, sum ]
[ 300s, sum ]
[ 3600s, sum ]
[ 86400s, sum ]
Apiresponsetime
平均响应时间
平均响应时间
ms
ServiceGroupId
Source
SubUin
[ 10s, avg ]
[ 60s, avg ]
[ 300s, avg ]
[ 3600s, avg ]
[ 86400s, avg ]
CfsClientDataReadBandwidth
turocfs 单节点服务端读带宽
turocfs 单节点服务端读带宽
KBytes/s
InstanceId
Source
SubUin
[ 10s, avg ]
[ 60s, avg ]
[ 300s, avg ]
[ 3600s, avg ]
[ 86400s, avg ]
CfsClientDataWriteBandwidth
turocfs 单节点服务端写带宽
turocfs 单节点服务端写带宽
KBytes/s
Source
SubUin
InstanceId
[ 10s, avg ]
[ 60s, avg ]
[ 300s, avg ]
[ 3600s, avg ]
[ 86400s, avg ]
CfsDataReadIoBytes
cfs 服务端读带宽
cfs 服务端读带宽
KBytes/s
InstanceId
Source
SubUin
[ 10s, avg ]
[ 60s, avg ]
[ 300s, avg ]
[ 3600s, avg ]
[ 86400s, avg ]
CfsDataReadIoLatency
cfs 读延迟
cfs 读延迟
ms
Source
SubUin
InstanceId
[ 10s, avg ]
[ 60s, avg ]
[ 300s, avg ]
[ 3600s, avg ]
[ 86400s, avg ]
CfsDataWriteIoBytes
cfs 服务端写带宽
cfs 服务端写带宽
KBytes/s
InstanceId
Source
SubUin
[ 10s, avg ]
[ 60s, avg ]
[ 300s, avg ]
[ 3600s, avg ]
[ 86400s, avg ]
CfsDataWriteIoLatency
cfs 写延迟
cfs 写延迟
ms
InstanceId
Source
SubUin
[ 10s, avg ]
[ 60s, avg ]
[ 300s, avg ]
[ 3600s, avg ]
[ 86400s, avg ]
CfsStrageUsageGb
cfs 存储数据容量
cfs 存储数据容量
GBytes
InstanceId
Source
SubUin
[ 10s, avg ]
[ 60s, avg ]
[ 300s, avg ]
[ 3600s, avg ]
[ 86400s, avg ]
Cpuutil
CPU 利用率
CPU 利用率
%
AppId
Source
SubUin
TaskId
[ 10s, avg ]
[ 60s, avg ]
[ 300s, avg ]
[ 3600s, avg ]
[ 86400s, avg ]
DiskIoUtil
磁盘 ioutil
磁盘 ioutil
%
Source
SubUin
InstanceId
[ 10s, avg ]
[ 60s, avg ]
[ 300s, avg ]
[ 3600s, avg ]
[ 86400s, avg ]
DiskIoWait
磁盘 iowait
磁盘 iowait
%
InstanceId
Source
SubUin
[ 10s, avg ]
[ 60s, avg ]
[ 300s, avg ]
[ 3600s, avg ]
[ 86400s, avg ]
DiskReadByte
磁盘读取带宽
磁盘读取带宽
MBytes/s
InstanceId
Source
SubUin
[ 10s, avg ]
[ 60s, avg ]
[ 300s, avg ]
[ 3600s, avg ]
[ 86400s, avg ]
DiskReadIops
磁盘读取 iops
磁盘读取 iops
Count
SubUin
InstanceId
Source
[ 10s, avg ]
[ 60s, avg ]
[ 300s, avg ]
[ 3600s, avg ]
[ 86400s, avg ]
DiskUsageRadio
系统盘分区利用率
系统盘分区利用率
%
InstanceId
Source
SubUin
[ 10s, avg ]
[ 60s, avg ]
[ 300s, avg ]
[ 3600s, avg ]
[ 86400s, avg ]
DiskWriteByte
磁盘写入带宽
磁盘写入带宽
MBytes/s
InstanceId
Source
SubUin
[ 10s, avg ]
[ 60s, avg ]
[ 300s, avg ]
[ 3600s, avg ]
[ 86400s, avg ]
DiskWriteIops
磁盘写入 iops
磁盘写入 iops
Count
Source
SubUin
InstanceId
[ 10s, avg ]
[ 60s, avg ]
[ 300s, avg ]
[ 3600s, avg ]
[ 86400s, avg ]
Gpumemutil
GPU 显存利用率
GPU 显存利用率
%
AppId
Source
SubUin
TaskId
[ 10s, avg ]
[ 60s, avg ]
[ 300s, avg ]
[ 3600s, avg ]
[ 86400s, avg ]
Gpuutil
GPU 利用率
GPU 利用率
%
AppId
Source
SubUin
TaskId
[ 10s, avg ]
[ 60s, avg ]
[ 300s, avg ]
[ 3600s, avg ]
[ 86400s, avg ]
Instancecpuutil
CPU 利用率
CPU 利用率
%
InstanceId
Source
SubUin
[ 10s, avg ]
[ 60s, avg ]
[ 300s, avg ]
[ 3600s, avg ]
[ 86400s, avg ]
Instancegpumemutil
GPU 显存利用率
GPU 显存利用率
%
SubUin
InstanceId
Source
[ 10s, avg ]
[ 60s, avg ]
[ 300s, avg ]
[ 3600s, avg ]
[ 86400s, avg ]
Instancegpuutil
GPU 利用率
GPU 利用率
%
InstanceId
Source
SubUin
[ 10s, avg ]
[ 60s, avg ]
[ 300s, avg ]
[ 3600s, avg ]
[ 86400s, avg ]
Instancehttpqps
http 调用 qps
实例的 http 每秒请求数
Count/s
InstanceId
Source
SubUin
[ 10s, max ]
[ 60s, max ]
[ 300s, max ]
[ 3600s, max ]
[ 86400s, max ]
Instancehttpqpslimit
http 调用被限制 qps
实例的 http 每秒被限制请求数
Count/s
Source
SubUin
InstanceId
[ 10s, max ]
[ 60s, max ]
[ 300s, max ]
[ 3600s, max ]
[ 86400s, max ]
Instancememutil
内存利用率
内存利用率
%
InstanceId
Source
SubUin
[ 10s, avg ]
[ 60s, avg ]
[ 300s, avg ]
[ 3600s, avg ]
[ 86400s, avg ]
Instancememvalue
内存使用量
内存使用量
MBytes
InstanceId
Source
SubUin
[ 10s, avg ]
[ 60s, avg ]
[ 300s, avg ]
[ 3600s, avg ]
[ 86400s, avg ]
Instancenetworkibytes
网络入流量
网络入流量
MBytes
InstanceId
Source
SubUin
[ 10s, avg ]
[ 60s, avg ]
[ 300s, avg ]
[ 3600s, avg ]
[ 86400s, avg ]
Instanceready
实例运行数量
实例运行数量
Count
AppId
Source
SubUin
TaskId
[ 10s, last ]
[ 60s, last ]
[ 300s, last ]
[ 3600s, last ]
[ 86400s, last ]
InstanceTiemsCurrentRequests
并发请求数
并发请求数
Count
InstanceId
Source
SubUin
[ 10s, max ]
[ 60s, max ]
[ 300s, max ]
[ 3600s, max ]
[ 86400s, max ]
Instancetotal
实例数量
实例数量
Count
AppId
Source
SubUin
TaskId
[ 10s, last ]
[ 60s, last ]
[ 300s, last ]
[ 3600s, last ]
[ 86400s, last ]
Memutil
内存利用率
内存利用率
%
AppId
Source
SubUin
TaskId
[ 10s, avg ]
[ 60s, avg ]
[ 300s, avg ]
[ 3600s, avg ]
[ 86400s, avg ]
Memvalue
内存用量
内存用量
MBytes
AppId
Source
SubUin
TaskId
[ 10s, avg ]
[ 60s, avg ]
[ 300s, avg ]
[ 3600s, avg ]
[ 86400s, avg ]
Networkreceivebytes
网络入流量
网络入流量
MBytes
AppId
Source
SubUin
TaskId
[ 10s, avg ]
[ 60s, avg ]
[ 300s, avg ]
[ 3600s, avg ]
[ 86400s, avg ]
ServiceCfsClientDataReadBandwidth
turocfs 单节点服务端读带宽
turocfs 单节点服务端读带宽
KBytes/s
AppId
Source
SubUin
TaskId
[ 10s, avg ]
[ 60s, avg ]
[ 300s, avg ]
[ 3600s, avg ]
[ 86400s, avg ]
ServiceCfsClientDataWriteBandwidth
turocfs 单节点服务端写带宽
turocfs 单节点服务端写带宽
KBytes/s
AppId
Source
SubUin
TaskId
[ 10s, avg ]
[ 60s, avg ]
[ 300s, avg ]
[ 3600s, avg ]
[ 86400s, avg ]
ServiceCfsDataReadIoBytes
cfs 服务端读带宽
cfs 服务端读带宽
KBytes/s
AppId
Source
SubUin
TaskId
[ 10s, avg ]
[ 60s, avg ]
[ 300s, avg ]
[ 3600s, avg ]
[ 86400s, avg ]
ServiceCfsDataReadIoLatency
cfs 读延迟
cfs 读延迟
ms
AppId
Source
SubUin
TaskId
[ 10s, avg ]
[ 60s, avg ]
[ 300s, avg ]
[ 3600s, avg ]
[ 86400s, avg ]
ServiceCfsDataWriteIoBytes
cfs 服务端写带宽
cfs 服务端写带宽
KBytes/s
AppId
Source
SubUin
TaskId
[ 10s, avg ]
[ 60s, avg ]
[ 300s, avg ]
[ 3600s, avg ]
[ 86400s, avg ]
ServiceCfsDataWriteIoLatency
cfs 写延迟
cfs 写延迟
ms
AppId
Source
SubUin
TaskId
[ 10s, avg ]
[ 60s, avg ]
[ 300s, avg ]
[ 3600s, avg ]
[ 86400s, avg ]
ServiceCfsStrageUsageGb
cfs 存储数据容量
cfs 存储数据容量
GBytes
AppId
Source
SubUin
TaskId
[ 10s, avg ]
[ 60s, avg ]
[ 300s, avg ]
[ 3600s, avg ]
[ 86400s, avg ]
ServiceDiskIoUtil
磁盘 ioutil
磁盘 ioutil
%
AppId
Source
SubUin
TaskId
[ 10s, avg ]
[ 60s, avg ]
[ 300s, avg ]
[ 3600s, avg ]
[ 86400s, avg ]
ServiceDiskIoWait
磁盘 iowait
磁盘 iowait
%
AppId
Source
SubUin
TaskId
[ 10s, avg ]
[ 60s, avg ]
[ 300s, avg ]
[ 3600s, avg ]
[ 86400s, avg ]
ServiceDiskReadByte
磁盘读取带宽
磁盘读取带宽
MBytes/s
AppId
Source
SubUin
TaskId
[ 10s, avg ]
[ 60s, avg ]
[ 300s, avg ]
[ 3600s, avg ]
[ 86400s, avg ]
ServiceDiskReadIops
磁盘读取 iops
磁盘读取 iops
Count
AppId
Source
SubUin
TaskId
[ 10s, avg ]
[ 60s, avg ]
[ 300s, avg ]
[ 3600s, avg ]
[ 86400s, avg ]
ServiceDiskUsageRadio
系统盘分区利用率
系统盘分区利用率
%
AppId
Source
SubUin
TaskId
[ 10s, avg ]
[ 60s, avg ]
[ 300s, avg ]
[ 3600s, avg ]
[ 86400s, avg ]
ServiceDiskWriteByte
磁盘写入带宽
磁盘写入带宽
MBytes/s
AppId
Source
SubUin
TaskId
[ 10s, avg ]
[ 60s, avg ]
[ 300s, avg ]
[ 3600s, avg ]
[ 86400s, avg ]
ServiceDiskWriteIops
磁盘写入 iops
磁盘写入 iops
Count
AppId
Source
SubUin
TaskId
[ 10s, avg ]
[ 60s, avg ]
[ 300s, avg ]
[ 3600s, avg ]
[ 86400s, avg ]
Servicehttpqps
http 调用 qps
服务的 http 每秒请求数
Count/s
AppId
Source
SubUin
TaskId
[ 10s, max ]
[ 60s, max ]
[ 300s, max ]
[ 3600s, max ]
[ 86400s, max ]
Servicehttpqpslimit
http 调用被限制qps
服务的 http 每秒被限制请求数
Count/s
AppId
Source
SubUin
TaskId
[ 10s, max ]
[ 60s, max ]
[ 300s, max ]
[ 3600s, max ]
[ 86400s, max ]
ServiceTiemsCurrentRequests
并发请求数
并发请求数
Count
AppId
Source
SubUin
TaskId
[ 10s, max ]
[ 60s, max ]
[ 300s, max ]
[ 3600s, max ]
[ 86400s, max ]
ServiceGpuMemValue
显存使用量
显存使用量
MBytes
AppId
Source
SubUin
TaskId
[ 10s, avg ]
[ 60s, avg ]
[ 300s, avg ]
[ 3600s, avg ]
[ 86400s, avg ]
ServiceEmsTokenThroughput
每分钟处理 Token 数
每分钟处理 Token 数
Count
AppId
Source
SubUin
TaskId
[ 10s, avg ]
[ 60s, avg ]
[ 300s, avg ]
[ 3600s, avg ]
[ 86400s, avg ]
ServiceEmsTokenThroughputInput
每分钟处理 Token 数,仅输入
每分钟处理输入 Token 数
Count
AppId
Source
SubUin
TaskId
[ 10s, avg ]
[ 60s, avg ]
[ 300s, avg ]
[ 3600s, avg ]
[ 86400s, avg ]
ServiceEmsTokenThroughputOutput
每分钟处理 Token 数,仅输出
每分钟处理生成Token 数
Count
AppId
Source
SubUin
TaskId
[ 10s, avg ]
[ 60s, avg ]
[ 300s, avg ]
[ 3600s, avg ]
[ 86400s, avg ]
ServiceEmsFirstTokenLatency
首 Token 时延
首 Token 时延
s
AppId
Source
SubUin
TaskId
[ 10s, avg ]
[ 60s, avg ]
[ 300s, avg ]
[ 3600s, avg ]
[ 86400s, avg ]
ServiceEmsNonFirstTokenLatency
非首 Token 时延
非首 Token 时延
s
AppId
Source
SubUin
TaskId
[ 10s, avg ]
[ 60s, avg ]
[ 300s, avg ]
[ 3600s, avg ]
[ 86400s, avg ]
ServiceEmsProcessingRequestCount
处理中请求数
处理中请求数
Count
AppId
Source
SubUin
TaskId
[ 10s, sum ]
[ 60s, sum ]
[ 300s, sum ]
[ 3600s, sum ]
[ 86400s, sum ]
ServiceEmsQueuingRequestCount
排队中请求数
排队中请求数
Count
AppId
Source
SubUin
TaskId
[ 10s, sum ]
[ 60s, sum ]
[ 300s, sum ]
[ 3600s, sum ]
[ 86400s, sum ]
ServiceEmsTotalProcessedTokens
已处理 Token 总量
已处理 Token 总量
Count
AppId
Source
SubUin
TaskId
[ 10s, sum ]
[ 60s, sum ]
[ 300s, sum ]
[ 3600s, sum ]
[ 86400s, sum ]
ServiceEmsTotalProcessedTokensInput
已处理 Token 总量,仅输入
输入token 总量
Count
AppId
Source
SubUin
TaskId
[ 10s, sum ]
[ 60s, sum ]
[ 300s, sum ]
[ 3600s, sum ]
[ 86400s, sum ]
ServiceEmsTotalProcessedTokensOutput
已处理 Token 总量,仅输出
生成 Token 总量
Count
AppId
Source
SubUin
TaskId
[ 10s, sum ]
[ 60s, sum ]
[ 300s, sum ]
[ 3600s, sum ]
[ 86400s, sum ]
ServiceEmsAverageLengthInput
输入平均长度(Token)
输入平均长度(Token)
Count
AppId
Source
SubUin
TaskId
[ 10s, avg ]
[ 60s, avg ]
[ 300s, avg ]
[ 3600s, avg ]
[ 86400s, avg ]
ServiceEmsAverageLengthOutput
输出平均长度(Token)
输出平均长度(Token)
Count
AppId
Source
SubUin
TaskId
[ 10s, avg ]
[ 60s, avg ]
[ 300s, avg ]
[ 3600s, avg ]
[ 86400s, avg ]
GpuMemValue
显存使用量
显存使用量
MBytes
AppId
InstanceId
Source
SubUin
[ 10s, avg ]
[ 60s, avg ]
[ 300s, avg ]
[ 3600s, avg ]
[ 86400s, avg ]
EmsTokenThroughput
每分钟处理 Token 数
每分钟处理 Token 数
Count
AppId
InstanceId
Source
SubUin
[ 10s, avg ]
[ 60s, avg ]
[ 300s, avg ]
[ 3600s, avg ]
[ 86400s, avg ]
EmsTokenThroughputInput
每分钟处理 Token 数,仅输入
每分钟处理 Token 数,仅输入
Count
AppId
InstanceId
Source
SubUin
[ 10s, avg ]
[ 60s, avg ]
[ 300s, avg ]
[ 3600s, avg ]
[ 86400s, avg ]
EmsTokenThroughputOutput
每分钟处理 Token 数,仅输出
每分钟处理 Token 数,仅输出
Count
AppId
InstanceId
Source
SubUin
[ 10s, avg ]
[ 60s, avg ]
[ 300s, avg ]
[ 3600s, avg ]
[ 86400s, avg ]
EmsFirstTokenLatency
首 Token 时延
首 Token 时延
s
AppId
InstanceId
Source
SubUin
[ 10s, avg ]
[ 60s, avg ]
[ 300s, avg ]
[ 3600s, avg ]
[ 86400s, avg ]
EmsNonFirstTokenLatency
非首 Token 时延
非首 Token 时延
s
AppId
InstanceId
Source
SubUin
[ 10s, avg ]
[ 60s, avg ]
[ 300s, avg ]
[ 3600s, avg ]
[ 86400s, avg ]
EmsProcessingRequestCount
处理中请求数
处理中请求数
Count
AppId
InstanceId
Source
SubUin
[ 10s, sum ]
[ 60s, sum ]
[ 300s, sum ]
[ 3600s, sum ]
[ 86400s, sum ]
EmsQueuingRequestCount
排队中请求数
排队中请求数
Count
AppId
InstanceId
Source
SubUin
[ 10s, sum ]
[ 60s, sum ]
[ 300s, sum ]
[ 3600s, sum ]
[ 86400s, sum ]
EmsTotalProcessedTokens
已处理 Token 总量
已处理 Token 总量
Count
AppId
InstanceId
Source
SubUin
[ 10s, sum ]
[ 60s, sum ]
[ 300s, sum ]
[ 3600s, sum ]
[ 86400s, sum ]
EmsTotalProcessedTokensInput
已处理 Token 总量,仅输入
已处理 Token 总量,仅输入
Count
AppId
InstanceId
Source
SubUin
[ 10s, sum ]
[ 60s, sum ]
[ 300s, sum ]
[ 3600s, sum ]
[ 86400s, sum ]
EmsTotalProcessedTokensOutput
已处理 Token 总量,仅输出
已处理 Token 总量,仅输出
Count
AppId
InstanceId
Source
SubUin
[ 10s, sum ]
[ 60s, sum ]
[ 300s, sum ]
[ 3600s, sum ]
[ 86400s, sum ]
EmsAverageLengthInput
输入平均长度(Token)
输入平均长度(Token)
Count
AppId
InstanceId
Source
SubUin
[ 10s, avg ]
[ 60s, avg ]
[ 300s, avg ]
[ 3600s, avg ]
[ 86400s, avg ]
EmsAverageLengthOutput
输出平均长度 (Token)
输出平均长度 (Token)
Count
AppId
InstanceId
Source
SubUin
[ 10s, avg ]
[ 60s, avg ]
[ 300s, avg ]
[ 3600s, avg ]
[ 86400s, avg ]
Fp16EngineActivity
FP16活跃时间比
FP16活跃时间比
%
Appld
InstanceGpuNum
Source
SubUin
[ 10s, avg ]
[ 60s, avg ]
[ 300s, avg ]
[ 3600s, avg ]
[ 86400s, avg ]
Fp32EngineActivity
FP32活跃时间比
FP32活跃时间比
%
Appld
InstanceGpuNum
Source
SubUin
[ 10s, avg ]
[ 60s, avg ]
[ 300s, avg ]
[ 3600s, avg ]
[ 86400s, avg ]
Fp64EngineActivity
FP64活跃时间比
FP64活跃时间比
%
Appld
InstanceGpuNum
Source
SubUin
[ 10s, avg ]
[ 60s, avg ]
[ 300s, avg ]
[ 3600s, avg ]
[ 86400s, avg ]
NvlinkBandwidth
nvlink 传输速率
nvlink 传输速率
Bytes/s
Appld
InstanceGpuNum
Source
SubUin
[ 10s, avg ]
[ 60s, avg ]
[ 300s, avg ]
[ 3600s, avg ]
[ 86400s, avg ]
PcieBandwidth
PCIe 总线传输速率
PCle 总线传输速率
Bytes/s
Appld
InstanceGpuNum
Source
SubUin
[ 10s, avg ]
[ 60s, avg ]
[ 300s, avg ]
[ 3600s, avg ]
[ 86400s, avg ]
SmActivity
SM 活跃状态时间比
SM 活跃状态时间比
%
Appld
InstanceGpuNum
Source
SubUin
[ 10s, avg ]
[ 60s, avg ]
[ 300s, avg ]
[ 3600s, avg ]
[ 86400s, avg ]
TensorActivity
Tensor 活跃状态时间比
Tensor 活跃状态时间比
%
Appld
InstanceGpuNum
Source
SubUin
[ 10s, avg ]
[ 60s, avg ]
[ 300s, avg ]
[ 3600s, avg ]
[ 86400s, avg ]
DcgmFiDevFbUsed
显存使用量
显存使用量
MBytes
Appld
InstanceGpuNum
Source
SubUin
[ 10s, avg ]
[ 60s, avg ]
[ 300s, avg ]
[ 3600s, avg ]
[ 86400s, avg ]
DcgmFiDevGpuUtil
GPU 使用率
GPU 使用率
%
Appld
InstanceGpuNum
Source
SubUin
[ 10s, avg ]
[ 60s, avg ]
[ 300s, avg ]
[ 3600s, avg ]
[ 86400s, avg ]
DcgmFiDevMemCopyUtil
GPU 显存使用率
GPU 显存使用率
%
Appld
InstanceGpuNum
Source
SubUin
[ 10s, avg ]
[ 60s, avg ]
[ 300s, avg ]
[ 3600s, avg ]
[ 86400s, avg ]
GpuDecUtil
GPU 解码使用率
GPU 解码使用率
%
Appld
InstanceGpuNum
Source
SubUin
[ 10s, avg ]
[ 60s, avg ]
[ 300s, avg ]
[ 3600s, avg ]
[ 86400s, avg ]
GpuEncUtil
GPU 编码器使用率
GPU 编码器使用率
%
Appld
InstanceGpuNum
Source
SubUin
[ 10s, avg ]
[ 60s, avg ]
[ 300s, avg ]
[ 3600s, avg ]
[ 86400s, avg ]
GpuMemoryClock
GPU 显存频率
GPU 显存频率
S
Appld
InstanceGpuNum
Source
SubUin
[ 10s, avg ]
[ 60s, avg ]
[ 300s, avg ]
[ 3600s, avg ]
[ 86400s, avg ]
GpuNvlinkRxMb
nvlink 接收数据量
nvlink 接收数据量
Mbps
Appld
InstanceGpuNum
Source
SubUin
[ 10s, avg ]
[ 60s, avg ]
[ 300s, avg ]
[ 3600s, avg ]
[ 86400s, avg ]
GpuNvlinkTxMb
nvlink 发送数据量
nvlink 发送数据量
Mbps
Appld
InstanceGpuNum
Source
SubUin
[ 10s, avg ]
[ 60s, avg ]
[ 300s, avg ]
[ 3600s, avg ]
[ 86400s, avg ]
GpuPcieRxMb
pcie 接收数据量
pcie 接收数据量
Mbps
Appld
InstanceGpuNum
Source
SubUin
[ 10s, avg ]
[ 60s, avg ]
[ 300s, avg ]
[ 3600s, avg ]
[ 86400s, avg ]
GpuPcieTxMb
pcie 发送数据量
pcie 发送数据量
Mbps
Appld
InstanceGpuNum
Source
SubUin
[ 10s, avg ]
[ 60s, avg ]
[ 300s, avg ]
[ 3600s, avg ]
[ 86400s, avg ]
GpuSmClock
SM 时钟频率
SM 时钟频率
S
Appld
InstanceGpuNum
Source
SubUin
[ 10s, avg ]
[ 60s, avg ]
[ 300s, avg ]
[ 3600s, avg ]
[ 86400s, avg ]

各维度对应参数总览

参数名称
维度名称
维度解释
格式
Instances.N.Dimensions.0.Name
AppId
账号基本信息 APPID 的维度名称
输入 String 类型维度名称:AppId(SDK 调用时会自动获取,无需传参)
Instances.N.Dimensions.0.Value
AppId
账号基本信息 APPID
输入 ID,例如:1231231231(SDK 调用时会自动获取,无需传参)
Instances.N.Dimensions.1.Name
SubUin
子账号 ID 的维度名称
输入 String 类型维度名称:SubUin
Instances.N.Dimensions.1.Value
SubUin
子账号 ID
输入 ID,例如:100001231231
Instances.N.Dimensions.2.Name
Source
创建来源的维度名称
输入 String 类型维度名称:Source
Instances.N.Dimensions.2.Value
Source
创建来源
输入来源,例如:normal(默认请使用此值)
Instances.N.Dimensions.3.Name
InstanceId
在线服务实例 ID 的维度名称
输入 String 类型维度名称:InstanceId
Instances.N.Dimensions.3.Value
InstanceId
在线服务实例 ID
输入具体实例 ID,例如: ms-2tgmq6ms-1-5f96656956-272wq
Instances.N.Dimensions.4.Name
TaskId
在线服务 ID 的维度名称
输入 String 类型维度名称:TaskId
Instances.N.Dimensions.4.Value
TaskId
在线服务 ID
输入 ID,例如:ms-2tgmq6ms-1
Instances.N.Dimensions.5.Name
ServiceGroupId
在线服务服务组 ID 的维度名称
输入 String 类型维度名称:ServiceGroupId
Instances.N.Dimensions.5.Value
ServiceGroupId
在线服务服务组 ID
输入 ID,例如:ms-2tgmq6ms
Instances.N.Dimensions.6.Name
InstanceGpuNum
在线服务实例使用的 GPU 卡号(仅限 GPU 整卡任务)的维度名称
输入 String 类型维度名称:InstanceGpuNum
Instances.N.Dimensions.6.Value
InstanceGpuNum
在线服务实例使用的GPU卡号(仅限 GPU 整卡任务)
实例 ID 拼接 GPU 卡号/avg,输入具体实例 ID,例如:ms-2tgmq6ms-1-5f96656956-272wq-0

入参说明

查询在线服务指标监控数据,取值如下:
&Namespace=QCE/TI_MODEL
&Instances.N.Dimensions.0.Name=AppId
&Instances.N.Dimensions.0.Value=具体的账号 ID
&Instances.N.Dimensions.1.Name=SubUin
&Instances.N.Dimensions.1.Value=具体的子账号 ID
&Instances.N.Dimensions.2.Name=Source
&Instances.N.Dimensions.2.Value=具体的创建来源
&Instances.N.Dimensions.3.Name=InstanceId
&Instances.N.Dimensions.3.Value=在线服务实例 ID
&Instances.N.Dimensions.4.Name=TaskId
&Instances.N.Dimensions.4.Value=具体的在线服务 ID
&Instances.N.Dimensions.5.Name=ServiceGroupId
&Instances.N.Dimensions.5.Value=具体的在线服务服务组 ID
&Instances.N.Dimensions.6.Name=InstanceGpuNum
&Instances.N.Dimensions.6.Value=在线服务实例使用的 GPU 卡号

帮助和支持

本页内容是否解决了您的问题?

填写满意度调查问卷,共创更好文档体验。

文档反馈