tencent cloud

Elastic MapReduce

Release Notes and Announcements
Release Notes
Announcements
Security Announcements
Product Introduction
Overview
Strengths
Architecture
Features
Use Cases
Constraints and Limits
Technical Support Scope
Product release
Purchase Guide
EMR on CVM Billing Instructions
EMR on TKE Billing Instructions
EMR Serverless HBase Billing Instructions
Getting Started
EMR on CVM Quick Start
EMR on TKE Quick Start
EMR on CVM Operation Guide
Planning Cluster
Administrative rights
Configuring Cluster
Managing Cluster
Managing Service
Monitoring and Alarms
TCInsight
EMR on TKE Operation Guide
Introduction to EMR on TKE
Configuring Cluster
Cluster Management
Service Management
Monitoring and Ops
Application Analysis
EMR Serverless HBase Operation Guide
EMR Serverless HBase Product Introduction
Quotas and Limits
Planning an Instance
Managing an Instance
Monitoring and Alarms
Development Guide
EMR Development Guide
Hadoop Development Guide
Spark Development Guide
Hbase Development Guide
Phoenix on Hbase Development Guide
Hive Development Guide
Presto Development Guide
Sqoop Development Guide
Hue Development Guide
Oozie Development Guide
Flume Development Guide
Kerberos Development Guide
Knox Development Guide
Alluxio Development Guide
Kylin Development Guide
Livy Development Guide
Kyuubi Development Guide
Zeppelin Development Guide
Hudi Development Guide
Superset Development Guide
Impala Development Guide
Druid Development Guide
TensorFlow Development Guide
Kudu Development Guide
Ranger Development Guide
Kafka Development Guide
Iceberg Development Guide
StarRocks Development Guide
Flink Development Guide
JupyterLab Development Guide
MLflow Development Guide
Practical Tutorial
Practice of EMR on CVM Ops
Data Migration
Practical Tutorial on Custom Scaling
API Documentation
History
Introduction
API Category
Cluster Resource Management APIs
Cluster Services APIs
User Management APIs
Data Inquiry APIs
Scaling APIs
Configuration APIs
Other APIs
Serverless HBase APIs
YARN Resource Scheduling APIs
Making API Requests
Data Types
Error Codes
FAQs
EMR on CVM
Service Level Agreement
Contact Us

Node Monitoring Metrics

PDF
Modo Foco
Tamanho da Fonte
Última atualização: 2026-01-13 17:37:54

Node - CPU

Title
Metric
Unit
Description
CPU utilization
idle
%
Percentage of CPU idle time
irq
%
Percentage of interrupts
nice
%
Percentage of CPU utilization under nice priority
steal
%
Percentage of wait time by virtual CPUs for physical CPUs
softirq
%
Percentage of CPU soft interrupts
guest
%
Percentage of time spent running virtual processors
system
%
CPU utilization in kernel mode
user
%
CPU utilization in user mode
iowait
%
Percentage of CPU idleness due to process I/O waits
Load
1m
%
1-minute load
5m
%
5-minute load
15m
%
15-minute load
Cores
cpu_count
-
Number of CPU cores
Service CPU usage percentage
${service}
%
Component service CPU usage/Node CPU usage percentage

Node - memory

Title
Metric
Unit
Description
Memory utilization
MemTotal
GB
Total memory size
MemFree
GB
Total free memory size
MemAvailable
GB
Total available memory size
Buffers
GB
Total memory size used by buffers
Cached
GB
Total memory size used by file cache
SwapCached
GB
Total swap memory size by anonymous page writes
SwapFree
GB
Total available swap size
AnonPages
GB
Total unmapped memory size
SwapTotal
GB
Total swap size
Dirty
GB
Total memory size to write to disk
Writeback
GB
Total memory size being written back to disk
HardwareCorrupted
GB
Total unavailable memory size due to memory hardware failure
Shmem
GB
Total shared memory size
MemUsed
GB
Total used memory size
Percentage of used memory
available_percent
%
Percentage of available memory size out of total memory
used_percent
%
Percentage of used memory size out of total memory
Service memory usage percentage (RSS-XMX)
${service}
%
Component Service (RSS-XMX)/XMX Memory Percentage
Service Memory Usage
${service}
MBytes
Component Service RSS Memory Usage
Service Memory Usage Percentage
${service}
%
Component Service RSS Memory Usage Percentage Node Memory

Node - disk

Title
Metric
Unit
Description
Device read/write rate
Read
MB/s
Data read per second
Write
MB/s
Data written per second
Device IOPS
all
count/s
Number of I/O operations in progress on current device
I/O operation time
Read
ms
Average wait time per device I/O read operation
Write
ms
Average wait time per device I/O write operation
IO
ms
Average processing time per I/O request
Device read/write QPS
Read
count/s
Read QPS
Write
count/s
Write QPS
Merge-Read
count/s
Merged read QPS
Merge-Write
count/s
Merged write QPS
I/O device utilization
all
%
Disk busyness
Disk space
Free
GB
Free disk storage space
Available
GB
Available disk storage space (for unprivileged users)
Total
GB
Total disk storage space
Disk space utilization
Used
%
Disk space utilization
INODES
Free
-
Number of remaining disk inodes
Total
-
Total number of disk inodes
Inode utilization
Used
%
Disk inode utilization
Single disk space utilization
Used(*) (* is the disk name)
%
Single disk space utilization
Single IO device usage
* (* is the disk name)
%
Single IO device usage

Node - file handle

Title
Metric
Unit
Description
File handle
allocated
-
Number of allocated file handles
maximum
-
Maximum number of file handles
System interrupt
intr_total
count/s
Number of system interrupts
System context switch
context_switches_total
count/s
Number of system context switches
System process
forks_total
-
Number of new system processes
procs_running
-
Number of running system processes
procs_blocked
-
Number of blocked system processes
procs_total
-
Total number of system processes
thrds_total
-
Total number of system threads
Agent version
AgentVersion
version
Agent version

Node - network

Title
Metric
Unit
Description
TCP LISTEN exception
ListenDrops
count/s
Number of incoming connections (SYN packets) dropped for any reason
ListenOverflows
count/s
Number of occurrences where the upper limit of the Accept queue is exceeded after the last step of three-way handshake is completed
TCPSyncookies
SyncookiesFailed
count/s
Number of packets received with invalid SYN Cookie information
SyncookiesRecv
count/s
Number of packets received with valid SYN Cookie information
SyncookiesSent
count/s
Number of SYN/ACK packets sent through SYN Cookie
TCP connection abort exception
TCPAbortOnTimeout
count/s
Number of connections closed because the attempts of retransmissions of various timers (RTO/PTO/keepalive) exceed the upper limit
TCPAbortOnData
count/s
Number of sockets closed due to unknown data received
TCPAbortOnClose
count/s
Number of sockets closed when the user-mode program has data in the buffer
TCPAbortOnMemory
count/s
Number of connections closed due to memory issues
TCPAbortOnLinger
count/s
Number of connections suspended in lingering status after being closed
TCPAbortFailed
count/s
Number of failed attempts to close connection
TCP connection establishment
ActiveOpens
count/s
Number of actively established TCP connections
CurrEstab
count/s
Number of TCP connections currently established
PassiveOpens
count/s
Number of passively established TCP connections
AttemptFails
count/s
Number of connection establishment failures
EstabResets
count/s
Number of reset connections
TCP packet
InSegs
count/s
Number of received packets, including erroneous ones
OutSegs
count/s
Number of sent packets
RetransSegs
count/s
Number of received TCP packets
InErrs
count/s
Number of retransmitted packets
OutRsts
count/s
Number of sent RST packets
TCP retransmission rate
RetransSegsRate
%
Retransmission rate at TCP layer
ResetRate
%
RESET sending frequency
InErrRate
%
Percentage of erroneous packets
TCP TIME-WAIT
TW
count/s
Number of sockets ending TIME_WAIT status after normal timeout
TWKilled
count/s
Number of sockets ending TIME_WAIT status through tcp_tw_recycle mechanism
TCPTimeWaitOverflow
count/s
Number of TIME_WAIT sockets unable to be allocated due to limit exceeding
TWRecycled
count/s
Number of sockets ending TIME_WAIT status through tcp_tw_reuse mechanism
TCP RTO
TCPTimeouts
count/s
Number of first RTO timer timeouts
TCPSpuriousRTOs
count/s
Number of spurious timeouts detected through F-RTO mechanism
TCPLossProbes
count/s
Number of Tail Loss Probe (TLP) packets sent due to Probe Timeout (PTO)
TCPLossProbeRecovery
count/s
Number of lost packets just repaired by TLP probes
TCPRenoRecoveryFail
count/s
Number of connections that enter the Recovery phase and then undergo RTO (SACK option not supported by the opposite)
TCPSackRecoveryFail
count/s
Number of connections that enter the Recovery phase and then undergo RTO (SACK option supported by the opposite)
TCPRenoFailures
count/s
Number of failures that enter the TCP_CA_Disorder phase and then undergo RTO (SACK option not supported by the opposite)
TCPSackFailures
count/s
Number of failures that enter the TCP_CA_Disorder phase and then undergo RTO (SACK option supported by the opposite)
TCPLossFailures
count/s
Number of connections that enter the TCP_CA_Loss phase and then undergo RTO timeout
TCP RTO constant
RtoAlgorithm
1/s
Number of delayed algorithms for forwarding unanswered objects
RtoMax
1
Maximum number of retransmissions due to TCP latency
RtoMin
1
Minimum number of retransmissions due to TCP latency
TCP retransmission
TCPLostRetransmit
count/s
Number of SKB retransmissions due to loss
TCPFastRetrans
count/s
Number of fast SKB retransmissions
TCPForwardRetrans
count/s
Number of regular SKB retransmissions
TCPSlowStartRetrans
count/s
Number of SKB retransmissions with successful slow start
TCPRetransFail
count/s
Number of failed retransmission attempts
UDP datagram
OutDatagrams
count/s
Number of sent UDP datagrams
InDatagrams
count/s
Number of received UDP datagrams
ENI data receiving/sending rate
eth0-receive_bytes
MB/s
Volume of data received by ENI
eth0-transmit_bytes
MB/s
Volume of data sent by ENI
ENI packet receiving/sending rate
eth0-receive_drop
count/s
Volume of data received and then dropped by ENI
eth0-receive_errs
count/s
Volume of data failed to be received by ENI
eth0-transmit_drop
count/s
Volume of data sent and then dropped by ENI
eth0-transmit_errs
count/s
Volume of data failed to be sent by ENI
eth0-transmit_packets
count/s
Number of packets sent by ENI
TCP socket
TCP_inuse
count
Number of TCP sockets in use (listening)
TCP_orphan
count
Number of TCP connections waiting to be closed
TCP_tw
count
Number of TCP sockets to be terminated
TCP_alloc
count
Number of TCP sockets allocated (established, sk_buff obtained)
sockets_used
count
Total number of used sockets
TCP4 link status
ESTABLISHED
count
Number of TCP connections in Established status
SYN-SENT
count
Number of TCP connections in SYN-SENT status
SYN-RECV
count
Number of TCP connections in SYN-RECV status
FIN-WAIT1
count
Number of TCP connections in FIN-WAIT1 status
FIN-WAIT2
count
Number of TCP connections in FIN-WAIT2 status
TIME-WAIT
count
Number of TCP connections in TIME-WAIT status
CLOSE
count
Number of TCP connections in CLOSE status
CLOSE-WAIT
count
Number of TCP connections in CLOSE-WAIT status
LAST-ACK
count
Number of TCP connections in LAST-ACK status
LISTEN
count
Number of TCP connections in LISTEN status
CLOSING
count
Number of TCP connections in CLOSING status
TCP6 link status
ESTABLISHED
count
TCP6 link status in ESTABLISHED status
FIN-WAIT2
count
TCP6 link status in FIN-WAIT2 status
FIN-WAIT1
count
TCP6 link status in FIN-WAIT1 status
CLOSE-WAIT
count
TCP6 link status in CLOSE-WAIT status
LISTEN
count
TCP6 link status in LISTEN status
LAST-ACK
count
TCP6 link status in LAST-ACK status
CLOSING
count
TCP6 link status in CLOSING status
CLOSE
count
TCP6 link status in CLOSE status
SYN-SENT
count
TCP6 link status in SYN-SENT status
SYN-RECV
count
TCP6 link status in SYN-RECV status
TIME-WAIT
count
TCP6 link status in TIME-WAIT status

Node - event

Title
Metric
Unit
Description
CPU utilization
used
%
1 - (percentage of CPU idle time)
15-minute CPU load
15m
-
15-minute load
1-minute CPU load
1m
-
1-minute load
5-minute CPU load
5m
-
5-minute load
Disk IOPS
all
-
Number of I/O operations in progress on current device
Disk I/O operation time
IO
-
Average processing time per I/O request
Disk space utilization
Used
-
Disk space utilization
Disk I/O device utilization
all
-
Disk busyness
Memory utilization
used_percent
-
Percentage of used memory size out of total memory
Outbound network traffic rate
*-transmit_bytes
-
Volume of data sent by ENI
Inbound network traffic rate
*-receive_bytes
-
Volume of data received by ENI
TCP connections
CurrEstab
-
Number of TCP connections currently established

Ajuda e Suporte

Esta página foi útil?

comentários