Title | Metric | Unit | Description |
Cluster storage capacity | CapacityTotal | GB | Total cluster storage capacity |
| CapacityUsed | GB | Used cluster storage capacity |
| CapacityRemaining | GB | Remaining cluster storage capacity |
| CapacityUsedNonDFS | GB | Non-HDFS used cluster capacity |
Cluster load | TotalLoad | 1 | Current connections |
Total files in cluster | FilesTotal | - | Total number of files |
Blocks | BlocksTotal | - | Total number of blocks |
| PendingReplicationBlocks | - | Number of blocks waiting to be backed up |
| UnderReplicatedBlocks | - | Number of blocks with insufficient replicas |
| CorruptBlocks | - | Number of corrupted blocks |
| ScheduledReplicationBlocks | - | Number of blocks arranged for backup |
| PendingDeletionBlocks | - | Number of blocks waiting to be deleted |
| ExcessBlocks | - | Number of excess blocks |
| PostponedMisreplicatedBlocks | - | Number of abnormal blocks postponed to be processed |
Block capacity | BlockCapacity | - | Block capacity |
Cluster data node | NumLiveDataNodes | - | Number of live data nodes |
| NumDeadDataNodes | - | Number of data nodes marked as dead |
| NumDecomLiveDataNodes | - | Number of decommissioned live nodes |
| NumDecomDeadDataNodes | - | Number of decommissioned dead nodes |
| NumDecommissioningDataNodes | - | Number of decommissioning nodes |
| NumStaleDataNodes | - | Number of DataNodes marked as stale |
HDFS storage space utilization | CapacityUsedRate | - | HDFS cluster storage space utilization |
Snapshots | Snapshots | - | Number of snapshots |
Disk failure | VolumeFailuresTotal | - | Total number of volume failures across all DataNodes |
Number of blocks | MissingReplicatedBlocks | count | Number of lost replica blocks |
Slow HDFS DataNode | SlowDataNode | - | Slow DataNode IP address |
Slow HDFS Disk | SlowDisks | - | The delay of a DataNode's read/write/metadata operation on a mount point |
Title | Metric | Unit | Description |
JVM memory | MemNonHeapUsedM | MB | Size of NonHeapMemory currently used by JVM |
| MemNonHeapCommittedM | MB | Size of NonHeapCommittedM configured by JVM |
| MemHeapUsedM | MB | Size of HeapMemory currently used by JVM |
| MemHeapCommittedM | MB | Committed size of JVM HeapMemory |
| MemHeapMaxM | MB | Size of HeapMemory configured by JVM |
| MemMaxM | MB | Maximum size of memory available to JVM runtime |
Heap memory utilization | MemHeapUsedRate | % | The percentage of the number of HeapMemory currently used by the JVM to the number of HeapMemory configured by the JVM |
Block reporting latency | BlockReportAvgTime | count/s | Average latency of processing DataNode blocks per second |
JVM threads | ThreadsNew | - | Number of threads in NEW status |
| ThreadsRunnable | - | Number of threads in RUNNABLE status |
| ThreadsBlocked | - | Number of threads in BLOCKED status |
| ThreadsWaiting | - | Number of threads in WAITING status |
| ThreadsTimedWaiting | - | Number of threads in TIMED WAITING status |
| ThreadsTerminated | - | Number of threads in Terminated status |
JVM logs | LogFatal | - | Number of FATAL-level logs |
| LogError | - | Number of ERROR-level logs |
| LogWarn | - | Number of WARN-level logs |
| LogInfo | - | Number of INFO-level logs |
GC count | YGC | - | Young GC count |
| FGC | - | Full GC count |
GC time | FGCT | s | Full GC time |
| GCT | s | Garbage collection time |
| YGCT | s | Young GC time |
Memory zone proportion | S0 | % | Percentage of used Survivor 0 memory |
| S1 | % | Percentage of used Survivor 1 memory |
| E | % | Percentage of used Eden memory |
| O | % | Percentage of used Old memory |
| M | % | Percentage of used Metaspace memory |
| CCS | % | Percentage of used compressed class space memory |
Storages marked as content stale | NumStaleStorages | - | Number of DataNode storages marked as content stale |
Pending block-related messages for later processing on the standby NameNode | PendingDataNodeMessageCount | count/s | Number of DataNode requests queued on the standby NameNode |
Missing blocks | NumberOfMissingBlocks | - | Number of missing blocks |
| NumberOfMissingBlocksWithReplicationFactorOne | - | Number of missing blocks (rf = 1) |
Snapshot operation | AllowSnapshotOps | count/s | Number of AllowSnapshot operations executed per second |
| DisallowSnapshotOps | count/s | Number of DisallowSnapshot operations executed per second |
| CreateSnapshotOps | count/s | Number of CreateSnapshot operations executed per second |
| DeleteSnapshotOps | count/s | Number of DeleteSnapshot operations executed per second |
| ListSnapshottableDirOps | count/s | Number of ListSnapshottableDir operations executed per second |
| SnapshotDiffReportOps | count/s | Number of SnapshotDiffReportOps operations executed per second |
| RenameSnapshotOps | count/s | Number of RenameSnapshotOps operations executed per second |
File operation | CreateFileOps | count/s | Number of CreateFile operations executed per second |
| GetListingOps | count/s | Number of GetListing operations executed per second |
| TotalFileOps | count/s | Number of TotalFileOps operations executed per second |
| DeleteFileOps | count/s | Number of DeleteFile operations executed per second |
| FileInfoOps | count/s | Number of FileInfo operations executed per second |
| GetAdditionalDatanodeOps | count/s | Number of GetAdditionalDatanode operations executed per second |
| CreateSymlinkOps | count/s | Number of CreateSymlink operations executed per second |
| GetLinkTargetOps | count/s | Number of GetLinkTarget operations executed per second |
| FilesInGetListingOps | count/s | Number of FilesInGetListing operations executed per second |
File statistics | FilesDeleted | count | Number of deleted or renamed files and folders |
| FilesCreated | count | Number of created files and folders |
| FilesAppended | count | Number of appended files |
Number of files | FilesTotal | count | Number of files |
Transaction operation | TransactionsNumOps | count/s | Number of journal transaction operations processed per second |
| TransactionsBatchedInSync | count/s | Number of journal transaction operations batch processed per second |
Image operation | GetEditNumOps | count/s | Number of GetEditNumOps operations executed per second |
| GetImageNumOps | count/s | Number of GetImageNumOps operations executed per second |
| PutImageNumOps | count/s | Number of PutImageNumOps operations executed per second |
Sync operation | SyncsNumOps | count/s | Number of journal sync operations processed per second |
Block operation | BlockReceivedAndDeletedOps | count/s | Number of BlockReceivedAndDeletedOps operations executed per second |
| BlockOpsQueued | count/s | Number of processed DataNode block reporting operations |
Cache reporting | CacheReportNumOps | count/s | Number of CacheReport operations processed per second |
Block reporting | BlockReportNumQps | count/s | Number of DataNode block reporting operations processed per second |
Sync operation latency | SyncsAvgTime | ms | Average latency of processing journal sync operations |
Cache reporting latency | CacheReportAvgTime | ms | Average latency of cache reporting |
Image operation latency | GetEditAvgTime | ms | Average latency of reading Edit files |
| GetImageAvgTime | ms | Average latency of reading image files |
| PutImageAvgTime | ms | Average latency of writing image files |
Transaction operation latency | TransactionsAvgTime | ms | Average latency of processing journal transaction operations |
Start time | StartTime | ms | Process start time |
Active/Standby status | State | 1 | NameNode HA status |
Active/Standby status | State | 1: Active 0: Standby | NameNode active/standby status |
Threads | PeakThreadCount | - | Peak number of threads |
| ThreadCount | - | Number of threads |
| DaemonThreadCount | - | Number of backend threads |
Transactions since the last checkpoint | SinceLastCheckpoint | count | Total number of transactions since the last checkpoint |
Checkpoint time | LastCheckpoint | time | Time since the last checkpoint |
Length of the queue waiting for file locks | LockQueueLength | count | LockQueueLength - length of the queue waiting for file locks |
Number of BLOCKS | BlocksTotal | count | Number of blocks |
| PendingReplicationBlocks | count | Number of blocks waiting to be backed up |
| UnderReplicatedBlocks | count | Number of blocks with insufficient replicas |
| CorruptBlocks | count | Number of bad blocks |
| ScheduledReplicationBlocks | count | Number of blocks arranged to be backed up |
| PendingDeletionBlocks | count | Number of blocks waiting to be deleted |
| ExcessBlocks | count | Number of redundant blocks |
| PostponedMisreplicatedBlocks | count | Number of abnormal blocks postponed for processing |
Number of files being written | NumFilesUnderConstruction | count | Number of files being written |
Data traffic QPS (Client RPC) | ReceivedBytes | Bytes/s | Data reception rate |
| SentBytes | Bytes/s | Data transmission rate |
QPS(Client RPC) | RpcQueueTimeNumOps | count/s | RPC call rate |
Request processing delay (Client RPC) | RpcQueueTimeAvgTime | ms | RPC average latency time |
| RpcProcessingTimeAvgTime | ms | RPC request average processing time |
Verification and authorization (Client RPC) | RpcAuthenticationFailures | count | RPC verification failure count |
| RpcAuthenticationSuccesses | count | RPC verification success count |
| RpcAuthorizationFailures | count | RPC authorization failure count |
| RpcAuthorizationSuccesses | count | RPC authorization success count |
Current connections (Client RPC) | NumOpenConnections | count | Current number of connections |
RPC processing queue length (Client RPC) | CallQueueLength | count | Current RPC processing queue length |
Data traffic (Service RPC) | ReceivedBytes | Bytes/s | Service RPC data reception rate |
| SentBytes | Bytes/s | Service RPC data transmission rate |
QPS(Service RPC) | RpcQueueTimeNumOps | count/s | Service RPC call rate |
Request processing latency (Service RPC) | RpcQueueTimeAvgTime | ms | Average latency time of Service RPC |
| RpcProcessingTimeAvgTime | ms | Service RPC request average processing time |
Verification and authorization (Service RPC) | RpcAuthenticationFailures | count | Service RPC verification failure count |
| RpcAuthenticationSuccesses | count | Service RPC verification success count |
| RpcAuthorizationFailures | count | Service RPC authorization failure count |
| RpcAuthorizationSuccesses | count | Service RPC authorization success count |
Current connections (Service RPC) | NumOpenConnections | count | Current number of Service RPC connections |
RPC processing queue length (Service RPC) | CallQueueLength | count | Current Service RPC processing queue length |
Average RPC time (NameNode RPC) | CompleteAvgTime | ms | Average latency of Complete requests |
| CreateAvgTime | ms | Average latency of Create requests |
| RenameAvgTime | ms | Average latency of Rename requests |
| AddBlockAvgTime | ms | Average latency of AddBlock requests |
| GetListingAvgTime | ms | Average latency of GetListing requests |
| GetFileInfoAvgTime | ms | Average latency of GetFileInfo requests |
| SendHeartbeatAvgTime | ms | Average latency of SendHeartbeat requests |
Average RPC time (NameNode RPC) | RegisterDatanodeAvgTime | ms | Average latency of RegisterDatanode requests |
| BlockReportAvgTime | ms | Average latency of BlockReport requests |
| DeleteAvgTime | ms | Average latency of Delete requests |
| RenewLeaseAvgTime | ms | Average latency of RenewLease requests |
| BlockReceivedAndDeletedAvgTime | ms | Average latency of BlockReceivedAndDeleted requests |
| FsyncAvgTime | ms | Average latency of fsync requests |
| VersionRequestAvgTime | ms | Average latency of VersionRequest requests |
Average RPC time (NameNode RPC) | ListEncryptionZonesAvgTime | ms | Average latency of ListEncryptionZones requests |
| SetPermissionAvgTime | ms | Average latency of SetPermission requests |
| SetTimesAvgTime | ms | Average latency of SetTimes requests |
| SetSafeModeAvgTime | ms | Average latency of SetSafeMode requests |
| MkdirsAvgTime | ms | Average latency of Mkdirs requests |
| GetServerDefaultsAvgTime | ms | Average latency of GetServerDefaults requests |
| GetBlockLocationsAvgTime | ms | Average latency of GetBlockLocations requests |
RPC statistics (NameNode RPC) | CompleteNumOps | count/s | Number of Complete calls per second |
| CreateNumOps | count/s | Number of Create calls per second |
| RenameNumOps | count/s | Number of Rename calls per second |
| AddBlockNumOps | count/s | Number of AddBlock calls per second |
| GetListingNumOps | count/s | Number of GetListing calls per second |
| GetFileInfoNumOps | count/s | Number of GetFileInfo calls per second |
| SendHeartbeatNumOps | count/s | Number of SendHeartbeat calls per second |
RPC statistics (NameNode RPC) | RegisterDatanodeNumOps | count/s | Number of RegisterDatanode calls per second |
| BlockReportNumOps | count/s | Number of BlockReport calls per second |
| DeleteNumOps | count/s | Number of Delete calls per second |
| RenewLeaseNumOps | count/s | Number of RenewLease calls per second |
| BlockReceivedAndDeletedNumOps | count/s | Number of BlockReceivedAndDeleted calls per second |
| FsyncNumOps | count/s | Number of fsync calls per second |
| VersionRequestNumOps | count/s | Number of VersionRequest calls per second |
RPC statistics (NameNode RPC) | ListEncryptionZonesNumOps | count/s | Number of ListEncryptionZones calls per second |
| SetPermissionNumOps | count/s | Number of SetPermission calls per second |
| SetTimesNumOps | count/s | Number of SetTimes calls per second |
| SetSafeModeNumOps | count/s | Number of SetSafeMode calls per second |
| MkdirsNumOps | count/s | Number of Mkdirs calls per second |
| GetServerDefaultsNumOps | count/s | Number of GetServerDefaults calls per second |
| GetBlockLocationsNumOps | count/s | Number of GetBlockLocations calls per second |
Number of blocks | MissingReplicatedBlocks | count | Number of lost replica blocks |
Number of audit log writing to ES failures | WriteEsFailed | Count | Number of audit log writing to ES failures |
Number of audit log writing to ES successes | WriteEsSuccess | Count | Number of audit log writing to ES successes |
Average lock waiting time | RpcLockWaitTimeAvgTime | ms | Average lock waiting time |
Average block obtaining time | GetBlocksAvgTime | ms | Average block obtaining time |
Average lock holding time of all lock operations | Read | us | Average lock holding time of all read lock operations |
| Write | us | Average lock holding time of all write lock operations |
Number of data catalogs | active | Count | Number of NameNode data catalogs in use |
| failed | Count | Number of catalogs with access failures or corruption |
Title | Metric | Unit | Description |
Xceivers | XceiverCount | - | Number of Xceivers |
Data read/write rate | BytesWrittenMB | Bytes/s | DataNode byte write rate |
| BytesReadMB | Bytes/s | DataNode byte read rate |
| RemoteBytesReadMB | Bytes/s | Remote client byte read rate |
| RemoteBytesWrittenMB | Bytes/s | Remote client byte write rate |
Client connections | WritesFromRemoteClient | - | Remote client write QPS |
| WritesFromLocalClient | - | Local client write QPS |
| ReadsFromRemoteClient | - | Remote client read QPS |
| ReadsFromLocalClient | - | Local client read QPS |
Block verification failure | BlockVerificationFailures | count/s | Number of block verification failures |
Disk failure | VolumeFailures | count/s | Number of disk failures |
Network error | DatanodeNetworkErrors | count/s | Network error statistics |
Heartbeat latency | HeartbeatsAvgTime | ms | Average heartbeat time |
Heartbeat QPS | HeartbeatsNumOps | count/s | Heartbeat QPS |
Packet transfer RT | SendDataPacketTransferNanosAvgTime | ms | Average time of sending packets |
Block operation | ReadBlockOpNumOps | count/s | Block read OPS from DataNode |
| WriteBlockOpNumOps | count/s | Block write OPS to DataNode |
| BlockChecksumOpNumOps | count/s | Checksum OPS by DataNode |
| CopyBlockOpNumOps | count/s | Block copying OPS |
| ReplaceBlockOpNumOps | count/s | Block replacement OPS |
| BlockReportsNumOps | count/s | Block reporting OPS |
| IncrementalBlockReportsNumOps | count/s | Incremental block reporting OPS |
| CacheReportsNumOps | count/s | Cache reporting OPS |
| PacketAckRoundTripTimeNanosNumOps | count/s | Number of ACK round trips processed per second |
Fsync operation | FsyncNanosNumOps | count/s | Number of fsync operations processed per second |
Flush operation | FlushNanosNumOps | count/s | Number of flush operations processed per second |
Block operation latency statistics | ReadBlockOpAvgTime | ms | Average block read time |
| WriteBlockOpAvgTime | ms | Average block write time |
| BlockChecksumOpAvgTime | ms | Average block check time |
| CopyBlockOpAvgTime | ms | Average block copy time |
| ReplaceBlockOpAvgTime | ms | Average block replacement time |
| BlockReportsAvgTime | ms | Average block reporting time |
| IncrementalBlockReportsAvgTime | ms | Average time of incremental block reporting |
| CacheReportsAvgTime | ms | Average time of cache reporting |
| PacketAckRoundTripTimeNanosAvgTime | ms | Average time of ACK round trip processing |
Flush latency | FlushNanosAvgTime | ns | Average flush time |
Fsync latency | FsyncNanosAvgTime | ns | Average fsync time |
RamDisk Blocks | RamDiskBlocksWrite | blocks/s | Total number of blocks written to memory |
| RamDiskBlocksWriteFallback | blocks/s | Total number of blocks failed to be written to memory (failover to disk) |
| RamDiskBlocksDeletedBeforeLazyPersisted | blocks/s | Total number of blocks deleted before the application is saved to the disk |
| RamDiskBlocksReadHits | blocks/s | Number of blocks read from memory |
| RamDiskBlocksEvicted | blocks/s | Total number of blocks cleared in memory |
| RamDiskBlocksEvictedWithoutRead | blocks/s | Total number of blocks retrieved from memory |
| RamDiskBlocksLazyPersisted | blocks/s | Number of disk writes by lazy writer |
| RamDiskBytesLazyPersisted | Bytes/s | Total number of bytes written to disk by lazy writer |
RamDisk write speed | RamDiskBytesWrite | Bytes/s | Total number of bytes written to memory |
JVM memory | MemNonHeapUsedM | MB | Size of NonHeapMemory currently used by JVM |
| MemNonHeapCommittedM | MB | Size of NonHeapCommittedM configured by JVM |
| MemHeapUsedM | MB | Size of HeapMemory currently used by JVM |
| MemHeapCommittedM | MB | Committed size of JVM HeapMemory |
| MemHeapMaxM | MB | Size of HeapMemory configured by JVM |
| MemMaxM | MB | Maximum size of memory available to JVM runtime |
Heap memory utilization | MemHeapUsedRate | % | Percentage of HeapMemory currently used by the JVM relative to the amount of HeapMemory configured for the JVM |
JVM threads | ThreadsNew | - | Number of threads in NEW status |
| ThreadsRunnable | - | Number of threads in RUNNABLE status |
| ThreadsBlocked | - | Number of threads in BLOCKED status |
| ThreadsWaiting | - | Number of threads in WAITING status |
| ThreadsTimedWaiting | - | Number of threads in TIMED WAITING status |
| ThreadsTerminated | - | Number of threads in Terminated status |
Heap memory utilization | MemHeapUsedRate | % | The percentage of the number of HeapMemory currently used by the JVM to the number of HeapMemory configured by the JVM |
JVM logs | LogFatal | - | Number of Fatal logs |
| LogError | - | Number of Error logs |
| LogWarn | - | Number of Warn logs |
| LogInfo | - | Number of Info logs |
GC count | YGC | - | Young GC count |
| FGC | - | Full GC count |
GC time | FGCT | s | Full GC time |
| GCT | s | Garbage collection time |
| YGCT | s | Young GC time |
Memory zone proportion | S0 | % | Percentage of used Survivor 0 memory |
| E | % | Percentage of used Eden memory |
| CCS | % | Percentage of used compressed class space memory |
| S1 | % | Percentage of used Survivor 1 memory |
| O | % | Percentage of used Old memory |
| M | % | Percentage of used Metaspace memory |
Data rate | ReceivedBytes | Bytes/s | Data receiving rate |
| SentBytes | Bytes/s | Data sending rate |
QPS | RpcQueueTimeNumOps | count/s | RPC call rate |
Request processing latency | RpcQueueTimeAvgTime | ms | Average RPC latency |
| RpcProcessingTimeAvgTime | count/s | Average RPC request processing time |
Authentication and authorization | RpcAuthenticationFailures | count/s | Number of RPC authentication failures |
| RpcAuthenticationSuccesses | count/s | Number of RPC authentication successes |
| RpcAuthorizationFailures | count/s | Number of RPC authorization failures |
| RpcAuthorizationSuccesses | count/s | Number of RPC authorization successes |
Current connections | NumOpenConnections | - | Number of current connections |
Length of RPC processing queue | CallQueueLength | 1 | Length of current RPC processing queue |
CPU time | CurrentThreadSystemTime | ms | System time |
| CurrentThreadUserTime | ms | User time |
Start time | StartTime | s | Process start time |
Threads | PeakThreadCount | - | Peak number of threads |
| DaemonThreadCount | - | Number of backend threads |
Read/Write latency | write | ms | Write time |
| read | ms | Read time |
Packet transfer QPS | DataPacketOps | count/s | Packet transfer QPS |
Blocks | Related to disk information, such as `/data/qcloud/data/hdfs` | - | Blocks |
Used disk capacity | Related to disk information, such as `/data/qcloud/data/hdfs` | GB | Used disk capacity |
Free disk capacity | Related to disk information, such as `/data/qcloud/data/hdfs` | GB | Free disk capacity |
Reserved disk capacity | Related to disk information, such as `/data/qcloud/data/hdfs` | GB | Reserved disk capacity |
Title | Metric | Unit | Description |
JVM memory | MemNonHeapUsedM | MB | Size of NonHeapMemory currently used by JVM |
| MemNonHeapCommittedM | MB | Size of NonHeapCommittedM configured by JVM |
| MemHeapUsedM | MB | Size of HeapMemory currently used by JVM |
| MemHeapCommittedM | MB | Committed size of JVM HeapMemory |
| MemHeapMaxM | MB | Size of HeapMemory configured by JVM |
| MemMaxM | MB | Maximum size of memory available to JVM runtime |
Heap memory utilization | MemHeapUsedRate | % | The percentage of the number of HeapMemory currently used by the JVM to the number of HeapMemory configured by the JVM |
JVM threads | ThreadsNew | - | Number of threads in NEW status |
| ThreadsRunnable | - | Number of threads in RUNNABLE status |
| ThreadsBlocked | - | Number of threads in BLOCKED status |
| ThreadsWaiting | - | Number of threads in WAITING status |
| ThreadsTimedWaiting | - | Number of threads in TIMED WAITING status |
| ThreadsTerminated | - | Number of threads in Terminated status |
JVM logs | LogFatal | - | Number of FATAL-level logs |
| LogError | - | Number of ERROR-level logs |
| LogWarn | - | Number of WARN-level logs |
| LogInfo | - | Number of INFO-level logs |
GC count | YGC | - | Young GC count |
| FGC | - | Full GC count |
GC time | FGCT | s | Full GC time |
| GCT | s | Garbage collection time |
| YGCT | s | Young GC time |
Memory zone proportion | S0 | % | Percentage of used Survivor 0 memory |
| E | % | Percentage of used Eden memory |
| CCS | % | Percentage of used compressed class space memory |
| S1 | % | Percentage of used Survivor 1 memory |
| O | % | Percentage of used Old memory |
| M | % | Percentage of used Metaspace memory |
Data traffic | ReceivedBytes | Bytes/s | Data receiving rate |
| SentBytes | Bytes/s | Data sending rate |
Request processing latency | RpcQueueTimeAvgTime | ms | Average RPC latency |
Authentication and authorization | RpcAuthenticationFailures | count/s | Number of RPC authentication failures |
| RpcAuthenticationSuccesses | count/s | Number of RPC authentication successes |
| RpcAuthorizationFailures | count/s | Number of RPC authorization failures |
| RpcAuthorizationSuccesses | count/s | Number of RPC authorization successes |
Current connections | NumOpenConnections | - | Number of current connections |
Length of RPC processing queue | CallQueueLength | 1 | Length of current RPC processing queue |
CPU time | CurrentThreadSystemTime | ms | System time |
| CurrentThreadUserTime | ms | User time |
Start time | StartTime | s | Process start time |
Threads | PeakThreadCount | - | Peak number of threads |
| DaemonThreadCount | - | Number of backend threads |
Title | Metric | Unit | Description |
GC count | YGC | - | Young GC count |
| FGC | - | Full GC count |
GC time | FGCT | s | Full GC time |
| GCT | s | Garbage collection time |
| YGCT | s | Young GC time |
Memory zone proportion | S0 | % | Percentage of used Survivor 0 memory |
| E | % | Percentage of used Eden memory |
| CCS | % | Percentage of used compressed class space memory |
| S1 | % | Percentage of used Survivor 1 memory |
| O | % | Percentage of used Old memory |
| M | % | Percentage of used Metaspace memory |
Title | Metric | Unit | Description |
GC count | YGC | count | Young GC count |
| FGC | count | Full GC count |
GC Time | FGCT | s | Time consumed by Full GC |
| GCT | s | Time consumed to collect garbage |
| YGCT | s | Time consumed by Young GC |
The Proportion of Memory Area | S0 | % | The proportion of the memory usage of Survivor 0 area |
| E | % | The proportion of the memory usage of Eden area |
| CCS | % | The proportion of the memory usage of Compressed Class Space area |
| S1 | % | The proportion of the memory usage of Survivor 1 area |
| O | % | The proportion of the memory usage of Old area |
| M | % | The proportion of the memory usage of Metaspace area |
Number of JVM threads | ThreadsNew | count | Number of threads in NEW status |
| ThreadsRunnable | count | Number of threads in RUNNABLE status |
| ThreadsBlocked | count | Number of threads in BLOCKED status |
| ThreadsWaiting | count | Number of threads in WAITING status |
| ThreadsTimedWaiting | count | Number of threads in TIMED WAITING status |
| ThreadsTerminated | count | Number of threads in Terminated status |
Number of JVM logs | LogFatal | count | Number of FATAL-level logs |
| LogError | count | Number of ERROR-level logs |
| LogWarn | count | Number of WARN-level logs |
| LogInfo | count | Number of INFO-level logs |
JVM memory | MemNonHeapUsedM | MB | Non-heap memory size used by the process |
| MemNonHeapCommittedM | MB | Non-heap memory size of the commit process |
| MemHeapUsedM | MB | Heap memory size used by the process |
| MemHeapCommittedM | MB | Heap memory size of the commit process |
| MemHeapMaxM | MB | Maximum heap memory size of the process |
| MemMaxM | MB | Maximum memory size of the process |
Daemon threads of the process | DaemonThreadCount | count | Daemon threads of the process |
| ThreadCount | count | Threads of the process |
| PeakThreadCount | count | Peak threads of the process |
Data traffic | ReceivedBytes | Bytes/s | Data reception rate |
| SentBytes | Bytes/s | Data transmission rate |
QPS | RpcQueueTimeNumOps | count/s | RPC call rate |
Request processing latency | RpcQueueTimeAvgTime | ms | Average latency time of RPC |
| RpcProcessingTimeAvgTime | ms | RPC request average processing time |
Verification and authorization | RpcAuthenticationFailures | count | RPC verification failure count |
| RpcAuthenticationSuccesses | count | RPC verification success count |
| RpcAuthorizationFailures | count | RPC authorization failure count |
| RpcAuthorizationSuccesses | count | RPC authorization success count |
Current number of connections | NumOpenConnections | count | Current number of connections |
RPC processing queue length | CallQueueLength | count | Current RPC processing queue length |
Heap memory utilization | MemHeapUsedRate | % | The proportion of heap memory usage |
Number of Files | NumFiles | count | Current number of files and directories |
Number of NameServices | NumNameservices | count | Number of NameServices |
Number of NameNodes | NumExpiredNamenodes | count | Number of Expired NameNodes |
| NumNamenodes | count | Number of NameNodes |
Number of Router operations | ProcessingOp | count/s | Number of operations processed internally by Router per second |
| ProxyOp | count/s | Number of operations proxied to Namenode by Router per second |
Number of failed requests | RouterFailureStateStore | count/s | Number of failed requests per second due to status storage unavailability |
| RouterFailureReadOnly | count/s | Number of failed requests per second due to read-only mount point |
| RouterFailureLocked | count/s | Number of failed requests per second due to path locking |
| RouterFailureSafemode | count/s | Number of failed requests per second due to security mode |
Average Router operation processing time | ProcessingAvgTime | ns | Average time for Router to process operations |
| ProxyAvgTime | ns | Average time for operations proxied to Namenode by Router |
StateStore transaction operations | ReadsNumOps | count/s | Number of GET transactions per second |
| WritesNumOps | count/s | Number of PUT transactions per second |
| RemovesNumOps | count/s | Number of REMOVE transactions per second |
| FailuresNumOps | count/s | Number of failed transactions per second |
Average time for StateStore transaction operations | ReadsAvgTime | ms | Average time for state store GET transactions |
| WritesAvgTime | ms | Average time for State Store PUT transactions |
| RemovesAvgTime | ms | Average time for State Store REMOVE transactions |
| FailuresAvgTime | ms | Average time for State Store failing transactions |
피드백