Title | Metric | Unit | Description |
Nodes | Active | - | Number of active nodes |
| Total | - | Total number of nodes |
| Failed | - | Number of failed nodes |
Query | RunningQueries | - | Total number of running queries |
| QueuedQueries | - | Total number of waiting queries |
Query frequency | FailedQueries | count/min | Total number of failed queries |
| AbandonedQueries | count/min | Total number of abandoned queries |
| CanceledQueries | count/min | Total number of canceled queries |
| CompletedQueries | count/min | Total number of completed queries |
| StartedQueries | count/min | Total number of started queries |
Data volume input/output per minute | InputDataSizeOneMinute | GB/min | Data input rate |
| OutputDataSizeOneMinute | GB/min | Data output rate |
Title | Metric | Unit | Description |
GC count | YGC | - | Young GC count |
| FGC | - | Full GC count |
GC time | FGCT | s | Full GC time |
| GCT | s | Garbage collection time |
| YGCT | s | Young GC time |
Memory zone proportion | S0 | % | Percentage of used Survivor 0 memory |
| E | % | Percentage of used Eden memory |
| CCS | % | Percentage of used compressed class space memory |
| S1 | % | Percentage of used Survivor 1 memory |
| O | % | Percentage of used Old memory |
| M | % | Percentage of used Metaspace memory |
JVM memory | MemNonHeapUsedM | MB | Size of NonHeapMemory currently used by JVM |
| MemNonHeapCommittedM | MB | Size of NonHeapMemory currently committed by JVM |
| MemHeapUsedM | MB | Size of HeapMemory currently used by JVM |
| MemHeapCommittedM | MB | Size of HeapMemory currently committed by JVM |
| MemHeapMaxM | MB | Size of HeapMemory configured by JVM |
| MemHeapInitM | MB | Size of initial JVM HeapMem |
| MemNonHeapInitM | MB | Size of initial JVM NonHeapMem |
Heap memory utilization | MemHeapUsedRate | % | Percentage of HeapMemory currently used by the JVM relative to the amount of HeapMemory configured for the JVM |
Data input/output rate | InputDataSize.OneMinute.Rate | GB/min | Data input rate |
| OutputDataSize.OneMinute.Rate | GB/min | Data output rate |
Worker threads | PeakThreadCount | - | Peak number of threads |
| ThreadCount | - | Number of threads |
| DaemonThreadCount | - | Number of backend threads |
Process execution duration | Uptime | s | Process execution duration |
File descriptors | MaxFileDescriptorCount | - | Maximum number of file descriptors |
| OpenFileDescriptorCount | - | Number of opened file descriptors |
task failure count | FailedTasksOneMinuteRate | count/min | Average Task failure count, minute-level dimension |
task data input volume | InputDataSizeOneMinuteRate | bytes/min | Average Task input data volume, minute-level dimension |
task data input line count | InputPositionsOneMinuteRate | count/min | Average Task input data row count, minute-level dimension |
task data output volume | OutputDataSizeOneMinuteRate | bytes/min | Average Task output data volume, minute-level dimension |
task data output line count | OutputPositionsOneMinuteRate | count/min | Average Task output data row count, minute-level dimension |
Task Notification Executor | ActiveCount | count | Number of notifications for tasks being executed |
| QueuedTaskCount | count | Number of notifications for tasks to be executed |
Task Executor Split | WaitingSplits | count | Number of Splits waiting for TaskExecutor |
| TotalSplits | count | Total number of Splits in TaskExecutor |
| RunningSplits | count | Number of ongoing Splits in TaskExecutor |
| BlockedSplits | count | Number of blocked Splits in TaskExecutor |
Task Executor Time | BlockedQuantaWallTimeOneMinuteAvg | μs | Quanta Blocked full time |
| SplitQueuedTimeOneMinuteAvg | μs | Average waiting time for Splits |
| SplitWallTimeOneMinuteAvg | μs | Split full duration |
| UnblockedQuantaWallTimeOneMinuteAvg | μs | Quanta Unblocked full time |
Input Page Size | OneMinuteAvg | Bytes | Average input Page size, minute-level dimension |
| OneMinuteMax | Bytes | Maximum input size in 1 min |
| OneMinuteCount | Bytes | Page size per minute |
Memory Pool | Free | Bytes | Available memory size |
| Max | Bytes | Maximum capacity of the memory pool |
| Reserved | Bytes | Reserved but not yet used memory size |
| ReservedRevocable | Bytes | Reserved but reclaimable memory size |
Title | Metric | Unit | Description |
GC count | YGC | - | Young GC count |
| FGC | - | Full GC count |
GC time | FGCT | s | Full GC time |
| GCT | s | Garbage collection time |
| YGCT | s | Young GC time |
Memory zone proportion | S0 | % | Percentage of used Survivor 0 memory |
| E | % | Percentage of used Eden memory |
| CCS | % | Percentage of used compressed class space memory |
| S1 | % | Percentage of used Survivor 1 memory |
| O | % | Percentage of used Old memory |
| M | % | Percentage of used Metaspace memory |
JVM memory | MemNonHeapUsedM | MB | Size of NonHeapMemory currently used by JVM |
| MemNonHeapCommittedM | MB | Size of NonHeapMemory currently committed by JVM |
| MemHeapUsedM | MB | Size of HeapMemory currently used by JVM |
| MemHeapCommittedM | MB | Size of HeapMemory currently committed by JVM |
| MemHeapMaxM | MB | Size of HeapMemory configured by JVM |
| MemHeapInitM | MB | Size of initial JVM HeapMem |
| MemNonHeapInitM | MB | Size of initial JVM NonHeapMem |
Worker threads | PeakThreadCount | - | Peak number of threads |
| ThreadCount | - | Number of threads |
| DaemonThreadCount | - | Number of backend threads |
Process execution duration | Uptime | s | Process execution duration |
Process start time | StartTime | s | Process start time |
File descriptors | MaxFileDescriptorCount | - | Maximum number of file descriptors |
| OpenFileDescriptorCount | - | Number of opened file descriptors |
Node Status | ActiveNodeCount | Count | Number of Active Nodes |
| InactiveNodeCount | Count | Number of Inactive Nodes |
| ShuttingDownNodeCount | Count | Number of ShuttingDown Nodes |
Cluster Memory | ClusterMemory | Bytes | Cluster Memory |
| ClusterTotalMemoryReservation | Bytes | Total Reserved Memory of the Cluster |
| ClusterUserMemoryReservation | Bytes | user Reserved Memory of the Cluster |
Leaked Queries | NumberOfLeakedQueries | count | Total number of memory leak queries in a cluster |
Queries Killed | QueriesKilledDueToOutOfMemory | count | Total number of oom killed queries |
Tasks Killed | TasksKilledDueToOutOfMemory | count | Total number of oom-killed tasks |
Cluster CPU cores | TotalAvailableProcessors | Cores | Number of available processor cores in the cluster |
Assigned Queries | AssignedQueries | count | Number of Queries |
Node Manager | BlockedNodes | count | Number of block nodes in the cluster |
| Nodes | count | Number of cluster nodes |
Cluster Memory Pool | ReservedDistributed | bytes | Reserved Distributed Memory of the Cluster |
| ReservedRevocableDistributed | bytes | Reserved Revocable Distributed Memory of the Cluster |
| TotalDistributed | bytes | Total Distributed Memory |
| FreeDistributed | bytes | Distributed memory of the cluster is available. |
Memory Pool | Free | Bytes | Available memory size |
| Max | Bytes | Maximum capacity of the memory pool |
| Reserved | Bytes | Reserved but not yet used memory size |
| ReservedRevocable | Bytes | Reserved but reclaimable memory size |
Required Workers | RequiredWorkers | count | Query the number of workers |
Query Execution | ExecutorActiveCount | count | Number of active queries |
| QueuedTaskCount | count | Number of pending tasks in queue |
| TaskCount | count | Number of tasks |
Queued Queries | QueuedQueries | count | Total number of queries waiting in queue |
Running Queries | RunningQueries | count | Number of ongoing queries |
Abandoned Queries | AbandonedQueriesOneMinuteRate | count/min | Average number of terminated queries per minute |
Canceled Queries | CanceledQueriesOneMinuteRate | count/min | Average number of canceled queries per minute |
Completed Queries | CompletedQueriesOneMinuteRate | count/min | Average number of completed queries per minute |
Consumed CPU Time | ConsumedCpuTimeOneMinuteRate | Secs/min | Average CPU time for query processing per minute |
Consumed Input | ConsumedInputOneMinuteRate | Bytes/min | Average size of Consumed Input per minute |
Consumed Input Rows | ConsumedInputRowsOneMinuteRate | Rows/min | Average number of rows in Consumed Input per minute |
External Failures | ExternalFailuresOneMinuteRate | count/min | Average number of External Failures per minute |
Failed Queries | FailedQueriesOneMinuteRate | count/min | Average number of Failed Queries per minute |
Insufficient Resources Failures | InsufficientResourcesFailuresOneMinuteRate | count/min | Average number of Insufficient Resources Failures per minute |
Internal Failures | InternalFailuresOneMinuteRate | count/min | Average number of Internal Failures per minute |
Started Queries | StartedQueriesOneMinuteRate | count/min | Average number of Started Queries per minute |
Submitted Queries | SubmittedQueriesOneMinuteRate | count/min | Average number of Submitted Queries per minute |
User Error Failures | UserErrorFailuresOneMinuteRate | count/min | Average number of User Error Failures per minute |
Wall Input | WallInputRateOneMinuteAvg | count/min | Average WallInput size per minute |
High Memory Split Source | HighMemorySplitSourceOneMinuteCount | count | Average count of HighMemory Splits per minute |
Queued Queries-${groupName} | NumQueuedQueries | count | Number of queries waiting in queue |
SubGroups-${groupName} | NumRunningQueries | count | Number of ongoing queries in the resource group |
CPU Usage-${groupName} | CpuUsageMs | ms | CPU time used by the resource group |
Memory Usage-${groupName} | MemoryUsageB | bytes | Memory usage used by the resource group |
Running Queries-${groupName} | NumEligibleSubGroups | count | Number of subgroups in the resource group that meet the conditions for parallel execution |
Audit log writing to ES failed count | WriteEsFailed | Count | Number of audit log writing failures to ES |
Audit log writing to ES success count | WriteEsSuccess | Count | Audit log writing to ES success count |
Feedback