Feature Introduction
The cluster health inspection feature automatically generates an inspection report every hour by default, thoroughly monitoring the cluster health status. Based on the health score model, it performs a percentage-based weight assessment across 5 levels: basic diagnosis, computing insights, storage insights, resource insights, and event diagnosis, providing quantified data for cluster stability and efficiency. The inspection report clearly annotates deduction dimensions and root cause information to help users quickly locate issues. It also supports the one-time inspection feature to meet users’ inspection needs in the temporary scenario.
Operation Steps
2. Health inspection provides an intuitive observation of the health score trend of the cluster. You can adjust the observation time to view the historical health trend of the cluster.
3. Click One-Time Inspection, select the inspection dimension and time range as needed to customize the inspection.
4. You can hover over the trend chart line to view the cluster inspection report details.
Note:
1. The health report performs automatic inspection on the data from the previous 1 hour by default at an hourly granularity, generating continuous data for cluster health scores. You can also set a custom time range and inspection coverage for a one-time inspection. Generation of the inspection report takes 3-30 minutes.
2. Currently, the correlation analysis report is an allowlist feature. If you need this feature, submit a ticket to request activation. The correlation analysis function requires proactively selecting the time range, main analysis item, and related information to perform root cause analysis and tracking of exceptions. Output of the real-time analysis report takes 5-30 minutes.