Overview
Concepts
Cluster Architecture
Strengths
Scenarios
show tablet tablet_id statement and then execute the following show proc statement to view the replica information of this tablet and check whether the replica is complete. You can also use show proc "/cluster_balance" Information to query the progress in replica scheduling and repair within the cluster.
For commands related to data replica management, see Data Replica Management.show tablet 27306172 statement and then execute the show proc in the result statement to view the status of each tablet replica. The versionCount in the result indicates the number of versions. If you find that a replica has too many versions, you need to reduce the import frequency or stop import and observe whether the number of versions decreases. If the version number still does not decrease after the import is stopped, you need to view the be.INFO log of the corresponding BE node, search for the tablet ID and the Compaction keyword, and check whether the Compaction is running normally.show tablet 110309738 statement and then execute the show proc in the result statement to view the status of each tablet replica for further troubleshooting.show backends command, where MaxDiskUsedPct shows the disk with the highest utilization on the corresponding BE. If it exceeds 95%, this error will be reported. You can choose to manually delete some data to free up the space or scale out the cloud disk to solve this problem. If the disk utilization increases exceptionally, you can submit a ticket Contact Us for troubleshooting.failed to initialize storage reader. tablet=63416.1050661139.aa4d304e7a7aff9c-f0fa7579928c85a0, res=-214, backend=192.168.100.10
show tablet 63416 statement and executing the result show proc xxx statement, you can view the status of each replica of the corresponding tablet. Usually, we need to care about the data in the Version column.
Normally, the versions of multiple replicas of a tablet should be the same, and it is the same as the VisibleVersion version of the corresponding partition.show partitions from tblx to view the corresponding partition version (the partition corresponding to the tablet can be found in the show tablet statement.)show proc statement to view the concrete version information and find out which versions are lost.
If there has been no automatic repair for a long time, you need to use the show proc "/cluster_balance" statement to view the tablet repair and scheduling tasks currently being executed by the system. This may be because there are a large number of tablets waiting to be scheduled, resulting in a lengthy repair time. You can follow the records in pending_tablets and running_tablets.admin repair statement to specify a table or partition to be repaired first. For details, see help admin repair.admin set replica status command to force the problematic replica to go offline. For details, see help admin set replica status for the example of setting the replica status to bad. (After being set to bad, the replica will no longer be visited. But before the operation, ensure that other replicas are normal).brpc_socket_max_unwritten_bytes: The default value is 1 GB. If the unsent data exceeds this value, an error will be reported. This value can be modified appropriately to avoid OVERCROWDED errors. (But this is only a temporary solution and congestion still occurs).tablet_writer_ignore_eovercrowded: The default value is false. If it is set to true, OVERCROWDED errors during import will be ignored. This parameter is mainly used to avoid import failure and improve import stability. brpc_max_body_size: The default value is 3 GB.フィードバック