tencent cloud

Failure Simulation
Last updated:2025-05-07 11:13:58
Failure Simulation
Last updated: 2025-05-07 11:13:58

Operation Scenarios

TencentDB for Redis® provides the automatic failover capability to ensure service availability. Automatic failover includes Proxy node failover and Redis server (Redis data storage node) failover.
TencentDB for Redis® provides the failure simulation feature to test services with simulated failures. You can experience the failure simulation feature in the console. The system triggers the automatic high availability (HA) logic by sending the shutdown command to all master Redis nodes to achieve failure simulation.

Proxy Node Failover

TencentDB for Redis® has Proxy nodes under both the standard and cluster architectures. The number of Proxy nodes is 3 under the standard architecture and increases linearly according to the number of shards under the cluster architecture. The HA logic of Proxy nodes is as follows:
Multiple Proxy nodes ensure high availability and load balancing of Proxy services.
Proxy nodes are deployed on 3 physical devices to ensure high availability.
Once a Proxy node fails, the test system detects that the node is unavailable and automatically adds a new node.

Redis Server Failover

TencentDB for Redis® under the standard architecture and cluster architecture uses the native cluster management mechanism of Redis Cluster. The node status is determined by the Gossip protocol between nodes in the cluster. The timeliness of node failure determination depends on cluster-node-timeout. The default value is 15000ms. It is recommended that you do not modify this parameter. For node failure determination, see Scale with Redis Cluster.

Usage Instructions

Only instances in the Running status support failure simulation.
Only instances in multi-AZ deployment mode support failure simulation. Instances deployed in the same AZ do not support this operation.

Note

Failure simulation will cause the Redis service to be unavailable for a certain period of time, which is usually less than 1 minute. If you are writing data, data loss may occur. Operate with caution.
Service unavailability caused by failure simulation is not involved in the scope of the guaranteed Redis service SLA.

Prerequisites

The instance has been deployed across multiple AZs.
The database version is 4.0 or later.
The instance status is Running.

Directions

1. Log in to the TencentDB for Redis console.
2. On the right side, select a region at the top of the Instance List page.
3. Find the target multi-AZ instance that requires failure simulation in the instance list.
4. Click the instance ID to enter the Instance Details page.
5. Click the Node Management tab on the Instance Details page, and click Simulate Failure in the drop-down list of More.

6. Confirm the instance name and ID in the pop-up dialog box of Simulate Failure, learn about the principle and notes of failure simulation, and click OK. The instance status will change to Processing.
7. Click Task Management in the left sidebar and wait for the task to complete. When the instance status changes to Running, the simulation is successful.


APIs

API
Description
Simulates failures.
Simulates failures on a Proxy node.
Was this page helpful?
You can also Contact Sales or Submit a Ticket for help.
Yes
No

Feedback