When the compute and storage resources of your EMR cluster are insufficient, you can add more core and task nodes in the console. When the master node is overloaded or runs out of resources, you can scale out the cluster or add router nodes to share the load of the master node or to serve as task submitters in the cluster. You can scale the cluster at any time.
- By default, the current instance specification for scale-out is the instance specification selected when the cluster was created. If the current default specification is unavailable, or if you want to adjust the scale-out configuration, you need to set it in Node spec as instructed in Node Specification Management.
- By default, the selected component will inherit the cluster-level configuration and fall into the default configuration group for that node type. You can also set the configuration of the target component through the Specify configuration parameter.
- You cannot specify a configuration group for a ClickHouse cluster to be scaled out.
- Add router nodes: A router node can be used as a submitter, through which you can submit YARN, Hive, and Spark computing tasks to the cluster. We recommend you select a model with larger memory, preferably not lower than the master node specification.
- For a pay-as-you-go cluster, all newly added nodes support the pay-as-you-go billing mode.
- Log in to the EMR console and click the ID/Name of the target cluster in the Cluster list to enter the cluster details page.
- On the cluster details page, select Cluster Resource > Resource Management > Scale Out and select the type of nodes to be added (core, task, or router), billing mode, optional services, and scale-out quantity as needed.
- Specify configuration: Find the target component and select the level from which to inherit the configuration.
- If you choose to inherit the configuration of the cluster, an added node will inherit the cluster-level configuration and fall into the default configuration group for that node type.
- If you choose to inherit the configuration of the configuration group, an added node will inherit the configuration group-level configuration and fall into the selected configuration group.
- Deployment Process: Displays the information of service deployment processes after components are selected for the nodes to be added. You can also select Show deployment process to view the deployment process.
- Do not start services after scaling: If this option is selected during scaling, added nodes will not start the service. When needed, you can start the service as instructed in Service Start/Stop.
- Tag: Used to identify the added node resources.
- Current Specification: The default specification.
- You can set the default specification for scale-out as instructed in Node Specification Management.
- The default node specification is used for scale-out. You can adjust it by clicking Set Node Specification.
- After selecting the desired components and number of nodes to be added, click Confirm and make the payment. Then, the cluster will start scaling out, which usually takes 10 to 20 minutes.
- For ClickHouse cluster scale-out, you can add an even number of nodes in a high-availability (HA) instance, and unlimited nodes in a non-HA instance. You can choose an existing cluster or a new one for the nodes to be added.
There will be no data in the newly added nodes after the virtual ClickHouse cluster is scaled out. You need to migrate the data manually as the system won't automatically do it for you. In order to achieve data balance and improve resource utilization, migrate your data in time after the cluster is scaled out successfully.