tencent cloud

GooseFS Cluster Overview
Last updated: 2025-07-17 17:42:50
GooseFS Cluster Overview
Last updated: 2025-07-17 17:42:50

Overview

GooseFS is a multi-protocol, high-performance, high-throughput data caching acceleration service introduced by Tencent Cloud. The GooseFS cluster is a form of deployment and ops for this service on the public cloud. With a GooseFS cluster, users can easily manage and operate local disks on computing nodes as cache nodes, providing a unified namespace and access protocol for upper-layer computing applications, making it convenient to manage and transfer data across different storage systems.

The GooseFS cluster can not only accelerate the performance of business accessing storage for massive data analysis, machine learning, and AI, but also help business implementation achieve automatic tiering of hot and cold data, balancing performance and cost expenditure in the business architecture.

Basic Concepts

Using a GooseFS cluster involves the following basic concepts:
Cluster: A collection of Tencent Cloud resources required for GooseFS lake storage to function, including CVMs and BMS.
Node: A cloud server resource instance that makes up a GooseFS cluster, mainly including the following node types:
Master nodes: The master node is the cluster master node, requiring a minimum of 1 instance deployment. In high availability mode, at least 3 instances are required.
Worker node: The worker node is responsible for caching hot data and requires at least one instance as a worker node. The number of worker nodes can linearly scale out as needed based on the storage capacity to be cached.
Client node: A Client node is normally deployed on the same machine as a compute or Worker node for read-write access to data in the Worker node.
Client: A terminal that accesses a GooseFS cluster for integration with computing applications. Currently only the GooseFS-FUSE client is supported.
Namespace: A space for GooseFS to organize and manage cached data, serving as the unified access API for a GooseFS cluster.
Deployment form: Refers to the deployment mode of data lake storage GooseFS, mainly divided into managed control plane and managed Master mode.
Management Plane Hosting: Deploy GooseFS using the customer's purchased CVM resources and have it managed by the GooseFS cloud platform.
Master-managed: The Master node of the GooseFS cluster is hosted in the GooseFS cloud platform, while Worker nodes require customers to self-purchase CVM instances for deployment.
Fully managed: The Master node and Worker nodes of the GooseFS cluster are hosted in the GooseFS platform, with no need for customer maintenance.
Configuration: Refers to various configuration files for running the GooseFS cluster, mainly including:
goosefs-site.properties: The GooseFS service configuration file, used to manage cluster data load, cache, and elimination policy configurations.
goosefs-env.sh: A GooseFS service dependency environment variable script file used to manage environment variables in the GooseFS cluster.
log4j.properties: The GooseFS service log output configuration, used to manage access logs, system logs, and various other logs and monitoring information during operation.
core-site.xml: The underlying storage system configuration for GooseFS service coordination, used to manage the underlying storage systems connected to the GooseFS cluster, such as Cloud Object Storage (COS) and Cloud HDFS (CHDFS).

Usage Process

The GooseFS cluster use flow is mainly divided into the following steps:
1. Role authorization: Register and log in to the Data Accelerator GooseFS Console, complete service authorization to obtain related resource operation permissions, and you can start using the Data Accelerator GooseFS product.
2. Create a cluster: Customize a new cluster as needed.
3. Manage cluster: After cluster creation, you can scale cluster, manage nodes, update configuration, perform namespace management, and operate client management.
4. Manage tasks: Cluster management operations are registered as cluster tasks. Users can view task progress to learn about cluster operation status.

Product Pricing

Tencent Cloud Data Lake Storage GooseFS offers different billing systems for different deployment forms. To learn about the product's detailed pricing model and specific price, contact us for consultation.

Related Services

GooseFS only provides data caching capability. For persistent data storage, use Tencent Cloud Object Storage (COS). For instructions, see COS product document.
GooseFS supports managing built-in local disks or cloud disks on CVM to cache data to local nodes. For instructions, see CVM product document.
GooseFS cluster can be created under a VPC, with hosts in the cluster allocated to subnets of different AZs. For the VPC product operation guide, please refer to VPC product document.
GooseFS cluster supports reporting monitoring data to Tencent Cloud Observability Platform (TCOP) or Cloud Log Service (CLS). For the TCOP product usage guide, please refer to Tencent Cloud Observability Platform product document. For the CLS user guide, please see Cloud Log Service product document.

Was this page helpful?
You can also Contact Sales or Submit a Ticket for help.
Yes
No

Feedback