tencent cloud

TDSQL Boundless

Release Notes
Product Introduction
Overview
Scenarios
Product Architecture
Instance Types
Compatibility Notes
Kernel Features
Kernel Overview
Kernel Version Release Notes
Functionality Features
Performance Features
Billing
Billing Overview
Purchase Method
Pricing Details
Renewal
Overdue Payments
Refund
Getting Started
Creating an Instance
Connect to Instances
User Guide
Data Migration
Data Subscription
Instance Management
Configuration Change
Parameter Configuration
Account Management
Security Group
Backup and Restoration
Database Auditing
Tag Management
Use Cases
Technical Evolution and Usage Practices of Online DDL
Lock Mechanism Analysis and Troubleshooting Practices
Data Intelligent Scheduling and Related Practices for Performance Optimization
TDSQL Boundless Selection Guide and Practical Tutorial
Developer Guide
Developer Guide (MySQL Compatibility Mode)
Developer Guide (HBase Compatibility Mode)
Performance Tuning
Performance Tuning Overview
SQL Tuning
DDL Tuning
Performance White Paper
Performance Overview
TPC-C Test
Sysbench Test
API Documentation
History
Introduction
API Category
Making API Requests
Instance APIs
Security Group APIs
Task APIs
Backup APIs
Rollback APIs
Parameter APIs
Database APIs
Data Types
Error Codes
General Reference
System Architecture
SQL Reference
Database Parameter Description
TPC-H benchmark data model reference
Error Code Information
Security and Compliance
FAQs
Agreements
Service Level Agreement
Terms of Service
Privacy Policy
Data Processing And Security Agreement
Contact Us
Glossary

Data Model

PDF
포커스 모드
폰트 크기
마지막 업데이트 시간: 2026-03-06 18:50:08

HBase Model

HBase Data Model

HBase's data model is a multidimensional mapping that can be represented as:
(row_key, column family, column, version) → value
row_key: the unique identifier for a row, sorted lexicographically.
column family: a collection of columns, which is the basic unit of physical storage. Each column family contains an unlimited number of columns and supports dynamic addition. The same column can contain multiple data versions, and the version number is usually represented by a timestamp.
column: a specific field within a column family, consisting of the column family name and the column qualifier (Qualifier).
version: the version number of data, usually represented by a timestamp.
value: the actual stored data.
cell: the smallest unit of data storage, uniquely identified by (row_key, column family, column, version).

HBase Storage Structure

Namespace: a logical container for tables, used to organize and manage tables. Tables belong to a specific Namespace.
Table: The basic unit of data storage in HBase. Tables are divided into multiple Regions based on the range of row key (Row Key). Each Region is responsible for storing a portion of the data.
Region: Each Region contains multiple Stores. Each Store corresponds to a column family (Column Family).
Store (storage unit): Store is the basic unit for storing data in a Region and corresponds one-to-one with a column family. Each Store maintains an independent LSM structure, including MemStore and StoreFile. The number of column families is typically limited, and it is recommended not to exceed 3-5.
MemStore: MemStore is an in-memory write cache that provides high write performance.
StoreFile: StoreFile is a storage file on disk, based on the HFile format, supporting efficient reading and compression.
Block: Block is the basic storage unit within a StoreFile, supporting efficient random reads.
HBase's storage structure can be represented by the following hierarchical relationships:
Table
├── Region (divided by row key range)
│ ├── Store (each column family corresponds to one Store)
│ │ ├── MemStore (in-memory write cache)
│ │ └── StoreFile (on-disk storage file)
│ │ └── Block (data block in the file)
│ └── ...
└── ...

TDSQL Boundless Data Model

Role Mapping

HBase Master is analogous to the MC (Metadata Cluster) in TDSQL Boundless.
HBase Region Server is analogous to the TDStore node in TDSQL Boundless.

Table Mapping Rules

One-to-many mapping: One HBase Table corresponds to multiple tables in TDSQL Boundless.
Column Family mapping: Each Column Family corresponds to one TDSQL Boundless table, and the table name follows the format of HBase table name_Column Family name.
Column mapping: A specific version of each column in HBase corresponds to a row of data in the TDSQL Boundless table.
Primary key design: The primary key of the TDSQL Boundless table is HBase Row Key + Column Qualifier + Version.

Mapping Example

Assuming the HBase table ht1 contains two column families cf1 and cf2, TDSQL Boundless will create two tables internally: ht1_cf1 and ht1_cf2.
HBase data example
row key
column family
column
version
value
row1
cf1
a
100
v1
row1
cf1
b
100
v2
row1
cf1
b
110
v3
row1
cf2
c
120
v4
row2
cf1
d
120
v5
row2
cf2
d
130
v6
TDSQL Boundless table data
create table ht1_cf1 (
K varbinary(1024),
Q varbinary(256),
T bigint,
V MediumBlob NOT NULL,
primary key(K, Q, T)) HBase;
create table ht1_cf2 (
K varbinary(1024),
Q varbinary(256),
T bigint,
V MediumBlob NOT NULL,
primary key(K, Q, T)) HBase;
table ht1_cf1
Primary Key (K + Q + T)
Value (V)
row1 + a + 100
v1
row1 + b + 100
v2
row1 + b + 110
v3
row2 + d + 120
v5
table ht1_cf2
Primary Key (K + Q + T)
Value (V)
row1 + c + 120
v4
row2 + d + 130
v6

도움말 및 지원

문제 해결에 도움이 되었나요?

피드백