tencent cloud

Tencent Cloud WeData

Release Notes
Dynamic Release Record (2026)
Product Introduction
Product Overview
Product Advantages
Product Architecture
Product Features
Application Scenarios
Purchase Guide
Billing Overview
Product Version Purchase Instructions
Execute Resource Purchase Description
Billing Modes
Overdue Policy
Refund
Preparations
Overview of Account and Permission Management
Add allowlist /security groups (Optional)
Sign in to WeData with Microsoft Entra ID (Azure AD) Single Sign-On (SSO)
Operation Guide
Console Operation
Project Management
Data Integration
Studio
Data Development
Data Analysis
Data Science
Data Governance (with Unity Semantics)
API Documentation
History
Introduction
API Category
Making API Requests
Smart Ops Related Interfaces
Project Management APIs
Resource Group APIs
Data Development APIs
Data Asset - Data Dictionary APIs
Data Development APIs
Ops Center APIs
Data Operations Related Interfaces
Data Exploration APIs
Asset APIs
Metadata Related Interfaces
Task Operations APIs
Data Security APIs
Instance Operation and Maintenance Related Interfaces
Data Map and Data Dictionary APIs
Data Quality Related Interfaces
DataInLong APIs
Platform Management APIs
Data Source Management APIs
Data Quality APIs
Platform Management APIs
Asset Data APIs
Data Source Management APIs
Data Types
Error Codes
WeData API 2025-08-06
Service Level Agreements
Related Agreement
Privacy Policy
Data Processing And Security Agreement
Contact Us
Glossary

Kudu Data Source

PDF
Mode fokus
Ukuran font
Terakhir diperbarui: 2024-11-01 17:52:37

Use Limits

1. Kudu reader must configure upperBound and lowerBound for concurrency settings to be effective.
2. upperBound and lowerBound in kudu reader are of long type, thus only time type or integer fields support being set as Bound.
3. The reader uses kudu-client to directly connect to the Kudu server to read data, where conditions do not support Impala SQL syntax.
4. Incremental synchronization WHERE condition syntax: create_time>='${yyyy-MM-dd-1d HH:mm:ss}' and create_time<'${yyyy-MM-dd HH:mm:ss}'
Bound currently supports integer and date functions configuration. Date functions configuration usage method:
// Converts to 13-bit timestamp (milliseconds)
TimestampMillis('yyyy-MM-ddTHH:mm:00+0800')
TimestampMillis('2023-07-10T00:00:00+0800')
TimestampMillis('2023-07-10 00:00:00')
TimestampMillis('2023-07-10')

// Converts to 10-bit timestamp (seconds)
TimestampSeconds('yyyy-MM-ddTHH:mm:00+0800')
TimestampSeconds('2023-07-10T00:00:00+0800')
TimestampSeconds('2023-07-10 00:00:00')
TimestampSeconds('2023-07-10')

Kudu Offline Single Table Read Node Configuration




Parameters
Description
Data Source
Available Kudu data source.
Database
Supports selection or manual input of the library name to read from.
By default, the database bound to the data source is used as the default database. Other databases need to be manually entered.
If the data source network is not connected and the database information cannot be fetched directly, you can manually enter the database name. Data synchronization can still be performed when the Data Integration network is connected.
Table
Supports selecting or manually entering the table name to be read.
Split Key
Specify the field for data sharding. After specifying, concurrent tasks will be launched for data synchronization. You can use a column in the source data table as the partition key. It is recommended to use the primary key or indexed column as the partition key.
Filter Condition (Optional)
In actual business scenarios, it is common to select the data of the current day for synchronization, setting the WHERE condition to gmt_create>$bizdate. WHERE condition can effectively perform business incremental synchronization. If the WHERE statement is not filled, including not providing the key or value of WHERE, data synchronization will be regarded as synchronizing full data.
upperBound
Partition limit.
If SQL table creation statement partition "5"<= values <="10", then lowerbound is "5" , upperbound is "10";
If SQL table creation statement partition value ="x", then lowerbound is "x", upperbound is "x\\000";
lowerBound
Partition Lower Limit.
If SQL table creation statement partition "5"<= values <="10", then lowerbound is "5" , upperbound is "10";
If the SQL create table statement has partition value = "x", then the lowerbound is "x" and the upperbound is "x\\000"

Kudu Offline Single Table Write Node Configuration




Parameters
Description
Data Destination
Kudu Data Source to be written.
Database
Supports selection or manual input of the database name to write to
By default, the database bound to the data source is used as the default database. Other databases need to be manually entered.
If the data source network is not connected and the database information cannot be fetched directly, you can manually enter the database name. Data synchronization can still be performed when the Data Integration network is connected.
Table
Supports selection or manual input of the table name to write to
If the data source network is not connected and the table information cannot be fetched directly, you can manually enter the table name. Data synchronization can still be performed when the Data Integration network is connected.
Whether to Clear Table
You can manually choose whether to clear the Kudu data table before writing to it.
Write Mode
Kudu writing supports three modes:
Append: When there is a conflict with the primary key/unique index, the conflicting rows cannot be written
Overwrite: When there is a conflict with the primary key/unique index, the original rows are deleted before inserting new rows
On Duplicate Key: When there is a conflict with the primary key/unique index, the new rows will replace the specified fields
Batch Submission Size
Record size for one-time batch submission, which can greatly reduce the number of network interactions between the data synchronization system and Kudu, and improve overall throughput. If this value is set too high, it can cause an OOM exception in the data synchronization process.

Data type conversion support

Read

Kudu Data Type
Internal Types
int8,int16,int32,int64
Long
float,double,decimal
Double
string,varchar
String
unixtime_micors,date
Date
binary
Bytes
bool
Boolean

Write

Internal Types
Kudu Data Type
Long
int8,int16,int32,int64
Double
float,double,decimal
String
string,date
Date
unixtime_micors,varchar
Bytes
binary
Boolean
bool


Bantuan dan Dukungan

Apakah halaman ini membantu?

masukan