DataHub is a data access and processing platform in Tencent Cloud for one-stop data access, processing, and distribution. It can continuously receive and collect data from applications, web, cloud product logs, and other sources, and process the data in real time. This helps build a data flow linkage at low costs to connect data sources and data processing systems.
DataHub features high availability, scalability, and security as well as real-time processing.
DataHub is based on the distributed deployment of CKafka to deliver a high stability.
DataHub efficiently collects and processes data in real time.
DataHub clusters are horizontally scalable, where instances can be seamlessly upgraded. The underlying system automatically scales elastically based on the business scope, and the scaling process is imperceptible to the upper-layer businesses.
Tenants are isolated at the network level, so the network access to instances is naturally isolated among different accounts. CAM authentication of the management streams and SASL permission control of the data streams are supported for strict access control.
DataHub can interconnect with more than 13 Tencent Cloud services for fast deployment, including TKE, COS, ES, CLS, and CDWCH.
DataHub provides a complete set of Ops services empowered by the Tencent Cloud platform, including multidimensional monitoring of and alarming for tenant isolation, access control, message retention query, consumer details viewing, etc.
CKafka is a high-throughput scalable distributed messaging system. Based on the publish/subscribe pattern, it enables async interactions between producers and consumers through message decoupling. It has many strengths, such as data compression and offline/real-time data processing.
As a feature module of CKafka, DataHub allows you to perform GUI-enabled configuration in CKafka to connect common data sources and receivers immediately. It connects data sources to data processing systems so as to decouple such systems from business data sources.
DataHub supports accessing different types of data generated by various data sources for unified management and distribution to downstream offline/online processing systems, forming a clear data flow channel.
DataHub can access different types of data from various data sources to simply cleanse, filter, and convert the data to generate unified structured data, making subsequent output, analysis, and archiving much easier.