DataHub is a CKafka service module that connects data sources to data processing systems. It preprocesses data obtained from business data sources and then distributes the output data to offline/online processing systems. In this way, it decouples these systems from business data sources.
DataHub supports accessing different types of data generated by various data sources for unified management and distribution to downstream offline/online processing systems, forming a clear data flow channel.
DataHub supports the following types of data sources: actively reported data, services, and logs.
You can configure log data source access in the shipper's console instead of the CKafka console.
Data access consists of two parts:
An SDK is provided. The process is as shown below:
Complete productized configuration GUIs are offered for service, log, and API data sources, eliminating your need to care about the underlying implementation.