distributedLoad [-A] [--replication <num>] [--active-jobs <num>] [--expire-time] <path>A Whether to enable atomic distributed load capacity--active-jobs <active job count> The maximum number of data loading tasks that can be enabled simultaneously. The default upper limit is 3000. If this value is exceeded, new tasks need to wait for the current task to complete before execution.--expire-time <arg> Set the expiry date for clearing the temporary directory used for data loading. The default is 24 hours (unit defaults to ms, supports s, min, hour, e.g., 100s).--replication <replicas> Number of Block data replicas loaded per loading task, defaults to 1.
goosefs.user.block.worker.client.pool.max: When establishing a read-write stream, it retrieves from the client pool. A small value may block client retrieval.goosefs.job.worker.max.active.task.num : Maximum number of tasks allowed to execute simultaneouslygoosefs.job.worker.threadpool.size: Number of threads for processing tasksgoosefs.user.block.worker.client.pool.max: Set the number of jobworker worker clients. A small value may block client retrieval.
goosefs.worker.network.reader.buffer.size: impacts worker memory usagegoosefs.worker.network.block.reader.threads.max: Maximum number of threads for worker to read
fs.cosn.block.size: Size of the loaded blockfs.cosn.upload_thread_pool: Size of the read thread, shared by each workerfs.cosn.read.ahead.block.size: The granularity at which COSN requests COSfs.cosn.read.ahead.queue.size: COSN has a read-ahead feature, mainly for large file sequential read scenarios, while load is sequential read but not necessarily for large files.
fs.cosn.block.size: Recommended setting is 128MB.fs.cosn.upload_thread_pool: Recommended setting is 2 - 3 times the number of CPUs. The thread pool size should be appropriately increased or decreased based on CPU usage.fs.cosn.read.ahead.block.size: Need to adjust according to block size:fs.cosn.read.ahead.queue.size: Recommended setting is 8 - 32. Need to set according to memory capacity value. Under normal circumstances, memory usage of a file input stream equals block.size * queue.size. Since block size equals fs.cosn.block.size / fs.cosn.read.ahead.block.size, taking the recommended setting as an example, this value equals 32. Therefore, the set value does not need to exceed 32. If set exceeding 32, it will waste resources.goosefs.worker.network.reader.buffer.size. This value also needs to be estimated based on memory. The total memory occupied by read operations equals worker read concurrency limit x (buffer.size + memory usage of one inputstream + length of a single readRequest (default is 1MB)).goosefs.job.worker.max.active.task.num: This value can be slightly larger than the value of fs.cosn.upload_thread_pool to fully leverage the capability of COSN.goosefs.user.block.worker.client.pool.max: The default value is 1024. In principle, this configuration value must be equal to twice the value of goosefs.job.worker.max.active.task.num.goosefs.job.worker.threadpool.size: The default value is 10. The maximum value of this numeric value must be less than or equal to goosefs.job.worker.max.active.task.num.goosefs.job.worker.max.active.task.num. In principle, it is required that the value of active-jobs is greater than the value of goosefs.job.worker.max.active.task.num.Feedback