Parameters | Type | Mandatory | Description |
AnchorName | string | Yes | Anchor name: 1. This name is mainly used to identify the custom avatar/sound and can be customized depending on the actual situation. 2. Naming reference: If the number of customizations for the anchor is only one, it can be named directly after the anchor, such as "Zhang San"; you can also add the name of the clothing for better identification, such as "Zhang San blue suit". 3. No more than 50 characters, at least 2 characters, only allow Chinese characters, letters, numbers, underscores, and hyphens. 4. Must be unique. |
MakeType | string | Yes | Customization categories: 1. IMAGE: 2D small sample - standard version (Studio Lip-Sync) avatar customization. 2. IMAGE_GENERAL: 2D small sample - standard version (general lip movement) avatar customization. 3. IMAGE_4K: 2D small sample - high-precision version (Studio Lip-Sync) avatar customization. 4. IMAGE_PHOTO: 2D small sample (photo avatar) avatar customization. 5. VOICE: Timbre Clone (Basic Version). 6. ZERO_SHOT_VOICE: Timbre Clone (Ultra-fast Version). |
IdentityCosUrl | string | No | Except for the IMAGE_PHOTO and ZERO_SHOT_VOICE customization types, fill in either the IdentityWrittenCosUrl or another customization type, or both. Requirements for the URL address of the video format authorization letter: 1. The URL address is the resource URL uploaded to the specified path via upload material to Tencent Cloud COS, with the idcard path added, such as: domain name/customer-pipeline/{number}/{uuid}/idcard/a.mp4. 2. This format is mainly suitable for oral authorization letters, or you can submit written authorization letters as clear and complete videos. |
IdentityWrittenCosUrl | string | No | Except for the IMAGE_PHOTO and ZERO_SHOT_VOICE customization types, fill in either the IdentityCosUrl or another customization type, or both. Requirements for the URL address of the PDF format authorization letter: 1. The URL address is the resource URL uploaded to the specified path via upload material to Tencent Cloud COS, with the idcard path added, such as: domain name/customer-pipeline/{number}/{uuid}/idcard/b.pdf 2. This format is mainly suitable for "written authorization letter" and should be submitted as a clear and complete scanned copy. |
MaterialCosUrl | string | No | Except for the ZERO_SHOT_VOICE customization type, all other customization types are required. Avatar customization material URL address requirements: 1. The URL address is the resource URL uploaded to the specified path by uploading material to Tencent Cloud COS, with the video path added, such as /customer-pipline/{number}/{uuid}/video/c.mp4. 2. Video size, no more than 5GB, 4K no more than 10GB. 3. Video duration: Exclusive lip sync version: custom duration of 2-10 minutes General lip sync version: custom duration of 1-10 minutes High-precision version customization: duration of 2-10 minutes 4. Video resolution: 1080P, 4K (3840x2160). For high-precision version customization, it must be 4K. 5. Video aspect ratio: 16:9 (or 9:16) 6. Video frame rate: No less than 25 fps and no more than 60 fps. 7. Video format: MP4 and MOV Customized Text To Speech material URL address requirements: 1. The URL address is the resource URL uploaded to the specified path by uploading material to Tencent Cloud COS, with the audio path added, such as /customer-pipline/{number}/{uuid}/audio/c.zip. 2. Compressed Archive Format: .zip format, each zip package for customizing one voice type; do not create new folders when compressing, select all wav files directly for compression. 3. Notes for audio files within a single zip file: Audio quantity: A zip file can contain one or more wav formatted audio files, with a total amount no more than 10. Audio size: The total size of the audio files in each zip file is no more than 1G. Audio format: Each audio file must be in wav format. For other audio formats, convert to wav format before recompressing into zip. Audio sample rate: A sampling rate equal to or greater than 24k is sufficient, recommend 24K or 36K. Audio naming: Name should not contain spaces or special characters, extension must be lowercase ".wav". Photo Digital Human customization material URL address requirements: 1. The URL address is the resource URL uploaded to the specified path by uploading material to Tencent Cloud COS, with the photo path added, such as /customer-pipline/{number}/{uuid}/photo/example.png. 2. Image name: no less than 2 characters, only allow Chinese characters, letters, digits, underscores, hyphens. Image format: support jpg, jpeg, png, webp. Image size: no more than 16M. Image aspect ratio: support 1:1, 9:16, 16:9, 4:3. 3. Clear front view of the person, face-centered, natural expression, mouth closed. |
IsHaveBackground | bool | No | Avatar customization type: Whether the trained avatar retains the original background. The default is "No", meaning without retaining the original background, and the background can be changed as needed during application. |
SexType | string | Yes | Gender: MALE: Male FEMALE: Female |
Notes | string | No | Customized remarks, within 100 characters. |
TextDriver | string | No | Text content used to generate the driving demo, with a character length between 4 and 1000 (including SSML tags, each Chinese character counts as one character). |
VoiceDriverCosFile | string | No | Requirements for the audio file path for generating the driving demo: 1. The URL address is the resource URL uploaded to the specified path by uploading material to Tencent Cloud COS, with the audio path added, such as /customer-pipline/{number}/{uuid}/audio/example.wav. 2. Audio file size cannot exceed 10M. Format support: wav, mp3, wma, m4a, aac. |
AudioId | string | No | For the ZERO_SHOT_VOICE customization type, it is required to fill in the AudioId returned after passing the Query Audio Quality Inspection Task Progress. |
Parameters | Type | Mandatory | Description |
TaskId | string | Yes | The created task ID. Access the Progress Query API with the TaskId to obtain the production progress and result. |
{"Header": {},"Payload": {"AnchorName": "Jingxuan in a green dress, sitting pose","MakeType": "IMAGE","IdentityCosUrl": "XXXX","MaterialCosUrl": "YYYY","IsRemoveBackground": true}}
{"Header": {"Code": 0,"DialogID": "","Message": "","RequestID": "123"},"Payload": {"TaskId": "666"}}
Feedback