tencent cloud

Video on Demand

Media Processing Template Configuration

PDF
Focus Mode
Font Size
Last updated: 2026-04-22 17:45:52
Media Processing Templates serve as a fundamental feature within Tencent Cloud's Video on Demand (VOD) service, offering users a standardized and reusable configuration solution for media processing. These predefined templates enable users to efficiently execute various media processing operations—including video transcoding, DRM encryption, video enhancement, subtitle recognition, and watermark addition—without the need for manual parameter configuration for each task.

Media Processing Page

1. Log in to the VOD console and select Application Management on the left sidebar.
2. Select the target application.
3. You will be directed to the Service Overview page.
4. Select Media Processing > Media Processing Template on the left sidebar.


Introduction to Media Processing Templates

Level-1 Template
Level-2 Template
Included Functions
Audio/Video Transcoding Template
Video Transcoding Template
Scenario-based Transcoding Template
Audio Transcoding Template
Transmuxing Template
The Video Transcoding Template supports TSC transcoding.
Adaptive Bitrate Streaming Template
Video Adaptive Bitrate Streaming
Audio Adaptive Bitrate Streaming
The Video Adaptive Bitrate Streaming Template supports TSC transcoding and video encryption (including SimpleAES encryption and Multi-DRM encryption).
Media AI Template
AI Subtitles
AI Analysis
New Intelligent Subtitle feature: supports speech recognition from video audio and translation into multilingual subtitles.
Intelligent Analysis: supports intelligent tag recognition, intelligent classification, intelligent cover recognition, frame-level tags, video clipping, highlight editing, and intro/outro analysis.
Audio/Video Enhancement Template
Transcoding + enhancement
Watermark Template
Image watermark, text watermark, SVG watermark
Screenshot Template
Snapshot at specified time points, sampled snapshots, sprite sheet snapshots
GIF Template
Convert a specified segment of a video into a GIF or WebP animated image
Content Audit Template
Mainly used to review video visuals, on-screen text, and speech text
AI Recognition Template
Legacy Intelligent Subtitle feature:
ASR capability (speech recognition): speaker speech recognition in video;
Subtitle generation and translation: generate subtitles from recognized content or translate them into other subtitle languages.
Other Template
Instant Image Processing, Audio/Video Quality Restoration
Audio/Video Quality Restoration only supports preset templates:
Audio denoising;
Audio/Video quality restoration (such as AI frame interpolation, etc.)

System-Preset Templates

The system provides a comprehensive collection of preset templates for rapid deployment. Refer to the Preset Media Processing Templates for details.

Audio/Video Transcoding Template

Video Transcoding Template

Click Create a Transcoding Template under the Video Transcoding tab to create a custom video transcoding template.
Template name: Up to 64 characters; supports Chinese characters, letters, digits, spaces, underscores (_), hyphens (-), and periods (.).
Encapsulation format: MP4, FLV, or HLS.
Configuration items: Video and audio parameters.
Video parameters:
Encoding standard: H.264, H.265, or AV1.
Video bitrate: 128-35,000 Kbps.
Resolution: Set the long and short sides or width and height of the video. Value range: 128-4096 px.
Frame rate: 0-100 fps.
TSC: When Ultra HD is enabled, transcoding will be charged as Ultra HD, improving transcoding quality at a higher price.
Encoding strategy: When Ultra HD is enabled, you can select from the following options: Default Strategy, Quality Priority, Bitrate Priority, or Custom.
Audio parameters:
Encoding standard: AAC or MP3.
Sample rate: 32,000 Hz, 44,100 Hz, or 48,000 Hz.
Audio bitrate: 26-256 Kbps.
Sound channel: Mono-channel or dual-channel.
Common template: Whether to set the template as a common template
The created template will be displayed in the template list. You can view, edit, or delete the template, or set it as a common template.


Audio Transcoding Template

Select the Audio Transcoding tab and click Create Audio Template to create a custom audio transcoding template.
Template name: Up to 64 characters; supports Chinese characters, letters, digits, spaces, underscores (_), hyphens (-), and periods (.).
Encapsulation format: MP3, FLAC, OGG, or M4A.
Audio parameters:
Encoding standard: MP3 is supported if the encapsulation format is MP3; FLAC is supported if the encapsulation format is FLAC or OGG; MP3, AAC; and AC3 are supported if the encapsulation format is M4A.
Sample rate: 32,000 Hz, 44,100 Hz, or 48,000 Hz.
Audio bitrate: 0 or 26-256 Kbps.
Sound channel: Mono-channel or dual-channel.
Common template: Whether to set the template as a common template.
The created template will be displayed in the template list. You can view, edit, or delete the template, or set it as a common template.

Adaptive Bitrate Streaming Template

You can use the preset adaptive bitrate streaming template or create custom templates.
Basic Information
Template name: Up to 64 characters; supports Chinese characters, letters, digits, spaces, underscores (_), hyphens (-), and periods (.).
Encapsulation format: HLS, MPEG-DASH
Encryption type: Unencryption, SimpleAES, DRM. The DRM encryption supports single Widevine encryption, single FairPlay encryption, and MultiDRM encryption (simultaneously supporting both FairPlay and Widevine with a single encryption process). For the complete workflow of content transcoding, encryption, and playback of encrypted videos, please refer to the documentation: Comprehensive Guide to Playing DRM-Encrypted Videos. Additionally, for DRM encryption, you can choose between CBCS or CENC encryption methods.
Switch from Low Resolution to High Resolution: Enable or disable.
Transcoding method: General, TSC
Note:
Regarding the DRM encryption section, please adhere to the following table rules.
Packaging Type
Segment Type
Supported Encryption Methods
HLS
fmp4
Unencrypted, SimpleAES, FairPlay (cbcs), Widevine (cbcs/cenc), FairPlay & Widevine (cbcs)
HLS
ts
Unencrypted, SimpleAES, FairPlay (cbcs)
MPEG-DASH
fmp4
Unencrypted, Widevine (cbcs/cenc)
MPEG-DASH
ts
Selection not supported
Substream info
Video encoding standard: H.264 or H.265.
Video bitrate: empty or 128-35,000 Kbps.
Video resolution: Set the long and short sides or width and height of the video. Value range: 0 or 128-4096 px.
Video frame rate: 0-60 fps.
Audio encoding standard: AAC or MP3.
Sample rate: 32,000 Hz, 44,100 Hz, or 48,000 Hz.
Audio bitrate: 0 or 26-256 Kbps.
Sound channel: Mono-channel or dual-channel.
The templates created are displayed in the template list, where you can view, edit, or delete a template.
Note:
You need to add at least one substream to create an adaptive bitrate streaming template.

Watermark Template

You can create a custom watermark template to add the watermark you upload to a specific position of the video.
Template name: Up to 64 characters; supports Chinese characters, letters, digits, spaces, underscores (_), hyphens (-), and periods (.).
Watermark type: Image watermark, text watermark, or SVG watermark.
Watermark image: PNG and APNG images are supported. For better visual experience, transparent images in PNG format are recommended. The image cannot exceed 200 KB in size or 200 x 200 px in dimensions.
Watermark position: Upper left (default), upper right, lower left, or lower right.
Horizontal offset: The percentage indicates the ratio of the horizontal distance between the watermark and the origin (top-left corner by default) to the video width.
Vertical offset: The percentage indicates the ratio of the vertical distance between the watermark and the origin (top-left corner by default) to the video height.
Image size: You can resize the watermark by specifying the width and height in pixels or as a percentage of the original dimensions.
The watermark template list shows information including template name, format, type, position, and size. You can also preview watermarks or click the buttons in the operation column to view, edit, delete, or set a template as the default.
Note:
If the horizontal offset and vertical offset are both 0%, the watermark will be in the top-left corner of the video. If the horizontal offset and vertical offset are both 99%, the watermark will be in the bottom-right corner of the video.

Screenshot Template

You can create screenshot templates to take different types of screenshots (time point, sampled, or image sprite) of uploaded videos.
The screenshot template list shows information including template name, screenshot type, and image size. You can click the buttons in the operation column to view details or edit or delete a template.

Time point screenshot

Select time point screenshot as the screenshot type and specify the time points to take screenshots in task flow settings. For detailed directions, see Task Flow Settings.
Template name: Up to 64 characters; supports Chinese characters, letters, digits, spaces, underscores (_), hyphens (-), and periods (.).
Image format: JPG
Image dimension: 0 or 128-4096 px

Sampled screenshot

Select sampled screenshot as the screenshot type.
Template name: Up to 64 characters; supports Chinese characters, letters, digits, spaces, underscores (_), hyphens (-), and periods (.).
Image format: JPG
Image dimension: 0 or 128-4096 px
Sampling interval: Specify the interval as a percentage (up to 100%) of the total video duration, or specify the number of seconds between screenshots.

Image sprite screenshot

Select image sprite screenshot as the screenshot type.
Template name: Up to 64 characters; supports Chinese characters, letters, digits, spaces, underscores (_), hyphens (-), and periods (.).
Image format: JPG
Image dimension: 0 or 128-4096 px
Sampling interval: Specify the interval as a percentage (up to 100%) of the total video duration, or specify the number of seconds between screenshots.
Rows: Enter a positive integer. The number of subimage rows multiplied by subimage columns must not exceed 100.
Columns: Enter a positive integer. The number of subimage rows multiplied by subimage columns must not exceed 100.

Animated Image Template

You can create an animated image template to take an animated screenshot with a specific duration. You need to specify the time period for taking the screenshot in task flow settings. For detailed directions, see Task Flow Settings.
Image type: WEBP or GIF
Frame rate: 1-30 fps
Image quality: 1-100
Image dimension: 0-1920 px

Content Moderation Template

Select the Content moderation tab and click Create template to create a custom moderation template.
Template name: Up to 64 characters; supports Chinese characters, letters, digits, spaces, underscores (_), hyphens (-), and periods (.).
Template details:
Moderation Items: The moderation labels to use, which may include porn, terrorism, and moaning.
Label value: The values of the labels to use, which correspond to the Label parameter in ReviewAudioVideoTaskOutput.
Moderated content type: The type of content on which moderation will be performed. The following content types are supported:
Images (Form in ReviewAudioVideoTaskOutput is Image)
Text in images (Form in ReviewAudioVideoTaskOutput is OCR)
Speech (Form in ReviewAudioVideoTaskOutput is ASR)
Sounds (Form in ReviewAudioVideoTaskOutput is Voice)
The supported review items are as illustrated below:

The templates created are displayed in the template list, where you can view, edit, or delete a template.
Note:
A content moderation template can be used for both audio/video moderation and image moderation.
For image moderation, the moderated content types speech and sounds are not supported. For details, see Image Processing - Template Settings - Content Moderation Template.

Intelligent Recognition Template

You can directly use the system's preset Intelligent Recognition Template or create your own Definition Template according to business needs.
Template name: Up to 64 characters; supports Chinese characters, letters, digits, spaces, underscores (_), hyphens (-), and periods (.).
Content Recognition Configuration: Speech Text Recognition, Speech Translation, other recognition items (e.g., Face Recognition) temporarily support control display only, editing is not supported.
Speech Full Text Recognition enabled:
Speech Source Language: Simplified Chinese, English, Japanese, Cantonese.
Subtitle File Format: .vtt, .srt.
Speech Translation enabled:
Speech Source Language: Simplified Chinese, English, Japanese, Korean, Vietnamese, Malay, Thai, Portuguese, Turkish, Arabic, Spanish, Hindi, French.
Translation Target Language: Simplified Chinese, English, Japanese, Korean, French, Spanish, Italian, German, Turkish, Russian, Portuguese, Vietnamese, Bahasa Indonesia, Thai, Malay, Arabic, Hindi.
Subtitle File Format: .vtt, .srt.

Help and Support

Was this page helpful?

Help us improve! Rate your documentation experience in 5 mins.

Feedback