tencent cloud

Media Processing Service

업데이트 및 공지 사항
Release Notes
공지 사항
제품 소개
Overview
제품 기능
제품 장점
Use Cases
구매 가이드
과금 안내
구매 가이드
연장 안내
연체 안내
환불 설명
시작하기
콘솔 가이드
개요
Creating Tasks
Tasks
Orchestrations
템플릿 관리
리소스 팩 관리
Video Evaluation
AIGC Content Generation
Terminal SDK
Subtitle Editing Tool
사용량 통계
Cloud Access Management Sample
FAQs
서비스 기본 개념 관련
Account Authorization
업로드 콜백 관련
트랜스코딩 다운로드 관련
기능 구현 관련
Related Agreement
Service Level Agreement
개인 정보 보호 정책
데이터 처리 및 보안 계약

Smart Subtitle Template

PDF
포커스 모드
폰트 크기
마지막 업데이트 시간: 2026-03-10 17:03:31

Overview

The smart subtitle feature can perform Automatic Speech Recognition (ASR) on the voice information in on-demand video files or live streams, convert it into subtitles, and translate it into multiple languages. It is suitable for scenarios such as live subtitles for streaming and video translation for overseas distribution. You can create custom smart subtitle templates and preset different process parameters for different use cases, to facilitate subsequent reuse.




Prerequisites

1. You have signed up for a Tencent Cloud account.
2. You have activated Media Processing Service (MPS) and logged in to the MPS Console.

Template Configuration Guide

1. Go to Templates > Media AI Template > Smart Subtitle. The system provides several preset templates that you can use directly, or you can click Create Smart Subtitle Template to create a custom template.

2. Enter the smart subtitle template creation page. The following configuration parameters are supported.

Processing Type of "Generate Subtitles based on ASR"


The following configuration items are supported:
Configuration Item
Description
Template Name
Only Chinese characters, English letters, digits, underscores (_), hyphens (-), and periods (.) are supported, and the length should not exceed 64 characters.
Processing Type
Automatic speech recognition (ASR)-based subtitle generation: Input an audio file or a video file, generate a subtitle file based on ASR, and translate subtitles.
Translation of subtitle files: Input a subtitle file, translate source subtitles to subtitles in different languages through LLM, and generate a new subtitle file.
Processing Type of "ASR-based Subtitle Generation"
Source Language of the Audio Content of the Video
Select the source language of the audio content of the video. The following is a list of the supported source languages.
Associate the ASR Hotword Lexicon
Commonly used words in speech are generally accurately recognized. However, for specific names (such as personal names, product names, and company names) and industry-specific terms (such as the brand name Zhiling, the building name Binhai Dasha, Hebao in the insurance field, or Cunchutong in cloud storage), the recognition accuracy may decrease. To address this issue, we provide a custom hotword lexicon feature. You can add specialized terms through manual input or file import, significantly improving the accuracy of ASR. For detailed configuration guidance, see Custom Hotword Lexicons.
Note:
Currently, the hotword lexicon only supports Mandarin and English. Therefore, the hotword lexicon can only be associated when the source language of the audio content of the video is Simplified Chinese or English.
Enable Translation
After it is enabled, the subtitles in the source language can be translated into the specified language, making it suitable for scenarios such as video translation for global distribution.
Note:
Certain languages are not available for selection currently. If you need to use them, contact us for support.
Target Language for Translation
Subtitle Type
If the translation feature is not enabled, the output subtitles will only contain content in the source language.
When the translation feature is enabled, the following subtitle output types are supported:
Monolingual (target language)
Bilingual (source language + target language)
Subtitle File Format
Currently, the WebVTT and SRT formats are supported. If you only need a subtitle content callback and do not need the output of a subtitle file, you can select Do Not Generate Subtitle Files.

Supported Languages

Processing Type as "OCR-based Subtitle Generation"


The following configuration items are supported:
Configuration Item
Description
Template Name
Only Chinese characters, English letters, digits, underscores (_), hyphens (-), and periods (.) are supported. The name cannot exceed 64 characters.
When the Processing Type Is "OCR-based Subtitle Generation"
Subtitle region
Select the area where subtitles are to be extracted. The preset region is the lower center of the video by default. To avoid missing subtitles, we recommend switching to Custom, uploading a preview video or image, and manually selecting the subtitle region.
Source language
Chinese/English: The source language is Chinese, English, or a mix of both.
Other: languages other than Chinese and English.
Enable translation
When enabled, source language subtitles can be translated into a specified target language, making it suitable for scenarios such as video translation for global distribution.
Note:
Some languages are not yet available for selection. If you need to use them, please contact us for support.
Target language
Subtitle file format
WebVTT and SRT are supported.
Subtitle type
If translation is disabled, the subtitle content is monolingual (source language).
If translation is enabled, the subtitle content supports monolingual (target language) or bilingual (two lines: source language + target language).

Supported Languages

Processing Type of "Translate Subtitle Files"


Configuration Item
Description
Template Name
Only Chinese characters, English letters, digits, underscores (_), hyphens (-), and periods (.) are supported, and the length should not exceed 64 characters.
Processing Type of "Translation of Subtitle Files"
Source Languages
The default value is "automatic identification". You can also specify a language. The following is a list of the supported source languages.
Target Language for Translation
After it is enabled, the subtitles in the source language can be translated into the specified language, making it suitable for scenarios such as video translation for global distribution. The following is a list of the supported target languages for translation.
Note:
Certain languages are not available for selection currently. If you need to use them, contact us for support.
Subtitle File Format
It is supported to select WebVTT, SRT, or consistent with the source file.
Subtitle Type
The following options are supported:
Monolingual (target language): The generated target language subtitle file contains single-line monolingual subtitles.
Bilingual (source language + target language): The generated target language subtitle file contains dual-line bilingual subtitles.

Supported Languages

Billing Instructions

ASR-based Subtitle Generation

Identifying the Source Language Only and "ASR" Fees Are Charged

If you set the processing type to "ASR-based subtitle generation" without enabling translation, ASR fees will be charged. For pricing details, see Billing Instructions.


Translating into One Target Language: "Speech Translation" Fee Is Charged

If you set the processing type to "ASR-based subtitle generation" with translation enabled, and select a target language for translation, "speech translation" fees will be charged. For pricing details, see Billing Instructions.


If There Are More Than 1 Target Language for Translation, "Subtitle Translation (Additional Languages)" Fees Will Be Charged for Each Extra Language

If you set the processing type to "ASR-based subtitle generation" with translation enabled, and select n target languages for translation, the fees to be charged are calculated as follows: Speech translation fees × 1 + Subtitle translation (additional languages) fees × (n-1).
Take the following configuration as an example. The fees are calculated as follows: Speech translation fees × 1 + Subtitle translation (additional languages) fees × 3. For pricing details, see Billing Instructions.


OCR-based Subtitle Generation

Extracting the Source Language Only: "OCR-based Subtitle Extraction" Fee Is Charged

If "OCR-based subtitle generation" is selected as the processing type and translation is disabled, "OCR-based subtitle extraction" fee will be charged. For pricing details, see relevant Billing Overview documentation.


Translating into One Target Language: "OCR-based Subtitle Extraction and Translation" Fee Is Charged

If "OCR-based subtitle generation" is selected as the processing type, translation is enabled, and one target language is selected, "OCR-based subtitle extraction and translation" fee will be charged. For pricing details, see relevant Billing Overview documentation.


Translating into More than One Target Language: "Subtitle Translation (Additional Languages)" Fee Is Charged for Each Additional Language

If "OCR-based subtitle generation" is selected as the processing type, translation is enabled, and n target languages are selected, the fees will be calculated as follows: "OCR-based subtitle extraction and translation" fee x 1 + "subtitle translation (additional languages)" fee x (n-1).
For example, with the configuration below, the fees will be calculated as follows: "OCR-based subtitle extraction and translation" fee x 1 + "subtitle translation (additional languages)" fee x 3. For pricing details, see relevant Billing Overview documentation.


Subtitle File Translation

Translating into One Target Language: "Subtitle Translation" Fee Is Charged

If "Subtitle file translation" is selected as the processing type and one target language is selected, "subtitle translation" fee will be charged. For pricing details, see relevant Billing Overview documentation.


Translating into More than One Target Language: "Subtitle Translation (Additional Languages)" Fee Is Charged for Each Additional Language

If "Subtitle file translation" is selected as the processing type and n target languages are selected, the fees will be calculated as follows: "subtitle translation" fee x 1 + "subtitle translation (additional languages)" fee x (n-1).
For example, with the configuration below, the fees will be calculated as follows: "subtitle translation" fee x 1 + "subtitle translation (additional languages)" fee x 3. For pricing details, see relevant Billing Overview documentation.


도움말 및 지원

문제 해결에 도움이 되었나요?

피드백