动态与公告
- 产品动态
- 产品公告
产品简介
购买指南
- 计费说明
- 购买指引
- 续费说明
- 欠费说明
- 退费说明
快速入门
控制台指南
- 概览
- 创建任务
- 任务管理
- 编排管理
- 模板管理
- 资源包管理
- 视频评测
- AIGC 内容生成
- 终端 SDK
- 字幕编辑工具
- 用量统计
- 访问管理示例
接入教程
- 音视频转码接入
- 音视频增强接入
- 音频分离接入
- 数字水印及明水印接入
- 媒体 AI 接入教程
- 媒体质检接入
- 终端 SDK 接入
- 直播流录制接入
- DRM 接入
- 其他接入教程
场景实践教程
API 文档
- History
- Introduction
- API Category
- 调用方式
- 发起处理任务相关接口
- 任务管理相关接口
- 转码增强模板相关接口
- 水印模板相关接口
- 截图模板相关接口
- 媒体AI模板相关接口
- 媒体AI-热词库相关接口
- 媒体AI-样本管理相关接口
- 媒体质检模板相关接口
- 直播录制模板相关接口
- 编排管理相关接口
- 数据统计相关接口
- 媒体传输-安全组管理相关接口
- 解析事件通知相关接口
- AI创作相关接口
- 图片处理模板相关接口
- Media AI – Episode Management APIs
- Media AI – Dubbing APIs
- 其他接口
- Data Types
- Error Codes
其他说明文档
- WebSocket 识别协议
常见问题
相关协议
联系我们
词汇表

RecognizeAudio

Download

聚焦模式

字号

最后更新时间： 2026-05-11 20:07:35

1. API Description

Domain name for API request: mps.intl.tencentcloudapi.com.

This API is used to return the speech recognition results synchronously.

A maximum of 5 requests can be initiated per second for this API.

We recommend you to use API Explorer

Try it

API Explorer provides a range of capabilities, including online call, signature authentication, SDK code generation, and API quick search. It enables you to view the request, response, and auto-generated examples.

2. Input Parameters

The following request parameter list only provides API request parameters and some common parameters. For the complete common parameter list, see Common Request Parameters.

Parameter Name	Required	Type	Description
Action	Yes	String	Common Params. The value used for this API: RecognizeAudio.
Version	Yes	String	Common Params. The value used for this API: 2019-06-12.
Region	No	String	Common Params. This parameter is not required for this API.
AudioData	Yes	String	Base64-encoded audio data.
Source	No	String	Target language for recognition. If this is not specified, the language is automatically identified (auto).Note: If the automatic identification provides unsatisfactory results, you can specify the language to improve the accuracy.Supported languages:auto: automatic identification.zh: Simplified Chinese.en: English.ja: Japanese.ko: Korean.vi: Vietnamese.ms: Malay.id: Indonesian.fil: Filipino.th: Thai.pt: Portuguese.tr: Turkish.ar: Arabic.es: Spanish.hi: Hindi.fr: French.de: German.it: Italian.yue: Cantonese.ru: Russian.af: Afrikaans.sq: Albanian.am: Amharic.hy: Armenian.az: Azerbaijani.eu: Basque.bn: Bengali.bs: Bosnian.bg: Bulgarian.my: Burmese.ca: Catalan.hr: Croatian.cs: Czech.da: Danish.nl: Dutch.et: Estonian.fi: Finnish.gl: Galician.ka: Georgian.el: Greek.gu: Gujarati.iw: Hebrew.hu: Hungarian.is: Icelandic.jv: Javanese.kn: Kannada.kk: Kazakh.km: Khmer.rw: Kinyarwanda.lo: Lao.lv: Latvian.lt: Lithuanian.mk: Macedonian.ml: Malayalam.mr: Marathi.mn: Mongolian.ne: Nepali.no: Norwegian Bokmal.fa: Persian.pl: Polish.ro: Romanian.sr: Serbian.si: Sinhala.sk: Slovak.sl: Slovenian.st: Southern Sotho.su: Sundanese.sw: Swahili.sv: Swedish.ta: Tamil.te: Telugu.ts: Tsonga.uk: Ukrainian.ur: Urdu.uz: Uzbek.ve: Vendaxh: Xhosa.zu: Zulu.
AudioFormat	No	String	Audio data format. Default value: pcm.Supported formats:pcm (mono 16-bit PCM data with a sample rate of 16000).ogg-opus (mono Opus-encoded Ogg data with sample rates of 16000, 24000, or 48000).
SampleRate	No	Integer	Audio sample rate.Supported sample rates:pcm 16000 ogg-opus 16000 / 24000 / 48000
UserExtPara	No	String	Extended parameter. This is left empty by default. Use this parameter for special requirements.

3. Output Parameters

Parameter Name	Type	Description
Text	String	Recognition result of the entire audio.
AudioLength	Float	Audio duration, in seconds.
Sentence	Array of RecognizeAudioSentence	Recognition results of individual sentences.
RequestId	String	The unique request ID, generated by the server, will be returned for every request (if the request fails to reach the server for other reasons, the request will not obtain a RequestId). RequestId is required for locating a problem.

4. Example

Example1 RecognizeAudio

Input Example

POST / HTTP/1.1
Host: mps.intl.tencentcloudapi.com
Content-Type: application/json
X-TC-Action: RecognizeAudio
<Common request parameters>

{
    "Source": "zh",
    "AudioFormat": "pcm",
    "AudioData": "KwDn/zIA5v///wUA0v8D"
}

Output Example

{
    "RequestId": "f27f3866-3882-4c18-a4ac-3b3d83fd2f5a",
    "Response": {
        "AudioLength": 4.2,
        "RequestId": "f27f3866-3882-4c18-a4ac-3b3d83fd2f5a",
        "Sentence": [
            {
                "End": 3.59,
                "Start": 0.03,
                "Text": "The third and fourth meetings were held at the Great Hall of the People.",
                "WordsInfo": [
                    {
                        "End": 0.27,
                        "Start": 0.03,
                        "Word": "The"
                    },
                    {
                        "End": 0.43,
                        "Start": 0.27,
                        "Word": "third"
                    },
                    {
                        "End": 0.51,
                        "Start": 0.43,
                        "Word": "and"
                    },
                    {
                        "End": 0.71,
                        "Start": 0.51,
                        "Word": "fourth"
                    },
                    {
                        "End": 0.91,
                        "Start": 0.71,
                        "Word": "meetings"
                    },
                    {
                        "End": 1.07,
                        "Start": 0.91,
                        "Word": "were"
                    },
                    {
                        "End": 1.55,
                        "Start": 1.39,
                        "Word": "held"
                    },
                    {
                        "End": 1.71,
                        "Start": 1.55,
                        "Word": "at"
                    },
                    {
                        "End": 1.95,
                        "Start": 1.75,
                        "Word": "the"
                    },
                    {
                        "End": 2.15,
                        "Start": 1.95,
                        "Word": "Great"
                    },
                    {
                        "End": 2.39,
                        "Start": 2.15,
                        "Word": "Hall"
                    },
                    {
                        "End": 2.75,
                        "Start": 2.47,
                        "Word": "of"
                    },
                    {
                        "End": 2.91,
                        "Start": 2.75,
                        "Word": "the"
                    },
                    {
                        "End": 3.11,
                        "Start": 2.91,
                        "Word": "People."
                    }
                ]
            }
        ],
        "Text": "The third and fourth meetings were held at the Great Hall of the People."
    }
}

5. Developer Resources

SDK

TencentCloud API 3.0 integrates SDKs that support various programming languages to make it easier for you to call APIs.

Command Line Interface

Tencent Cloud CLI 3.0

6. Error Code

The following only lists the error codes related to the API business logic. For other error codes, see Common Error Codes.

Error Code	Description
InternalError.RecognitionError	Recognition error.
InvalidParameterValue.AudioData	Invalid audio data.
InvalidParameterValue.AudioDataTooLong	The audio data is too long.
InvalidParameterValue.AudioFormat	Unsupported audio data format.
InvalidParameterValue.SampleRate	Invalid audio sample rate.
InvalidParameterValue.SourceLanguage	SourceLanguage parameter error.
ResourceNotFound.UserUnregister	The user is not registered.

帮助和支持

本页内容是否解决了您的问题？

您也可以联系销售或提交工单以寻求帮助。

填写满意度调查问卷，共创更好文档体验。

文档反馈

tencent cloud

媒体处理