tencent cloud

媒体处理

动态与公告
产品动态
产品公告
产品简介
产品概述
产品功能
产品优势
应用场景
购买指南
计费说明
购买指引
续费说明
欠费说明
退费说明
快速入门
控制台指南
概览
创建任务
任务管理
编排管理
模板管理
资源包管理
视频评测
AIGC 内容生成
终端 SDK
字幕编辑工具
用量统计
访问管理示例
接入教程
音视频转码接入
音视频增强接入
音频分离接入
数字水印及明水印接入
媒体 AI 接入教程
媒体质检接入
终端 SDK 接入
直播流录制接入
DRM 接入
其他接入教程
场景实践教程
画质提升场景
音视频成本优化场景
短剧出海场景
生成式场景
在线教育场景
API 文档
History
Introduction
API Category
调用方式
发起处理任务相关接口
任务管理相关接口
转码增强模板相关接口
水印模板相关接口
截图模板相关接口
媒体AI模板相关接口
媒体AI-热词库相关接口
媒体AI-样本管理相关接口
媒体质检模板相关接口
直播录制模板相关接口
编排管理相关接口
数据统计相关接口
媒体传输-安全组管理相关接口
解析事件通知相关接口
图片处理模板相关接口
AI创作相关接口
其他接口
Data Types
Error Codes
其他说明文档
WebSocket 识别协议
常见问题
产品基础相关
账号授权相关
任务配置相关
发起任务相关
任务结果查看相关
相关协议
Service Level Agreement
隐私协议
数据处理和安全协议
联系我们
词汇表

RecognizeAudio

PDF
聚焦模式
字号
最后更新时间: 2026-03-10 11:15:00

1. API Description

Domain name for API request: mps.intl.tencentcloudapi.com.

This API is used to return the speech recognition results synchronously.

A maximum of 5 requests can be initiated per second for this API.

We recommend you to use API Explorer
Try it
API Explorer provides a range of capabilities, including online call, signature authentication, SDK code generation, and API quick search. It enables you to view the request, response, and auto-generated examples.

2. Input Parameters

The following request parameter list only provides API request parameters and some common parameters. For the complete common parameter list, see Common Request Parameters.

Parameter Name Required Type Description
Action Yes String Common Params. The value used for this API: RecognizeAudio.
Version Yes String Common Params. The value used for this API: 2019-06-12.
Region No String Common Params. This parameter is not required for this API.
AudioData Yes String Base64-encoded audio data.
Source No String Target language for recognition. If this is not specified, the language is automatically identified (auto).Note: If the automatic identification provides unsatisfactory results, you can specify the language to improve the accuracy.Supported languages:auto: automatic identification.zh: Simplified Chinese.en: English.ja: Japanese.ko: Korean.vi: Vietnamese.ms: Malay.id: Indonesian.fil: Filipino.th: Thai.pt: Portuguese.tr: Turkish.ar: Arabic.es: Spanish.hi: Hindi.fr: French.de: German.it: Italian.yue: Cantonese.ru: Russian.af: Afrikaans.sq: Albanian.am: Amharic.hy: Armenian.az: Azerbaijani.eu: Basque.bn: Bengali.bs: Bosnian.bg: Bulgarian.my: Burmese.ca: Catalan.hr: Croatian.cs: Czech.da: Danish.nl: Dutch.et: Estonian.fi: Finnish.gl: Galician.ka: Georgian.el: Greek.gu: Gujarati.iw: Hebrew.hu: Hungarian.is: Icelandic.jv: Javanese.kn: Kannada.kk: Kazakh.km: Khmer.rw: Kinyarwanda.lo: Lao.lv: Latvian.lt: Lithuanian.mk: Macedonian.ml: Malayalam.mr: Marathi.mn: Mongolian.ne: Nepali.no: Norwegian Bokmal.fa: Persian.pl: Polish.ro: Romanian.sr: Serbian.si: Sinhala.sk: Slovak.sl: Slovenian.st: Southern Sotho.su: Sundanese.sw: Swahili.sv: Swedish.ta: Tamil.te: Telugu.ts: Tsonga.uk: Ukrainian.ur: Urdu.uz: Uzbek.ve: Vendaxh: Xhosa.zu: Zulu.
AudioFormat No String Audio data format. Default value: pcm.Supported formats:pcm (mono 16-bit PCM data with a sample rate of 16000).ogg-opus (mono Opus-encoded Ogg data with sample rates of 16000, 24000, or 48000).
SampleRate No Integer Audio sample rate.Supported sample rates:pcm 16000
ogg-opus 16000 / 24000 / 48000
UserExtPara No String Extended parameter. This is left empty by default. Use this parameter for special requirements.

3. Output Parameters

Parameter Name Type Description
Text String Recognition result of the entire audio.
AudioLength Float Audio duration, in seconds.
Sentence Array of RecognizeAudioSentence Recognition results of individual sentences.
RequestId String The unique request ID, generated by the server, will be returned for every request (if the request fails to reach the server for other reasons, the request will not obtain a RequestId). RequestId is required for locating a problem.

4. Example

Example1 RecognizeAudio

Input Example

POST / HTTP/1.1
Host: mps.intl.tencentcloudapi.com
Content-Type: application/json
X-TC-Action: RecognizeAudio
<Common request parameters>

{
    "Source": "zh",
    "AudioFormat": "pcm",
    "AudioData": "KwDn/zIA5v///wUA0v8D"
}

Output Example

{
    "RequestId": "f27f3866-3882-4c18-a4ac-3b3d83fd2f5a",
    "Response": {
        "AudioLength": 4.2,
        "RequestId": "f27f3866-3882-4c18-a4ac-3b3d83fd2f5a",
        "Sentence": [
            {
                "End": 3.59,
                "Start": 0.03,
                "Text": "The third and fourth meetings were held at the Great Hall of the People.",
                "WordsInfo": [
                    {
                        "End": 0.27,
                        "Start": 0.03,
                        "Word": "The"
                    },
                    {
                        "End": 0.43,
                        "Start": 0.27,
                        "Word": "third"
                    },
                    {
                        "End": 0.51,
                        "Start": 0.43,
                        "Word": "and"
                    },
                    {
                        "End": 0.71,
                        "Start": 0.51,
                        "Word": "fourth"
                    },
                    {
                        "End": 0.91,
                        "Start": 0.71,
                        "Word": "meetings"
                    },
                    {
                        "End": 1.07,
                        "Start": 0.91,
                        "Word": "were"
                    },
                    {
                        "End": 1.55,
                        "Start": 1.39,
                        "Word": "held"
                    },
                    {
                        "End": 1.71,
                        "Start": 1.55,
                        "Word": "at"
                    },
                    {
                        "End": 1.95,
                        "Start": 1.75,
                        "Word": "the"
                    },
                    {
                        "End": 2.15,
                        "Start": 1.95,
                        "Word": "Great"
                    },
                    {
                        "End": 2.39,
                        "Start": 2.15,
                        "Word": "Hall"
                    },
                    {
                        "End": 2.75,
                        "Start": 2.47,
                        "Word": "of"
                    },
                    {
                        "End": 2.91,
                        "Start": 2.75,
                        "Word": "the"
                    },
                    {
                        "End": 3.11,
                        "Start": 2.91,
                        "Word": "People."
                    }
                ]
            }
        ],
        "Text": "The third and fourth meetings were held at the Great Hall of the People."
    }
}

5. Developer Resources

SDK

TencentCloud API 3.0 integrates SDKs that support various programming languages to make it easier for you to call APIs.

Command Line Interface

6. Error Code

The following only lists the error codes related to the API business logic. For other error codes, see Common Error Codes.

Error Code Description
InternalError.RecognitionError Recognition error.
InvalidParameterValue.AudioData Invalid audio data.
InvalidParameterValue.AudioDataTooLong The audio data is too long.
InvalidParameterValue.AudioFormat Unsupported audio data format.
InvalidParameterValue.SampleRate Invalid audio sample rate.
InvalidParameterValue.SourceLanguage SourceLanguage parameter error.
ResourceNotFound.UserUnregister The user is not registered.

帮助和支持

本页内容是否解决了您的问题?

填写满意度调查问卷,共创更好文档体验。

文档反馈