tencent cloud

Cloud Infinite

Release Notes and Announcements
Release Notes
Announcements
Product Introduction
Product Overview
Product Strengths
Use Cases
Feature Overview
Regions and Domains
Specifications and Limits
Billing
Billing Overview
Billing Mode
Billable Items
Free Tier
Payment Overdue
Viewing Bill Details
FAQs
Getting Started
Registering and Logging In
Bind Bucket
Uploading and Processing File
Downloading and Deleting Images
Unbinding Buckets
Using CI via COS
Features
Image Processing
Media Processing
Content Moderation
AI Content Recognition
File Processing
Smart Voice
File processing
User Guide
Overview
Bucket Management
Smart Toolbox
Job and Workflow
Data Monitoring
Usage statistics
Use Cases
Copyright Protection Solutions
Image Processing Practices
Working with API Authorization Policies
Workflow Practices
API Documentation
API Overview
Structure
Common Request Headers
Common Response Headers
Activate Vast Service
Image Processing
AI-Based Content Recognition
Smart Audio
Media Processing
Content Moderation
Document Processing
File Processing
Job and Workflow
Cloud Virus Detection
Error Codes
Request Signature
SDK Documentation
SDK Overview
Android SDK
iOS SDK
COS Android SDK
C SDK
C++ SDK
.NET(C#) SDK
Go SDK
COS iOS SDK
Java SDK
JavaScript SDK
Node.js SDK
PHP SDK
Python SDK
Mini Program SDK
Personal Information Protection Policy for SDK
Security and Compliance
Permission ‍Management
FAQs
Basic Settings
Document Processing
Media Processing
Content Recognition
Smart Audio
Agreements
Service Level Agreement
Contact Us
Glossary

Text To Speech

PDF
포커스 모드
폰트 크기
마지막 업데이트 시간: 2024-01-31 16:33:35

Overview

The text to speech feature converts text to natural-sounding and smooth speeches in a variety of voices in PCM, WAV, or MP3 format through advanced deep learning technology. It comes with various features such as speech speed, voice, and volume adjustment. It is suitable for diverse scenarios, including smart customer service, voice interaction, audiobook, and accessible broadcasting.

Use Cases

Smart customer service

The text to speech feature works with speech recognition and natural language processing modules to close the loop of human-machine interaction in customer service bot and task service robot use cases. The highly natural bot voices make human-machine interaction more natural.

Audiobook

Electronic courseware, novels, and other types of text can be converted to audios of different voices to create audiobooks that can be listened to at any time.

Directions

You can use the text to speech feature through jobs or workflows. In order to improve the operational efficiency and reduce repetitive operations, CI offers the template feature, which is a configuration item in jobs and workflows. You can save common parameter combinations as templates and reuse them directly in subsequent operations, with no need to set the parameters every time you start a job. You can customize text-to-speech templates as follows:
Custom templates: You can create a template in the console as instructed in Template. You can also create, modify, find, and delete a template through API as instructed in Creating Text-to-Speech Template, Updating Text-to-Speech Template, DescribeMediaTemplates, and DeleteMediaTemplate respectively.

Voice description

Name
Voice Parameter Value
Type
Use Case
Supported Languages
Voice Quality
Ruxue
ruxue
Standard female voice
General
Chinese, Chinese-English mix
Standard
Aixiaonan
aixiaonan
Sweet female voice
General, social
Chinese, Chinese-English mix
Premium
Aixiaoxing
aixiaoxing
Commentary male voice
General, commentary
Chinese, Chinese-English mix
Premium
Alice
alice
Standard female voice
General
English
Premium

Multi-sentiment voice description

Name
Voice Parameter Value
Sentiment Category
Aixiaoxing
aixiaoxing
Neutral, broadcasting, calm, excited
Note:
Text to speech supports async and sync modes. If the input text is short, such as in the one-sentence scenario, the sync mode is recommended.

Through job

You can create a text-to-speech job through the console or API for existing data stored in COS.
Console: You can create a text-to-speech job visually in the CI console as instructed in Job.
API: You can create a text-to-speech job through API as instructed in Submitting Text-to-Speech Job.

Through workflow

CI provides the workflow service. You can enable a workflow for a bucket or a specific path. Then, text to speech will be automatically performed on files uploaded to the bucket or path, and the generated audio files will be saved in the specified location.

Creating workflow

You can create a workflow in the CI console as instructed in Workflow.

Creating, deleting, querying, and updating workflow through API

You can create, delete, query, and update a workflow through API as instructed in Creating Workflow, Deleting Workflow, Querying Workflow, and Updating Workflow respectively.

도움말 및 지원

문제 해결에 도움이 되었나요?

피드백