tencent cloud

Feedback

Text To Speech

Last updated: 2022-09-26 15:57:37

    Overview

    The text to speech feature converts text to natural-sounding and smooth speeches in a variety of voices in PCM, WAV, or MP3 format through advanced deep learning technology. It comes with various features such as speech speed, voice, and volume adjustment. It is suitable for diverse scenarios, including smart customer service, voice interaction, audiobook, and accessible broadcasting.

    Use Cases

    Smart customer service

    The text to speech feature works with speech recognition and natural language processing modules to close the loop of human-machine interaction in customer service bot and task service robot use cases. The highly natural bot voices make human-machine interaction more natural.

    Audiobook

    Electronic courseware, novels, and other types of text can be converted to audios of different voices to create audiobooks that can be listened to at any time.

    Directions

    You can use the text to speech feature through jobs or workflows. In order to improve the operational efficiency and reduce repetitive operations, CI offers the template feature, which is a configuration item in jobs and workflows. You can save common parameter combinations as templates and reuse them directly in subsequent operations, with no need to set the parameters every time you start a job. You can customize text-to-speech templates as follows:

    Custom templates: You can create a template in the console as instructed in Template. You can also create, modify, find, and delete a template through API as instructed in Creating Text-to-Speech Template, Updating Text-to-Speech Template, DescribeMediaTemplates, and DeleteMediaTemplate respectively.

    Voice description

    Name Voice Parameter Value Type Use Case Supported Languages Voice Quality
    Ruxue ruxue Standard female voice General Chinese, Chinese-English mix Standard
    Aixiaonan aixiaonan Sweet female voice General, social Chinese, Chinese-English mix Premium
    Aixiaoxing aixiaoxing Commentary male voice General, commentary Chinese, Chinese-English mix Premium
    Alice alice Standard female voice General English Premium

    Multi-sentiment voice description

    Name Voice Parameter Value Sentiment Category
    Aixiaoxing aixiaoxing Neutral, broadcasting, calm, excited
    Note:

    Text to speech supports async and sync modes. If the input text is short, such as in the one-sentence scenario, the sync mode is recommended.

    Through job

    You can create a text-to-speech job through the console or API for existing data stored in COS.

    • Console: You can create a text-to-speech job visually in the CI console as instructed in Job.
    • API: You can create a text-to-speech job through API as instructed in Submitting Text-to-Speech Job.

    Through workflow

    CI provides the workflow service. You can enable a workflow for a bucket or a specific path. Then, text to speech will be automatically performed on files uploaded to the bucket or path, and the generated audio files will be saved in the specified location.

    Creating workflow

    You can create a workflow in the CI console as instructed in Workflow.

    Creating, deleting, querying, and updating workflow through API

    You can create, delete, query, and update a workflow through API as instructed in Creating Workflow, Deleting Workflow, Querying Workflow, and Updating Workflow respectively.

    Contact Us

    Contact our sales team or business advisors to help your business.

    Technical Support

    Open a ticket if you're looking for further assistance. Our Ticket is 7x24 avaliable.

    7x24 Phone Support