System voice catalog
These tables list all system voices for CosyVoice.
Requirements:
Instruct settings
The following voices support Instruct. Pass these formats in the
Instruct settings
The following voices support Instruct. Pass these formats in the
- Each
modelsupports a specific set of voices -- you cannot mix them across models. - The
textvalue must be in a language the voice supports. Unsupported languages cause pronunciation errors or unnatural output. - For SSML-enabled voices, pass SSML content in the
textparameter. See SSML guide. - For Instruct-enabled voices, pass Instruct-formatted text in the
instructionparameter. - For timestamp-enabled voices, set
word_timestamp_enabledtotrue(enableWordTimestampin Java SDK). Timestamp data is delivered as word-level timing events through WebSocket callbacks, not embedded in the audio data.- Python SDK: pass
word_timestamp_enabledthroughadditional_params:SpeechSynthesizer(model=model, voice=voice, callback=callback, additional_params={"word_timestamp_enabled": True})
- Python SDK: pass
cosyvoice-v3-flash
| Scenario | Voice name | Voice parameter | Attribute | Age | Language | SSML | Instruct | Timestamp | Sample |
|---|---|---|---|---|---|---|---|---|---|
| Social companionship (Benchmark voice) | Long Anyang | longanyang | Sunny young man | 20-30 | Chinese (Mandarin), English | Yes | Yes | Yes | Sample |
| Social companionship (Benchmark voice) | Long Anhuan | longanhuan | Energetic and cheerful female | 20-30 | Chinese (Mandarin), English | Yes | Yes | Yes | Sample |
| Child's voice (Benchmark voice) | Long Huhu | longhuhu_v3 | Innocent and lively girl | 6-10 | Chinese (Mandarin), English | Yes | Yes | Yes | Sample |
| Dialects | Long Anyue | longanyue_v3 | Energetic Cantonese male | 25-35 | Chinese (Cantonese), English | Yes | No | Yes | Sample |
| Dialects | Long Shange | longshange_v3 | Authentic Northern Shaanxi male | 25-35 | Chinese (Shaanxi dialect), English | Yes | No | Yes | Sample |
| Dialects | Long Anmin | longanmin_v3 | Innocent young girl | 18-25 | Chinese (Minnan dialect), English | Yes | No | Yes | Sample |
| Telesales | Long Yingxiao | longyingxiao_v3 | Sweet-voiced saleswoman | 20-25 | Chinese (Mandarin), English | Yes | No | Yes | Sample |
| Customer service | Long Yingxun | longyingxun_v3 | Young and inexperienced male | 20-25 | Chinese (Mandarin), English | Yes | No | Yes | Sample |
| Customer service | Long Yingtao | longyingtao_v3 | Gentle and composed female | 25-30 | Chinese (Mandarin), English | Yes | No | Yes | Sample |
| Voice assistant | Long Anyun | longanyun_v3 | Homey and warm-hearted male | 30-35 | Chinese (Mandarin), English | Yes | No | Yes | Sample |
| Voice assistant | Long Anwen | longanwen_v3 | Elegant and intellectual female | 25-35 | Chinese (Mandarin), English | Yes | No | Yes | Sample |
| Voice assistant | Long Anli | longanli_v3 | Crisp and composed female | 25-35 | Chinese (Mandarin), English | Yes | No | Yes | Sample |
| Voice assistant | Long Anlang | longanlang_v3 | Fresh and crisp male | 20-25 | Chinese (Mandarin), English | Yes | No | Yes | Sample |
| Voice assistant | Long Yingmu | longyingmu_v3 | Elegant and intellectual female | 25-30 | Chinese (Mandarin), English | Yes | No | Yes | Sample |
| Social companionship | Long Anzhi | longanzhi_v3 | Wise and mature young male | 25-35 | Chinese (Mandarin), English | Yes | No | Yes | Sample |
| Social companionship | Long Anya | longanya_v3 | Elegant and classy female | 25-35 | Chinese (Mandarin), English | Yes | No | Yes | Sample |
| Social companionship | Long Anqin | longanqin_v3 | Approachable and lively female | 20-25 | Chinese (Mandarin), English | Yes | No | Yes | Sample |
| Audiobooks | Long Wanjun | longwanjun_v3 | Delicate and soft-spoken female | 20-30 | Chinese (Mandarin), English | Yes | No | Yes | Sample |
| Audiobooks | Long Yichen | longyichen_v3 | Free-spirited and energetic male | 20-30 | Chinese (Mandarin), English | Yes | No | Yes | Sample |
| Audiobooks | Long Laobo | longlaobo_v3 | World-weary old man | 60+ | Chinese (Mandarin), English | Yes | No | Yes | Sample |
| Audiobooks | Long Laoyi | longlaoyi_v3 | Worldly and calm auntie | 60+ | Chinese (Mandarin), English | Yes | No | Yes | Sample |
| Short video voice-over | Long Jiqi | longjiqi_v3 | Dorky and cute robot | 20-30 | Chinese (Mandarin), English | Yes | No | Yes | Sample |
| Short video voice-over | Long Houge | longhouge_v3 | Classic Monkey King | 20-25 | Chinese (Mandarin), English | Yes | No | Yes | Sample |
| Short video voice-over | Long Daiyu | longdaiyu_v3 | Delicate and talented female voice | 15-25 | Chinese (Mandarin), English | Yes | No | Yes | Sample |
| Livestreaming e-commerce | Long Anxuan | longanxuan_v3 | Classic female livestreamer | 30-40 | Chinese (Mandarin), English | Yes | No | Yes | Sample |
instruction parameter:
Long Anyang (longanyang)
Long Anyang (longanyang)
- Set emotion
- Format: "
你说话的情感是<情感值>。" (Use Chinese. Include "。". Replace<情感值>with an emotion value.) - Example: "
你说话的情感是neutral." - Emotions:
neutral,fearful,angry,sad,surprised,happy,disgusted.
- Format: "
- Set scenario and emotion
- Format: "
你正在进行<场景>,你说话的情感是<情感值>。" (Use Chinese. Include "。". Replace<场景>with a scenario and<情感值>with an emotion value.) - Example: "
你正在进行闲聊互动,你说话的情感是neutral." - Scenarios:
闲聊互动(casual conversation),新闻播报(news broadcast),广告促销(ad promotion),比赛解说(sports commentary),一些儿童内容解说(commentary for children's content),语音导航(voice navigation),脱口秀表演(stand-up comedy). - Emotions:
neutral,fearful,angry,sad,surprised,happy,disgusted.
- Format: "
- Set role and emotion
- Format: "
你现在说话的角色是<角色>,你说话的情感是<情感值>。" (Use Chinese. Include "。". Replace<角色>with a role and<情感值>with an emotion value.) - Example: "
你现在说话的角色是一个旁白,你说话的情感是neutral." - Roles:
一个旁白(narrator). - Emotions:
neutral,fearful,angry,sad,surprised,happy,disgusted.
- Format: "
- Set identity and emotion
- Format: "
你正在以一个<身份>的身份说话,你说话的情感是<情感值>。" (Use Chinese. Include "。". Replace<身份>with an identity and<情感值>with an emotion value.) - Example: "
你正在以一个故事机的身份说话,你说话的情感是neutral." - Identities:
故事机(storytelling machine). - Emotions:
neutral,fearful,angry,sad,surprised,happy,disgusted.
- Format: "
Long Anhuan (longanhuan)
Long Anhuan (longanhuan)
- Set emotion
- Format: "
你说话的情感是<情感值>。" (Use Chinese. Include "。". Replace<情感值>with an emotion value.) - Example: "
你说话的情感是neutral." - Emotions:
neutral,fearful,angry,sad,surprised,happy,disgusted.
- Format: "
- Set scenario and emotion
- Format: "
你正在进行<场景>,你说话的情感是<情感值>。" (Use Chinese. Include "。". Replace<场景>with a scenario and<情感值>with an emotion value.) - Example: "
你正在进行闲聊对话,你说话的情感是neutral." - Scenarios:
闲聊对话(casual conversation),比赛解说(sports commentary),深夜电台广播(late-night radio broadcast),剧情解说(plot summary),诗歌朗诵(poetry reading),科普知识推广(science popularization),产品推广(product promotion),脱口秀表演(stand-up comedy). - Emotions:
neutral,fearful,angry,sad,surprised,happy,disgusted.
- Format: "
- Set role and emotion
- Format: "
你现在说话的角色是<角色>,你说话的情感是<情感值>。" (Use Chinese. Include "。". Replace<角色>with a role and<情感值>with an emotion value.) - Example: "
你说话的角色是温和客服,你说话的情感是neutral." - Roles:
温和客服(gentle customer service). - Emotions:
neutral,fearful,angry,sad,surprised,happy,disgusted.
- Format: "
Long Huhu (longhuhu_v3)
Long Huhu (longhuhu_v3)
- Set emotion
- Format: "
你说话的情感是<情感值>。" (Use Chinese. Include "。". Replace<情感值>with an emotion value.) - Example: "
你说话的情感是neutral." - Emotions:
neutral,fearful,angry,sad,surprised,happy,disgusted.
- Format: "
- Set scenario and emotion
- Format: "
你正在进行<场景>,你说话的情感是<情感值>。" (Use Chinese. Include "。". Replace<场景>with a scenario and<情感值>with an emotion value.) - Example: "
你正在进行自由对话,你说话的情感是neutral." - Scenarios:
自由对话(free conversation),广告促销(ad promotion). - Emotions:
neutral,fearful,angry,sad,surprised,happy,disgusted.
- Format: "
- Set role and emotion
- Format: "
你现在说话的角色是<角色>,你说话的情感是<情感值>。" (Use Chinese. Include "。". Replace<角色>with a role and<情感值>with an emotion value.) - Example: "
你说话的角色是傲娇公主,你说话的情感是neutral." - Roles:
傲娇公主(tsundere princess),元气少女(energetic girl),可爱孩童(cute child),机器人(robot),小猪佩奇(Peppa Pig). - Emotions:
neutral,fearful,angry,sad,surprised,happy,disgusted.
- Format: "
- Set identity and emotion
- Format: "
你正在以一个<身份>的身份说话,你说话的情感是<情感值>。" (Use Chinese. Include "。". Replace<身份>with an identity and<情感值>with an emotion value.) - Example: "
你正在以一个故事机的身份说话,你说话的情感是neutral." - Identities:
故事机(storytelling machine),儿童玩具(children's toy). - Emotions:
neutral,fearful,angry,sad,surprised,happy,disgusted.
- Format: "
cosyvoice-v3-plus
| Scenario | Voice name | Voice parameter | Attribute | Age | Language | SSML | Instruct | Timestamp | Sample |
|---|---|---|---|---|---|---|---|---|---|
| Social companionship (Benchmark voice) | Long Anyang | longanyang | Sunny young man | 20-30 | Chinese (Mandarin), English | Yes | Yes | Yes | Sample |
| Social companionship (Benchmark voice) | Long Anhuan | longanhuan | Energetic and cheerful female | 20-30 | Chinese (Mandarin), English | Yes | Yes | Yes | Sample |
instruction parameter:
Long Anyang (longanyang)
Long Anyang (longanyang)
- Set emotion
- Format: "
你说话的情感是<情感值>。" (Use Chinese. Include "。". Replace<情感值>with an emotion value.) - Example: "
你说话的情感是neutral." - Emotions:
neutral,fearful,angry,sad,surprised,happy,disgusted.
- Format: "
- Set scenario and emotion
- Format: "
你正在进行<场景>,你说话的情感是<情感值>。" (Use Chinese. Include "。". Replace<场景>with a scenario and<情感值>with an emotion value.) - Example: "
你正在进行闲聊互动,你说话的情感是neutral." - Scenarios:
闲聊互动(casual conversation),新闻播报(news broadcast),广告促销(ad promotion),比赛解说(sports commentary),一些儿童内容解说(commentary for children's content),语音导航(voice navigation),脱口秀表演(stand-up comedy). - Emotions:
neutral,fearful,angry,sad,surprised,happy,disgusted.
- Format: "
- Set role and emotion
- Format: "
你现在说话的角色是<角色>,你说话的情感是<情感值>。" (Use Chinese. Include "。". Replace<角色>with a role and<情感值>with an emotion value.) - Example: "
你现在说话的角色是一个旁白,你说话的情感是neutral." - Roles:
一个旁白(narrator). - Emotions:
neutral,fearful,angry,sad,surprised,happy,disgusted.
- Format: "
- Set identity and emotion
- Format: "
你正在以一个<身份>的身份说话,你说话的情感是<情感值>。" (Use Chinese. Include "。". Replace<身份>with an identity and<情感值>with an emotion value.) - Example: "
你正在以一个故事机的身份说话,你说话的情感是neutral." - Identities:
故事机(storytelling machine). - Emotions:
neutral,fearful,angry,sad,surprised,happy,disgusted.
- Format: "
Long Anhuan (longanhuan)
Long Anhuan (longanhuan)
- Set emotion
- Format: "
你说话的情感是<情感值>。" (Use Chinese. Include "。". Replace<情感值>with an emotion value.) - Example: "
你说话的情感是neutral." - Emotions:
neutral,fearful,angry,sad,surprised,happy,disgusted.
- Format: "
- Set scenario and emotion
- Format: "
你正在进行<场景>,你说话的情感是<情感值>。" (Use Chinese. Include "。". Replace<场景>with a scenario and<情感值>with an emotion value.) - Example: "
你正在进行闲聊互动,你说话的情感是neutral." - Scenarios:
闲聊对话(casual conversation),比赛解说(sports commentary),深夜电台广播(late-night radio broadcast),诗歌朗诵(poetry reading),科普知识推广(science popularization),产品推广(product promotion),脱口秀表演(stand-up comedy). - Emotions:
neutral,fearful,angry,sad,surprised,happy,disgusted.
- Format: "
- Set role and emotion
- Format: "
你现在说话的角色是<角色>,你说话的情感是<情感值>。" (Use Chinese. Include "。". Replace<角色>with a role and<情感值>with an emotion value.) - Example: "
你说话的角色是温和客服,你说话的情感是neutral." - Roles:
温和客服(gentle customer service). - Emotions:
neutral,fearful,angry,sad,surprised,happy,disgusted.
- Format: "