Text to Speech Generation
Convert Text into Speech
The text-to-speech feature enables you to convert written text into spoken audio with a selection of voices and languages. You can seamlessly integrate this audio into a clip by specifying the desired voice and the text to be spoken.
{
"asset": {
"type": "text-to-speech",
"text": "Good evening, in Sydney tonight we’re tracking a developing story as unexpected storms roll in across the city, bringing with them flash flooding warnings and major disruptions to the evening commute.",
"voice": "Amy"
},
"start": 0,
"length": "auto"
}
This request will embed the generated speech at the beginning of your video, with the duration automatically matching the length of the generated audio. For more details on optimizing timing, refer to smart clips.
Translating text
To create an audio file in a different language, use the language option. Ensure that you select a voice compatible with the desired language.
{
"asset": {
"type": "text-to-speech",
"text": "Good evening, in Sydney tonight we’re tracking a developing story as unexpected storms roll in across the city, bringing with them flash flooding warnings and major disruptions to the evening commute.",
"voice": "Seoyeon",
"language": "ko-KR"
},
"start": 0,
"length": "auto"
}
The above example creates an audio file in Korean. The English text is translated to and spoken in Korean.
Supported translations
| Language | Value |
|---|---|
| Chinese (Mandarin) | cmn-CN |
| Danish | da-DK |
| German | de-DE |
| English (Australian) | en-AU |
| English (British) | en-GB |
| English (Indian) | en-IN |
| English (US) | en-US |
| Spanish (European) | es-ES |
| Spanish (Mexican) | es-MX |
| Spanish (US) | es-US |
| French (Canadian) | fr-CA |
| French | fr-FR |
| Italian | it-IT |
| Japanese | ja-JP |
| Hindi | hi-IN |
| Korean | ko-KR |
| Norwegian Bokmål | nb-NO |
| Dutch | nl-NL |
| Polish | pl-PL |
| Portuguese (Brazilian) | pt-BR |
| Portuguese (European) | pt-PT |
| Swedish | sv-SE |
| English (New Zealand) | en-NZ |
| English (South African) | en-ZA |
| Catalan | ca-ES |
| German (Austrian) | de-AT |
| Chinese (Cantonese) | yue-CN |
| Arabic (Gulf) | ar-AE |
| Finnish | fi-FI |
Newscaster mode
Shotstack’s text-to-speech service includes a newscaster mode, which produces audio that emulates a newsreader’s delivery. To enable this mode, set the newscaster option to true.
{
"asset": {
"type": "text-to-speech",
"text": "Good evening, in Sydney tonight we’re tracking a developing story as unexpected storms roll in across the city, bringing with them flash flooding warnings and major disruptions to the evening commute.",
"voice": "Joanna",
"newscaster": true
},
"start": 0,
"length": "auto"
}
The newscaster style is available with the Matthew and Joanna voices in US English, the Lupe voice in US Spanish, and the Amy voice in British English.
Voices
The Shotstack text-to-speech service offers a variety of voices in different languages and genders:
| Voice Name | Language | Gender |
|---|---|---|
| Hala | Arabic (Gulf) | Female |
| Lisa | Dutch (Belgian) | Female |
| Arlet | Catalan | Female |
| Hiujin | Chinese (Cantonese) | Female |
| Zhiyu | Chinese (Mandarin) | Female |
| Sofie | Danish | Female |
| Laura | Dutch | Female |
| Olivia | English (Australian) | Female |
| Amy | English (British) | Female |
| Emma | English (British) | Female |
| Brian | English (British) | Male |
| Arthur | English (British) | Male |
| Kajal | English (Indian) | Female |
| Niamh | English (Ireland) | Female |
| Aria | English (New Zealand) | Female |
| Ayanda | English (South African) | Female |
| Ivy | English (US) | Female (child) |
| Joanna | English (US) | Female |
| Kendra | English (US) | Female |
| Kimberly | English (US) | Female |
| Salli | English (US) | Female |
| Joey | English (US) | Male |
| Justin | English (US) | Male (child) |
| Kevin | English (US) | Male (child) |
| Matthew | English (US) | Male |
| Ruth | English (US) | Female |
| Stephen | English (US) | Male |
| Suvi | Finnish | Female |
| Léa | French | Female |
| Rémi | French | Male |
| Gabrielle | French (Canadian) | Female |
| Liam | French (Canadian) | Male |
| Vicki | German | Female |
| Daniel | German | Male |
| Hannah | German (Austrian) | Female |
| Bianca | Italian | Female |
| Adriano | Italian | Male |
| Takumi | Japanese | Male |
| Kazuha | Japanese | Female |
| Tomoko | Japanese | Female |
| Seoyeon | Korean | Female |
| Ida | Norwegian | Female |
| Ola | Polish | Female |
| Camila | Portuguese (Brazilian) | Female |
| Vitória/Vitoria | Portuguese (Brazilian) | Female |
| Thiago | Portuguese (Brazilian) | Male |
| Inês/Ines | Portuguese (European) | Female |
| Lucia | Spanish (European) | Female |
| Sergio | Spanish (European) | Male |
| Mia | Spanish (Mexican) | Female |
| Andrés | Spanish (Mexican) | Male |
| Lupe | Spanish (US) | Female |
| Pedro | Spanish (US) | Male |
| Elin | Swedish | Female |
ElevenLabs Integration
Our ElevenLabs integration is currently unavailable.
Generated AI assets in the sandbox environment will incur credits.