Language and Voice Support for Azure Speech - Foundry Tools (original) (raw)

You can also see the list of locales and voices supported for each specific region or endpoint:

Language support varies by functionality in Azure Speech.

The table in this section summarizes the locales supported for real-time transcription, fast transcription, and batch transcription.

In this section

Speech to text locales

Locale (BCP-47) Language Fast transcription support
af-ZA Afrikaans (South Africa)
am-ET Amharic (Ethiopia)
ar-AE Arabic (United Arab Emirates)
ar-BH Arabic (Bahrain)
ar-DZ Arabic (Algeria)
ar-EG Arabic (Egypt)
ar-IL Arabic (Israel)
ar-IQ Arabic (Iraq)
ar-JO Arabic (Jordan)
ar-KW Arabic (Kuwait)
ar-LB Arabic (Lebanon)
ar-LY Arabic (Libya)
ar-MA Arabic (Morocco)
ar-OM Arabic (Oman)
ar-PS Arabic (Palestinian Authority)
ar-QA Arabic (Qatar)
ar-SA Arabic (Saudi Arabia)
ar-SY Arabic (Syria)
ar-TN Arabic (Tunisia)
ar-YE Arabic (Yemen)
as-IN Assamese (India)
az-AZ Azerbaijani (Latin, Azerbaijan)
bg-BG Bulgarian (Bulgaria)
bn-IN Bengali (India)
bs-BA Bosnian (Bosnia and Herzegovina)
ca-ES Catalan
cs-CZ Czech (Czechia)
cy-GB Welsh (United Kingdom)
da-DK Danish (Denmark)
de-AT German (Austria)
de-CH German (Switzerland)
de-DE German (Germany)
el-GR Greek (Greece)
en-AU English (Australia)
en-CA English (Canada)
en-GB English (United Kingdom)
en-GH English (Ghana)
en-HK English (Hong Kong SAR)
en-IE English (Ireland)
en-IN English (India)
en-KE English (Kenya)
en-NG English (Nigeria)
en-NZ English (New Zealand)
en-PH English (Philippines)
en-SG English (Singapore)
en-TZ English (Tanzania)
en-US English (United States)
en-ZA English (South Africa)
es-AR Spanish (Argentina)
es-BO Spanish (Bolivia)
es-CL Spanish (Chile)
es-CO Spanish (Colombia)
es-CR Spanish (Costa Rica)
es-CU Spanish (Cuba)
es-DO Spanish (Dominican Republic)
es-EC Spanish (Ecuador)
es-ES Spanish (Spain)
es-GQ Spanish (Equatorial Guinea)
es-GT Spanish (Guatemala)
es-HN Spanish (Honduras)
es-MX Spanish (Mexico)
es-NI Spanish (Nicaragua)
es-PA Spanish (Panama)
es-PE Spanish (Peru)
es-PR Spanish (Puerto Rico)
es-PY Spanish (Paraguay)
es-SV Spanish (El Salvador)
es-US Spanish (United States)1
es-UY Spanish (Uruguay)
es-VE Spanish (Venezuela)
et-EE Estonian (Estonia)
eu-ES Basque
fa-IR Persian (Iran)
fi-FI Finnish (Finland)
fil-PH Filipino (Philippines)
fr-BE French (Belgium)
fr-CA French (Canada)
fr-CH French (Switzerland)
fr-FR French (France)
ga-IE Irish (Ireland)
gl-ES Galician
gu-IN Gujarati (India)
he-IL Hebrew (Israel)
hi-IN Hindi (India)
hr-HR Croatian (Croatia)
hu-HU Hungarian (Hungary)
hy-AM Armenian (Armenia)
id-ID Indonesian (Indonesia)
is-IS Icelandic (Iceland)
it-CH Italian (Switzerland)
it-IT Italian (Italy)
ja-JP Japanese (Japan)
jv-ID Javanese (Latin, Indonesia)
ka-GE Georgian (Georgia)
kk-KZ Kazakh (Kazakhstan)
km-KH Khmer (Cambodia)
kn-IN Kannada (India)
ko-KR Korean (Korea)
lo-LA Lao (Laos)
lt-LT Lithuanian (Lithuania)
lv-LV Latvian (Latvia)
mk-MK Macedonian (North Macedonia)
ml-IN Malayalam (India)
mn-MN Mongolian (Mongolia)
mr-IN Marathi (India)
ms-MY Malay (Malaysia)
mt-MT Maltese (Malta)
my-MM Burmese (Myanmar)
nb-NO Norwegian Bokmål (Norway)
ne-NP Nepali (Nepal)
nl-BE Dutch (Belgium)
nl-NL Dutch (Netherlands)
or-IN Odia (India)
pa-IN Punjabi (India)
pl-PL Polish (Poland)
ps-AF Pashto (Afghanistan)
pt-BR Portuguese (Brazil)
pt-PT Portuguese (Portugal)
ro-RO Romanian (Romania)
ru-RU Russian (Russia)
si-LK Sinhala (Sri Lanka)
sk-SK Slovak (Slovakia)
sl-SI Slovenian (Slovenia)
so-SO Somali (Somalia)
sq-AL Albanian (Albania)
sr-RS Serbian (Cyrillic, Serbia)
sv-SE Swedish (Sweden)
sw-KE Kiswahili (Kenya)
sw-TZ Kiswahili (Tanzania)
ta-IN Tamil (India)
te-IN Telugu (India)
th-TH Thai (Thailand)
tr-TR Turkish (Türkiye)
uk-UA Ukrainian (Ukraine)
ur-IN Urdu (India)
uz-UZ Uzbek (Latin, Uzbekistan)
vi-VN Vietnamese (Vietnam)
wuu-CN Chinese (Wu, Simplified)
yue-CN Chinese (Cantonese, Simplified)
zh-CN Chinese (Mandarin, Simplified)
zh-CN-shandong Chinese (Jilu Mandarin, Simplified)
zh-CN-sichuan Chinese (Southwestern Mandarin, Simplified)
zh-HK Chinese (Cantonese, Traditional)
zh-TW Chinese (Taiwanese Mandarin, Traditional)
zu-ZA Zulu (South Africa)

Custom speech

To improve the accuracy of speech-to-text recognition, you can customize some languages and base models. Depending on the locale, you can upload audio with human-labeled transcripts, plain text, structured text, and pronunciation data. By default, all available base models support plain-text customization. For more information about customization, see What is custom speech?.

Locale (BCP-47) Language Custom speech support
af-ZA Afrikaans (South Africa) Plain text
am-ET Amharic (Ethiopia) Plain text
ar-AE Arabic (United Arab Emirates) Audio + human-labeled transcriptPlain text
ar-BH Arabic (Bahrain) Audio + human-labeled transcriptPlain text
ar-DZ Arabic (Algeria) Audio + human-labeled transcriptPlain text
ar-EG Arabic (Egypt) Audio + human-labeled transcriptPlain textStructured text
ar-IL Arabic (Israel) Audio + human-labeled transcriptPlain text
ar-IQ Arabic (Iraq) Audio + human-labeled transcriptPlain text
ar-JO Arabic (Jordan) Audio + human-labeled transcriptPlain text
ar-KW Arabic (Kuwait) Audio + human-labeled transcriptPlain text
ar-LB Arabic (Lebanon) Audio + human-labeled transcriptPlain text
ar-LY Arabic (Libya) Audio + human-labeled transcriptPlain text
ar-MA Arabic (Morocco) Audio + human-labeled transcriptPlain text
ar-OM Arabic (Oman) Audio + human-labeled transcriptPlain text
ar-PS Arabic (Palestinian Authority) Audio + human-labeled transcriptPlain text
ar-QA Arabic (Qatar) Audio + human-labeled transcriptPlain text
ar-SA Arabic (Saudi Arabia) Audio + human-labeled transcriptPlain textStructured text
ar-SY Arabic (Syria) Audio + human-labeled transcriptPlain text
ar-TN Arabic (Tunisia) Audio + human-labeled transcriptPlain text
ar-YE Arabic (Yemen) Audio + human-labeled transcriptPlain text
as-IN Assamese (India) Audio + human-labeled transcript
az-AZ Azerbaijani (Latin, Azerbaijan) Plain text
bg-BG Bulgarian (Bulgaria) Plain text
bn-IN Bengali (India) Audio + human-labeled transcriptPlain text
bs-BA Bosnian (Bosnia and Herzegovina) Plain text
ca-ES Catalan Plain textPronunciation
cs-CZ Czech (Czechia) Audio + human-labeled transcriptPlain textStructured textPronunciation
cy-GB Welsh (United Kingdom) Plain text
da-DK Danish (Denmark) Audio + human-labeled transcriptPlain textStructured textOutput formatPronunciation
de-AT German (Austria) Audio + human-labeled transcriptPlain textStructured textPronunciation
de-CH German (Switzerland) Audio + human-labeled transcriptPlain textPronunciation
de-DE German (Germany) Audio + human-labeled transcriptPlain textStructured textOutput formatPronunciation
el-GR Greek (Greece) Audio + human-labeled transcriptPlain textStructured text
en-AU English (Australia) Audio + human-labeled transcriptAudioPlain textStructured textOutput formatPronunciation
en-CA English (Canada) Audio + human-labeled transcriptAudioPlain textStructured textOutput formatPronunciation
en-GB English (United Kingdom) Audio + human-labeled transcriptAudioPlain textStructured textOutput formatPronunciation
en-GH English (Ghana) Audio + human-labeled transcriptAudioPlain textStructured textPronunciation
en-HK English (Hong Kong SAR) Audio + human-labeled transcriptAudioPlain textStructured textOutput formatPronunciation
en-IE English (Ireland) Audio + human-labeled transcriptAudioPlain textStructured textOutput formatPronunciation
en-IN English (India) Audio + human-labeled transcriptPlain textStructured textOutput formatPronunciation
en-KE English (Kenya) Audio + human-labeled transcriptAudioPlain textStructured textPronunciation
en-NG English (Nigeria) Audio + human-labeled transcriptAudioPlain textStructured textOutput formatPronunciation
en-NZ English (New Zealand) Audio + human-labeled transcriptAudioPlain textStructured textOutput formatPronunciation
en-PH English (Philippines) Audio + human-labeled transcriptAudioPlain textStructured textOutput formatPronunciation
en-SG English (Singapore) Audio + human-labeled transcriptAudioPlain textStructured textOutput formatPronunciation
en-TZ English (Tanzania) Audio + human-labeled transcriptAudioPlain textStructured textPronunciation
en-US English (United States) Audio + human-labeled transcriptAudioPlain textStructured textOutput formatPronunciation
en-ZA English (South Africa) Audio + human-labeled transcriptAudioPlain textStructured textPronunciation
es-AR Spanish (Argentina) Plain textStructured textPronunciation
es-BO Spanish (Bolivia) Audio + human-labeled transcriptPlain textStructured textPronunciation
es-CL Spanish (Chile) Audio + human-labeled transcriptPlain textStructured textPronunciation
es-CO Spanish (Colombia) Audio + human-labeled transcriptPlain textStructured textPronunciation
es-CR Spanish (Costa Rica) Audio + human-labeled transcriptPlain textStructured textPronunciation
es-CU Spanish (Cuba) Plain textStructured textPronunciation
es-DO Spanish (Dominican Republic) Plain textStructured textPronunciation
es-EC Spanish (Ecuador) Audio + human-labeled transcriptPlain textStructured textPronunciation
es-ES Spanish (Spain) Audio + human-labeled transcriptPlain textStructured textOutput formatPronunciation
es-GQ Spanish (Equatorial Guinea) Audio + human-labeled transcriptPlain textStructured text
es-GT Spanish (Guatemala) Audio + human-labeled transcriptPlain textStructured textPronunciation
es-HN Spanish (Honduras) Audio + human-labeled transcriptPlain textStructured textPronunciation
es-MX Spanish (Mexico) Audio + human-labeled transcriptPlain textStructured textOutput formatPronunciation
es-NI Spanish (Nicaragua) Audio + human-labeled transcriptPlain textStructured textPronunciation
es-PA Spanish (Panama) Audio + human-labeled transcriptPlain textStructured textPronunciation
es-PE Spanish (Peru) Audio + human-labeled transcriptPlain textStructured textPronunciation
es-PR Spanish (Puerto Rico) Audio + human-labeled transcriptPlain textStructured textPronunciation
es-PY Spanish (Paraguay) Audio + human-labeled transcriptPlain textStructured textPronunciation
es-SV Spanish (El Salvador) Audio + human-labeled transcriptPlain textStructured textPronunciation
es-US Spanish (United States) Plain textStructured textPronunciation
es-UY Spanish (Uruguay) Audio + human-labeled transcriptPlain textStructured textPronunciation
es-VE Spanish (Venezuela) Audio + human-labeled transcriptPlain textStructured textPronunciation
et-EE Estonian (Estonia) Plain textPronunciation
eu-ES Basque Plain text
fa-IR Persian (Iran) Plain text
fi-FI Finnish (Finland) Audio + human-labeled transcriptPlain textStructured textOutput formatPronunciation
fil-PH Filipino (Philippines) Plain textPronunciation
fr-BE French (Belgium) Plain text
fr-CA French (Canada) Plain textStructured textOutput formatPronunciation
fr-CH French (Switzerland) Plain textPronunciation
fr-FR French (France) Audio + human-labeled transcriptPlain textStructured textOutput formatPronunciation
ga-IE Irish (Ireland) Plain textPronunciation
gl-ES Galician Plain text
gu-IN Gujarati (India) Audio + human-labeled transcriptPlain text
he-IL Hebrew (Israel) Audio + human-labeled transcriptPlain text
hi-IN Hindi (India) Audio + human-labeled transcriptPlain textStructured textOutput format
hr-HR Croatian (Croatia) Plain textPronunciation
hu-HU Hungarian (Hungary) Audio + human-labeled transcriptPlain textStructured textPronunciation
hy-AM Armenian (Armenia) Plain text
id-ID Indonesian (Indonesia) Audio + human-labeled transcriptPlain textStructured textPronunciation
is-IS Icelandic (Iceland) Plain text
it-CH Italian (Switzerland) Plain text
it-IT Italian (Italy) Audio + human-labeled transcriptPlain textStructured textOutput formatPronunciation
ja-JP Japanese (Japan) Audio + human-labeled transcriptPlain textStructured textOutput format
jv-ID Javanese (Latin, Indonesia) Plain text
ka-GE Georgian (Georgia) Plain text
kk-KZ Kazakh (Kazakhstan) Plain text
km-KH Khmer (Cambodia) Plain text
kn-IN Kannada (India) Audio + human-labeled transcriptPlain text
ko-KR Korean (Korea) Audio + human-labeled transcriptPlain textStructured textOutput format
lo-LA Lao (Laos) Plain text
lt-LT Lithuanian (Lithuania) Plain textPronunciation
lv-LV Latvian (Latvia) Plain textPronunciation
mk-MK Macedonian (North Macedonia) Plain text
ml-IN Malayalam (India) Audio + human-labeled transcriptPlain text
mn-MN Mongolian (Mongolia) Plain text
mr-IN Marathi (India) Audio + human-labeled transcriptPlain text
ms-MY Malay (Malaysia) Plain text
mt-MT Maltese (Malta) Plain text
my-MM Burmese (Myanmar) Plain text
nb-NO Norwegian Bokmål (Norway) Plain textOutput format
ne-NP Nepali (Nepal) Plain text
nl-BE Dutch (Belgium) Plain text
nl-NL Dutch (Netherlands) Audio + human-labeled transcriptPlain textStructured textOutput formatPronunciation
or-IN Odia (India) Audio + human-labeled transcript
pa-IN Punjabi (India) Audio + human-labeled transcript
pl-PL Polish (Poland) Audio + human-labeled transcriptPlain textStructured textOutput formatPronunciation
ps-AF Pashto (Afghanistan) Plain text
pt-BR Portuguese (Brazil) Audio + human-labeled transcriptPlain textStructured textOutput formatPronunciation
pt-PT Portuguese (Portugal) Audio + human-labeled transcriptPlain textStructured textOutput formatPronunciation
ro-RO Romanian (Romania) Plain textPronunciation
ru-RU Russian (Russia) Audio + human-labeled transcriptPlain textStructured text
si-LK Sinhala (Sri Lanka) Plain text
sk-SK Slovak (Slovakia) Plain textPronunciation
sl-SI Slovenian (Slovenia) Plain textPronunciation
so-SO Somali (Somalia) Plain text
sq-AL Albanian (Albania) Plain text
sr-ME Serbian (Montenegro) Plain text
sr-RS Serbian (Cyrillic, Serbia) Plain text
sr-XK Serbian (Kosovo) Plain text
sv-SE Swedish (Sweden) Audio + human-labeled transcriptPlain textStructured textOutput formatPronunciation
sw-KE Kiswahili (Kenya) Plain text
sw-TZ Kiswahili (Tanzania) Plain text
ta-IN Tamil (India) Audio + human-labeled transcriptPlain textStructured text
te-IN Telugu (India) Audio + human-labeled transcriptPlain textStructured text
th-TH Thai (Thailand) Audio + human-labeled transcriptPlain textStructured textOutput format
tr-TR Turkish (Türkiye) Audio + human-labeled transcriptPlain textStructured textOutput format
uk-UA Ukrainian (Ukraine) Audio + human-labeled transcriptPlain text
ur-IN Urdu (India) Audio + human-labeled transcript
uz-UZ Uzbek (Latin, Uzbekistan) Plain text
vi-VN Vietnamese (Vietnam) Plain text
wuu-CN Chinese (Wu, Simplified) Plain text
yue-CN Chinese (Cantonese, Simplified) Plain text
zh-CN Chinese (Mandarin, Simplified) Audio + human-labeled transcriptPlain textStructured textOutput format
zh-CN-shandong Chinese (Jilu Mandarin, Simplified) Plain text
zh-CN-sichuan Chinese (Southwestern Mandarin, Simplified) Plain text
zh-HK Chinese (Cantonese, Traditional) Audio + human-labeled transcriptPlain textStructured textOutput format
zh-TW Chinese (Taiwanese Mandarin, Traditional) Audio + human-labeled transcriptPlain textStructured textOutput format
zu-ZA isiZulu (South Africa) Plain text

Note

Phrase list is a runtime recognition feature, not a model customization capability. You can use phrase lists with real-time transcription and fast transcription on both base and custom speech endpoints, for locales where the feature is enabled. Phrase list doesn't require model training. For supported locales and usage details, see Improve recognition accuracy with phrase list.

Custom speech display support

These locales support the display text format feature: da-DK, de-DE, en-AU, en-CA, en-GB, en-HK, en-IE, en-IN, en-NG, en-NZ, en-PH, en-SG, en-US, es-ES, es-MX, fi-FI, fr-CA, fr-FR, hi-IN, it-IT, ja-JP, ko-KR, nb-NO, nl-NL, pl-PL, pt-BR, pt-PT, sv-SE, tr-TR, zh-CN, zh-HK.

LLM speech translation

LLM speech supports the following languages for both transcription and translation.

Locale Language LLM speech translation support
de German
en English
es Spanish
fr French
it Italian
ja Japanese
ko Korean
pt Portuguese
zh Chinese

MAI Transcribe-1 supported languages

The following table summarizes the languages supported by MAI Transcribe-1 for speech recognition.

Language code Language MAI-Transcribe-1.5 support MAI-transcribe-1 support
ar Arabic
as Assamese
bg Bulgarian
bn Bengali
ca Catalan
cs Czech
da Danish
de German
el Greek
en English
es Spanish
et Estonian
fi Finnish
fr French
gu Gujarati
hi Hindi
hu Hungarian
id Indonesian
it Italian
ja Japanese
kn Kannada
ko Korean
lt Lithuanian
ml Malayalam
mr Marathi
nb Norwegian Bokmål
nl Dutch
or Odia
pa Punjabi (Gurmukhi script)
pl Polish
pt Portuguese
ro Romanian
ru Russian
sk Slovak
sl Slovenian
sv Swedish
ta Tamil
te Telugu
th Thai
tr Turkish
uk Ukrainian
vi Vietnamese