Speech-To-Text | TTS Voice Wizard (original) (raw)

tip

DeepGram is for VoiceWizardPro Only! Subscribe to the Patreon or Kofi to unlock it.

image

STT Methods List

Speech-to-Text Method Description Free Pricing Continuous
System Speech This method is the default and has the worst recognition quality. Although it can improved with training and editing the speech dictionary Unlimited yes
Azure Great recognition quality without needing to sacrifice computational resources. Built in Translations 5 speech recognition hours + 5 speech translation hours. This is actually much more than it seems when not using continuous recognition. (yes you can for example translate from English to English after your recognition hours run out for 10 total hours.) both
Vosk Ok recognition quality at the cost of computational resources (CPU and RAM). Can have higher recognition quality than Web Captioner depending on model used. (does not work on x86 version) Unlimited yes
Web Captioner Ok recognition quality using "Web Speech API" through Web Captioner. Only available on Google Chrome. Multi-Language support. Unlimited yes
Whisper AMAZING recognition quality at the cost of computational resources (GPU and RAM). Can have higher recognition accuracy than Azure depending on model used. (Experimental implementation) (does not work on x86 version) Unlimited yes
DeepGram Similar quality to Azure Recognition Only available with Voice Wizard Pro, limits vary with selected tier both