Skip to content

VoiceWizardPro

VRCWizard edited this page Oct 19, 2023 · 30 revisions

VoiceWizardPro

🔑 Unlock VoiceWizardPro Benefits!

Subscribe to Ko-Fi or Patreon and experience a world of powerful features that will transform your TTS and translation experience:

  • Instant Access to Premium Voices: Enjoy hundreds of voices from leading cloud services, including:

    • Microsoft Azure
    • Amazon Polly
    • Google Cloud
    • IBM Watson
  • 🌍 Multilingual Magic: Translate your voices into 70+ supported languages, talk to your friends from all over the world

  • 🎤 Crystal-Clear Transcriptions: Gain access to speech recognition through DeepGram's Nova-2 model, the fastest and most accurate speech-to-text API.

Your subscription not only enhances your capabilities but also supports future development:

  • 💪 Empower Ongoing Development: Your contribution assists in server upkeep, covers character costs from premium APIs, and fuels future software innovations.

Ready to elevate your TTS game? Dive into VoiceWizardPro now! For detailed insights, explore our VoiceWizardPro GitHub Wiki page.

Unlock the power of VoiceWizardPro today! 🚀

Patreon

Buy Me a Coffee at ko-fi.com

Tiers

Here is the break down of the tiers available via Kofi or Patreon

Tier Price Per Month TTS Characters Per Month Translation Character Per Month Speech Recognition Hours (DeepGram) Rate Limiting
Acolyte $3 100,000 50,000 1 moderate
Magician $5 250,000 50,000 3 moderate
Enchanter $6 0 500,000 3 moderate
Witch $10 500,000 100,000 5 moderate
Sorcerer $15 500,000 500,000 10 moderate
Warlock $18 1,000,000 100,000 10 low
Wizard $20 750,000 500,000 15 low
Archmage $50 2,000,000 1,000,000 25 low
Deity $100 4,000,000 2,000,000 50 low

Pro Changelog

5/16/2023 Announcement

  • I have permanently "discounted" using Amazon Polly and Google voices with pro. Basically your TTS Characters used is about 1/3 as much. So if you typed a 3 letter word you would only increase your usage by 1. (It's rounded so 4 letter word = 1 usage, 5 letter word = 2 usage)

  • Azure (and IBM Watson) voices are still 1 to 1.

  • So you could use about 3 times as many characters if you are using Amazon Polly or Google for TTS.

5/23/2023 Announcement

  • All Tiers now have some translation characters

6/2/2023 Announcement

  • DeepGram Speech Recognition added to Pro (pre-release)

7/7/2023 Announcement

  • Support added for TTS Voices from IBM Watson

9/8/2023 Announcement

  • Support added for for 40 new languages
  • Tiers above Acolyte now have 1000 max characters per request (Acolyte still has 300 per request)

10/6/2023 Announcement

How to get API Key

  1. Become Member on the Kofi: https://ko-fi.com/ttsvoicewizard/tiers
  2. Link your Discord to Kofi and join the Discord Server
  3. Navigate to the #get-api-key-beta channel in Discord
    • To get a key for the first time type: /create-key
    • To refresh your key type: /refresh-key
  4. Your key will be DM'ed to you by the Official TTS Voice Wizard Bot in Discord

Where do I put the API Key?

  1. In TTS Voice Wizard navigate to Speech Provider > VoiceWizard Pro


  1. Make sure "Use Voice Wizard Pro Key" is enabled and copy and paste your key from the discord DM to the "Voice Wizard Pro Key" text field

  2. You can now choose whether to use the Pro Key for Azure, Amazon Polly and Translations. (It will automatically be used for Google Cloud voices since those are VoiceWizardPro Only)


Using Voice Wizard Pro

  • You can select the voice you wish to use from the "Text to Speech" Tab under "Voice Customization Options:


DeepGram Recognition

  • Select Deepgram (Pro Only) from Settings > Audio

image

  • Go to the Speech Provider > Voice Wizard Pro and scroll down to DeepGram Recognition.

image

Adjusting Settings

Silence Threshold

  • Click you're speech to text hotkey (Ctrl + G) by default to activate speech recognition while in this tab.
  • Monitor The dial

image

  • If the needle seems to ignore your voice then your environment is really quiet and you need to more the slider to the left towards silent.

image

  • If needle seems to think you're talking when you aren't you have a loud environment and you need to move the slider to the right towards loud.

image

Audio Duration

  • Minimum Audio Duration is the shortest duration a audio clip can have in seconds
  • Maximum audio duration is the longest duration an audio clip can have with a soft cap of 25 seconds and a hard cap in the API of 30 seconds.