Text-to-Speech Overview
Voice SDK’s Text-to-Speech (TTS) feature uses a Wit.ai based service to provide audio files for text strings. It’s handled by a single TTSService prefab for all TTS settings and you can use a simple TTSSpeaker script for each scene location in which a TTS clip can be played.
To keep TTS working smoothly, Voice SDK handles the caching of TTS files during runtime (or when otherwise needed). If streaming TTS audio files is not an option for your application, however, you can set it to use preloaded static TTS files prior to building your app.
Use the following steps to set up TTS for your app once the plugin has been imported:
- Open the scene you want to use TTS within.
- Generate a new Wit Configuration using the Wit Configurations menu within the Meta > Voice SDK > Voice HUB. Ensure it refreshes successfully and displays all voices available for your configuration.
- Navigate to Assets > Create > Voice SDK > TTS > Add Default TTS Setup
- In your scene heirarchy navigate inside the newly generated TTS Game Object to select the TTSWitService Game Object and adjust the inspector to fit your needs:
- Use TTSWit > Request Settings > Configuration to select the Wit Configuration asset generated in step 2.
- Once your configuration is setup, go to Preset Voice Settings and setup any voices that might be shared by multiple TTSSpeakers. For more information, see TTS Voice Customization.

- Under TTS Runtime Cache (Script), adjust the settings to indicate how often clips will be automatically uploaded from memory. For more information, see TTS Cache Options.
- If needed, adjust your disk cache directory location and name in the tree under TTS Disk Cache (Script).

- Move & duplicate the TTSSpeaker to all the locations in your app where you would like TTS to be played.
- Modify each TTSSpeaker via the Inspector to fit your needs:
- Under Voice Settings, select the Voice Preset for the specific speaker or select Custom to apply speaker specific settings.
- Adjust the AudioSource in order to add the TTSSpeaker to a custom audio group or set audio from 2D to 3D.
- Via a script use the following TTSSpeaker methods to load and play text.
- Use the TTSSpeaker script’s
Speak(textToSpeak : string)
method to request and say specified text on load. - Use the TTSSpeaker script’s
SpeakQueued(textToSpeak : string)
method to request and say specified text. - Send a custom TTSSpeaker into any
Speak
/SpeakQueued
method for request specific text load & playback event callbacks. - Use the TTSSpeaker script’s
Stop()
method to immediately stop all loading & playing TTS clips. - Use TTSSpeaker’s
Stop(textToSpeak : string
)` to immediately stop loading & playing of a specific text string.
Note Check out Samples/TTSVoices/TTSVoices.unity
for an example of TTS implementation.