Using Text to Speech

Learn how to use Text to speech.

Text to speech (TTS) transforms written text into spoken words, bridging the gap between the written word and the spoken voice.

TTS tools offer several valuable use cases for businesses, enhancing productivity and user experience:

Audiobook Production
TTS technology can automate the conversion of written content into audiobooks, saving time and resources while catering to a broader audience's preferences for audio content.
Accessibility Compliance
Businesses can ensure their digital content is accessible to individuals with visual impairments by using TTS to convert text into spoken words, making websites and documents compliant with accessibility regulations.
Interactive Voice Response (IVR) Systems
TTS is vital for creating natural-sounding voice prompts in IVR systems enhancing customer service by providing automated but human-like interactions, such as call routing and information retrieval.
Virtual Assistants and Chatbots
Integrating TTS into virtual assistants and chatbots allows businesses to provide personalized and engaging interactions with users, whether on websites or through messaging apps, enhancing customer engagement and support.
Enhanced Product Demonstrations
Sales teams can use TTS to create audio-enhanced product demonstrations or tutorials. This makes it easier for potential customers to understand product features and benefits, leading to more informed purchase decisions.

Capabilities

  • Synchronous API: Text to Speech supports synchronous API over HTTPS protocols. You can send text input and get audio as a response.
  • Multiple Output Formats: Text to Speech can generate PCM, MP3, OGG, and JSON format.
  • Standard and Natural Voices: Text to Speech offers male and female standard and natural (human-like) voices.
  • Chunk Streaming Support: Text to Speech service supports chunk transfer encoding over HTTPS protocol. You can send a request with input text and get audio output in chunks. This helps in reducing latency at client side.
  • Speech Synthesis Markup Language (SSML): You can send Speech Synthesis Markup Language (SSML) in your Text to Speech request to for more customization in your audio response by providing details on pauses, and audio formatting for acronyms, dates, times, and abbreviations.

SSML Tags

Was this article helpful?