OpenAI
OpenAI's voices, accessible via Azure, offer an enhanced audio experience. We are using OpenAI TTS through Azure in your geography for compliant usage. It is important to acknowledge existing constraints, notably the absence of Speech Synthesis Markup Language (SSML) support. Here are methods to effectively employ OpenAI's TTS features within these boundaries:
To activate OpenAI as your Text-to-Speech (TTS) provider, please get in touch with your Customer Success Manager.
Integrating OpenAI TTS
To integrate OpenAI's TTS capabilities:
From the available options, choose an OpenAI voice that suits your needs.
Date and Time Formatting
To achieve the most natural and accurate voice output from OpenAI TTS, it's crucial to format dates and times in a way that the system can easily recognize and correctly articulate. Below are recommended formats and examples to avoid:
Dates
Times
Properly formatting times is equally essential to ensure that OpenAI TTS can interpret and vocalize them accurately. Here are the effective formats alongside examples of what to avoid:
Prices and Currencies
Ensuring that prices and currencies are expressed in a format that OpenAI TTS can accurately interpret is crucial for clear communication. The following table outlines the recommended practices for formatting prices and currencies, as well as common pitfalls to avoid:
Price Formatting
Numbers and Alphanumerics
For numbers and alphanumeric sequences, transforming them into a format that OpenAI TTS processes without errors ensures accurate and complete voice output. Below are effective inputs alongside formats that may result in less accurate articulation:
Enhancing Intonation
Emotional Tone
OpenAI's voices may sound monotonous in certain contexts. Incorporating emotive language and enthusiastic expressions, such as 'Great!', 'Fantastic!', 'Klasse!', and 'Super!', can significantly improve the listener's engagement and the overall appeal of the speech output, infusing the bot’s vibe with a more enthusiastic energy.
Pronunciation and Pauses
Although SSML tags for structured pronunciations aren't supported, experimenting with punctuation or separators such as "--" may offer a workaround for inserting pauses. The effectiveness of these techniques varies, emphasizing the importance of testing in your specific use case.
Last updated