LogoLogo
  • 👋START HERE
    • Welcome!
  • ℹ️General
    • Release Notes
      • Full Feature Base Template
      • Services
      • Rule-based Automation
        • May 2025
        • February 2025
        • January 2025
        • December 2024
        • November 2024
        • October 2024
        • September 2024
        • August 2024
        • July 2024
        • June 2024
        • May 2024
        • April 2024
        • March 2024
        • February 2024
        • January 2024
        • 2023
        • 2022
        • 2021
        • Dialog Design Update
    • Glossary of Terms
    • Authentication Methods
      • SSO (Single Sign-On)
      • Built-In User Management
    • Acceptable Use Policy
  • ⚙️Rule-based Automation
    • Overview
      • Account Settings
        • Profile
        • Team
        • Roles and Permissions
          • User Management
          • Project Permissions
      • Basic Concepts
        • Project Management
        • Version Management
        • Multi-Lingual Bots
          • Supported Languages
        • Managing User Interactions
          • Unexpected User Input
          • No User Input
    • Dialog Interface
      • Blocks
        • Conversation Logic
          • Start Conversation
          • Global
          • State
          • Intermediate Response
          • To Previous State
          • End Conversation
        • Subdialog
          • Reusable Subdialogs
        • Phone
          • Continue Listening
          • Call Control
        • Technical Logic
          • Service
          • Condition
          • Storage
        • Other
          • Note
      • Speech Assets
        • Intents
          • Utterances
          • Descriptions
        • Slots
          • Custom Slots
            • List Slots
            • Machine Learning Slots
            • Regex Slots
            • LLM Slots
          • Prebuilt Slots
            • DTMF Slot
        • Text Snippets
        • Dictionary
      • Variables
        • Intents
        • Slots
        • Storage
        • Text Snippets
        • Environments
        • Platform
        • Context
      • Services
        • Service Integration Guide
        • Service Development
        • Service Branches and Error Handling
        • Public Services
          • Date and Birthdate Recognition
          • Spelling Post-Processing for Phone
          • IBAN Validation
          • License Plate Validation
          • Address Search
          • Street Names per Postal Code
          • Email Service
          • SMS Service
          • API Adapter
          • Salesforce-Flow Connector
          • Opening Hours
          • Speech-to-Text Hints
          • Fuzzy Match Names
          • Delay Service
      • Debugger
        • Phone 2
        • WhatsApp
        • Textchat 2
    • Environments Interface
      • Service Keys
    • Deployments Interface
      • Creating a Release
      • Editing a Release
    • Text-to-Speech
      • Azure
      • ElevenLabs
      • OpenAI via Azure (Preview)
      • SSML
        • Audio
        • Break
        • Emphasis
        • Prosody
        • Say-as
        • Substitute
        • Paragraph and Sentence
        • Voice
    • Autocomplete
    • Parloa APIs
      • CallData Service and API
      • Conversation History API
      • Textchat V2 API
    • Phone Integrations
      • Genesys Cloud
        • Setting up the SIP Trunk
        • Sending/Receiving UUI Data
        • Creating a Script to Display UUI
      • SIP
      • Tenios
        • Setting Up an Inbound Connection
        • Setting Up an Outbound Connection
        • Transferring a Call
      • Twilio
      • Public IPs and Port Information
    • AI Integration Overview
      • Dual Intent Recognizer (DIR)
      • Dual Tone Multifrequency (DTMF) Intent
    • Analytics and Debugging
      • Understanding Conversations and Transactions
      • Managing Caller ID Data
      • Hangup Events and Triggered Analytics
      • Analytics Transactions: Data Structure and Insights
      • Dialog Analytics
      • Audit Logs
      • Parloa-hosted Analytics
    • Data Privacy
      • Anonymizing Personally Identifiable Information
    • NLU Training
      • NLU Training Best Practices
    • How To
      • Create a Scalable and Maintainable Bot Architecture
      • Implement OnError Loop Handling
      • Resolve the 'Service Unavailable' Error
    • Reference
      • Parloa Keyboard Shortcuts
      • Frequently Asked Questions (FAQ)
      • JavaScript Cheat Sheet
        • Using Regular Expressions (Regex)
  • 🧠Knowledge Skill
    • Introduction
    • Knowledge Collections
    • Knowledge Sources
    • Knowledge Skill Setup
      • Step 1 – Create a Knowledge Skill Agent
      • Step 2 – Configure a Knowledge Skill Agent
      • Step 3 – Configure a Knowledge Skill Agent
Powered by GitBook
On this page
  • Overview
  • Technical Configuration
  • Workflow Example

Was this helpful?

Export as PDF
  1. Rule-based Automation
  2. Dialog Interface
  3. Services
  4. Public Services

Speech-to-Text Hints

PreviousOpening HoursNextFuzzy Match Names

Last updated 4 months ago

Was this helpful?

This functionality is available for the supported language locales where the underlying Speech-to-Text (STT) model facilitates phrase lists. For a comprehensive list of supported languages, you may refer .

Overview

Speech-to-Text Hints, sometimes also referred to as Phrase Lists, are instrumental in guiding the Automatic Speech Recognition (ASR) system to better comprehend words or phrases that are more likely to occur in your dialogues.

Implementing STT Hints significantly bolsters the accuracy of the ASR system, particularly when discerning phonetically similar utterances. This is invaluable for accurately capturing proper nouns such as names and locations, as well as industry-specific terminology that often poses spelling challenges. For an in-depth understanding of STT Hints and their implementation, you may refer .

Technical Configuration

Set up STT hints

To take advantage of Speech-to-Text Hints, make a service call to our dedicated Speech-to-Text Hints Service. Use the following technical specifications:

  • continue: Default.

  • empty_stt: Triggered when no hints are provided in the list.

  • speechToTextHints: Input a comma-separated list or a stringified JSON array encompassing the words or phrases you aim to emphasize for the ASR system.

No output is returned.

  • URL: https://parloaservices.azurewebsites.net/api/STTConvert?clientId=<CLIENT_ID>

  • Header: x-functions-key: <AUTHCODE>

For authentication, replace <AUTHCODE> and <CLIENTID> with the values provided by your Parloa representative.

The hints you provide will remain active until they are replaced by an empty list or input. Therefore, it's recommended to make a service call with an empty input when exiting the relevant dialog section.

Leverage the specialized STT model (German only)

If your project language is German (de-DE), take advantage of the specialized STT model called Given name (hints) that enhances further the recognition results when using STT hints. Refer to Conversation Logic, section Intents for further details on how to activate it.

Workflow Example

The STT hints you submit will remain active within the system until they are supplanted by an empty list or new input. To prevent carryover effects, it is advisable to dispatch a service call with an empty payload upon concluding the dialog segment that utilized the hints.

Imagine a scenario in which STT hints are employed at the beginning of a sub-dialogue and are cleared as you exit that dialogue sequence. Illustrated below is a straightforward workflow exemplifying this process:

⚙️
here
here
STT Hints Example Workflow