Skip to main content

How to use Transcription

The Transcription integration is used to upload a session recording (audio file) and automatically transcribe it into text, followed by dialogue analysis and extraction of key parameters — such as conversation summary, contact info, or user intent captured by the AI agent during the session.

Transcription is a system-level integration of type Speech-to-Text (STT). It is not used for live conversations, but for processing uploaded audio files (e.g., via the Import Session button in the Sessions screen).

Where Transcription Is Used

  • In the Sessions section, via the Import Session button
  • When a user uploads an audio file from an external source

The platform will automatically:

  1. Transcribe the audio file
  2. Identify speaker roles (e.g., user / agent)
  3. Create a text-based session
  4. Pass the session to the AI agent for analysis and extraction of Conversation Results

Important

  • Transcription is configured at the system level — it cannot be set manually via the Integrations interface.
  • The platform uses a pre-configured STT model (e.g., Whisper or Google Speech) to convert audio into text.
  • This integration does not appear as a regular user-configurable integration and does not require setup in the UI.

How to Use

  1. Go to the Sessions section
  2. Click Import Session
  3. Upload the audio file
  4. Select the AI agent to process the conversation
  5. The platform will:
    • Transcribe the audio
    • Convert it into a text session
    • Automatically extract parameters (summary, contact info, interest, payment promise, etc.)