How to use Transcription

The Transcription integration is used to upload a session recording (audio file) and automatically transcribe it into text, followed by dialogue analysis and extraction of key parameters — such as conversation summary, contact info, or user intent captured by the AI agent during the session.

Transcription is a system-level integration of type Speech-to-Text (STT). It is not used for live conversations, but for processing uploaded audio files (e.g., via the Import Session button in the Sessions screen).

Where Transcription Is Used

In the Sessions section, via the Import Session button
When a user uploads an audio file from an external source

The platform will automatically:

Transcribe the audio file
Identify speaker roles (e.g., user / agent)
Create a text-based session
Pass the session to the AI agent for analysis and extraction of Conversation Results

Important

Transcription is configured at the system level — it cannot be set manually via the Integrations interface.
The platform uses a pre-configured STT model (e.g., Whisper or Google Speech) to convert audio into text.
This integration does not appear as a regular user-configurable integration and does not require setup in the UI.

How to Use

Go to the Sessions section
Click Import Session
Upload the audio file
Select the AI agent to process the conversation
The platform will:
- Transcribe the audio
- Convert it into a text session
- Automatically extract parameters (summary, contact info, interest, payment promise, etc.)

Where Transcription Is Used​

Important​

How to Use​

Where Transcription Is Used

Important

How to Use