How to use Transcription
The Transcription integration is used to upload a session recording (audio file) and automatically transcribe it into text, followed by dialogue analysis and extraction of key parameters — such as conversation summary, contact info, or user intent captured by the AI agent during the session.
Transcription is a system-level integration of type Speech-to-Text (STT). It is not used for live conversations, but for processing uploaded audio files (e.g., via the Import Session button in the Sessions screen).
Where Transcription Is Used
- In the Sessions section, via the Import Session button
- When a user uploads an audio file from an external source
The platform will automatically:
- Transcribe the audio file
- Identify speaker roles (e.g., user / agent)
- Create a text-based session
- Pass the session to the AI agent for analysis and extraction of Conversation Results
Important
- Transcription is configured at the system level — it cannot be set manually via the Integrations interface.
- The platform uses a pre-configured STT model (e.g., Whisper or Google Speech) to convert audio into text.
- This integration does not appear as a regular user-configurable integration and does not require setup in the UI.
How to Use
- Go to the Sessions section
- Click Import Session
- Upload the audio file
- Select the AI agent to process the conversation
- The platform will:
- Transcribe the audio
- Convert it into a text session
- Automatically extract parameters (summary, contact info, interest, payment promise, etc.)