Vatis Docs
Speech to Text
Audio Intelligence
Infrastructure
- Streams
- Diagrams
- Data retention
- Stream templates
- Configuration patches
Real-time Transcription
Real-time transcription specifications using the Vatis Streams API
Stream configuration template ids
- production:
670ba9e0efa59fe6aecd56f1
Input data
- multimedia content:
wav
,mp3
,mp4
,webm
,flv
, etc.
Egress responses
Export response
Parameters
The language as country ISO code to transcribe the audio to. When not specified, it is automatically detected.
Enable or disable multichannel audio processing. Possible values: true
or false
.
When enabled, each channel is processed separately.
The minimum waiting time in milliseconds after the last speech is detected before the emitting an utterance
transcription message.
The encoding of the audio data. Accepted values:
linear16
: 16-bit signed little-endian samples (Linear PCM)flac
: Free Lossless Audio Codecmulaw
: 8-bit samples (G.711 mu-law)opus
: OGG Opus
The sample rate of the audio data in Hertz (Hz).
The number of audio channels in the audio data.
Create a summary based on the upstream content.
Summary length. Options are: brief
or detailed
.
Summary tone. Options are: conversational
or informative
.
Summary structure. Options are: paragraphs
or bullet_points
.
Perform sentiment analysis on the upstream content.
Specifies the custom prompt content for one of the ask0
, ask1
, …, askN
ask anything slots.
When the content is specified, the prompt is considered activated. Otherwise, it is deactivated.
Specifies the custom system prompt for one of the ask0System
, ask1System
, …, askNSystem
ask anything slots.
Specifies the prompt id for one of the ask0Id
, ask1Id
, …, askNId
ask anything slots.
The role of the prompt id is to identify the prompt in the responses.
When unspecified, it will fallback on the index of the slot (e.g. "0"
, "1"
).
Specifies the prompt response JSON Schema encoded as a string for one of the ask0Format
, ask1Format
, …, askNFormat
ask anything slots.
When unspecified, the response not be structured in any particular way. When specified, the response will be a JSON object encoded as a string.