Stream configuration template ids
- production:
670ba9e0efa59fe6aecd56f1
Input data
- multimedia content:
wav
,mp3
,mp4
,webm
,flv
, etc.
Egress responses
Export response
Parameters
Transcription
Transcription
The language as country ISO code to transcribe the audio to. When not specified, it is automatically detected.
Enable or disable multichannel audio processing. Possible values:
true
or false
.
When enabled, each channel is processed separately.The minimum waiting time in milliseconds after the last speech is detected before the emitting an
utterance
transcription message.Raw data format parameters
Raw data format parameters
Use these parameters only when sending raw data.
The encoding of the audio data. Accepted values:
linear16
: 16-bit signed little-endian samples (Linear PCM)flac
: Free Lossless Audio Codecmulaw
: 8-bit samples (G.711 mu-law)opus
: OGG Opus
The sample rate of the audio data in Hertz (Hz).
The number of audio channels in the audio data.
Audio Intelligence
Audio Intelligence
Create a summary based on the upstream content.
Summary length. Options are:
brief
or detailed
.Summary tone. Options are:
conversational
or informative
.Summary structure. Options are:
paragraphs
or bullet_points
.Perform sentiment analysis on the upstream content.
Specifies the custom prompt content for one of the
ask0
, ask1
, …, askN
ask anything slots.
When the content is specified, the prompt is considered activated. Otherwise, it is deactivated.Specifies the custom system prompt for one of the
ask0System
, ask1System
, …, askNSystem
ask anything slots.Specifies the prompt id for one of the
ask0Id
, ask1Id
, …, askNId
ask anything slots.
The role of the prompt id is to identify the prompt in the responses.
When unspecified, it will fallback on the index of the slot (e.g. "0"
, "1"
).Specifies the prompt response JSON Schema encoded as a string for one of the
ask0Format
, ask1Format
, …, askNFormat
ask anything slots.When unspecified, the response not be structured in any particular way.
When specified, the response will be a JSON object encoded as a string.Enable transcription enhancement.
Balance the speed-accuracy ration of the transcription enhancing model.
Provide a list of words to enhance the transcription. Those can be domain-specific terms that the model should pay special attention to.
Provide a system prompt to guide the transcription enhancement model. This can help the model understand the context or specific requirements for the transcription.