Skip to main content

Stream configuration template ids

  • production: 678e26e695f8f5742e8718f4

Input data

  • multimedia content: wav, mp3, mp4, webm, flv, etc.
  • public http or https link to multimedia file encoded in UTF-8

Egress responses

Export response

Parameters

disfluencies
bool
default:"false"
A disfluency is a part of speech not being smooth and continuous. When disfluencies are disabled (default), all this discontinuous parts from the transcript are removed.
punctuationCapitalization
bool
default:"true"
Add punctuation and capitalization to the transcription.
numeralsConversion
bool
default:"true"
Convert the numerals from their word form to their numeric form (e.g. twenty two -> 22).
splitStereo
bool
default:"false"
Enable or disable splitting stereo audio into two mono audio streams. Possible values: true or false.
findReplace
bool
default:"true"
Enable the find-replace functionality performed using the user-defined findReplaceExpressions.
findReplaceExpressions
list[object]
default:"null"
Find-replace expressions.
Specifying find-replace expressions is not available using URL query parameters.
replacement
string
required
The replace string.
regex
list[string]
required
Regex expressions to find.
merge
string
default:"STANDALONE"
Replacement merge strategy relative to the left and right neighbours.
  • STANDALONE: keep the replacement as a standalone word
  • MERGE_LEFT: concatenate the replacement with the left neighbour
  • MERGE_RIGHT: concatenate the replacement with the right neighbour
  • MERGE_LEFT_RIGHT: concatenate the replacement with both the left and the right neighbours
  • MERGE_LEFT_CAPITALIZE_NEXT: concatenate the replacement with the left neighbour and capitalize the right neighbour
enabledOnPrerecordedFiles
bool
default:"true"
Flag to enable this current expression in the find-replace operation.
summary
bool
default:"false"
Create a summary based on the upstream content.
summaryLength
string
default:"brief"
Summary length. Options are: brief or detailed.
summaryTone
string
default:"conversational"
Summary tone. Options are: conversational or informative.
summaryStructure
string
default:"paragraphs"
Summary structure. Options are: paragraphs or bullet_points.
sentimentAnalysis
bool
default:"false"
Perform sentiment analysis on the upstream content.
ask[0-N]
string
default:"none"
Specifies the custom prompt content for one of the ask0, ask1, …, askN ask anything slots. When the content is specified, the prompt is considered activated. Otherwise, it is deactivated.
ask[0-N]System
string
default:"none"
Specifies the custom system prompt for one of the ask0System, ask1System, …, askNSystem ask anything slots.
ask[0-N]Id
string
default:"none"
Specifies the prompt id for one of the ask0Id, ask1Id, …, askNId ask anything slots. The role of the prompt id is to identify the prompt in the responses. When unspecified, it will fallback on the index of the slot (e.g. "0", "1").
ask[0-N]Format
string
default:"none"
Specifies the prompt response JSON Schema encoded as a string for one of the ask0Format, ask1Format, …, askNFormat ask anything slots.When unspecified, the response not be structured in any particular way. When specified, the response will be a JSON object encoded as a string.
enhancedTranscription
bool
default:"false"
Enable transcription enhancement.
etModel
[standard|enhanced]
default:"standard"
Balance the speed-accuracy ration of the transcription enhancing model.
etVocabulary
string[]
default:"null"
Provide a list of words to enhance the transcription. Those can be domain-specific terms that the model should pay special attention to.
etSystemPrompt
string
default:"null"
Provide a system prompt to guide the transcription enhancement model. This can help the model understand the context or specific requirements for the transcription.