Pre-recorded Transcription (Romanian)

Stream configuration template ids

production: 678e26e695f8f5742e8718f4

Input data

multimedia content: wav, mp3, mp4, webm, flv, etc.
public http or https link to multimedia file encoded in UTF-8

Egress responses

Egress Response
- Transcription Response
- LLM Response

Export response

Audio Intelligence Response

Parameters

Transcription

disfluencies

bool

default:"false"

A disfluency is a part of speech not being smooth and continuous. When disfluencies are disabled (default), all this discontinuous parts from the transcript are removed.

punctuationCapitalization

bool

default:"true"

Add punctuation and capitalization to the transcription.

numeralsConversion

bool

default:"true"

Convert the numerals from their word form to their numeric form (e.g. twenty two -> 22).

splitStereo

bool

default:"false"

Enable or disable splitting stereo audio into two mono audio streams. Possible values: true or false.

findReplace

bool

default:"true"

Enable the find-replace functionality performed using the user-defined findReplaceExpressions.

findReplaceExpressions

list[object]

default:"null"

Find-replace expressions.

Specifying find-replace expressions is not available using URL query parameters.

Attributes

replacement

string

required

The replace string.

regex

list[string]

required

Regex expressions to find.

merge

string

default:"STANDALONE"

Replacement merge strategy relative to the left and right neighbours.

STANDALONE: keep the replacement as a standalone word
MERGE_LEFT: concatenate the replacement with the left neighbour
MERGE_RIGHT: concatenate the replacement with the right neighbour
MERGE_LEFT_RIGHT: concatenate the replacement with both the left and the right neighbours
MERGE_LEFT_CAPITALIZE_NEXT: concatenate the replacement with the left neighbour and capitalize the right neighbour

enabledOnPrerecordedFiles

bool

default:"true"

Flag to enable this current expression in the find-replace operation.

Audio Intelligence

summary

bool

default:"false"

Create a summary based on the upstream content.

summaryLength

string

default:"brief"

Summary length. Options are: brief or detailed.

summaryTone

string

default:"conversational"

Summary tone. Options are: conversational or informative.

summaryStructure

string

default:"paragraphs"

Summary structure. Options are: paragraphs or bullet_points.

sentimentAnalysis

bool

default:"false"

Perform sentiment analysis on the upstream content.

ask[0-N]

string

default:"none"

Specifies the custom prompt content for one of the ask0, ask1, …, askN ask anything slots. When the content is specified, the prompt is considered activated. Otherwise, it is deactivated.

ask[0-N]System

string

default:"none"

Specifies the custom system prompt for one of the ask0System, ask1System, …, askNSystem ask anything slots.

ask[0-N]Id

string

default:"none"

Specifies the prompt id for one of the ask0Id, ask1Id, …, askNId ask anything slots. The role of the prompt id is to identify the prompt in the responses. When unspecified, it will fallback on the index of the slot (e.g. "0", "1").

ask[0-N]Format

string

default:"none"

Specifies the prompt response JSON Schema encoded as a string for one of the ask0Format, ask1Format, …, askNFormat ask anything slots.When unspecified, the response not be structured in any particular way. When specified, the response will be a JSON object encoded as a string.

Vatis Docs

Speech to Text

Audio Intelligence

Infrastructure

Integration

Stream configuration template ids

Input data

Egress responses

Export response

Parameters

Vatis Docs

Speech to Text

Audio Intelligence

Infrastructure

Integration

​ Stream configuration template ids

​ Input data

​ Egress responses

​ Export response

​ Parameters

Stream configuration template ids

Input data

Egress responses

Export response

Parameters