> ## Documentation Index
> Fetch the complete documentation index at: https://docs.vatis.tech/llms.txt
> Use this file to discover all available pages before exploring further.

# Pre-recorded Transcription

> Pre-recorded transcription specifications using the Vatis Streams API

### <Icon icon="sliders" iconType="light" size="24" />  Stream configuration template ids

* production all languages: `668115d123bca7e3509723d4`
* [production Romanian-only](/speech-to-text/pre-recorded/romanian-pre-recorded-transcription)

### <Icon icon="upload" iconType="light" size="24" />  Input data

* multimedia content: `wav`, `mp3`, `mp4`, `webm`, `flv`, etc.
* public `http` or `https` link to multimedia file encoded in `UTF-8`

### <Icon icon="download" iconType="light" size="24" />  Egress responses

* [Egress Response](/api-reference/schemas/egress/stream-sink-message)
  * [Transcription Response](/api-reference/schemas/egress/transcription-response)
  * [LLM Response](/api-reference/schemas/egress/llm-response)

### <Icon icon="file-code" iconType="light" size="24" />  Export response

* [Audio Intelligence Response](/api-reference/schemas/export/audio-intelligence-response)

### <Icon icon="wrench" iconType="light" size="24" />  Parameters

<AccordionGroup>
  <Accordion title="Transcription">
    <ParamField query="language" type="string | list" default="none">
      The language as country [ISO code](/speech-to-text/pre-recorded/transcription-languages) to transcribe the audio to. When not specified, it is automatically detected. When multiple languages are specified, the model will auto-detect one of the specified languages.

      If multiple languages are specified, the language will perform multilingual code switching between the specified languages. If none is specified, the model will switch between all supported languages.

      To provide multiple languages in the URL, simply add multiple `language` parameters. For example: `language=en&language=es`.
    </ParamField>

    <ParamField query="noSpeechThreshold" type="float" default="0.5">
      The threshold value to consider a segment as a speech segment. The value ranges from 0 to 1.
    </ParamField>

    <ParamField query="vad" type="bool" default="true">
      Enable or disable voice activity detection. Possible values: `true` or `false`.
    </ParamField>

    <ParamField query="splitStereo" type="bool" default="false">
      Enable or disable splitting stereo audio into two mono audio streams. Possible values: `true` or `false`.
    </ParamField>
  </Accordion>

  <Accordion title="Audio Intelligence">
    <ParamField query="summary" type="bool" default="false">
      Create a summary based on the upstream content.
    </ParamField>

    <ParamField query="summaryLength" type="string" default="brief">
      Summary length. Options are: `brief` or `detailed`.
    </ParamField>

    <ParamField query="summaryTone" type="string" default="conversational">
      Summary tone. Options are: `conversational` or `informative`.
    </ParamField>

    <ParamField query="summaryStructure" type="string" default="paragraphs">
      Summary structure. Options are: `paragraphs` or `bullet_points`.
    </ParamField>

    <ParamField query="sentimentAnalysis" type="bool" default="false">
      Perform sentiment analysis on the upstream content.
    </ParamField>

    <ParamField query="ask[0-N]" type="string" default="none">
      Specifies the custom prompt content for one of the `ask0`, `ask1`, ..., `askN` ask anything slots.
      When the content is specified, the prompt is considered activated. Otherwise, it is deactivated.
    </ParamField>

    <ParamField query="ask[0-N]System" type="string" default="none">
      Specifies the custom system prompt for one of the `ask0System`, `ask1System`, ..., `askNSystem` ask anything slots.
    </ParamField>

    <ParamField query="ask[0-N]Id" type="string" default="none">
      Specifies the prompt id for one of the `ask0Id`, `ask1Id`, ..., `askNId` ask anything slots.
      The role of the prompt id is to identify the prompt in the responses.
      When unspecified, it will fallback on the index of the slot (e.g. `"0"`, `"1"`).
    </ParamField>

    <ParamField query="ask[0-N]Format" type="string" default="none">
      Specifies the prompt response [JSON Schema](https://json-schema.org/overview/what-is-jsonschema) encoded as a string for one of the `ask0Format`, `ask1Format`, ..., `askNFormat` ask anything slots.

      When unspecified, the response not be structured in any particular way.
      When specified, the response will be a JSON object encoded as a string.
    </ParamField>

    <ParamField query="enhancedTranscription" type="bool" default="false">
      Enable transcription enhancement.
    </ParamField>

    <ParamField query="etModel" type="[standard|enhanced]" default="standard">
      Balance the speed-accuracy ration of the transcription enhancing model.
    </ParamField>

    <ParamField query="etVocabulary" type="string[]" default="null">
      Provide a list of words to enhance the transcription. Those can be domain-specific terms that the model should pay special attention to.
    </ParamField>

    <ParamField query="etSystemPrompt" type="string" default="null">
      Provide a system prompt to guide the transcription enhancement model. This can help the model understand the context or specific requirements for the transcription.
    </ParamField>
  </Accordion>
</AccordionGroup>
