Response from the transcription service

transcription
string

The transcription of the audio

words
object[]

The words of the transcription. It may be null if stated not to include them

start
integer

The start time of the segment in milliseconds

end
integer

The end time of the segment in milliseconds

metadata
object

The metadata of the segment

channel
integer

The audio channel of the segment

utterance
boolean

Whether the segment is an utterance end