Top-level Elements
The following table describes the top-level elements included in a JSON transcript.
Refer to Transcription Parameters for more information on the stream tags used to generate the elements that appear in these sections.
Element | Availability | Type | Definition |
---|---|---|---|
emotion | All | value | Linguistic sentiment value. Possible values include:
|
confidence | All | value | A measure of how confident the speech recognition system is in its transcription results. Results range between 0 and 1 with 1 being the most confident. |
rawemotion | All | value | Acoustic emotion values. Possible values in version 7.1+ include:
Acoustic emotion values prior to version 7.1 include:
|
donedate | All | value | Date and time the file transcription was completed by the speech-to-text engine, meaning the last utterance finished. |
All | array | Each audio file is broken up into segments of speech called utterances. The utterances array contains the word transcripts and corresponding metadata organized by utterances. | |
license | All | value | Identification information for the license used. |
nchannels | All | value | Number of channels in the audio file unless diarization is set to true, in which a single (1) channel file is broken up into 2 based on speaker separation |
scrubbed | All | value | If true then audio is purified so numbers are all redacted. If false, the data name does not appear in the JSON output. |
sentiment | All | value | Linguistic sentiment value:
|
sentiment_scores | All | array | Array of length 2. [0]=Positive phrase counts and [1]=Negative phrase counts in the file |
source | All | value | The audio file name. |
gender | All | value | The gender identified for the audio. |
model | All | string containing model name if one model was specified; array of model names if multiple models were specified | Language model(s) specified for transcription. For example: "model": "eng1:callcenter" |
recvdate | All | value | Date and time the audio file was received by the ASR engine and placed in queue |
requestid | All | value | The unique identifier for the request. |
nsubs | V‑Blaze version 7.1+ | value | The number of substitutions applied. This tag will not appear if no substitutions were applied. This value does not include |
V‑Spark only | object | A JSON object that stores call metadata associated with the audio file. | |
V‑Spark only | object | A JSON object that stores metadata generated by V‑Spark. | |
warning | V‑Blaze version 5.6.0-3+ | string | This field describes a problem or issue that was encountered during transcription. A common example is substitutions errors. |