V-Blaze Online Help

Top-level Elements

The following table describes the top-level elements included in a JSON transcript.

Table 1. Top-level Elements

Element

Availability

Type

Definition

asr

V‑Blaze v6.1+

value

Version number of the automatic speech recognition server being used

audiosecs

V‑Blaze v6.1+

value

Duration of audio, in seconds, in the stream.

confidence

All

value

A measure of how confident the speech recognition system is in its transcription results

  • Range between 0 and 1

  • 1 is most confident

donedate

All

value

Date and time the file transcription was completed by the speech-to-text engine, meaning the last utterance finished.

utterances

All

array

Each audio file is broken up into segments of speech called utterances. The utterances array contains the word transcripts and corresponding metadata organized by utterances.

sentiment

All

value

Linguistic sentiment value:

  • Positive

  • Mostly Positive

  • Neutral

  • Mostly Negative

  • Negative

  • Mixed (contains both Positive and Negative in the file)

nchannels

All

value

Number of channels in the audio file unless diarization is set to true, in which a single (1) channel file is broken up into 2 based on speaker separation

sentiment_scores

All

array

Array of length 2. [0]=Positive phrase counts and [1]=Negative phrase counts in the file

streamtags

V‑Blaze v6.1+

A list of tags or values specified by the user. This is useful for debugging and verification. It is also useful for tagging the output with user-level metadata (for example, tags that have meaning to the user for filtering or association).

model

All

string containing model name if one model was specified;

array of model names if multiple models were specified

Language model(s) specified for transcription

recvdate

All

value

Date and time the audio file was received by the ASR engine and placed in queue.

recvtz

V‑Blaze, V‑Cloud

array

An array containing two values:

  • time zone abbreviation of the timezone in which the ASR engine is running

  • offset in seconds from UTC for the time on the ASR engine

requestid

All

value

The unique identifier for the request.

scrubbed

All

value

If true then audio is purified so numbers are all redacted. If false, the data name does not appear in the JSON output.

source

All

value

Audio file name

started

V‑Blaze v6.1+

value

Date and time the stream started. This is most useful for measuring real-time transcription.

ended

V‑Blaze v6.1+

value

Date and time the stream ended. This is most useful for measuring real-time transcription.