V-Blaze and V-Cloud Online Help (May 2020)

output

Values: json (default), jsontop, text

Compatibility Values: jsonlist, jsontext, jsontextlist

Description:

Specifies transcript delivery format. The following outputs are supported:

output=json

Returns a JSON output that includes utterance and event details in a hierarchical structure.

output=jsontop

Returns a JSON output in which all utterance details are collapsed into a single, top-level text field.

output=text

Returns a plain text version of the transcript.

Note

Earlier releases included an outer list (enclosing square brackets) in the JSON output, which has since been removed. The structure of the inner JSON dictionary has not changed, and is now returned directly without the outer list. To produce JSON output in the list format that was previously used by the API, refer to the output parameters below.

Legacy Parameters for V‑Blaze

The following output parameters are legacy and can be used by manually setting output to the desired format when making a transcription request.

output=jsonlist

Returns a JSON dictionary that includes an outer list. Use to produce the same type of JSON used by previous releases. Voci recommends modifying any applications that depend on the old output formats to be compatible with the new output formats.

output=jsontext

Returns text output in the text or list formats that were previously used. A JSON representation of the transcript is provided and stored in source and utterances fields. jsontext and jsontextlist output formats are provided to enable any pre-existing applications that depend on the old output format to to be used. Voci recommends modifying any applications that depend on the old output formats to be compatible with the new output formats.

Note

Text output was formerly a JSON representation of the transcript, stored in source and utterances fields. To produce text output in the format that was previously used by Voci APIs, you can specify the output as output=jsontextlist.

json is the recommended output format because it contains all of the value extracted from the speech signal in addition to the text of the transcript. json includes the following:

  • Confidence scores for each word and utterance

  • Start and stop time of each word and utterance

  • Gender, emotion, sentiment, and other valuable data

In most use-cases, a transcript complete with all forms of metadata is preferable, however, text and jsontop outputs can be useful during integration testing because they are easier to read.