V-Blaze and V-Cloud Online Help

Adjusting for Different Types of Input

Table 1. Adjusting for different types of input





false (default), true, noise

Diarization is the process of recognizing distinct speakers on a single (i.e. mono) audio channel and segmenting transcribed speech into separate channels, which are identified in JSON output. Voci’s diarization capability is designed to do this for two speakers, typically a call agent engaged in a conversation with a client over the phone.

You should only set diarize to true under the following conditions:

  • You know that your audio only contains a single audio channel

  • You know that 2 people are talking on the channel

  • Segregation of 2 speakers in the transcripts is important for your use case

The noise setting is typically not needed. However, if you are experiencing excessive diarization errors due to interference from music or other non-speech sources, you can apply noise reduction by setting diarize=noise.

Diarization is a licensed optional feature.


If diarize and any redaction options are used together, redaction accuracy is somewhat reduced. For maximum redaction accuracy, do not activate diarization when using any of the redaction options.


false (default), true

Determines whether V‑Cloud should use its built-in decoders to try to convert incoming audio into a supported format, if necessary. This option cannot be used with the truncate option.

Transcode functionality supports an extensive set of open audio formats. Submit your audio in a request to determine if the audio format is supported.