V-Blaze and V-Cloud Online Help (May 2020)

uttmaxsilence

Values: integer

Description:

Specifies the maximum amount of silence in milliseconds that can occur between speech sounds without terminating the current utterance. Once a silence occurs that exceeds uttmaxsilence milliseconds, an utterance “cut” is made within the detected silent region.

The default value for uttmaxsilence is 800 milliseconds. This setting will not need to be modified except in unusually aggressive real-time deployments. In most cases, shortening uttmaxsilence to be less than 650 milliseconds will compromise accuracy. This decrease in accuracy worsens as uttmaxsilence is reduced towards its minimum setting of 100 milliseconds.

Note

When lowering the uttmaxsilence value, accuracy is reduced because the shorter threshold for splitting audio regions into utterances will result in shorter utterances on average. Shorter utterances mean that less contextual information available for error reduction.