uttmaxsilence
Values: integer
Description:
Specifies the maximum amount of silence in milliseconds that can occur between speech sounds without terminating the current utterance. Once a silence occurs that exceeds uttmaxsilence
milliseconds, an utterance “cut” is made within the detected silent region.
The default value for uttmaxsilence
is 800 milliseconds. This setting will not need to be modified except in unusually aggressive real-time deployments. In most cases, shortening uttmaxsilence
to be less than 650 milliseconds will compromise accuracy. This decrease in accuracy worsens as uttmaxsilence
is reduced towards its minimum setting of 100 milliseconds.
Note
When lowering the uttmaxsilence
value, accuracy is reduced because the shorter threshold for splitting audio regions into utterances will result in shorter utterances on average. Shorter utterances mean that less contextual information available for error reduction.