V-Blaze and V-Cloud Online Help

V‑Cloud version 1.6-2020.10.07 (October 2020)

V‑Cloud 1.6 includes several new features and parameters, including:

  1. Language identification (LID) has been enhanced with new option parameters, JSON output elements, and other functionality improvements.

    1. Added new optional parameters for lid

      Table 1. New optional parameters for lid

      Name

      Values

      Description

      lidoffset=N

      integer

      Delay start of LID until specified (N) seconds into audio. If there is not enough audio left after offset, this will process preceding utterances in reverse.



    2. Improved logic for LID decisions with low scores.

      When LID scoring is below the decision threshold, the ASR engine will transcribe the audio with the language model specified by the model tag (or the default model for the ASR configuration if model is not explicitly provided). The results are indicated by a lidinfo.langfinal or lidinfo["n"].langfinal element in the JSON output.

    3. Made additions to JSON output.

      • langinfo - breakdown of language information that is added when there was more than one language detected.

      • langfinal - added when the language specified in LID is below threshold and not the default language.

    For more information on LID and using these parameters, refer to Receiving Language Identification Information.

  2. Added new debugging parameters and JSON elements to assist with improved warnings and logging when using substitutions.

    1. New debugging parameters

      Warning

      These parameters are intended for debugging purposes only and should not be used in production.

      Table 2. New substitution debugging parameters

      Name

      Values

      Description

      subst

      true, false (default), none

      The subst parameter can be used to enable or disable automatic system- and model-level substitutions.

      subst=true

      Enables system- and model-level substitutions

      subst=false

      Disables system-level substitutions; model-level substitutions still apply

      subst=none

      Disables both system- and model-level substitutions

      substinfo

      true, false (default)

      Provides substitution details in JSON transcripts.

      Set substinfo to true to include a top-level JSON object that indicates the applied substitution rules and a number count for each rule.

      In addition to the top-level JSON object, substinfo includes another JSON object in the metadata that details each substitution's location, the substitution rule applied, and the substitution rule source.



      For more details on these and other parameters, refer to the Substitutions section of the V-Cloud REST API.

    2. Added a new JSON output element: nsubs shows a count of substitutions applied at both top-level and utterance levels. When substinfo=true, nsubs will also include numtrans counts within the substinfo array. Top-level nsubs does not include numtrans counts. The nsubs element will not appear if no substitutions were applied.

    For more information, refer to the JSON Output Reference.

  3. Made quality improvements to eng2:largevocab and spa1:voicemail language models.

  4. Hinting is now supported for eng1 version 7 models. Hinting for version 5 models is no longer supported.

    For more information on hinting support, refer to the English page of the Language Models Reference.

  5. Made minor enhancements and fixes to speech-to-text output processing (textproc), including:

    • The system now preserves timestamps on backrefs in substitutions instead of interpolating.

    • Fixed inadvertent uppercase of English cased backrefs (for example, /\1/).

    • Eliminated unexpected behavior of pattern{min,max} when min=0.

  6. Made minor improvements to Spanish time formatting.

  7. Corrected the scope of emotion scoring to always score individual utterances.

  8. Eliminated rare edge case decode failures.