V-Blaze and V-Cloud Online Help

Single-Channel (Mono) Versus Channel-Separated Audio

It is important to distinguish between single-channel (mono) and channel-separated audio.

  • In mono audio, all speakers are recorded on a single channel.

  • In channel-separated audio, each speaker is isolated to a distinct channel.

Channel-separated audio makes it possible to transcribe each channel independently and maintain a perfect correspondence between the person speaking and the words spoken. For analytic purposes, it is important to have each speaker on a separate channel.

For example, channel-separated audio not only decouples overtalk from overall accuracy, but it also allows for an objective measurement of the overtalk in calls. However, in single-channel (mono) audio, the greater the overtalk, the lower overall accuracy will be.

Voci employs a process called diarization of mono audio to separate speakers into separate channels. The effectiveness of diarization is decreased when source audio includes hold music, voice recordings, or more than two speakers. Overtalk may also negatively impact diarization accuracy. However, for typical agent and caller situations with only two speakers, diarization is very effective for separating speakers to their own channels for enhanced analytics.


Recording channel-separated source audio instead of mono will typically generate a 10% accuracy increase. Voci highly recommends using channel-separated audio for transcription.