V-Blaze and V-Cloud Online Help

Receiving Transcription Results

A successful call to the V‑Cloud API returns the transcript in the default (JSON) format or whatever other format you specified with the output  parameter in your call to V‑Cloud's /transcribe endpoint.

A callback server is generally used to collect output and forward it to some other application, process the transcript itself, or perhaps simply to preserve the output for subsequent use. Using the sample callback server that was introduced earlier, transcripts are written to the standard output for the shell in which you executed the netcat command.

curl -F "file=@sample7.wav;type=audio/x-wav" -F output=text \
     -F token=0123456789ABCDEFGHIJ0123456789ABS \
     -F callback=http://196.168.6.64:5555 \
     https://vcloud.vocitec.com/transcribe

This call would return a message like the following:

{"requestid":"3b1c30e0-e62e-4da0-9487-c2f2c76310c7"}

The following example shows the output that the netcat callback server displays after a call to that server when text output was requested:

POST / HTTP/1.1
Content-Type: multipart/form-data;boundary=x7UiTsbnoupKk6ndj9DxpOvyt6NtDFjnn3K0OC
User-Agent: Java/1.7.0_161
Host: 73.174.3.131:5555
Accept: text/html, image/gif, image/jpeg, *; q=.2, */*; q=.2
Connection: keep-alive
Content-Length: 1100

--x7UiTsbnoupKk6ndj9DxpOvyt6NtDFjnn3K0OC
Content-Disposition: form-data; name="requestid"
Content-Type: text/plain;charset=UTF-8
Content-Length: 36

3b1c30e0-e62e-4da0-9487-c2f2c76310c7
--x7UiTsbnoupKk6ndj9DxpOvyt6NtDFjnn3K0OC
Content-Disposition: form-data; name="file"; filename="sample7.txt"
Content-Type: text/plain
Content-Length: 702

Thank you for calling Center point energy technical support. I understand you need to report a gas leak and I have your name please
my name is Joe and I thank you Mr. Know what is your address or account number
my address and then one Martin Houston, Texas is there. Anyone inside the house? I know everyone is out of the house. I notice the strange smell when I got home and I called you I am sending and gas technician to your home to fix the problem. Could you give me a good number to reach you at
you can call 28195345 zero's.
Thank you, please be safe and wait for the technician to arrive call us back if anything changes.
Thank you, bye. Good bye and thank you for calling Center point energy.

--x7UiTsbnoupKk6ndj9DxpOvyt6NtDFjnn3K0OC--

As discussed earlier, the goal of a callback server is to enable another application to receive and directly interact with the transcripts produced by V‑Cloud. However, a simple callback server such as the one used in this section can also be convenient when testing the effects of trying different options with calls to V‑Cloud's /transcribe endpoint.

For example, the following is the callback server's output after transcribing the same sample audio file using the output=text  option and adding the diarize=true  option:

POST / HTTP/1.1
Content-Type: multipart/form-data;boundary=PX0PI61Stzs4xNw-G7SyqnxXPcstL3PEmbF
User-Agent: Java/1.7.0_161
Host: 73.174.3.131:5555
Accept: text/html, image/gif, image/jpeg, *; q=.2, */*; q=.2
Connection: keep-alive
Content-Length: 1096

--PX0PI61Stzs4xNw-G7SyqnxXPcstL3PEmbF
Content-Disposition: form-data; name="requestid"
Content-Type: text/plain;charset=UTF-8
Content-Length: 36

9ca9674c-65b7-46a7-aa06-1ceaeae9d8af
--PX0PI61Stzs4xNw-G7SyqnxXPcstL3PEmbF
Content-Disposition: form-data; name="file"; filename="sample7.txt"
Content-Type: text/plain
Content-Length: 707

Thank you for calling Center point energy to technical support.
I understand you need to report a gas leak and I have your name please.
My name is John.
Thank you, Mr. Darrow.
What is your address or account number?
My address is and and Walmart in Houston Texas.
Is there anyone inside the house?
Know everyone is out of the house so I noticed the strange now when I
got home and I called you.
Hi, I'm sending a gas technician to your home to fix the problem.
Could you give me a good number to reach you at.
You can call 281-953-4507.
Thank you, please be safe and wait for the technician to arrive call us back if anything changes.
Thank you bye.
Good, bye and thank you for calling Center point energy.

--PX0PI61Stzs4xNw-G7SyqnxXPcstL3PEmbF--

In the example output, you can see that enabling V‑Cloud's diarize option has improved the identification of the different speakers on the call, even though the audio file is still in mono.