Skip to content

Install Prerequisites

Handling audio content in Botium Box is based on Botium Speech Processing, a unified, developer-friendly API to the best available free and Open-Source Speech-To-Text and Text-To-Speech services.

Launch Botium Speech Processing Service

Botium Speech Processing comes with a reasonable default configuration.

Note

Both of them are free and Open Source and a good match to get started with voice technologies, on the other hand they are without a doubt among the best free voice tools available.

Launching it can be done with a few command line calls.

$ git clone https://github.com/codeforequity-at/botium-speech-processing.git
$ cd botium-speech-processing
$ docker-compose up -d

Note

Depending on network speed and hardware this step can take a while.

Pointing your browser to http://localhost will show the API explorer for Botium Speech Processing.

Connect Botium Speech Processing Service to Botium Box

In order to enable audio capabilities in Botium Box, you have to add two environment variables to Botium Box Server (see here for details):

  • SPEECH_PROCESSING_ENDPOINT - the URL to your Botium Speech Processing installation

  • SPEECH_PROCESSING_APIKEY - in case you configured Botium Speech Processing API Key protection