| |
This page describes how to download and install a recognition server based on
our prototype system that integrates with the DARPA Communicator client/server
architecture. This server decodes audio data and displays the resulting
hypothesis in a text window. A demo program is included that makes it easy to
get started.
There are five easy steps to Communicator bliss:
- Step 1: Install the DARPA Communicator system.
- Step 2: Install the ISIP Prototype System.
- Step 3: Install the recognition demo server.
- Step 4: Run the demo. "
- Step 5: Modify the hub program file.
Step 1: Install the DARPA Communicator system
In order to run this server, one must have previously installed the
DARPA Communicator system and be somewhat familiar with its operation.
Step 2: Install the prototype system
Download the latest version of the
ISIP prototype system
in which we provide an integrated API for the DARPA Communicator system.
The system can be easily installed following the steps below:
- tar xzvf isip_proto_v5.9.tar.gz
- cd isip_proto
- ./configure --prefix=.
- source ISIP_ENV.sh
- make
- make install
In addition to installing the recognition binaries, this step builds a library
containing the API that will be linked into the applications described below.
Step 3: Install the recognition server demo
Once the DARPA Communicator and ISIP prototype systems have been successfully
installed, we can download the
recognition server demo (Solaris x-86 and linux) or
recognition server demo (Solaris sparc)
and proceed with building the server program:
- tar xzvf rec_demo_v2_0.tar.gz
- ln -s rec_demo_v2_0 rec_demo
- cd rec_demo
- ./configure [--prefix=Communicator install directory]
- source GC_ENV.sh
- make
Note that the --prefix option is used to specify the directory in which you
installed Communicator. If you have set the Communicator environment variable,
GC_ROOT, you do not need to use the --prefix option. The default
directory for the Communicator installation is /usr/local/communicator.
Before compiling the recognition server, make sure you have sourced the ISIP
environment (source ISIP_ENV.sh) to establish the proper run-time
environment for the prototype system.
Step 4: Run the application demo
Steps 1 to 3 above must have been successfully completed for the next step to
work. The recognition demo program can now be invoked as follows:
Before running the recognition server, make sure you have sourced the
Communicator environment in step 3 (source GC_ENV.sh) to establish the
necessary environment.
Now you are finally ready to run an audio server that will send data to the
recognizer in batch mode:
The audio demo server reads data from a file and sends it to the recognition
server. The recognition server decodes the audio data and displays the
hypothesis at the end of the utterance in a text window.
Step 5: Modify the hub program file
The demo_pgm.text can be easily modified to operate in your local environment:
;; -------------------------------------------------
;; SERVERS
;; -------------------------------------------------
SERVER: recognize_serv
;; HOST: 130.18.6.111
HOST: localhost
PORT: 12346
OPERATIONS: reinitialize recognize_speech
;; -------------------------------------------------
;; CONDITIONS (default order)
;; -------------------------------------------------
RULE: :binary_port & :binary_host & :call_id --> recognizer_speech
IN: :binary_port :binary_host :call_id
OUT: none!
The tag recognize_speech handles incoming audio broker requests,
decodes the received audio data and outputs the hypothesis. Currently, the
recognizer expects the broker to send GAL_INT_16 or GAL_BINARY data and
expects the audio server to send the following control messages of type
GAL_STRING:
- "new_utt": audio is about to be sent
- "done": no more samples will be sent (end of utterance)
Note: A demonstration system based on a real-time Resource Management system
is also included in the final_model/ directory of the demo release. The
acoustic models for this system were trained on 16kHz sampled data. Hence, it
is important that the Audio server sends compatible sampled data.
Success!
Kick back, smoke a victory cigar, and recite George Peppard s famous line from
the A-Team: "I love it when a plan comes together."
|
|
|
|