watch me try to do speech recognition on Linux: pocketsphinx

The two programs, _batch and _continuous, take mostly the same commandline parameters. However, I've read that _continuous is more suitable for, well, continuous speech; I guess _batch is for collections of short utterances, such as voice-control commands. No, thanks, I hate voice interfaces!

However, here is an example of a suggested commandline for _batch:

pocketsphinx_batch \
-argfile argfile.txt \
-cepdir <path>/data \
-ctl ctlfile.txt \
-cepext .wav \
-adcin true \
-hyp out.txt

Where ctlfile.txt contains the name(s) of the audio input file(s), located in "cepdir", one filename per line; the names must *not* include the file extension, such as .wav, this is instead specified by -cepext.
("cepdir" can simply be the current directory: "-cepdir ." works, in Linux.)

argfile.txt contains three parameters with their values, each being a filename with directory path. These specify the libraries and acoustic models and stuff, by which the speech translation is made. If your results end up having poor accuracy, these will be the files to replace with your own, hopefully better, versions. In Debian, I found that the default install of the pocketsphinx package put all the files where they were expected: I just used the filenames and paths as given in many of the Linux examples, and everything worked. My parameters are as follows:

    -hmm /usr/share/pocketsphinx/model/hmm/en_US/hub4wsj_sc_8k \
    -dict /usr/share/pocketsphinx/model/lm/en_US/cmu07a.dic \
    -lm /usr/share/pocketsphinx/model/lm/en_US/hub4.5000.DMP

So a lot of this should work with _continuous, though I guess not -ctl... I'll be trying it soon, in any case, because I think _continuous is the one I need.

(...later) Yes, indeed, when I try to run _batch on my file, I get:

FATAL_ERROR: "acmod.c", line 532: Batch processing can not process more than 32767 frames at once, requested 259043

(My audio is about 30 minutes of conversation.)

watch me try to do speech recognition on Linux

Tuesday, November 5, 2019

pocketsphinx_batch

No comments:

Post a Comment