Title :
Analysis of different acoustic front-ends for automatic voice over IP recognition
Author :
Falavigna, Daniele ; Matassoni, Marco
Author_Institution :
ITC-irst, Trento, Italy
fDate :
30 Nov.-3 Dec. 2003
Abstract :
We investigated the usage for automatic speech recognition of different acoustic features, obtained from the output bitstream of a voice over IP codec. In particular, we analyzed the influence, on recognition performance, of both analysis rate and vector quantization of acoustic parameters introduced by the codec. Particular care has to be taken to train acoustic models at the reduced analysis rate employed by the codec: some related issues are discussed in the paper. We also used a model for simulating packet loss and we measured the corresponding performance degradation. Experiments were carried out on both clean and noisy speech databases.
Keywords :
Internet telephony; feature extraction; parameter estimation; speech codecs; speech recognition; vector quantisation; acoustic features; acoustic front-ends; acoustic model training; acoustic parameters; analysis rate; automatic speech recognition; packet loss simulation; recognition performance; vector quantization; voice over IP codec; Acoustic measurements; Automatic speech recognition; Codecs; Degradation; Internet telephony; Loss measurement; Performance analysis; Performance loss; Speech analysis; Vector quantization;
Conference_Titel :
Automatic Speech Recognition and Understanding, 2003. ASRU '03. 2003 IEEE Workshop on
Print_ISBN :
0-7803-7980-2
DOI :
10.1109/ASRU.2003.1318468