DocumentCode
328048
Title
Feature sets in continuous speech recognition for the Portuguese language
Author
Santos, Sidney Cerqueira Bispo dos ; Alcaim, Abraham
Author_Institution
CETUC, Univ. Catolica de Rio de Janeiro, Brazil
fYear
1998
fDate
9-13 Aug 1998
Firstpage
126
Abstract
We evaluate the performance of different feature sets in continuous speech recognition systems for the Portuguese language. Results were obtained for the task of recognizing sequences of digits spoken in a fluent manner. We have investigated five parametric descriptions of speech, selected among the most-used ones in present continuous speech recognition systems. We show that the feature set providing the best results for the Portuguese language comprises 18 parameters, 15 derived from the PLP-cepstrum and 3 from the energy. In the speaker-independent mode, a word accuracy of 99.5% was obtained. The performance of a Mel-cepstrum-based set with 39 parameters was 99.3% word-accurate
Keywords
cepstral analysis; feature extraction; sequences; speech recognition; Mel-cepstrum-based set; PLP-cepstrum; Portuguese language; continuous speech recognition systems; energy parameters; feature sets; performance evaluation; sequences; Band pass filters; Discrete cosine transforms; Laboratories; Loudspeakers; Microphones; Natural languages; Signal processing; Signal to noise ratio; Speech processing; Speech recognition;
fLanguage
English
Publisher
ieee
Conference_Titel
Telecommunications Symposium, 1998. ITS '98 Proceedings. SBT/IEEE International
Conference_Location
Sao Paulo
Print_ISBN
0-7803-5030-8
Type
conf
DOI
10.1109/ITS.1998.713103
Filename
713103
Link To Document