DocumentCode
2023371
Title
In search for the relevant parameters for speaker independent speech recognition
Author
Smolders, Johan ; Van Compernolle, Dirk
Author_Institution
K. Univ., Leuven, Heverlee, Belgium
Volume
2
fYear
1993
fDate
27-30 April 1993
Firstpage
684
Abstract
One of the problems with speaker-independent speech recognition is the huge amount of training data required, which implies a high cost. The performance of a discrete density hidden-Markov-model speaker-independent speech recognition system when using a small set of examples for training is investigated. By using LPC (linear prediction coding)-based analysis, an approximately 12% error rate was obtained on a highly confusable telephone-quality vocabulary. Using RASTA PLP analysis, a 4% error rate can be achieved. The reason for this improvement is that the RASTA filter filters out the convolutional noise of the different telephone lines and PLP analysis suppresses speaker-dependent details. RASTA filtering was also tried out on the LPC cepstra and gives, with a higher model order, the same results as RASTA PLP.<>
Keywords
discrete systems; hidden Markov models; learning (artificial intelligence); linear predictive coding; speech recognition; vocabulary; LPC cepstra; RASTA PLP analysis; convolutional noise; discrete density hidden-Markov-model; linear prediction coding; performance; speaker-independent speech recognition; telephone-quality vocabulary; training data;
fLanguage
English
Publisher
ieee
Conference_Titel
Acoustics, Speech, and Signal Processing, 1993. ICASSP-93., 1993 IEEE International Conference on
Conference_Location
Minneapolis, MN, USA
ISSN
1520-6149
Print_ISBN
0-7803-7402-9
Type
conf
DOI
10.1109/ICASSP.1993.319403
Filename
319403
Link To Document