Title :
HMM adaptation and microphone array processing for distant speech recognition
Author :
Kleban, Jim ; Gong, Yifan
Author_Institution :
CAIP Center, Rutgers Univ., Piscataway, NJ, USA
Abstract :
Connected strings of seven digits from the TIDIGITS database were recorded in a reverberant office room for evaluation using microphone array processing and HMM, hidden Markov model, adaptation. A sixteen-channel linear microphone array records a distance speech database useful for further experimentation. The adaptation techniques of parallel model combination (PMC) and maximum likelihood linear regression (MLLR) are evaluated and compared. The effect of the number of adaptation utterances and number of vectors per class for the regression tree in order to optimize MLLR results are studied. Results show, compared to no adaptation, 40% word error reduction (improvement to 4.2%) for PMC and 60% word error reduction (improvement to 3.0%) for MLLR
Keywords :
acoustic signal processing; acoustic transducer arrays; architectural acoustics; array signal processing; hidden Markov models; maximum likelihood estimation; microphones; speech recognition; HMM adaptation; MLLR; PMC; TIDIGITS database; adaptation utterances; distance speech database; distant speech recognition; hidden Markov model; maximum likelihood linear regression; microphone array processing; parallel model combination; reverberant office room; sixteen-channel linear microphone array; Array signal processing; Databases; Hidden Markov models; Maximum likelihood linear regression; Microphone arrays; Signal processing; Speech enhancement; Speech processing; Speech recognition; Working environment noise;
Conference_Titel :
Acoustics, Speech, and Signal Processing, 2000. ICASSP '00. Proceedings. 2000 IEEE International Conference on
Conference_Location :
Istanbul
Print_ISBN :
0-7803-6293-4
DOI :
10.1109/ICASSP.2000.861853