DocumentCode :
2862514
Title :
The Meta-Pi network: connectionist rapid adaptation for high-performance multi-speaker phoneme recognition
Author :
Hampshire, John B., II ; Waibel, Alex H.
Author_Institution :
Sch. of Comput. Sci., Carnegie Mellon Univ., Pittsburgh, PA, USA
fYear :
1990
fDate :
3-6 Apr 1990
Firstpage :
165
Abstract :
A multinetwork time-delay-neural-network (TDNN)-based connectionist architecture that allows multispeaker phoneme discrimination (/b,d,g/) to be performed at the speaker-dependent recognition rate of 98.4% is presented. The overall network gates the phonemic decisions of modules trained on individual speakers to form its overall classification decision. By dynamically adapting to the input speech and focusing on a combination of speaker-specific modules, the network outperforms a single TDNN trained on the speech of all six speakers (95.9%). To train this network a form of multiplicative connection called the Meta-Pi connection is developed. It is illustrated how the Mega-Pi paradigm implements a dynamically adaptive Bayesian MAP classifier. It learns-without supervision-to recognize the speech of one particular speaker (99.8%) using a dynamic combination of internal models of other speakers exclusively. The Meta-Pi model is a viable basis for a connectionist speech recognition system that can rapidly adapt to new speakers and varying speaker dialects
Keywords :
Bayes methods; adaptive systems; computer architecture; computerised signal processing; learning systems; neural nets; speech recognition; Meta-Pi network; classification decision; connectionist rapid adaptation; dynamically adaptive Bayesian MAP classifier; multi-speaker phoneme recognition; multispeaker phoneme discrimination; speaker-dependent recognition rate; speaker-specific modules; time-delay-neural-network; Adaptive systems; Bayesian methods; Character recognition; Computer architecture; Computer science; Databases; Neural networks; Robustness; Speech recognition; Testing;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Acoustics, Speech, and Signal Processing, 1990. ICASSP-90., 1990 International Conference on
Conference_Location :
Albuquerque, NM
ISSN :
1520-6149
Type :
conf
DOI :
10.1109/ICASSP.1990.115564
Filename :
115564
Link To Document :
بازگشت