Title :
Phone-context specific gender-dependent acoustic-models for continuous speech recognition
Author :
Neti, C. ; Roukos, S.
Author_Institution :
IBM Thomas J. Watson Res. Center, Yorktown Heights, NY, USA
Abstract :
Gender dependent systems are usually created by splitting the training data into each gender and building two separate acoustic models for each gender. This method assumes that every state of a subphonetic model is uniformly dependent on the gender. We use the premise that the acoustic realizations of various sub phonetic units are dependent on gender in varying degrees across phones and more particularly context dependent. We show that this is indeed the case by using gender as a question in addition to phone context questions in the context decision trees. Using these trees we build phone specific gender dependent acoustic models and demonstrate a novel method to pick between genders during decoding based on a measure of confidence of the decoded hypothesis. An improvement of 6.3% in word error is achieved relative to a gender independent system
Keywords :
acoustic signal processing; decision theory; speech coding; speech recognition; trees (mathematics); acoustic models; acoustic realizations; context decision trees; context dependent; continuous speech recognition; decoded hypothesis; decoding; gender dependent systems; gender independent system; phone context questions; phone context specific gender dependent acoustic models; sub phonetic units; subphonetic model; training data; word error; Acoustic measurements; Context modeling; Decision trees; Decoding; Hidden Markov models; Speech analysis; Speech recognition; Training data;
Conference_Titel :
Automatic Speech Recognition and Understanding, 1997. Proceedings., 1997 IEEE Workshop on
Conference_Location :
Santa Barbara, CA
Print_ISBN :
0-7803-3698-4
DOI :
10.1109/ASRU.1997.659005