Title :
Composite background models and score standardization for language identification systems
Author :
Gleason, Terry P. ; Zissman, M.A.
Author_Institution :
Lincoln Lab., MIT, Lexington, MA, USA
Abstract :
Describes two enhancements to our language identification system. Composite background (CBG) modeling allows us to identify target language speech in an environment where labeled background training data is unavailable or limited. Instead of separate models for each of the background languages, a single composite background model is created from all the non-target training speech. Generally, the CBG system performed about as well as a baseline system containing a separate model per background language. The average equal error rate for 12 CBG tests was 13.6% versus 13.4% for the baseline. We have also developed and tested a standardized confidence scoring function based on a single-layer perceptron which has proven to be capable of robust modeling of score distributions
Keywords :
Gaussian distribution; natural languages; perceptrons; speech recognition; background language; composite background models; language identification systems; robust modeling; score distributions; score standardization; single-layer perceptron; speech recognition; standardized confidence scoring function; Electronic mail; Error analysis; Iron; Laboratories; Natural languages; Robustness; Speech recognition; Standardization; Testing; Training data;
Conference_Titel :
Acoustics, Speech, and Signal Processing, 2001. Proceedings. (ICASSP '01). 2001 IEEE International Conference on
Conference_Location :
Salt Lake City, UT
Print_ISBN :
0-7803-7041-4
DOI :
10.1109/ICASSP.2001.940884