Title :
Phonotactic spoken language recognition: Using diversely adapted acoustic models in parallel phone recognizers
Author :
Cheung-Chi Leung ; Bin Ma ; Haizhou Li
Author_Institution :
Inst. for Infocomm Res., A*STAR, Singapore, Singapore
Abstract :
In phonotactic spoken language recognition systems, acoustic model adaptation prior to phone lattice decoding has been adopted to deal with the mismatch between training and test conditions. Moreover, combining diversified phonotactic features is commonly used. These motivate us to have an in-depth investigation of combining diversified phonotactic features from diversely adapted acoustic models. Our experiment shows that our approach achieves an equal error rate (EER) of 1.94% in the 30-second closed-set trials of the 2007 NIST Language Recognition Evaluation (LRE). It represents a 14.9% relative improvement in EER over a sophisticated system, in which parallel phone recognizers, speaker adaptive training (SAT) in acoustic models and CMLLR adaptation are used. Moreover, it is shown that our approach provides consistent and substantial improvements in three different phonotactic systems, in each of which a single phone recognizer is used.
Keywords :
acoustic signal processing; decoding; speech coding; speech recognition; CMLLR adaptation; EER; LRE; NIST language recognition evaluation; acoustic model adaptation; diversely adapted acoustic model; diversified phonotactic feature; equal error rate; parallel phone recognizer; phone lattice decoding; phonotactic spoken language recognition system; speaker adaptive training; Acoustics; Adaptation models; Hidden Markov models; Lattices; Speech; Speech recognition; Training; MLLR adaptation; phone lattice; phone recognizer; spoken language recognition;
Conference_Titel :
Chinese Spoken Language Processing (ISCSLP), 2012 8th International Symposium on
Conference_Location :
Kowloon
Print_ISBN :
978-1-4673-2506-6
Electronic_ISBN :
978-1-4673-2505-9
DOI :
10.1109/ISCSLP.2012.6423509