Title :
Score fusion and calibration in multiple language detectors with large performance variation
Author :
Ng, Raymond W M ; Leung, Cheung-Chi ; Lee, Tan ; Ma, Bin ; Li, Haizhou
Author_Institution :
Dept. of Electron. Eng., Chinese Univ. of Hong Kong, Hong Kong, China
Abstract :
In a large-scale language detection task, performance variation found between different component systems and different target languages has an adverse effect to the pooled error statistics. Special care has to be taken in score fusion and calibration. In this paper, we use a prosodic LID system to fuse with a phonotactic LID system using NIST Language Recognition Evaluation 2009 experimental data. Among four logistic regression models, the one which gives the lowest Cavg is chosen. We further explore our previously proposed calibration algorithm based on the minimum erroneous deviation criterion. The algorithm is made more robust by removing the predetermined list of target languages to be calibrated, as well as by adding an optimization constraint which enforces calibration in the data portion with a large performance variation. The fusion and calibration operations together bring a 33.9% relative Cavg reduction compared with the original result from a phonotactic LID system.
Keywords :
calibration; speech recognition; LID system; NIST language recognition evaluation; calibration; language detection task; multiple language detectors; optimization constraint; phonotactic LID system; score fusion; Calibration; Indexes; Language recognition; calibration; erroneous deviation; fusion; performance variation;
Conference_Titel :
Acoustics, Speech and Signal Processing (ICASSP), 2011 IEEE International Conference on
Conference_Location :
Prague
Print_ISBN :
978-1-4577-0538-0
Electronic_ISBN :
1520-6149
DOI :
10.1109/ICASSP.2011.5947330