Title :
Telephone speech recognition using simulated data from clean database
Author :
Zuo, Guoyu ; Liu, Wenju ; Ruan, Xiaogang
Author_Institution :
Inst. of Autom., Chinese Acad. of Sci., Beijing, China
Abstract :
Speech recognition over lines forms an integral part of various applications of large vocabulary continuous speech recognition (LVCSR). This paper describes an implementation system completely in software form to produce simulated telephone data starting from clean databases. Filters adopted in this system are well-designed to simulate the frequency properties of analogue transmission equipments in telephone connection. A speech recognizer was trained from speech data extracted from clean corpus piped through a hardware simulator. The recognition performances are evaluated on a real telephone speech set and several test sets simulated from the clean database for test use. The experiments verified the effectiveness and feasibility of software simulation from a recognition testing point of view, and the results showed using simulated data derived from clean corpus could achieve the same recognition performance as real telephone speech.
Keywords :
adaptive filters; computational linguistics; natural languages; speech processing; speech recognition; telephone sets; analogue transmission equipments; clean databases; large vocabulary continuous speech recognition; simulated telephone data; software simulation; speech data extraction; Application software; Databases; Filters; Frequency; Performance evaluation; Software systems; Speech recognition; Telephony; Testing; Vocabulary;
Conference_Titel :
Robotics, Intelligent Systems and Signal Processing, 2003. Proceedings. 2003 IEEE International Conference on
Print_ISBN :
0-7803-7925-X
DOI :
10.1109/RISSP.2003.1285547