DocumentCode :
128230
Title :
Automatic multi-speaker speech recognition system based on time-frequency blind source separation under ubiquitous environment
Author :
Zhe Wang ; Haijian Zhang ; Guoan Bi ; Xiumei Li
Author_Institution :
Sch. of Electr. & Electron. Eng., Nanyang Technol. Univ., Singapore, Singapore
fYear :
2014
fDate :
9-11 June 2014
Firstpage :
101
Lastpage :
106
Abstract :
In this paper, an automatic speech recognition (ASR) system under ubiquitous environment is proposed, which is successfully implemented in a personalized voice command system under vehicle and living room environment. The proposed ASR system describes a novel scheme of separating speech sources from multi-speakers, detecting speech presence/absence by tracking the higher portion of speech power spectrum and adaptively suppressing noises. An automatic recognition algorithm to adapt with the multi-speaker task is designed and conducted. Evaluation tests are carried out using noise database NOISEX-92 and speech database YOHO Corpus. Experimental results show that the proposed algorithm manages to achieve very impressive improvements.
Keywords :
blind source separation; speech recognition; time-frequency analysis; ASR system; automatic recognition algorithm; automatic speech recognition system; living room environment; multispeakers; noise database NOISEX-92; personalized voice command system; speech database YOHO Corpus; speech power spectrum; speech presence-absence detection; time-frequency blind source separation; ubiquitous environment; vehicle environment; Algorithm design and analysis; Educational institutions; Equations; Noise; Noise measurement; Speech; Speech recognition;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Industrial Electronics and Applications (ICIEA), 2014 IEEE 9th Conference on
Conference_Location :
Hangzhou
Print_ISBN :
978-1-4799-4316-6
Type :
conf
DOI :
10.1109/ICIEA.2014.6931139
Filename :
6931139
Link To Document :
بازگشت