DocumentCode :
1799627
Title :
Developing “voice care”: Real-time methods for event recognition and localization based on acoustic cues
Author :
Yi-Wen Liu ; Hang-Ming Liang ; Shung-You Lao ; Chen-Wei Wu ; Hung-Kuang Hao ; Fan-Jie Kung ; Yu-Tse Ho ; Pei-Yi Lee ; Shih-Chung Kang
Author_Institution :
Dept. of Electr. Eng., Nat. Tsing Hua Univ., Hsinchu, Taiwan
fYear :
2014
fDate :
14-18 July 2014
Firstpage :
1
Lastpage :
6
Abstract :
This paper presents methods for sound recognition in a living space and ways to track the location of the sound sources. Algorithms were developed so sound recognition and localization can both be performed in real time. The sound recognition method is based on Gaussian mixture modeling with outlier rejection. The sound source localization method is based on multiple signal classification (MUSIC) and it borrows the idea of particle filtering to confine the estimation error. Estimates of the sound source location can be successively refined by Kalman filtering. The recognition method was tested with real recordings and achieved > 90% of accuracy in distinguishing 8 classes of sounds while keeping both the false-acceptance and the false-rejection rates below 20%. The localization method was tested in real time and demonstrated the capabilities to track a sound source moving at about 0.3 m/s. These results indicate that the methods, when integrated, can be deployed to the home for acoustic event detection purposes.
Keywords :
Gaussian processes; Kalman filters; geriatrics; mixture models; particle filtering (numerical methods); signal classification; speech recognition; Gaussian mixture modeling; Kalman filtering; MUSIC; Voice Care; acoustic cues; acoustic event detection purposes; event localization; event recognition; multiple signal classification; outlier rejection; particle filtering; sound recognition method; sound source localization method; sound source tracking; Kalman filters; Microphones; Monitoring; Multiple signal classification; Real-time systems; Speech recognition; Gaussianmixture models; Kalman filtering; Voice activity detection; particle methods;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Multimedia and Expo Workshops (ICMEW), 2014 IEEE International Conference on
Conference_Location :
Chengdu
ISSN :
1945-7871
Type :
conf
DOI :
10.1109/ICMEW.2014.6890676
Filename :
6890676
Link To Document :
بازگشت