Title :
RESONATE: Reverberation environment simulation for improved classification of speech models
Author :
Dickerson, Robert F. ; Hoque, Endadul ; Asare, Philip ; Nirjon, Shahriar ; Stankovic, John A.
Author_Institution :
Coll. of William & Mary, Williamsburg, VA, USA
Abstract :
Home monitoring systems currently gather information about peoples activities of daily living and information regarding emergencies, however they currently lack the ability to track speech. Practical speech analysis solutions are needed to help monitor ongoing conditions such as depression, as the amount of social interaction and vocal affect is important for assessing mood and well-being. Although there are existing solutions that classify the identity and the mood of a speaker, when the acoustic signals are captured in reverberant environments they perform poorly. In this paper, we present a practical reverberation compensation method called RESONATE, which uses simulated room impulse responses to adapt a training corpus for use in multiple real reverberant rooms. We demonstrate that the system creates robust classifiers that perform within 5 - 10% of baseline accuracy of non-reverberant environments. We demonstrate and evaluate the performance of this matched condition strategy using a public dataset, and also in controlled experiments with six rooms, and two long-term and uncontrolled real deployments. We offer a practical implementation that performs collection, feature extraction, and classification on-node, and training and simulation of training sets on a base station or cloud service.
Keywords :
feature extraction; reverberation; speech synthesis; RESONATE; acoustic signals; base station; classification on-node; cloud service; feature extraction; home monitoring systems; matched condition strategy; public dataset; reverberant rooms; reverberation compensation method; reverberation environment simulation; robust classifiers; social interaction; speech analysis solutions; speech model classification; training sets simulation; vocal affect; Accuracy; Feature extraction; Microphones; Mood; Reverberation; Speech; Training; Reverberation Compensation; Speaker Identification;
Conference_Titel :
Information Processing in Sensor Networks, IPSN-14 Proceedings of the 13th International Symposium on
Conference_Location :
Berlin
Print_ISBN :
978-1-4799-3146-0
DOI :
10.1109/IPSN.2014.6846745