Title :
Overlapped-speech detection with applications to driver assessment for in-vehicle active safety systems
Author :
Shokouhi, Navid ; Sathyanarayana, Aarti ; Sadjadi, Seyed Omid ; Hansen, John H. L.
Author_Institution :
Center for Robust Speech Syst. (CRSS), Univ. of Texas at Dallas, Richardson, TX, USA
Abstract :
In this study we propose a system for overlapped-speech detection. Spectral harmonicity and envelope features are extracted to represent overlapped and single-speaker speech using Gaussian mixture models (GMM). The system is shown to effectively discriminate the single and overlapped speech classes. We further increase the discrimination by proposing a phoneme selection scheme to generate more reliable artificial overlapped data for model training. Evaluations on artificially generated co-channel data show that the novelty in feature selection and phoneme omission results in a relative improvement of 10% in the detection accuracy compared to baseline. As an example application, we evaluate the effectiveness of overlapped-speech detection for vehicular environments and its potential in assessing driver alertness. Results indicate a good correlation between driver performance and the amount and location of overlapped-speech segments.
Keywords :
feature extraction; safety systems; speaker recognition; GMM; Gaussian mixture model; cochannel data; driver assessment; envelope feature extraction; feature selection; in-vehicle active safety system; overlapped-speech detection; phoneme selection scheme; single-speaker speech; spectral harmonicity; Correlation; Feature extraction; Mel frequency cepstral coefficient; Performance evaluation; Speech; Training; Vehicles; Active safety; co-channel speech; overlapped speech detection;
Conference_Titel :
Acoustics, Speech and Signal Processing (ICASSP), 2013 IEEE International Conference on
Conference_Location :
Vancouver, BC
DOI :
10.1109/ICASSP.2013.6638174