DocumentCode
2799674
Title
Cepstral mean based speech source discrimination
Author
Greenhall, Adam ; Atlas, Les
Author_Institution
Dept. of Electr. Eng., Univ. of Washington, Seattle, WA, USA
fYear
2010
fDate
14-19 March 2010
Firstpage
4490
Lastpage
4493
Abstract
This paper presents and compares methods for discrimination between speech from a broadcast audio device - like a television, radio, or GPS receiver - and live speech in the same acoustic environment. A solution to this discrimination problem has direct application wherever the audio from such a device interferes with voice recognition, verification, or transcription tasks. The methods and theory applied also have potential applications in multimedia and speaker segmentation, as well as in speaker verification. This paper presents a new use of the cepstral mean as an estimator of the linear time-invariant response of a “speaker” - either broadcast or live - over a relatively long time window. The problem is framed in terms of traditional speaker verification, but with two classes of speakers. This method is tested on five different data sets and the results compared for different feature sets, training methods, and window lengths.
Keywords
cepstral analysis; speaker recognition; broadcast audio device; cepstral mean based speech source discrimination; linear time-invariant response; speaker segmentation; speaker verification; speech detection; voice recognition task; voice transcription tasks; voice verification task; Acoustic devices; Cepstral analysis; Global Positioning System; Loudspeakers; Multimedia communication; Radio broadcasting; Speech recognition; TV broadcasting; TV receivers; Testing; Speech detection; cepstral mean; rich transcription; segmentation;
fLanguage
English
Publisher
ieee
Conference_Titel
Acoustics Speech and Signal Processing (ICASSP), 2010 IEEE International Conference on
Conference_Location
Dallas, TX
ISSN
1520-6149
Print_ISBN
978-1-4244-4295-9
Electronic_ISBN
1520-6149
Type
conf
DOI
10.1109/ICASSP.2010.5495600
Filename
5495600
Link To Document