• DocumentCode
    2799674
  • Title

    Cepstral mean based speech source discrimination

  • Author

    Greenhall, Adam ; Atlas, Les

  • Author_Institution
    Dept. of Electr. Eng., Univ. of Washington, Seattle, WA, USA
  • fYear
    2010
  • fDate
    14-19 March 2010
  • Firstpage
    4490
  • Lastpage
    4493
  • Abstract
    This paper presents and compares methods for discrimination between speech from a broadcast audio device - like a television, radio, or GPS receiver - and live speech in the same acoustic environment. A solution to this discrimination problem has direct application wherever the audio from such a device interferes with voice recognition, verification, or transcription tasks. The methods and theory applied also have potential applications in multimedia and speaker segmentation, as well as in speaker verification. This paper presents a new use of the cepstral mean as an estimator of the linear time-invariant response of a “speaker” - either broadcast or live - over a relatively long time window. The problem is framed in terms of traditional speaker verification, but with two classes of speakers. This method is tested on five different data sets and the results compared for different feature sets, training methods, and window lengths.
  • Keywords
    cepstral analysis; speaker recognition; broadcast audio device; cepstral mean based speech source discrimination; linear time-invariant response; speaker segmentation; speaker verification; speech detection; voice recognition task; voice transcription tasks; voice verification task; Acoustic devices; Cepstral analysis; Global Positioning System; Loudspeakers; Multimedia communication; Radio broadcasting; Speech recognition; TV broadcasting; TV receivers; Testing; Speech detection; cepstral mean; rich transcription; segmentation;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Acoustics Speech and Signal Processing (ICASSP), 2010 IEEE International Conference on
  • Conference_Location
    Dallas, TX
  • ISSN
    1520-6149
  • Print_ISBN
    978-1-4244-4295-9
  • Electronic_ISBN
    1520-6149
  • Type

    conf

  • DOI
    10.1109/ICASSP.2010.5495600
  • Filename
    5495600