• DocumentCode
    1686110
  • Title

    Voice activity detection using a sliding-window, maximum margin clustering approach

  • Author

    De Leon, Phillip ; Sanchez, Santiago

  • Author_Institution
    Klipsch Sch. of Electr. & Comput. Eng., New Mexico State Univ., Las Cruces, NM, USA
  • fYear
    2013
  • Firstpage
    6674
  • Lastpage
    6678
  • Abstract
    Recently, an unsupervised, data clustering algorithm based on maximum margin, i.e. support vector machine (SVM) was reported. The maximum margin clustering (MMC) algorithm was later applied to the problem of voice activity detection, however, the application did not allow for real-time detection which is important in speech processing applications. In this paper, we propose a voice activity detector (VAD) based on a sliding window, MMC algorithm which allows for real-time detection. Our system requires a separate initialization stage which imposes an initial detection delay, however, once initialized the system can operate in real-time. Using TIMIT speech under several NOISEX-92 noise backgrounds at various SNRs, we show that our average speech and non-speech hit rates are better than state-of-the-art VADs.
  • Keywords
    pattern clustering; speech recognition; support vector machines; MMC algorithm; NOISEX-92 noise; SNR; SVM; TIMIT speech; VAD; data clustering algorithm; maximum margin clustering approach; real-time detection; sliding-window; speech hit rates; speech processing; support vector machine; voice activity detection; Detectors; Feature extraction; Noise; Real-time systems; Speech; Support vector machines; Vectors; Speech analysis; classification algorithms;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Acoustics, Speech and Signal Processing (ICASSP), 2013 IEEE International Conference on
  • Conference_Location
    Vancouver, BC
  • ISSN
    1520-6149
  • Type

    conf

  • DOI
    10.1109/ICASSP.2013.6638953
  • Filename
    6638953