• DocumentCode
    2970813
  • Title

    Extended Minimum Classification Error Training in Voice Activity Detection

  • Author

    Arakawa, Takayuki ; Al-Hassanieh, Haitham ; Tsujikawa, Masanori ; Isotani, Ryosuke

  • Author_Institution
    Common Platform Software Res. Labs., NEC Corp., Kawasaki, Japan
  • fYear
    2009
  • fDate
    Nov. 13 2009-Dec. 17 2009
  • Firstpage
    232
  • Lastpage
    236
  • Abstract
    Voice activity detection (VAD) is a fundamental part of speech processing. Combination of multiple acoustic features is an effective approach to make VAD more robust against various noise conditions. There have been proposed several feature combination methods, in which weights for feature values are optimized based on minimum classification error (MCE) training. We improve these MCE-based methods by introducing a novel discriminative function for whole frames. The proposed method optimizes combination weights taking into account the ratio between false acceptance and false rejection rates as well as the effect of the use of shaping procedures such as hangover.
  • Keywords
    pattern classification; signal detection; speech processing; speech recognition; MCE-based methods; extended minimum classification error training; false acceptance rate; false rejection rates; speech processing; speech recognition; voice activity detection; Acoustic signal detection; Communications technology; Computer errors; Laboratories; National electric code; Noise robustness; Optimization methods; Speech processing; Training data; Working environment noise;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Automatic Speech Recognition & Understanding, 2009. ASRU 2009. IEEE Workshop on
  • Conference_Location
    Merano
  • Print_ISBN
    978-1-4244-5478-5
  • Electronic_ISBN
    978-1-4244-5479-2
  • Type

    conf

  • DOI
    10.1109/ASRU.2009.5373251
  • Filename
    5373251