• DocumentCode
    178000
  • Title

    Fast segment search for corpus-based speech enhancement based on speech recognition technology

  • Author

    Ogawa, Anna ; Kinoshita, Keizo ; Hori, Toshikazu ; Nakatani, Takeshi ; Nakamura, A.

  • fYear
    2014
  • fDate
    4-9 May 2014
  • Firstpage
    1557
  • Lastpage
    1561
  • Abstract
    Corpus-based speech enhancement has received increasing attention recently since it shows high enhancement performance in highly non-stationary noisy environments by precisely modeling the long-term temporal dynamics of speech. However, it has a disadvantage in that the cost is very high for searching the longest matching clean speech segments from a multi-condition parallel speech corpus. This paper proposes a fast segment search method for corpus-based speech enhancement. It is mainly based on two techniques derived from speech recognition technology. The first is an A* search like segment evaluation function for accurately finding the longest matching segments. The second is a tree and linear connected search space for efficiently sharing the segment likelihood calculations. In the experiments for non-stationary noisy observations using the 26 multi-condition TIMIT parallel speech corpus, the proposed search method found the segments almost in real-time without degrading the quality of the enhanced speech. Our method was about 7 to 13 times faster than the conventional segment search method.
  • Keywords
    search problems; speech enhancement; speech recognition; A* search like segment evaluation function; TIMIT parallel speech corpus; corpus-based speech enhancement; fast segment search; multicondition parallel speech corpus; speech recognition technology; speech segments; speech temporal dynamics; Noise measurement; Search methods; Signal to noise ratio; Speech; Speech enhancement; Speech recognition; Corpus-based speech enhancement; fast segment search; longest matching segments; speech recognition technology;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Acoustics, Speech and Signal Processing (ICASSP), 2014 IEEE International Conference on
  • Conference_Location
    Florence
  • Type

    conf

  • DOI
    10.1109/ICASSP.2014.6853859
  • Filename
    6853859