Title :
Fast segment search for corpus-based speech enhancement based on speech recognition technology
Author :
Ogawa, Anna ; Kinoshita, Keizo ; Hori, Toshikazu ; Nakatani, Takeshi ; Nakamura, A.
Abstract :
Corpus-based speech enhancement has received increasing attention recently since it shows high enhancement performance in highly non-stationary noisy environments by precisely modeling the long-term temporal dynamics of speech. However, it has a disadvantage in that the cost is very high for searching the longest matching clean speech segments from a multi-condition parallel speech corpus. This paper proposes a fast segment search method for corpus-based speech enhancement. It is mainly based on two techniques derived from speech recognition technology. The first is an A* search like segment evaluation function for accurately finding the longest matching segments. The second is a tree and linear connected search space for efficiently sharing the segment likelihood calculations. In the experiments for non-stationary noisy observations using the 26 multi-condition TIMIT parallel speech corpus, the proposed search method found the segments almost in real-time without degrading the quality of the enhanced speech. Our method was about 7 to 13 times faster than the conventional segment search method.
Keywords :
search problems; speech enhancement; speech recognition; A* search like segment evaluation function; TIMIT parallel speech corpus; corpus-based speech enhancement; fast segment search; multicondition parallel speech corpus; speech recognition technology; speech segments; speech temporal dynamics; Noise measurement; Search methods; Signal to noise ratio; Speech; Speech enhancement; Speech recognition; Corpus-based speech enhancement; fast segment search; longest matching segments; speech recognition technology;
Conference_Titel :
Acoustics, Speech and Signal Processing (ICASSP), 2014 IEEE International Conference on
Conference_Location :
Florence
DOI :
10.1109/ICASSP.2014.6853859