Title :
A maximum a Posterior-based reconstruction approach to speech bandwidth expansion in noise
Author :
Hyunson Seo ; Hong-Goo Kang ; Soong, Frank
Author_Institution :
Dept. of E.E., Yonsei Univ., Seoul, South Korea
Abstract :
We propose a novel bandwidth expansion algorithm for extending narrowband speech signal to wideband by exploiting segment examples pre-stored in a speaker independent database. Both narrowband and wideband representation of speech signals are pre-stored in the corpus and they are dynamically chopped into variable length segments. Narrowband segments are used dynamically to explain a given narrowband input sentence while the wideband expanded version of the input sentence is constructed correspondingly. The matching process in the narrowband favors a longer segment patch by the chosen Maximum A Posterior (MAP) criterion. As a result, the multiple choices in matching process are significantly reduced with the MAP criterion in decoding. The approach is further generalized to deal with noise corrupted narrowband input signals and the well-known Vector Taylor Series (VTS) noise adaptation algorithm is incorporated into the matching and bandwidth expansion process. A series of experiments is performed to validate the approach on both clean and noise corrupted narrowband speech where both car noise and babble noise corrupted samples are tested.
Keywords :
maximum likelihood estimation; signal reconstruction; signal representation; speech processing; MAP criterion; VTS noise adaptation algorithm; babble noise; car noise; clean narrowband speech; matching process; maximum a posterior-based reconstruction approach; narrowband input sentence; narrowband input signals; narrowband representation; narrowband segments; narrowband speech signal; noise corrupted narrowband speech; speaker independent database; speech bandwidth expansion algorithm; speech signal representation; vector taylor series; wideband representation; Hidden Markov models; Narrowband; Noise; Speech; Vectors; Wideband; corpus-model; maximum a posterior; noise reduction; speech bandwidth expansion; vector Taylor series;
Conference_Titel :
Acoustics, Speech and Signal Processing (ICASSP), 2014 IEEE International Conference on
Conference_Location :
Florence
DOI :
10.1109/ICASSP.2014.6854773