DocumentCode
2173251
Title
Language informed bandwidth expansion
Author
Han, Jinyu ; Mysore, Gautham J. ; Pardo, Bryan
fYear
2012
fDate
23-26 Sept. 2012
Firstpage
1
Lastpage
6
Abstract
High-level knowledge of language helps the human auditory system understand speech with missing information such as missing frequency bands. The automatic speech recognition community has shown that the use of this knowledge in the form of language models is crucial to obtaining high quality recognition results. In this paper, we apply this idea to the bandwidth expansion problem to automatically estimate missing frequency bands of speech. Specifically, we use language models to constrain the recently proposed non-negative hidden Markov model for this application. We compare the proposed method to a bandwidth expansion algorithm based on non-negative spectrogram factorization and show improved results on two standard signal quality metrics.
Keywords
hidden Markov models; natural language processing; speech recognition; automatic speech recognition; human auditory system; language informed bandwidth expansion; missing frequency band estimation; nonnegative hidden Markov model; nonnegative spectrogram factorization; speech understanding; Dictionaries; Hidden Markov models; Narrowband; Spectrogram; Speech; Wideband; Bandwidth Expansion; Language Model; Non-negative Hidden Markov Model;
fLanguage
English
Publisher
ieee
Conference_Titel
Machine Learning for Signal Processing (MLSP), 2012 IEEE International Workshop on
Conference_Location
Santander
ISSN
1551-2541
Print_ISBN
978-1-4673-1024-6
Electronic_ISBN
1551-2541
Type
conf
DOI
10.1109/MLSP.2012.6349783
Filename
6349783
Link To Document