Title of article
Unsupervised learning of phonemes of whispered speech in a noisy environment based on convolutive non-negative matrix factorization
Author/Authors
Jian Zhou، نويسنده , , Ruiyu Liang، نويسنده , , Li Zhao، نويسنده , , Liang Tao، نويسنده , , Cairong Zou، نويسنده ,
Issue Information
روزنامه با شماره پیاپی سال 2014
Pages
12
From page
115
To page
126
Abstract
This paper focuses on the development of an algorithm that can be optimized for a specific acoustic environment to improve the intelligibility of whispered speech. A new convolutive non-negative matrix factorization (NMF) algorithm is proposed to extract phoneme bases from noisy whispered speech with the noise bases from prior learning; these noise bases are obtained from training using the conventional non-negative matrix factorization. The divergence function with a sparseness constraint term is selected as the objective function in the developed algorithm to obtain multiplicative update rules of the phoneme base matrix and the corresponding weight matrix. The weights of the noise bases from prior learning are also updated in the phoneme learning stage. Listening experiments were conducted to assess the intelligibility performance of speech synthesized using the proposed algorithm. The experimental results indicate that the proposed algorithm is very effective for improving the intelligibility of whispers in various noise contexts, and it outperforms conventional algorithms.
Keywords
Unsupervised phoneme base learning , Non-negative matrix factorization , Whisper intelligibility enhancement
Journal title
Information Sciences
Serial Year
2014
Journal title
Information Sciences
Record number
1215916
Link To Document