DocumentCode :
178422
Title :
Fundamental frequency generation for whisper-to-audible speech conversion
Author :
Janke, Matthias ; Wand, Michael ; Heistermann, T. ; Schultz, Tanja ; Prahallad, K.
Author_Institution :
Cognitive Syst. Lab., Karlsruhe Inst. of Technol., Karlsruhe, Germany
fYear :
2014
fDate :
4-9 May 2014
Firstpage :
2579
Lastpage :
2583
Abstract :
In this work, we address the issues involved in whisper-to-audible speech conversion. Spectral mapping techniques using Gaussian mixture models or Artificial Neural Networks borrowed from voice conversion have been applied to transform whisper spectral features to normally phonated audible speech. However, the modeling and generation of fundamental frequency (F0) and its contour in the converted speech is a major issue. Whispered speech does not contain explicit voicing characteristics and hence it is hard to derive a suitable F0, making it difficult to generate a natural prosody after conversion. Our work addresses the F0 modeling in whisper-to-speech conversion. We show that F0 contours can be derived from the mapped spectral vectors, which can be used for the synthesis of a speech signal. We also present a hybrid unit selection approach for whisper-to-speech conversion. Unit selection is performed on the spectral vectors, where F0 and its contour can be obtained as a byproduct without any additional modeling.
Keywords :
Gaussian processes; neural nets; speech processing; speech synthesis; vectors; F0 modeling; Gaussian mixture models; artificial neural networks; fundamental frequency generation; hybrid unit selection approach; mapped spectral vectors; natural prosody generation; normally phonated audible speech; spectral mapping techniques; speech signal. synthesis; voicing characteristics; whisper spectral feature transform; whisper-to-audible speech conversion; Acoustics; Artificial neural networks; Conferences; Speech; Speech processing; Speech recognition; Vectors; F0 generation; Silent speech interface; voice conversion; whisper-to-speech conversion;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Acoustics, Speech and Signal Processing (ICASSP), 2014 IEEE International Conference on
Conference_Location :
Florence
Type :
conf
DOI :
10.1109/ICASSP.2014.6854066
Filename :
6854066
Link To Document :
بازگشت