Title :
Towards video realistic synthetic visual speech
Author :
Theobald, Barry J. ; Bangham, J. Andrew ; Matthews, Iain A. ; Cawley, Gavin C.
Author_Institution :
School of Information Systems, University of East Anglia, Norwich, NR4 7TJ, UK
Abstract :
In this paper we present initial work towards a video-realistic visual speech synthesiser based on statistical models of shape and appearance. A synthesised image sequence corresponding to an utterance is formed by concatenation of synthesis units (in this case phonemes) from a pre-recorded corpus of training data. A smoothing spline is applied to the concatenated parameters to ensure smooth transitions between frames and the resultant parameters applied to the model—early results look promising.
Conference_Titel :
Acoustics, Speech, and Signal Processing (ICASSP), 2002 IEEE International Conference on
Conference_Location :
Orlando, FL, USA
Print_ISBN :
0-7803-7402-9
DOI :
10.1109/ICASSP.2002.5745507