DocumentCode
2151699
Title
Dynamics of tongue gestures extracted automatically from ultrasound
Author
Berry, Jeff ; Fasel, Ian
Author_Institution
Univ. of Arizona, Tucson, AZ, USA
fYear
2011
fDate
22-27 May 2011
Firstpage
557
Lastpage
560
Abstract
We describe a system for automatically extracting dynamics of tongue gestures from ultrasound images of the tongue using translational deep belief networks (tDBNs). In tDBNs, a joint model of the input and output vectors are learned during a generative pretraining stage, and then a translation step is used to transform input-only vectors into this joint representation. A final fine-tuning stage is then used to reconstruct the desired outputs given input vectors. We show that this technique dramatically improves performance on segmenting ultrasound image sequences of continuous speech into individual consonant gestures compared with the original DBN method of as well as alternative methods using PCA and support vector machines.
Keywords
belief networks; gesture recognition; image sequences; medical image processing; principal component analysis; support vector machines; DBN method; PCA; support vector machine; tDBN; tongue gesture extraction; translational deep belief network; ultrasound image sequence; Feature extraction; Shape; Speech; Support vector machines; Tongue; Training; Ultrasonic imaging; Deep Belief Networks; Ultrasound;
fLanguage
English
Publisher
ieee
Conference_Titel
Acoustics, Speech and Signal Processing (ICASSP), 2011 IEEE International Conference on
Conference_Location
Prague
ISSN
1520-6149
Print_ISBN
978-1-4577-0538-0
Electronic_ISBN
1520-6149
Type
conf
DOI
10.1109/ICASSP.2011.5946464
Filename
5946464
Link To Document