DocumentCode :
2504663
Title :
Deep Belief Networks for Real-Time Extraction of Tongue Contours from Ultrasound During Speech
Author :
Fasel, Ian ; Berry, Jeff
Author_Institution :
Univ. of Arizona, Tucson, AZ, USA
fYear :
2010
fDate :
23-26 Aug. 2010
Firstpage :
1493
Lastpage :
1496
Abstract :
Ultrasound has become a useful tool for speech scientists studying mechanisms of language sound production. State-of-the-art methods for extracting tongue contours from ultrasound images of the mouth, typically based on active contour snakes, require considerable manual interaction by an expert linguist. In this paper we describe a novel method for fully automatic extraction of tongue contours based on a hierarchy of restricted Boltzmann machines (RBMs), i.e. deep belief networks (DBNs). Usually, DBNs are first trained generatively on sensor data, then discriminatively to predict human-provided labels of the data. In this paper we introduce the translational RBM (tRBM), which allows the DBN to make use of both human labels and raw sensor data at all stages of learning. This method yields performance in contour extraction comparable to human labelers, without any temporal smoothing or human intervention, and runs in real-time.
Keywords :
Boltzmann machines; belief networks; feature extraction; linguistics; ultrasonic imaging; contour extraction; deep belief networks; language sound production; real time extraction; restricted Boltzmann machines; speech scientists; tongue contours; translational RBM; ultrasound images; Decoding; Humans; Image reconstruction; Speech; Tongue; Training; Ultrasonic imaging; Computer aided detection and diagnosis; Pattern recognition systems and applications; Signal/image representation;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Pattern Recognition (ICPR), 2010 20th International Conference on
Conference_Location :
Istanbul
ISSN :
1051-4651
Print_ISBN :
978-1-4244-7542-1
Type :
conf
DOI :
10.1109/ICPR.2010.369
Filename :
5597284
Link To Document :
بازگشت