• DocumentCode
    2151699
  • Title

    Dynamics of tongue gestures extracted automatically from ultrasound

  • Author

    Berry, Jeff ; Fasel, Ian

  • Author_Institution
    Univ. of Arizona, Tucson, AZ, USA
  • fYear
    2011
  • fDate
    22-27 May 2011
  • Firstpage
    557
  • Lastpage
    560
  • Abstract
    We describe a system for automatically extracting dynamics of tongue gestures from ultrasound images of the tongue using translational deep belief networks (tDBNs). In tDBNs, a joint model of the input and output vectors are learned during a generative pretraining stage, and then a translation step is used to transform input-only vectors into this joint representation. A final fine-tuning stage is then used to reconstruct the desired outputs given input vectors. We show that this technique dramatically improves performance on segmenting ultrasound image sequences of continuous speech into individual consonant gestures compared with the original DBN method of as well as alternative methods using PCA and support vector machines.
  • Keywords
    belief networks; gesture recognition; image sequences; medical image processing; principal component analysis; support vector machines; DBN method; PCA; support vector machine; tDBN; tongue gesture extraction; translational deep belief network; ultrasound image sequence; Feature extraction; Shape; Speech; Support vector machines; Tongue; Training; Ultrasonic imaging; Deep Belief Networks; Ultrasound;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Acoustics, Speech and Signal Processing (ICASSP), 2011 IEEE International Conference on
  • Conference_Location
    Prague
  • ISSN
    1520-6149
  • Print_ISBN
    978-1-4577-0538-0
  • Electronic_ISBN
    1520-6149
  • Type

    conf

  • DOI
    10.1109/ICASSP.2011.5946464
  • Filename
    5946464