Title :
Speech/gesture interface to a visual-computing environment
Author :
Sharma, Rajeev ; Zeller, Michael ; Pavlovic, Vladimir I. ; Huang, Thomas S. ; Lo, Zion ; Chu, Stephen ; Zhao, Yunxin ; Phillips, James C. ; Schulten, Klaus
Author_Institution :
Beckman Inst. for Adv. Sci. & Technol., Illinois Univ., Urbana, IL, USA
Abstract :
We developed a speech/gesture interface that uses visual hand-gesture analysis and speech recognition to control a 3D display in VMD, a virtual environment for structural biology. The reason we used a particular virtual environment context was to set the necessary constraints to make our analysis robust and to develop a command language that optimally combines speech and gesture inputs. Our interface uses: automatic speech recognition (ASR), aided by a microphone, to recognize voice commands; two strategically positioned cameras to detect hand gestures; and automatic gesture recognition (AGR), a set of computer vision techniques to interpret those hand gestures. The computer vision algorithms can extract the user´s hand from the background, detect different finger positions, and distinguish meaningful gestures from unintentional hand movements. Our main goal was to simplify model manipulation and rendering to make biomolecular modeling more playful. Researchers can explore variations of their model and concentrate on biomolecular aspects of their task without undue distraction by computational aspects. They can view simulations of molecular dynamics, play with different combinations of molecular structures, and better understand the molecules´ important properties. A potential benefit, for example, might be reducing the time to discover new compounds for new drugs
Keywords :
biology computing; computer vision; gesture recognition; molecular configurations; speech recognition; speech-based user interfaces; virtual reality; 3D display control; VMD; automatic gesture recognition; automatic speech recognition; biomolecular aspects; biomolecular modeling; command language; computer vision algorithms; finger positions; gesture inputs; hand gestures; meaningful gestures; microphone; model manipulation; molecular dynamics; molecular structures; rendering; speech recognition; speech/gesture interface; strategically positioned cameras; structural biology; unintentional hand movements; virtual environment; visual computing environment; visual hand-gesture analysis; voice commands; Automatic control; Automatic speech recognition; Biological control systems; Command languages; Computer vision; Robustness; Speech analysis; Speech recognition; Three dimensional displays; Virtual environment;
Journal_Title :
Computer Graphics and Applications, IEEE