Title :
Speech activity detection and face orientation estimation using multiple microphone arrays and human position information
Author :
Carlos T. Ishi;Jani Even;Norihiro Hagita
Author_Institution :
Intelligent Robotics and Communication Labs, ATR, Japan
fDate :
9/1/2015 12:00:00 AM
Abstract :
We developed a system for detecting the speech activity intervals of multiple speakers by combining multiple microphone arrays and human tracking technologies. We also proposed a method for estimating the face orientation of the detected speakers. The developed system was evaluated in two steps: individual utterances in different positions and orientations; and simultaneous dialogues by multiple speakers. Evaluation results revealed that the proposed system could detect speech activity intervals with more than 90% of accuracy, and face orientations with standard deviations within 30 degrees, in situations excluding the cases where all arrays are in the opposite direction to the speaker´s face orientation.
Keywords :
"Estimation","Microphone arrays","Face","Speech","Three-dimensional displays","Arrays"
Conference_Titel :
Intelligent Robots and Systems (IROS), 2015 IEEE/RSJ International Conference on
DOI :
10.1109/IROS.2015.7354167