• DocumentCode
    3709868
  • Title

    Speech activity detection and face orientation estimation using multiple microphone arrays and human position information

  • Author

    Carlos T. Ishi;Jani Even;Norihiro Hagita

  • Author_Institution
    Intelligent Robotics and Communication Labs, ATR, Japan
  • fYear
    2015
  • fDate
    9/1/2015 12:00:00 AM
  • Firstpage
    5574
  • Lastpage
    5579
  • Abstract
    We developed a system for detecting the speech activity intervals of multiple speakers by combining multiple microphone arrays and human tracking technologies. We also proposed a method for estimating the face orientation of the detected speakers. The developed system was evaluated in two steps: individual utterances in different positions and orientations; and simultaneous dialogues by multiple speakers. Evaluation results revealed that the proposed system could detect speech activity intervals with more than 90% of accuracy, and face orientations with standard deviations within 30 degrees, in situations excluding the cases where all arrays are in the opposite direction to the speaker´s face orientation.
  • Keywords
    "Estimation","Microphone arrays","Face","Speech","Three-dimensional displays","Arrays"
  • Publisher
    ieee
  • Conference_Titel
    Intelligent Robots and Systems (IROS), 2015 IEEE/RSJ International Conference on
  • Type

    conf

  • DOI
    10.1109/IROS.2015.7354167
  • Filename
    7354167