• DocumentCode
    2596170
  • Title

    Live assessment of beat tracking for robot audition

  • Author

    Oliveira, João Lobato ; Ince, Gökhan ; Nakamura, Keisuke ; Nakadai, Kazuhiro ; Okuno, Hiroshi G. ; Reis, Luis Paulo ; Gouyon, Fabien

  • fYear
    2012
  • fDate
    7-12 Oct. 2012
  • Firstpage
    992
  • Lastpage
    997
  • Abstract
    In this paper we propose the integration of an online audio beat tracking system into the general framework of robot audition, to enable its application in musically-interactive robotic scenarios. To this purpose, we introduced a staterecovery mechanism into our beat tracking algorithm, for handling continuous musical stimuli, and applied different multi-channel preprocessing algorithms (e.g., beamforming, ego noise suppression) to enhance noisy auditory signals lively captured in a real environment. We assessed and compared the robustness of our audio beat tracker through a set of experimental setups, under different live acoustic conditions of incremental complexity. These included the presence of continuous musical stimuli, built of a set of concatenated musical pieces; the presence of noises of different natures (e.g., robot motion, speech); and the simultaneous processing of different audio sources on-the-fly, for music and speech. We successfully tackled all these challenging acoustic conditions and improved the beat tracking accuracy and reaction time to music transitions while simultaneously achieving robust automatic speech recognition.
  • Keywords
    audio signal processing; hearing; interactive systems; robots; speech recognition; automatic speech recognition; live assessment; multichannel preprocessing; musically-interactive robotic scenarios; online audio beat tracking system; robot audition; Accuracy; Acoustics; Noise; Robots; Speech; Speech processing; Tracking;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Intelligent Robots and Systems (IROS), 2012 IEEE/RSJ International Conference on
  • Conference_Location
    Vilamoura
  • ISSN
    2153-0858
  • Print_ISBN
    978-1-4673-1737-5
  • Type

    conf

  • DOI
    10.1109/IROS.2012.6386100
  • Filename
    6386100