• DocumentCode
    2680875
  • Title

    Missing-feature-theory-based robust simultaneous speech recognition system with non-clean speech acoustic model

  • Author

    Takahashi, Toru ; Nakadai, Kazuhiro ; Komatani, Kazunori ; Ogata, Tetsuya ; Okuno, Hiroshi G.

  • Author_Institution
    Dept. of Intell. & Sci. & Technol. Grad. Sch. of Inf., Kyoto Univ., Kyoto, Japan
  • fYear
    2009
  • fDate
    10-15 Oct. 2009
  • Firstpage
    2730
  • Lastpage
    2735
  • Abstract
    A humanoid robot must recognize a target speech signal while people around the robot chat with them in real-world. To recognize the target speech signal, robot has to separate the target speech signal among other speech signals and recognize the separated speech signal. As separated signal includes distortion, automatic speech recognition (ASR) performance degrades. To avoid the degradation, we trained an acoustic model from non-clean speech signals to adapt acoustic feature of distorted signal and adding white noise to separated speech signal before extracting acoustic feature. The issues are (1) To determine optimal noise level to add the training speech signals, and (2) To determine optimal noise level to add the separated signal. In this paper, we investigate how much noises should be added to clean speech data for training and how speech recognition performance improves for different positions of three talkers with soft masking. Experimental results show that the best performance is obtained by adding white noises of 30 dB. The ASR with the acoustic model outperforms with ASR with the clean acoustic model by 4 points.
  • Keywords
    humanoid robots; speech recognition; automatic speech recognition; humanoid robot; missing-feature-theory; nonclean speech acoustic model; robust simultaneous speech recognition system; target speech signal; Acoustic distortion; Automatic speech recognition; Degradation; Humanoid robots; Robotics and automation; Robustness; Speech enhancement; Speech recognition; Target recognition; White noise;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Intelligent Robots and Systems, 2009. IROS 2009. IEEE/RSJ International Conference on
  • Conference_Location
    St. Louis, MO
  • Print_ISBN
    978-1-4244-3803-7
  • Electronic_ISBN
    978-1-4244-3804-4
  • Type

    conf

  • DOI
    10.1109/IROS.2009.5354201
  • Filename
    5354201