• DocumentCode
    2019565
  • Title

    Distant speech recognition for home automation: Preliminary experimental results in a smart home

  • Author

    Lecouteux, Benjamin ; Vacher, Michel ; Portet, François

  • Author_Institution
    Lab. d´´Inf. de Grenoble, UJF-Grenoble 1, Grenoble, France
  • fYear
    2011
  • fDate
    18-21 May 2011
  • Firstpage
    1
  • Lastpage
    10
  • Abstract
    This paper presents a study that is part of the Sweet-Home project which aims at developing a new home automation system based on voice command. The study focused on two tasks: distant speech recognition and sentence spotting (e.g., recognition of domotic orders). Regarding the first task, different combinations of ASR systems, language and acoustic models were tested. Fusion of ASR outputs by consensus and with a triggered language model (using a priori knowledge) were investigated. For the sentence spotting task, an algorithm based on distance evaluation between the current ASR hypotheses and the predefine set of keyword patterns was introduced in order to retrieve the correct sentences in spite of the ASR errors. The techniques were assessed on real daily living data collected in a 4-room smart home that was fully equipped with standard tactile commands and with 7 wireless microphones set in the ceiling. Thanks to Driven Decoding Algorithm techniques, a classical ASR system reached 7.9% WER against 35% WER in standard configuration and 15% with MLLR adaptation only. The best keyword pattern classification result obtained in distant speech conditions was 7.5% CER.
  • Keywords
    home automation; speech recognition; ASR; Sweet-Home project; distant speech recognition; driven decoding algorithm techniques; home automation system; keyword pattern classification; sentence spotting task; smart home; voice command; Acoustics; Adaptation model; Hidden Markov models; Microphones; Smart homes; Speech; Speech recognition; distant speech recognition; home automation; keyword detection; smart home; triggered language models;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Speech Technology and Human-Computer Dialogue (SpeD), 2011 6th Conference on
  • Conference_Location
    Brasov
  • Print_ISBN
    978-1-4577-0440-6
  • Type

    conf

  • DOI
    10.1109/SPED.2011.5940728
  • Filename
    5940728