• DocumentCode
    2522100
  • Title

    Experimental evaluation of structure of garbage model generated from in-vocabulary words

  • Author

    Hirota, Shuichiro ; Hayasaka, Noboru ; Iiguni, Youji

  • Author_Institution
    Grad. Sch. of Eng. Sci., Osaka Univ., Toyonaka, Japan
  • fYear
    2012
  • fDate
    2-5 Oct. 2012
  • Firstpage
    87
  • Lastpage
    92
  • Abstract
    This paper proposes an effective garbage model for rejection of out-of-vocabulary words (OOV) in a word recognition system. Many methods for rejecting OOV and generating garbage models based on registering OOV have been proposed. However, they could not have sufficient rejection capability. To solve them, we propose a new garbage model generated from only in-vocabulary words (IV) without information about OOV. The proposed garbage model employs 2 parameters, i.e., the number of states (NS) and the number of mixtures (NM). From a large amount of word recognition simulations, we derived the best values of the parameters. We found that the equal error rate of the proposed garbage model was saturated around NM = 120 at any NS, and no significant change was observed with increasing NM more. In addition, the best value of NS was 18 at NM = 120. The results of simulations showed that the best parameters of the proposed garbage model (i.e., NS and NM) were 18 and 120, respectively.
  • Keywords
    speech recognition; vocabulary; word processing; NM; NS; OOV rejection; equal error rate; garbage model; in-vocabulary word; number of mixture; number of state; out-of-vocabulary word; word recognition; Complexity theory; Databases; Educational institutions; Hidden Markov models; Speech; Speech recognition; Training;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Communications and Information Technologies (ISCIT), 2012 International Symposium on
  • Conference_Location
    Gold Coast, QLD
  • Print_ISBN
    978-1-4673-1156-4
  • Electronic_ISBN
    978-1-4673-1155-7
  • Type

    conf

  • DOI
    10.1109/ISCIT.2012.6381027
  • Filename
    6381027