• DocumentCode
    2770199
  • Title

    Random discriminant structure analysis for automatic recognition of connected vowels

  • Author

    Qiao, Yu ; Asakawa, Satoshi ; Minematsu, Nobuaki

  • Author_Institution
    Univ. of Tokyo, Tokyo
  • fYear
    2007
  • fDate
    9-13 Dec. 2007
  • Firstpage
    576
  • Lastpage
    581
  • Abstract
    The universal structure of speech [1, 2], proves to be invariant to transformations in feature space, and thus provides a robust representation for speech recognition. One of the difficulties of using structure representation is due to its high dimensionality. This not only increases computational cost but also easily suffers from the curse of dimensionality [3, 4]. In this paper, we introduce random discriminant structure analysis (RDSA) to deal with this problem. Based on the observation that structural features are highly correlated and include large redundancy, the RDSA combines random feature selection and discriminative analysis to calculate several low dimensional and discriminative representations from an input structure. Then an individual classifier is trained for each representation and the outputs of each classifier are integrated for the final classification decision. Experimental results on connected Japanese vowel utterances show that our approach achieves a recognition rate of 98.3% based on the training data of 8 speakers, which is higher than that (97.4%) of HMMs trained with the utterances of 4,130 speakers.
  • Keywords
    feature extraction; random processes; signal classification; signal representation; speech recognition; automatic speech recognition; connected vowels representation; random discriminant structure analysis; random feature selection; speech classification; Automatic speech recognition; Communication channels; Computational efficiency; Hidden Markov models; Redundancy; Robustness; Speech analysis; Speech recognition; Statistical analysis; Training data; Classifier ensemble; Discriminative analysis; Feature selection; Invariant structure; Speech recognition;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Automatic Speech Recognition & Understanding, 2007. ASRU. IEEE Workshop on
  • Conference_Location
    Kyoto
  • Print_ISBN
    978-1-4244-1746-9
  • Electronic_ISBN
    978-1-4244-1746-9
  • Type

    conf

  • DOI
    10.1109/ASRU.2007.4430176
  • Filename
    4430176