• DocumentCode
    1903670
  • Title

    Notice of Violation of IEEE Publication Principles
    Acoustic and Phoneme Modeling Based on Confusion Matrix for Ubiquitous Mixed-Language Speech Recognition

  • Author

    Shih, Po-Yi ; Wang, Jhing-Fa ; Lee, Hsiao-Ping ; Kai, Hung-Jen ; Kao, Hung-Tzu ; Lin, Yuan-Ning

  • Author_Institution
    Dept. of Electr. Eng., Nat. Cheng Rung Univ., Tainan
  • fYear
    2008
  • fDate
    11-13 June 2008
  • Firstpage
    500
  • Lastpage
    506
  • Abstract
    Notice of Violation of IEEE Publication Principles

    "Acoustic and Phoneme Modeling Based on Confusion Matrix for Ubiquitous Mixed-Language Speech Recognition,"
    by Po-Yi Shih; Jhing-Fa Wang; Hsiao-Ping Lee; Hung-Jen Kai; Hung-Tzu Kao; Yuan-Ning Lin,
    in the Proceedings of the IEEE International Conference on Sensor Networks, Ubiquitous and Trustworthy Computing, 2008. SUTC \´08, pp. 500-506, June 2008

    After careful and considered review of the content and authorship of this paper by a duly constituted expert committee, this paper has been found to be in violation of IEEE\´s Publication Principles.

    This paper contains significant portions of original text from the paper cited below. The original text was copied without attribution (including appropriate references to the original author(s) and/or paper title) and without permission.

    Due to the nature of this violation, reasonable effort should be made to remove all past references to this paper, and future references should be made to the following article:

    "Dissimilarity Measures for Hidden Markov Models and Their Application in Multilingual Speech Recognition"
    by Matti Vihola
    in his Masters Thesis, Tampere University of Technology, November 2001

    This work presents a novel approach to acoustic and phoneme modeling in order to recognize ubiquitous mixed-language speech. The conventional approaches to perform multilingual speech recognition are the usage of a multilingual phone set. A confusion matrix combining acoustic between every two phonetic is built for phonetic unit clustering. In this work, we are interested in speaker independent voice command recognition. The IPA representation is adapted for phonetic unit modeling. The EAT is applied to construct speaker independent acoustic models. The experimental results show that the proposed method can perform 70-80% lexicon recognition accuracy.
  • Keywords
    acoustic signal processing; natural languages; speech recognition; acoustic modeling; confusion matrix; phoneme modeling; speaker independent voice command recognition; ubiquitous mixed-language speech recognition; acoustic modeling; confusion matrix; ubiquitous mixed-language speech recognition; voice command;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Sensor Networks, Ubiquitous and Trustworthy Computing, 2008. SUTC '08. IEEE International Conference on
  • Conference_Location
    Taichung
  • Print_ISBN
    978-0-7695-3158-8
  • Electronic_ISBN
    978-0-7695-3158-8
  • Type

    conf

  • DOI
    10.1109/SUTC.2008.78
  • Filename
    4545809