Notice of Violation of IEEE Publication Principles
Acoustic and Phoneme Modeling Based on Confusion Matrix for Ubiquitous Mixed-Language Speech Recognition

Author

Shih, Po-Yi ; Wang, Jhing-Fa ; Lee, Hsiao-Ping ; Kai, Hung-Jen ; Kao, Hung-Tzu ; Lin, Yuan-Ning

Author_Institution

Dept. of Electr. Eng., Nat. Cheng Rung Univ., Tainan

fYear

2008

fDate

11-13 June 2008

Firstpage

500

Lastpage

506

Abstract

Notice of Violation of IEEE Publication Principles

"Acoustic and Phoneme Modeling Based on Confusion Matrix for Ubiquitous Mixed-Language Speech Recognition,"
by Po-Yi Shih; Jhing-Fa Wang; Hsiao-Ping Lee; Hung-Jen Kai; Hung-Tzu Kao; Yuan-Ning Lin,
in the Proceedings of the IEEE International Conference on Sensor Networks, Ubiquitous and Trustworthy Computing, 2008. SUTC \´08, pp. 500-506, June 2008

After careful and considered review of the content and authorship of this paper by a duly constituted expert committee, this paper has been found to be in violation of IEEE\´s Publication Principles.

This paper contains significant portions of original text from the paper cited below. The original text was copied without attribution (including appropriate references to the original author(s) and/or paper title) and without permission.

Due to the nature of this violation, reasonable effort should be made to remove all past references to this paper, and future references should be made to the following article:

"Dissimilarity Measures for Hidden Markov Models and Their Application in Multilingual Speech Recognition"
by Matti Vihola
in his Masters Thesis, Tampere University of Technology, November 2001

This work presents a novel approach to acoustic and phoneme modeling in order to recognize ubiquitous mixed-language speech. The conventional approaches to perform multilingual speech recognition are the usage of a multilingual phone set. A confusion matrix combining acoustic between every two phonetic is built for phonetic unit clustering. In this work, we are interested in speaker independent voice command recognition. The IPA representation is adapted for phonetic unit modeling. The EAT is applied to construct speaker independent acoustic models. The experimental results show that the proposed method can perform 70-80% lexicon recognition accuracy.

Keywords

acoustic signal processing; natural languages; speech recognition; acoustic modeling; confusion matrix; phoneme modeling; speaker independent voice command recognition; ubiquitous mixed-language speech recognition; acoustic modeling; confusion matrix; ubiquitous mixed-language speech recognition; voice command;

fLanguage

English

Publisher

ieee

Conference_Titel

Sensor Networks, Ubiquitous and Trustworthy Computing, 2008. SUTC '08. IEEE International Conference on

Conference_Location

Taichung

Print_ISBN

978-0-7695-3158-8

Electronic_ISBN

978-0-7695-3158-8

Type

conf

DOI

10.1109/SUTC.2008.78

Filename

4545809

Notice of Violation of IEEE Publication PrinciplesAcoustic and Phoneme Modeling Based on Confusion Matrix for Ubiquitous Mixed-Language Speech Recognition

Shih, Po-Yi ; Wang, Jhing-Fa ; Lee, Hsiao-Ping ; Kai, Hung-Jen ; Kao, Hung-Tzu ; Lin, Yuan-Ning

conf

Notice of Violation of IEEE Publication Principles
Acoustic and Phoneme Modeling Based on Confusion Matrix for Ubiquitous Mixed-Language Speech Recognition