مرکز منطقه ای اطلاع رساني علوم و فناوري - Speech recognition using sub-word neural tree network models and multiple classifier fusion

DocumentCode :

2933734

Title :

Speech recognition using sub-word neural tree network models and multiple classifier fusion

Author :

Sharma, Manish ; Mammone, Richard

Author_Institution :

CAIP Center, Rutgers Univ., Piscataway, NJ, USA

Volume :

fYear :

1995

fDate :

9-12 May 1995

Firstpage :

3323

Abstract :

A new neural tree network (NTN)-based speech recognition system is presented. NTN is a hierarchial classifier that combines the properties of decision trees and feed-forward neural networks. In the sub-word unit-based system, the NTNs model the sub-word speech segments, while the Viterbi algorithm is used for temporal alignment. Durational probability is associated with each sub-word NTN. An iterative algorithm is proposed for training the sub-word NTNs. The sub-word NTN models, as well as the subword segment boundaries within a vocabulary word, are re-estimated. Thus, the proposed system is a homogeneous neural network-based, sub-word unit-based, speech recognition system. Furthermore, embedded within this word model paradigm, multiple NTNs are trained for each subword segment and their output decisions are combined or fused to yield improved performance. The proposed discriminatory training-based system did not perform favourably as compared to a hidden Markov model-based system. The paradigm presented in this paper can be argued to represent a class of discriminatory training-based, homogeneous (versus hybrid), sub-word unit-based, speech recognition systems. Hence, the results reported here can be generalized to other similar systems

Keywords :

feedforward neural nets; iterative methods; speech recognition; Viterbi algorithm; decision trees; durational probability; feed-forward neural networks; hierarchial classifier; iterative algorithm; multiple classifier fusion; output decisions; performance; speech recognition; sub-word neural tree network models; sub-word speech segments; temporal alignment; training-based system; Classification tree analysis; Decision trees; Feedforward neural networks; Feedforward systems; Hidden Markov models; Iterative algorithms; Neural networks; Speech recognition; Viterbi algorithm; Vocabulary;

fLanguage :

English

Publisher :

ieee

Conference_Titel :

Acoustics, Speech, and Signal Processing, 1995. ICASSP-95., 1995 International Conference on

Conference_Location :

Detroit, MI

ISSN :

1520-6149

Print_ISBN :

0-7803-2431-5

Type :

conf

DOI :

10.1109/ICASSP.1995.479696

Filename :

479696

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=2933734