Title :
Speech recognition using sub-word neural tree network models and multiple classifier fusion
Author :
Sharma, Manish ; Mammone, Richard
Author_Institution :
CAIP Center, Rutgers Univ., Piscataway, NJ, USA
Abstract :
A new neural tree network (NTN)-based speech recognition system is presented. NTN is a hierarchial classifier that combines the properties of decision trees and feed-forward neural networks. In the sub-word unit-based system, the NTNs model the sub-word speech segments, while the Viterbi algorithm is used for temporal alignment. Durational probability is associated with each sub-word NTN. An iterative algorithm is proposed for training the sub-word NTNs. The sub-word NTN models, as well as the subword segment boundaries within a vocabulary word, are re-estimated. Thus, the proposed system is a homogeneous neural network-based, sub-word unit-based, speech recognition system. Furthermore, embedded within this word model paradigm, multiple NTNs are trained for each subword segment and their output decisions are combined or fused to yield improved performance. The proposed discriminatory training-based system did not perform favourably as compared to a hidden Markov model-based system. The paradigm presented in this paper can be argued to represent a class of discriminatory training-based, homogeneous (versus hybrid), sub-word unit-based, speech recognition systems. Hence, the results reported here can be generalized to other similar systems
Keywords :
feedforward neural nets; iterative methods; speech recognition; Viterbi algorithm; decision trees; durational probability; feed-forward neural networks; hierarchial classifier; iterative algorithm; multiple classifier fusion; output decisions; performance; speech recognition; sub-word neural tree network models; sub-word speech segments; temporal alignment; training-based system; Classification tree analysis; Decision trees; Feedforward neural networks; Feedforward systems; Hidden Markov models; Iterative algorithms; Neural networks; Speech recognition; Viterbi algorithm; Vocabulary;
Conference_Titel :
Acoustics, Speech, and Signal Processing, 1995. ICASSP-95., 1995 International Conference on
Conference_Location :
Detroit, MI
Print_ISBN :
0-7803-2431-5
DOI :
10.1109/ICASSP.1995.479696