DocumentCode :
2280596
Title :
High performance telephone bandwidth speaker independent continuous digit recognition
Author :
Cosi, Piero ; Hosom, John-Paul ; Valente, Alberto
Author_Institution :
Ist. di Fonetica a Dialettologia, CNR, Padova, Italy
fYear :
2001
fDate :
2001
Firstpage :
405
Lastpage :
408
Abstract :
The development of a high-performance telephone-bandwidth speaker independent connected digit recognizer for Italian is described. The CSLU Speech Toolkit was used to develop and implement the hybrid ANN/HMM system, which is trained on context-dependent categories to account for coarticulatory variation. Various front-end processing and system architectures were compared and, when the best features (MFCC with CMS + Δ) and network (4-layer fully connected feed-forward network) were considered, there was a 98.92% word recognition accuracy and a 92.62% sentence recognition accuracy on a test set of the FIELD continuous digits recognition task.
Keywords :
hidden Markov models; neural nets; speech recognition; 4-layer fully connected feed-forward network; Italian; coarticulatory variation; context dependent categories; front-end processing; high-performance telephone bandwidth speaker independent connected digit recognizer; hybrid ANN/HMM system; system architecture; Automatic speech recognition; Bandwidth; Collision mitigation; Feedforward systems; Hidden Markov models; Mel frequency cepstral coefficient; Natural languages; Speech recognition; System testing; Telephony;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Automatic Speech Recognition and Understanding, 2001. ASRU '01. IEEE Workshop on
Print_ISBN :
0-7803-7343-X
Type :
conf
DOI :
10.1109/ASRU.2001.1034670
Filename :
1034670
Link To Document :
بازگشت