Title :
Trigram duration modeling in speech recognition
Author :
Tang, Yun ; Liu, Wenju ; Xu, Bo
Abstract :
Rate of speech (ROS) is a very important factor in speech recognition. We present a new speech rate measurement method which first normalizes the duration of different acoustic units to a standard duration and then builds a trigram duration model to measure the speech rate of a sentence. We propose two methods based on the standard duration to compensate the influence introduced by speech rate variation in a data corpus and get 11% error rate reduction in Mandarin digit string recognition.
Keywords :
acoustic signal processing; error statistics; natural languages; speech recognition; Mandarin digit string recognition; acoustic unit duration; error rate reduction; rate of speech; speech rate measurement; speech recognition; standard duration; trigram duration model; Acoustic measurements; Context modeling; Databases; Error analysis; Hidden Markov models; Measurement standards; Measurement units; Natural languages; Parameter estimation; Speech recognition;
Conference_Titel :
Chinese Spoken Language Processing, 2004 International Symposium on
Print_ISBN :
0-7803-8678-7
DOI :
10.1109/CHINSL.2004.1409627