DocumentCode :
3527309
Title :
Maximizing the continuity in segmentation - A new approach to model, segment and recognize speech
Author :
Ming, Ji
Author_Institution :
Inst. of Electron., Commun. & Inf. Technol., Queen´´s Univ. Belfast, Belfast
fYear :
2009
fDate :
19-24 April 2009
Firstpage :
3849
Lastpage :
3852
Abstract :
This paper presents a new approach to speech modeling and recognition. The new approach consists of a statistical model to represent up to sentence-long temporal dynamics in the training data, and an algorithm to identify the matching segments with maximum continuities between the training and testing sentences. Recognition is performed by combining the longest matching segments found from the training sentences. Because of their richer and more distinct temporal dynamics, longer speech segments as whole units can be recognized with lower error rates than shorter speech segments. Therefore basing recognition on the longest matching segments optimizes the discrimination and hence recognition of speech. The new approach has been evaluated on the TIMIT database for identifying matching speech segments. The results obtained are encouraging given the very low parametric complexity of the new model.
Keywords :
speech recognition; statistical analysis; matching segments; parametric complexity; segmentation continuity; sentence long temporal dynamics; speech recognition; speech segmentation; statistical model; Acoustic noise; Error analysis; Hidden Markov models; Loudspeakers; Nonlinear dynamical systems; Spatial databases; Speech enhancement; Speech recognition; Testing; Training data; speech modeling; speech recognition; speech segmentation; temporal dynamics;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Acoustics, Speech and Signal Processing, 2009. ICASSP 2009. IEEE International Conference on
Conference_Location :
Taipei
ISSN :
1520-6149
Print_ISBN :
978-1-4244-2353-8
Electronic_ISBN :
1520-6149
Type :
conf
DOI :
10.1109/ICASSP.2009.4960467
Filename :
4960467
Link To Document :
بازگشت