Title of article :
THE REFINED IDENTIFICATION OF BEGINNING-END OF SPEECH; THE RECOGNITION OF THE VOICELESS SOUNDS AT THE BEGINNING-END OF SPEECH. ON THE RECOGNITION OF THE EXTRA-LARGE VOCABULARIES.
Author/Authors :
Shelepov, V.Ju. Institute of Artifical Intelligence, Donetsk, Ukraine , Nitsenko, A.V. Institute of Artifical Intelligence, Donetsk, Ukraine
Abstract :
The present paper belongs to the diphone DTW-recognition strategy developed
by the authors. Voiceless plosives, as well as energetically weak hard and soft [f] constitute
a problem for recognition when they occur at the beginning or end of speech, owing to their
similarity to neighboring silence stretches. The article opens up a description of some refined
methods for specifying the beginning and the end of a spoken word or phrase. This is the
basis for the proposed methods of recognizing the mentioned sounds beginning or concluding
a spoken word or phrase. We introduce a concept of the final quasifricative fragment as well as
the algorithms for its selection and use to classify voiceless plosives in the final position. The
results obtained together with an insignificant increase in the number of basic speech units,
makes it possible to advance in solving the difficult problems of recognizing short speech
segments as well as extra-large vocabularies.
Keywords :
continuous-speech recognition , speech segmentation , large vocabulary speech recognition , voiceless fragment , diphone , dynamic time warping (DTW)
Journal title :
Eurasian Journal of Mathematical and Computer Applications