مرکز منطقه ای اطلاع رساني علوم و فناوري - THE REFINED IDENTIFICATION OF BEGINNING-END OF SPEECH; THE RECOGNITION OF THE VOICELESS SOUNDS AT THE BEGINNING-END OF SPEECH. ON THE RECOGNITION OF THE EXTRA-LARGE VOCABULARIES.

Title of article :

THE REFINED IDENTIFICATION OF BEGINNING-END OF SPEECH; THE RECOGNITION OF THE VOICELESS SOUNDS AT THE BEGINNING-END OF SPEECH. ON THE RECOGNITION OF THE EXTRA-LARGE VOCABULARIES.

Author/Authors :

Shelepov, V.Ju. Institute of Artifical Intelligence, Donetsk, Ukraine , Nitsenko, A.V. Institute of Artifical Intelligence, Donetsk, Ukraine

Pages :

From page :

To page :

Abstract :

The present paper belongs to the diphone DTW-recognition strategy developed by the authors. Voiceless plosives, as well as energetically weak hard and soft [f] constitute a problem for recognition when they occur at the beginning or end of speech, owing to their similarity to neighboring silence stretches. The article opens up a description of some refined methods for specifying the beginning and the end of a spoken word or phrase. This is the basis for the proposed methods of recognizing the mentioned sounds beginning or concluding a spoken word or phrase. We introduce a concept of the final quasifricative fragment as well as the algorithms for its selection and use to classify voiceless plosives in the final position. The results obtained together with an insignificant increase in the number of basic speech units, makes it possible to advance in solving the difficult problems of recognizing short speech segments as well as extra-large vocabularies.

Keywords :

continuous-speech recognition , speech segmentation , large vocabulary speech recognition , voiceless fragment , diphone , dynamic time warping (DTW)

Journal title :

Eurasian Journal of Mathematical and Computer Applications

Serial Year :

2017

Full Text URL :

drive.google.com/file/d/0B2YBOGP6HM8vQ3RsSmh1RlM3WEo1S3hsUXVkeHdxMnBHTlNN/view

Record number :

2601668

Link To Document :

https://search.isc.ac/dl/search/defaultta.aspx?DTC=10&DC=2601668