پيش‌گويي قابليت فهم همخوان‌ها در افراد داراي شنوايي عادي با استفاده از مدل‌هاي ميكروسكوپي داراي معيار فاصله‌ متفاوت در بازشناساگر خودكار گفتار

عنوان فرعي

Prediction of consonants Intelligibility for Listeners with Normal Hearing Using Microscopic Models of Speech Perception Considering Different Distance Measures in Automatic Speech Recognizer

پديد آورندگان

گراوانچي‌زاده ، مسعود نويسنده دانشكده مهندسي برق و كامپيوتر- دانشگاه تبريز Geravanchizadeh, Masoud , فلاح ، علي نويسنده دانشكده مهندسي برق و كامپيوتر- دانشگاه تبريز Fallah, Ali , اعتراف اسكويي ، مير علي نويسنده دانشكده توانبخشي- دانشگاه علوم پزشكي تبريز Eteraf Oskouei, Mir Ali

اطلاعات موجودي

دوفصلنامه سال 1394 شماره 23

رتبه نشريه

علمي پژوهشي

تعداد صفحه

از صفحه

تا صفحه

كليدواژه

معيار فاصله , مدل ميكروسكوپي , بردار ويژگي , ادراك گفتار , نرخ تشخيص آوا , قابليت فهم , شناساگر خودكار گفتار

چكيده فارسي

در اين مطالعه، نرخ تشخيص همخوان‌هاي موجود در ساختار هجايي «واكه- همخوان- واكه»، در آزمون‌هاي شنوايي و دو مدل ميكروسكوپي ادراك گفتار مورد بررسي قرار مي‌گيرد. چنين ساختار هجايي در زبان فارسي و تركي آذري وجود ندارد؛ با وجود اين، نتايج آزمون‌هاي شنوايي نشان مي‌دهد كه شنونده آذري يا فارسي‌زبان در شرايط بدون نوفه، قادر به تشخيص صحيح همخوان‌ها هستند. براي اين پژوهش كه در آن هدف، تشخيص صحيح آواها و نه كلمات بامعني است، استفاده از اين دادگان صوتي فاقد معني مناسب است، چون با استفاده از اين دادگان، دانش زباني شنوندگان در پيش‌بيني كلمات ناديده گرفته ميشود. نتايج آزمون‌هاي شنوايي با نتايج دو مدل ميكروسكوپي كه بر پايه دستگاه شنوايي انسان است، مقايسه مي‌شود. تفاوت دو مدل در مرحله نهايي استخراج ويژگي به‌منظور استفاده در شناساگر خودكار گفتار DTW است. در مدل ميكروسكوپي اول، در مرحله پاياني استخراج ويژگي، از فيلتر 8 هرتز و در مدل دوم، از فيلتربانك مدولاسيون استفاده مي‌شود. در ادامه، نرخ تشخيص صحيح آواها در مقادير مختلف سيگنال به نوفه با استفاده از معيارهاي فاصله اقليدسي و لگاريتمي با يكديگر مقايسه مي‌شود. در اين تحقيق، نرخ تشخيص همخوان‌ها براي شنونده آذري‌زبان مورد بررسي قرار گرفته است. در كنار جنبه تجربي اين مطالعه، نو‌آوري اين مقاله در بررسي دو معيار فاصله مختلف براي مدل هلوب و نيز مقايسه مستقيم دو مدل ميكروسكوپي در پيش‌بيني ميانگين نرخ تشخيص و نيز نرخ تشخيص تك‌تك همخوان‌ها است.

چكيده لاتين

In this study, recognition rates of consonants available in vowel-consonant-vowel structure in hearing tests and two microscopic models will be investigated. Such a syllable structure doesn’t exist in Farsi and Azerbaijani languages, but since the goal is only recognition of middle phoneme, according to hearing tests, listeners are able to properly recognize phonemes in clean speech conditions. Inasmuch as these syllable structures are meaningless, it will be suitable for our purpose that is only determination of recognition rates of phonemes not meaningful words. Using this corpus, listeners’ linguistic knowledge in prediction of words is disregarded. Results of hearing tests are compared with two microscopic models based on human auditory system. Difference between two models is at the final stage of feature extraction that in first model, a 8 Hz filter and in the second model a modulation filterbank is used. Correct recognition rates of phonemes in different signal to noise ratios and two distance metrics for speech recognizer, will be compared. In this study recognition rates of consonants for listeners with Azerbaijani native language have been studied. Beside the empirical aspect of the paper, the innovations of this work lies in the study of using two different distance measures for Holube’s model and also direct comparison of two microscopic models in prediction of overall recognition rates and recognition rate of each consonant.

سال انتشار

1394

عنوان نشريه

پردازش علائم و داده ها

عنوان نشريه

پردازش علائم و داده ها

اطلاعات موجودي

دوفصلنامه با شماره پیاپی 23 سال 1394

كلمات كليدي

#تست#آزمون###امتحان

لينک به اين مدرک

https://search.isc.ac/dl/search/defaultta.aspx?DTC=8&DC=741797