مرکز منطقه ای اطلاع رساني علوم و فناوري - Audiovusual automatic speech segmentation

DocumentCode :

3641720

Title :

Audiovusual automatic speech segmentation

Author :

Eren Akdemir;Tolga Çiloğlu

Author_Institution :

Elektrik ve Elektronik Mü

fYear :

2011

fDate :

4/1/2011 12:00:00 AM

Firstpage :

896

Lastpage :

899

Abstract :

Audiovisual speech segmentation using visual information together with audio data is introduced. The collaboration of audio and visual data results in lower average absolute boundary error between the manual segmentation and automatic segmentation results that directly affects the quality of speech processing systems using the segmented database. The audio and visual feature vectors are fused at the feature level and used in a HMM based speech segmentation system. A Turkish audiovisual speech database has been prepared and used in the experiments. The average absolute boundary error decreases up to 20.82% by using different audiovisual feature vectors.

Keywords :

"Mel frequency cepstral coefficient","Hidden Markov models","Speech","Conferences","Visualization","Speech processing"

Publisher :

ieee

Conference_Titel :

Signal Processing and Communications Applications (SIU), 2011 IEEE 19th Conference on

ISSN :

2165-0608

Print_ISBN :

978-1-4577-0462-8

Type :

conf

DOI :

10.1109/SIU.2011.5929796

Filename :

5929796

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=3641720