Title :
LIP features for speech and speaker recognition
Author :
Auckenthaler, Roland ; Brand, Jason ; Mason, John S.
Author_Institution :
Dept. of Electron., Tech. Univ. Graz, Graz, Austria
Abstract :
This paper implicitly differentiates between the quality of visual representation necessary for speech and speaker recognition and assesses the performance of visual lip features with respect to well established audio features. Blue lip highlighted data is used to show how variations in lip measurements can influence speech and speaker recognition. From these experiments and other researchers results [1] it is postulated that the fine detail of the lips is critical for speaker recognition, but conversely, the same amount of detail does not noticeably improve visual speech recognition. Visual error rates of 26.3% and 70% are achieved for cross-digit speaker and cross-speaker speech recognition respectively.
Keywords :
audio signal processing; error analysis; speaker recognition; audio features; blue lip highlighted data; cross-digit speaker; lip measurements; speaker recognition; speech recognition; visual error rates; visual lip features; visual representation; Acoustics; Feature extraction; Lips; Speaker recognition; Speech; Speech recognition; Visualization;
Conference_Titel :
Signal Processing Conference (EUSIPCO 1998), 9th European
Conference_Location :
Rhodes
Print_ISBN :
978-960-7620-06-4