Multimodal person authentication system using features of utterance

Author

Nishino, Takanori ; Kajikawa, Y. ; Muneyasu, Mitsuji

Author_Institution

Fac. of Eng. Sceince, Kansai Univ., Suita, Japan

fYear

2012

fDate

4-7 Nov. 2012

Firstpage

43

Lastpage

47

Abstract

In this paper, we propose a multimodal biometrics authentication method using features of an utterance. The proposed authentication method authenticates persons using image and voice signals. Hence, the proposed method can be realized with only a camera and microphone to extract the lip area and voice without the special equipment used in other personal authentication methods and can easily change the registration data. Moreover, the proposed authentication method can provide a key function to the registered phrase of the utterance. In the proposed method, the edges and texture in the mouth are used as image features, and pitch and spectrum envelope are used as voice features. Authentication is realized by classifiers generated by AdaBoost, classifiers are generated for the voice- and image-processing parts. Moreover, each classifier is weighted according to the corresponding confidence and then the final authentication score is calculated. Hence, the proposed method can provide valid authentication results in various environments. Experimental results demonstrate that multimodal processing in the proposed method is more effective than monomodal (only image or voice) processing.

Keywords

feature extraction; image recognition; image registration; image texture; learning (artificial intelligence); speaker recognition; AdaBoost classifiers; camera; image features; image processing part; image signal; lip area extraction; microphone; monomodal processing; mouth edges; mouth texture; multimodal biometric authentication method; multimodal person authentication system; multimodal processing; pitch-spectrum envelope; registration data; utterance features; voice extraction; voice features; voice processing part; voice signal; Accuracy; Authentication; Biometrics (access control); Face; Feature extraction; Image edge detection; Vectors; AdaBoost; Dynamic Time Warping; Multimodal; authentication; features; utterance;

fLanguage

English

Publisher

ieee

Conference_Titel

Intelligent Signal Processing and Communications Systems (ISPACS), 2012 International Symposium on

Conference_Location

New Taipei

Print_ISBN

978-1-4673-5083-9

Electronic_ISBN

978-1-4673-5081-5

Type

conf

DOI

10.1109/ISPACS.2012.6473450

Filename

6473450