• DocumentCode
    3484555
  • Title

    Multimodal person authentication system using features of utterance

  • Author

    Nishino, Takanori ; Kajikawa, Y. ; Muneyasu, Mitsuji

  • Author_Institution
    Fac. of Eng. Sceince, Kansai Univ., Suita, Japan
  • fYear
    2012
  • fDate
    4-7 Nov. 2012
  • Firstpage
    43
  • Lastpage
    47
  • Abstract
    In this paper, we propose a multimodal biometrics authentication method using features of an utterance. The proposed authentication method authenticates persons using image and voice signals. Hence, the proposed method can be realized with only a camera and microphone to extract the lip area and voice without the special equipment used in other personal authentication methods and can easily change the registration data. Moreover, the proposed authentication method can provide a key function to the registered phrase of the utterance. In the proposed method, the edges and texture in the mouth are used as image features, and pitch and spectrum envelope are used as voice features. Authentication is realized by classifiers generated by AdaBoost, classifiers are generated for the voice- and image-processing parts. Moreover, each classifier is weighted according to the corresponding confidence and then the final authentication score is calculated. Hence, the proposed method can provide valid authentication results in various environments. Experimental results demonstrate that multimodal processing in the proposed method is more effective than monomodal (only image or voice) processing.
  • Keywords
    feature extraction; image recognition; image registration; image texture; learning (artificial intelligence); speaker recognition; AdaBoost classifiers; camera; image features; image processing part; image signal; lip area extraction; microphone; monomodal processing; mouth edges; mouth texture; multimodal biometric authentication method; multimodal person authentication system; multimodal processing; pitch-spectrum envelope; registration data; utterance features; voice extraction; voice features; voice processing part; voice signal; Accuracy; Authentication; Biometrics (access control); Face; Feature extraction; Image edge detection; Vectors; AdaBoost; Dynamic Time Warping; Multimodal; authentication; features; utterance;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Intelligent Signal Processing and Communications Systems (ISPACS), 2012 International Symposium on
  • Conference_Location
    New Taipei
  • Print_ISBN
    978-1-4673-5083-9
  • Electronic_ISBN
    978-1-4673-5081-5
  • Type

    conf

  • DOI
    10.1109/ISPACS.2012.6473450
  • Filename
    6473450