• DocumentCode
    2718429
  • Title

    Audio -Visual Biometric Based Speaker Identification

  • Author

    Kar, Biswajit ; Bhatia, Sandeep ; Dutta, P.K.

  • Author_Institution
    Indian Inst. of Technol., Kharagpur
  • Volume
    4
  • fYear
    2007
  • fDate
    13-15 Dec. 2007
  • Firstpage
    94
  • Lastpage
    98
  • Abstract
    In this paper, we present a multimodal audio-visual speaker identification system. The proposed system decomposes the information existing in a video stream into two components: speech and lip motion. It has been studied that lip information not only presents speech information but also characteristic information about a person´s identity. Fusing this information with speech information will produce robust person identification under adverse condition. Gaussian mixture models (GMMs) and Hidden markov models (HMMs) are used throughout this work for the tasks of text dependent speaker recognition and mouth tracking. The performance is evaluated for dataset of 22 Indian of different ethnicity speakers each uttering a sentence. The results show that the performance of the biometric system is significantly better when both audio and video features are used
  • Keywords
    Gaussian processes; audio-visual systems; hidden Markov models; speaker recognition; Gaussian mixture models; Hidden markov models; audio-visual biometric; mouth tracking; multimodal audio-visual system; person identification; speaker identification; speaker recognition; speech information; Biometrics; Face detection; Face recognition; Hidden Markov models; Lips; Mouth; Robustness; Shape; Speaker recognition; Speech recognition;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Conference on Computational Intelligence and Multimedia Applications, 2007. International Conference on
  • Conference_Location
    Sivakasi, Tamil Nadu
  • Print_ISBN
    0-7695-3050-8
  • Type

    conf

  • DOI
    10.1109/ICCIMA.2007.21
  • Filename
    4426456