• DocumentCode
    3723736
  • Title

    Impact of each camera on multiple camera visual speech recognizer using ANOVA: A brief study

  • Author

    Astik Biswas;P.K. Sahu;Mahesh Chandra

  • Author_Institution
    Dept of Electrical Engineering, National Institute of Technology, Rourkela, India-769008
  • fYear
    2015
  • Firstpage
    1
  • Lastpage
    5
  • Abstract
    Multiple camera fusion technique is an imperative part of multi-camera computer vision applications. Visual modality plays a vital role in computer vision systems when the acoustic modality is corrupted by the background noise. Multiple camera protocol allows the user to move freely and can provide complementary information to each other. This study shows the influence of each camera on visual speech recognizer using the one-way analysis of variance (ANOVA). We choose a real world four cameras audio-visual corpus “AVICAR” for this study. ANOVA is applied to the each pair of the camera to explore the effect of different viewing angle. This ANOVA test shows the influence of side and central faced camera on AVICAR visual speech recognizer (VSR). Based on the ANOVA F-statistics test multiple camera streams are fused into one visual feature vector. Dynamic visual speech information is captured using Motion History Images (MHI). Zernike Moments (ZM) are used as the visual feature to carry out the study. Four camera visual features show ample improvement over single camera visual features across all driving condition.
  • Keywords
    "Cameras","Speech"
  • Publisher
    ieee
  • Conference_Titel
    TENCON 2015 - 2015 IEEE Region 10 Conference
  • ISSN
    2159-3442
  • Print_ISBN
    978-1-4799-8639-2
  • Electronic_ISBN
    2159-3450
  • Type

    conf

  • DOI
    10.1109/TENCON.2015.7372979
  • Filename
    7372979