• DocumentCode
    3286489
  • Title

    Binary tree-based precision-keeping clustering for very fast Japanese character recognition

  • Author

    Sobu, Yohei ; Goto, Hideaki ; Aso, Hirotomo

  • Author_Institution
    Grad. Sch. of Inf. Sci., Tohoku Univ., Sendai, Japan
  • fYear
    2010
  • fDate
    8-9 Nov. 2010
  • Firstpage
    1
  • Lastpage
    6
  • Abstract
    Real-time character recognition in video frames has been attracting great attention from developers since scene text recognition was recognized as a new field of Optical Character Recognition (OCR) applications. Some oriental languages such as Japanese and Chinese have thousands of characters, and the character recognition takes much longer time in general compared with European languages. Speed-up of character recognition is crucial to develop software for mobile devices such as Smart Phones. This paper proposes a binary tree-based clustering technique that can keep the precision as quite high as possible. The experimental results show that the character recognition using the proposed clustering technique is 8.3 times faster than the full linear matching at mere 0.22% precision drop. When the proposed method is combined with the Sequential Similarity Detection Algorithm (SSDA) and a PCA-based dimensionality reduction, we can achieve 36.2 times faster character matching at 0.29% precision drop.
  • Keywords
    image matching; natural language processing; optical character recognition; pattern clustering; principal component analysis; smart phones; trees (mathematics); video signal processing; Chinese language; European language; Japanese language; PCA-based dimensionality reduction; binary tree-based precision-keeping clustering; full linear matching; mobile device software; optical character recognition; principal component analysis; realtime character recognition; scene text recognition; sequential similarity detection algorithm; smart phones; very fast Japanese character recognition; video frame; Character recognition; Clustering algorithms; Dictionaries; Mobile handsets; Optical character recognition software; Real time systems; Vectors; Japanese character recognition; character clustering; dimensionality reduction; fast matching algorithm; real-time character recognition;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Image and Vision Computing New Zealand (IVCNZ), 2010 25th International Conference of
  • Conference_Location
    Queenstown
  • ISSN
    2151-2191
  • Print_ISBN
    978-1-4244-9629-7
  • Type

    conf

  • DOI
    10.1109/IVCNZ.2010.6148843
  • Filename
    6148843