• DocumentCode
    3486822
  • Title

    Codebook for Writer Characterization: A Vocabulary of Patterns or a Mere Representation Space?

  • Author

    Djeddi, Chawki ; Siddiqi, Imran ; Souici-Meslati, Labiba ; Ennaji, Abdellatif

  • Author_Institution
    LAMIS Lab., Univ. of Tebessa, Tebessa, Algeria
  • fYear
    2013
  • fDate
    25-28 Aug. 2013
  • Firstpage
    423
  • Lastpage
    427
  • Abstract
    Codebook-based representations have been effectively employed for writer identification. Most of the codebook-based methods generate a codebook by clustering a set of patterns extracted from an independent data set. The probability of occurrence of the codebook patterns in a given writing is then used to characterize its author. This study investigates the hypothesis that the codebook is merely a representation space and the codebook patterns themselves do not affect the writer identification performance. The idea is validated by first using codebooks in different scripts from those of writings in question and then by using a synthetically generated codebook. A number of data sets with handwritten samples in Arabic, French, English, German, Urdu and Greek are considered in our series of evaluations. Experiments conducted with different codebooks report interesting results which validate the ideas put forward in this study.
  • Keywords
    handwriting recognition; image representation; natural language processing; pattern clustering; probability; Arabic; English; French; German; Greek; Urdu; codebook patterns; codebook-based methods; codebook-based representations; occurrence probability; pattern clustering; pattern vocabulary; writer characterization; writer identification; Educational institutions; Handwriting recognition; Laboratories; Text analysis; Training; Writing; Codebook; Multi-script Handwritten Samples; Synthetic Patterns; Writer identification;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Document Analysis and Recognition (ICDAR), 2013 12th International Conference on
  • Conference_Location
    Washington, DC
  • ISSN
    1520-5363
  • Type

    conf

  • DOI
    10.1109/ICDAR.2013.92
  • Filename
    6628657