• DocumentCode
    1584173
  • Title

    Use of colour in form layout analysis

  • Author

    Wong, Wing Seong ; Sherkat, Nasser ; Allen, Tony

  • Author_Institution
    Dept. of Comput., Nottingham Trent Univ., UK
  • fYear
    2001
  • fDate
    6/23/1905 12:00:00 AM
  • Firstpage
    942
  • Lastpage
    946
  • Abstract
    Colour has long been viewed as one of the unnecessary features in any form processing system, due not only to the large storage requirement and computational cost its inclusion imposes but also to the complexities of hue, chroma and brightness variation. However, as technology has advanced and computing costs have reduced, the processing of documents in colour has now become practical. This paper describes a prototype form extraction system that utilises colour information to help facilitate data extraction from a form. Blank forms are first automatically analysed to obtain their layout, colour and statistical information. The filled data is then extracted from the filled forms using techniques based upon the colour characteristic changes that have occurred with respect to the blank form. The improved performance of the proposed method has been verified by comparing the processing time, data extraction precision and recall rate of the proposed system to that of an archetypal black and white form extraction system
  • Keywords
    business forms; document image processing; image colour analysis; optical character recognition; OCR; brightness; chroma; colour document processing; colour reduction; computational cost; data extraction; form layout analysis; form processing system; hue; image colour; performance; storage requirement; Brightness; Computational efficiency; Computer science education; Costs; Data mining; Finance; Image color analysis; Information analysis; Iris; Medical services;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Document Analysis and Recognition, 2001. Proceedings. Sixth International Conference on
  • Conference_Location
    Seattle, WA
  • Print_ISBN
    0-7695-1263-1
  • Type

    conf

  • DOI
    10.1109/ICDAR.2001.953924
  • Filename
    953924