DocumentCode :
153419
Title :
Improving Classification of an Industrial Document Image Database by Combining Visual and Textual Features
Author :
Augereau, Olivier ; Journet, Nicholas ; Vialard, Anne ; Domenger, Jean-Philippe
Author_Institution :
Gestform, Merignac, France
fYear :
2014
fDate :
7-10 April 2014
Firstpage :
314
Lastpage :
318
Abstract :
The main contribution of this paper is a new method for classifying document images by combining textual features extracted with the Bag of Words (BoW) technique and visual features extracted with the Bag of Visual Words (BoVW) technique. The BoVW is widely used within the computer vision community for scene classification or object recognition but few applications for the classification of entire document images have been submitted. While previous attempts have been showing disappointing results by combining visual and textual features with the Borda-count technique, we´re proposing here a combination through learning approach. Experiments conducted on a 1925 document image industrial database reveal that this fusion scheme significantly improves the classification performances. Our concluding contribution deals with the choosing and tuning of the BoW and/or BoVW techniques in an industrial context.
Keywords :
feature extraction; image classification; visual databases; BoVW technique; BoW technique; Borda-count technique; bag of visual words technique; bag of words technique; computer vision community; document image classification; industrial document image database classification; learning approach; textual feature extraction; visual features extraction; Context; Databases; Feature extraction; Layout; Optical character recognition software; Support vector machines; Visualization; document image classification; industrial application; interest point; late fusion; visual and textual feature combination;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Document Analysis Systems (DAS), 2014 11th IAPR International Workshop on
Conference_Location :
Tours
Print_ISBN :
978-1-4799-3243-6
Type :
conf
DOI :
10.1109/DAS.2014.44
Filename :
6831020
Link To Document :
بازگشت