DocumentCode
465900
Title
Three and Four Phase Scenarios for Dynamic Document Organization
Author
Jo, Taeho ; Japkowicz, Nathalie
Volume
3
fYear
2006
fDate
8-11 Oct. 2006
Firstpage
2228
Lastpage
2233
Abstract
This research introduces a new paradigm: dynamic document organization, DDO, for managing documents. DDO consists of managing documents automatically under the assumption that the topic structure in a collection of documents is always variable and temporary. This paradigm is in contrast with static document organization, SDO, the currently used paradigm which assumes that the topic structure is fixed and permanent. In this work, we consider two scenarios, a three-phase-scenario and a four-phase-scenario, for managing documents based on DDO. In both scenarios, text clustering, cluster identification, and document classification are integrated into a cycle. In the four-phase-scenario, one more phase, classifier training, is added between the cluster identification and document classification phases. The goal of this research is to evaluate the two proposed scenarios and contrast the best one ti its best SDO counterpart. We show that the four-phase DDO scenario is more reliable than the three-phase DDO scenario, and that it generally outperforms the best SDO scenario.
Keywords
document handling; cluster identification; document classification; documents collection; dynamic document organization; four phase scenarios; static document organization; text clustering; three phase scenarios; Clustering algorithms; Content management; Maintenance; Management training; Niobium; Prototypes; Support vector machines; Switches; Text categorization; Text mining;
fLanguage
English
Publisher
ieee
Conference_Titel
Systems, Man and Cybernetics, 2006. SMC '06. IEEE International Conference on
Conference_Location
Taipei
Print_ISBN
1-4244-0099-6
Electronic_ISBN
1-4244-0100-3
Type
conf
DOI
10.1109/ICSMC.2006.385192
Filename
4274198
Link To Document