Title :
Wavelet-based hierarchical document categorization
Author :
Xu, Chao ; Zhou, Yi-ming
Author_Institution :
Beihang Univ., Beijing
Abstract :
A multiresolution method based wavelet is purposed for hierarchical document categorization. Multi-level wavelet decomposition of document vectors is regarded as a feature selection process. The traditional vector space model in document representation is reorganized by hierarchical conceptual-semantic relation of the words. The multiresolution wavelet coefficients of reordered vectors reflect the characteristics of the document category in hierarchy. At different levels of category, corresponding resolution wavelet coefficients are used to classify. The computational complexity of classification is reduced and better performance is obtained.
Keywords :
computational complexity; data mining; feature extraction; text analysis; wavelet transforms; computational complexity; conceptual-semantic relation; feature selection; hierarchical document categorization; multiresolution wavelet coefficient; text mining; word similarity; Binary trees; Discrete wavelet transforms; Frequency; Pattern analysis; Pattern recognition; Signal resolution; Wavelet analysis; Wavelet coefficients; Wavelet domain; Wavelet transforms; hierarchical document categorization; multiresolution; wavelet transform; words similarity;
Conference_Titel :
Wavelet Analysis and Pattern Recognition, 2007. ICWAPR '07. International Conference on
Conference_Location :
Beijing
Print_ISBN :
978-1-4244-1065-1
Electronic_ISBN :
978-1-4244-1066-8
DOI :
10.1109/ICWAPR.2007.4421692