DocumentCode :
2647598
Title :
Wavelet-based hierarchical document categorization
Author :
Xu, Chao ; Zhou, Yi-ming
Author_Institution :
Beihang Univ., Beijing
Volume :
4
fYear :
2007
fDate :
2-4 Nov. 2007
Firstpage :
1524
Lastpage :
1527
Abstract :
A multiresolution method based wavelet is purposed for hierarchical document categorization. Multi-level wavelet decomposition of document vectors is regarded as a feature selection process. The traditional vector space model in document representation is reorganized by hierarchical conceptual-semantic relation of the words. The multiresolution wavelet coefficients of reordered vectors reflect the characteristics of the document category in hierarchy. At different levels of category, corresponding resolution wavelet coefficients are used to classify. The computational complexity of classification is reduced and better performance is obtained.
Keywords :
computational complexity; data mining; feature extraction; text analysis; wavelet transforms; computational complexity; conceptual-semantic relation; feature selection; hierarchical document categorization; multiresolution wavelet coefficient; text mining; word similarity; Binary trees; Discrete wavelet transforms; Frequency; Pattern analysis; Pattern recognition; Signal resolution; Wavelet analysis; Wavelet coefficients; Wavelet domain; Wavelet transforms; hierarchical document categorization; multiresolution; wavelet transform; words similarity;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Wavelet Analysis and Pattern Recognition, 2007. ICWAPR '07. International Conference on
Conference_Location :
Beijing
Print_ISBN :
978-1-4244-1065-1
Electronic_ISBN :
978-1-4244-1066-8
Type :
conf
DOI :
10.1109/ICWAPR.2007.4421692
Filename :
4421692
Link To Document :
بازگشت