Title of article :
A novel dual wing harmonium model aided by 2-D wavelet transform subbands for document data mining
Author/Authors :
Zhang، نويسنده , , Haijun and Chow، نويسنده , , Tommy W.S. and Rahman، نويسنده , , M.K.M.، نويسنده ,
Issue Information :
روزنامه با شماره پیاپی سال 2010
Pages :
10
From page :
4403
To page :
4412
Abstract :
A novel dual wing harmonium model that integrates multiple features including term frequency features and 2-D wavelet transform features into a low dimensional semantic space is proposed for the applications of document classification and retrieval. Terms are extracted from the graph representation of document by employing weighted feature extraction method. 2-D wavelet transform is used to compress the graph due to its sparseness while preserving the basic document structure. After transform, low-pass subbands are stacked to represent the term associations in a document. We then develop a new dual wing harmonium model projecting these multiple features into low dimensional latent topics with different probability distributions assumption. Contrastive divergence algorithm is used for efficient learning and inference. We perform extensive experimental verification in document classification and retrieval, and comparative results suggest that the proposed method delivers better performance than other methods.
Keywords :
Graph representation , Multiple features , Dual wing harmonium , 2-D wavelet , Term association , Document data
Journal title :
Expert Systems with Applications
Serial Year :
2010
Journal title :
Expert Systems with Applications
Record number :
2347948
Link To Document :
بازگشت