DocumentCode :
2089811
Title :
Tensor Space Model for Hypertext Representation
Author :
Saha, Suman ; Murthy, C.A. ; Pal, Sankar K.
Author_Institution :
Center for Soft Comput. Res., Indian Stat. Inst., India
fYear :
2008
fDate :
17-20 Dec. 2008
Firstpage :
261
Lastpage :
266
Abstract :
We investigate the basics of tensor based hypertext representation and perform experiments this novel hypertext representation model. Most documents have an inherent hierarchical structure that render the desirable use of multidimensional representations such as those offered by tensor objects. We focus on the advantages of Tensor Space Model, in which documents are represented using second-order tensors. We exploit the local-structure and neighborhood recommendation encapsulated by the proposed representation. We define the distance metric on tensor space of hypertext documents, which is a generalization of distance metric defined on vector space model. Our results provide evidence that tensor based model is very efficient for clustering and classification of hypertext documents compared to traditional vector based model.
Keywords :
classification; data structures; hypermedia; tensors; distance metric; hypertext categorization system; hypertext document representation model; inherent hierarchical structure; second-order tensor; tensor space model; Computational complexity; Extraterrestrial measurements; Feature extraction; HTML; Information technology; Performance evaluation; Space technology; Tensile stress; Uniform resource locators; Web pages; hypertext; internal structure; similarity measure; tensor space;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Information Technology, 2008. ICIT '08. International Conference on
Conference_Location :
Bhubaneswar
Print_ISBN :
978-1-4244-3745-0
Type :
conf
DOI :
10.1109/ICIT.2008.13
Filename :
4731339
Link To Document :
بازگشت