Title :
Application of Topic Based Vector Space Model with WordNet
Author :
Wibowo, Adi ; Handojo, Andreas ; Halim, Albert
Author_Institution :
Inf. Dept., Petra Christian Univ., Surabaya, Indonesia
Abstract :
Topic Based Vector Space Model (TVSM) proposed a new vector space that its dimensions is composed of topics. Every term and document is represented by vectors inside this vector space. By using topics as dimensions TVSM tries to overcome word-mismatch between terms with similar topics in finding relevant documents to query. This study proposes to develop relations between terms using WordNet and thesaurus to help TVSM calculating similarity between documents. Relations between terms are represented by relation score. This study proposes a way to find optimal relation score for a set of documents. To help indexing documents with multi language terms this study also proposes to use dictionary to expand query terms.
Keywords :
dictionaries; indexing; query processing; text analysis; thesauri; vectors; WordNet; dictonary; document indexing; document querying; multi language terms; thesaurus; topic based vector space model; word-mismatch; Business; Dictionaries; Mathematical model; Search engines; Testing; Thesauri; Weight measurement; Topic based vector space model; dictionary; wordnet;
Conference_Titel :
Uncertainty Reasoning and Knowledge Engineering (URKE), 2011 International Conference on
Conference_Location :
Bali
Print_ISBN :
978-1-4244-9985-4
Electronic_ISBN :
978-1-4244-9984-7
DOI :
10.1109/URKE.2011.6007864