مرکز منطقه ای اطلاع رساني علوم و فناوري

DocumentCode :

174890

Title :

Random Manhattan Indexing

Author :

Zadeh, Behrang Q. ; Handschuh, Siegfried

Author_Institution :

Insight Centre for Data Analytics, Nat. Univ. of Ireland, Galway, Ireland

fYear :

2014

fDate :

1-5 Sept. 2014

Firstpage :

203

Lastpage :

208

Abstract :

Vector space models (VSMs) are mathematically well-defined frameworks that have been widely used in text processing. In these models, high-dimensional, often sparse vectors represent text units. In an application, the similarity of vectors -- and hence the text units that they represent -- is computed by a distance formula. The high dimensionality of vectors, however, is a barrier to the performance of methods that employ VSMs. Consequently, a dimensionality reduction technique is employed to alleviate this problem. This paper introduces a new method, called Random Manhattan Indexing (RMI), for the construction of L1 normed VSMs at reduced dimensionality. RMI combines the construction of a VSM and dimension reduction into an incremental, and thus scalable, procedure. In order to attain its goal, RMI employs the sparse Cauchy random projections.

Keywords :

data reduction; indexing; text analysis; L1 normed VSM; RMI; dimensionality reduction technique; natural language text; random Manhattan indexing; sparse Cauchy random projections; vector space model; Computational modeling; Context; Equations; Indexing; Mathematical model; Vectors; Manhattan distance; dimensionality reduction; random projection; retrieval models; vector space model;

fLanguage :

English

Publisher :

ieee

Conference_Titel :

Database and Expert Systems Applications (DEXA), 2014 25th International Workshop on

Conference_Location :

Munich

ISSN :

1529-4188

Print_ISBN :

978-1-4799-5721-7

Type :

conf

DOI :

10.1109/DEXA.2014.51

Filename :

6974850

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=174890