Title :
EKSS: An efficient approach for similarity search
Author :
Gupta, Shelley ; Dwivedi, Avinash ; Issac, R.K. ; Agrawal, Sachin Kumar
Author_Institution :
Dept. of Comput. Sci. & Eng, KEC, Ghaziabad, India
Abstract :
Nowadays, crucial task in data mining field in large multidimensional data has always been the similarity search problems. Similarity search involves both subsequences matching and whole sequence matching. In this paper, we present an approach which consider on how many dimensions the data point is similiar to the query point, the average distance of these dimensions of data point to the query point as well as efficiency with respect to time and space required with the dramatic increment of data size. The proposed approach involves dynamic selection of input parameters, covering both subsequences matching and whole sequence matching, suppressing the impact of high dissimilarities in few dimensions. Thus our proposed approach can help improving performance of existing data analysis technologies, such as financial market analysis, medical diagnosis and scientific and engineering database analysis as tremendous amount of data is generated in these disciplines.
Keywords :
computational complexity; data mining; query processing; EKSS framework; data analysis technology performance improvement; data dimension dissimilarities; data size; engineering database analysis; financial market analysis; input parameter dynamic selection; medical diagnosis; multidimensional data mining; query point; scientific database analysis; similarity search problems; space analysis; subsequence matching; time analysis; whole-sequence matching; Bismuth; Computer science; Computers; Data analysis; Data mining; Nearest neighbor searches; Search problems; Efficient; K Similarity Search; subsequence matching; whole sequence matching;
Conference_Titel :
Communication, Information & Computing Technology (ICCICT), 2012 International Conference on
Conference_Location :
Mumbai
Print_ISBN :
978-1-4577-2077-2
DOI :
10.1109/ICCICT.2012.6398194