مرکز منطقه ای اطلاع رساني علوم و فناوري - iSAX 2.0: Indexing and Mining One Billion Time Series

DocumentCode :

2207235

Title :

iSAX 2.0: Indexing and Mining One Billion Time Series

Author :

Camerra, Alessandro ; Palpanas, Themis ; Shieh, Jin ; Keogh, Eamonn

Author_Institution :

Univ. of Trento, Trento, Italy

fYear :

2010

fDate :

13-17 Dec. 2010

Firstpage :

Lastpage :

Abstract :

There is an increasingly pressing need, by several applications in diverse domains, for developing techniques able to index and mine very large collections of time series. Examples of such applications come from astronomy, biology, the web, and other domains. It is not unusual for these applications to involve numbers of time series in the order of hundreds of millions to billions. However, all relevant techniques that have been proposed in the literature so far have not considered any data collections much larger than one-million time series. In this paper, we describe iSAX 2.0, a data structure designed for indexing and mining truly massive collections of time series. We show that the main bottleneck in mining such massive datasets is the time taken to build the index, and we thus introduce a novel bulk loading mechanism, the first of this kind specifically tailored to a time series index. We show how our method allows mining on datasets that would otherwise be completely untenable, including the first published experiments to index one billion time series, and experiments in mining massive data from domains as diverse as entomology, DNA and web-scale image collections.

Keywords :

data mining; indexing; time series; data collection; data mining; data structure; iSAX 2.0; indexable symbolic aggregate approximation; indexing; time series; data mining; indexing; representations; time series;

fLanguage :

English

Publisher :

ieee

Conference_Titel :

Data Mining (ICDM), 2010 IEEE 10th International Conference on

Conference_Location :

Sydney, NSW

ISSN :

1550-4786

Print_ISBN :

978-1-4244-9131-5

Electronic_ISBN :

1550-4786

Type :

conf

DOI :

10.1109/ICDM.2010.124

Filename :

5693959

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=2207235