Title :
IndexedFCP - An Index Based Approach to Identify Frequent Contiguous Patterns (FCP) in Big Data
Author :
Devikarubi, R. ; Rubi, R. Devika ; Arockiam, L.
Author_Institution :
R & D Centre, Bharathiar Univ., Coimbatore, India
Abstract :
The process of identifying the repeated contiguous patterns existing in a Big data environment, especially in a Sequence Database (SDB) is called Frequent Contiguous Patterns (FCP) mining. The existing FCP algorithms repeatedly scan the SDB to find FCP, which requires high time complexity and large storage space. In this paper, an Index based algorithm has been presented which minimizes the scanning time of SDB, while finding FCP in the SDB. We define IndexedFCP() which uses index based technique to divide the SDB into sub SDBs and helps directly to search the required FCP in its possible sub SDBs. FCP algorithms are used in many SDB applications such as finding Motif, Regulatory Regions and internal repeats in genomic SDB, identifying required items in Inventory SDB and finding trading patterns in stack trading etc.
Keywords :
Big Data; computational complexity; data mining; database indexing; Big Data; FCP algorithms; IndexedFCP; Motif; SDB applications; frequent contiguous pattern identification; frequent contiguous pattern mining; genomic SDB; index based technique; inventory SDB; large storage space; regulatory regions; sequence database; stack trading patterns; time complexity; Algorithm design and analysis; Bioinformatics; DNA; Data mining; Indexes; Proteins; Big Data; Biomarkers; Contiguous Pattern Mining; DNA Data Sequences; Motif; Regulatory Regions; Sequence Database;
Conference_Titel :
Intelligent Computing Applications (ICICA), 2014 International Conference on
Conference_Location :
Coimbatore
DOI :
10.1109/ICICA.2014.15