DocumentCode :
2692013
Title :
CTGR-Span: Efficient mining of cross-timepoint gene regulation sequential patterns from microarray datasets
Author :
Cheng, Chun-Pei ; Tsai, Yi-Lin ; Tseng, Vincent S.
Author_Institution :
Dept. of Comput. Sci. & Inf. Eng., Nat. Cheng Kung Univ., Tainan, Taiwan
fYear :
2012
fDate :
4-7 Oct. 2012
Firstpage :
1
Lastpage :
4
Abstract :
Sequential pattern mining techniques have been widely used in different topics of interest, such as mining customer purchasing sequences from a transactional database. Notably, observation of gene expressions to discover gene regulations during biological or clinical progression via microarray approaches has become the dominant trend. By converting microarray datasets into the format of transactional databases, sequential patterns implying gene regulations could be identified. However, there exists no effective method in current studies that can handle such kind of dataset as every transaction may contain too many items/genes and the resultant patterns are very susceptible to item order. We propose a new method called CTGR-Span (Cross-Timepoint Gene Regulation Sequential Patterns) to efficiently mine CTGR-SPs (cross-timepoint gene regulation sequential patterns). The proposed method was experimented with two publicly available human time course microarray datasets and it outperformed traditional methods over 2,000 times in terms of the execution efficiency. Furthermore, via a Gene Ontology enrichment analysis, the resultant patterns are more meaningful biologically compared to previous literature reports. Hence, it could provide biologists more insights into the mechanisms of novel gene regulations in certain disease progressions.
Keywords :
data mining; diseases; genetics; lab-on-a-chip; medical computing; CTGR-span; cross-timepoint gene regulation sequential patterns; disease progressions; gene expressions; gene ontology enrichment analysis; gene regulations; human time course microarray datasets; mining customer purchasing sequences; sequential pattern mining techniques; sequential patterns; transactional database; Blood; Data mining; Databases; Diseases; Gene expression; Immune system; Cross timepoint; Gene regulation; Long transaction; Sequential pattern; Time course microarray;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Bioinformatics and Biomedicine (BIBM), 2012 IEEE International Conference on
Conference_Location :
Philadelphia, PA
Print_ISBN :
978-1-4673-2559-2
Electronic_ISBN :
978-1-4673-2558-5
Type :
conf
DOI :
10.1109/BIBM.2012.6392736
Filename :
6392736
Link To Document :
بازگشت