DocumentCode :
1772946
Title :
Mining correlation patterns of taxa, pathways and environmental factors with an improved weighted network community detection algorithm
Author :
Xiao-Ying Yan ; Shao-Wu Zhang ; Ze-Gang Wei ; Wei-Feng Guo
Author_Institution :
Key Lab. of Inf. Fusion Technol., Northwestern Polytech. Univ., Xi´an, China
fYear :
2014
fDate :
24-27 Oct. 2014
Firstpage :
141
Lastpage :
145
Abstract :
With the development of high-throughput and low-cost sequencing technology, a large amount of marine microbial sequences is generated. So, it is possible to research more uncultivated marine microbes. Generally, the functional capability and taxa structure are highly related with environment factors in microbial communities, which are hidden in these large amount sequences. However, most works used the canonical correlation analysis (CCA) method to research the correlative relationship among taxa, pathways and environmental factors. CCA is difficult to find which environmental factors are the major determinants of some special taxa and pathway. In this paper, we integrated 14 ocean metagenomes with geographical, meteorological and geophysicochemical data to construct the correlative weighted networks with Spearman correlation. By using an improved weighted network community detection algorithm, named as IWNCD, we find some special correlation patterns among taxa, pathways and environmental factors. Analysis of these patterns shows that the climatic factors such as temperature, sunlight, and correlated CO2, and the nutrients such as chlorophyII and primary production are the main determining factors of the functional community composition; The growth and development of some special taxa are dependent on some main environmental factors such as sunlight, temperature, CO2, primary production, dissolved oxygen, dissolved silicate; In addition, sampling sites more similar in geographic location have a greater tendency to be closer together based on their metabolic pathways.
Keywords :
cellular biophysics; data mining; environmental factors; genomics; microorganisms; CCA method; CO2 production; IWNCD; Spearman correlation; canonical correlation analysis; chlorophyII; climatic factors; correlation pattern mining; dissolved oxygen; dissolved silicate; environment factors; functional community composition; geographical data; geophysicochemical data; high-throughput sequencing technology; low-cost sequencing technology; marine microbial sequences; metabolic pathways; meteorological data; nutrients; ocean metagenomes; primary production; sunlight; taxa structure; uncultivated marine microbes; weighted network community detection algorithm; Biochemistry; Communities; Correlation; Environmental factors; Image edge detection; Production; Temperature; correlation pattern; marine microbe; pathway; taxa; weighted network;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Systems Biology (ISB), 2014 8th International Conference on
Conference_Location :
Qingdao
Type :
conf
DOI :
10.1109/ISB.2014.6990746
Filename :
6990746
Link To Document :
بازگشت