Title :
A sparse integrative cluster analysis for understanding soybean phenotypes
Author :
Jinbo Bi ; Jiangwen Sun ; Tingyang Xu ; Jin Lu ; Yansong Ma ; Lijuan Qiu
Author_Institution :
Dept. of Comput. Sci. & Eng., Univ. of Connecticut, Storrs, CT, USA
Abstract :
Soybean is one of the most important crops for food, feed and bio-energy world-wide. The study of soybean phenotypic variation at different geographical locations can help the understanding of soybean domestication, population structure of soybean, and the conservation of soybean biodiversity. We investigate if soybean varieties can be identified that they differ from other varieties on multiple traits even when growing at different geographical locations. When a collection of traits are observed for the same soybean type at different locations (different views), joint analysis of the multiple-view data is required in order to identify the same soybean clusters based on data from different locations. We employ a new multi-view singular value decomposition approach that simultaneously decomposes the data matrix gathered at each location into sparse singular vectors. This approach is able to group soybean samples consistently across the different locations and simultaneously identify the phenotypes at each location on which the soybean samples within a cluster are the most similar. Comparison with several latest multi-view co-clustering methods demonstrates the superior performance of the proposed approach.
Keywords :
crops; singular value decomposition; bio-energy; crops; data matrix; feed; food; geographical locations; group soybean sample; multiple-view data; multiview co-clustering method; multiview singular value decomposition approach; soybean biodiversity conservation; soybean clusters; soybean domestication; soybean phenotypes; soybean phenotypic variation; soybean population structure; soybean varieties; sparse integrative cluster analysis; sparse singular vectors; Clustering algorithms; Matrix decomposition; Optimization; Sociology; Sparse matrices; Statistics; Vectors; multi-view clustering; multi-view data analysis; soybean population structure; soybean trait analysis;
Conference_Titel :
Bioinformatics and Biomedicine (BIBM), 2014 IEEE International Conference on
Conference_Location :
Belfast
DOI :
10.1109/BIBM.2014.6999290