Title
A sparse integrative cluster analysis for understanding soybean phenotypes
Abstract
Soybean is one of the most important crops for food, feed and bio-energy world-wide. The study of soybean phenotypic variation at different geographical locations can help the understanding of soybean domestication, population structure of soybean, and the conservation of soybean biodiversity. We investigate if soybean varieties can be identified that they differ from other varieties on multiple traits even when growing at different geographical locations. When a collection of traits are observed for the same soybean type at different locations (different views), joint analysis of the multiple-view data is required in order to identify the same soybean clusters based on data from different locations. We employ a new multi-view singular value decomposition approach that simultaneously decomposes the data matrix gathered at each location into sparse singular vectors. This approach is able to group soybean samples consistently across the different locations and simultaneously identify the phenotypes at each location on which the soybean samples within a cluster are the most similar. Comparison with several latest multi-view co-clustering methods demonstrates the superior performance of the proposed approach.
Year
DOI
Venue
2014
10.1109/BIBM.2014.6999290
BIBM
Keywords
Field
DocType
multi-view clustering,soybean clusters,soybean domestication,soybean population structure,soybean phenotypic variation,sparse singular vectors,group soybean sample,soybean trait analysis,sparse integrative cluster analysis,multiple-view data,bio-energy,crops,soybean biodiversity conservation,soybean varieties,food,feed,geographical locations,multiview singular value decomposition approach,multi-view data analysis,multiview co-clustering method,soybean phenotypes,data matrix,singular value decomposition,matrix decomposition,sociology,statistics,clustering algorithms,optimization,vectors,sparse matrices
Biodiversity,Domestication,Phenotype,Computer science,Bioinformatics,Cluster analysis,Population structure,Sparse matrix
Conference
ISSN
Citations 
PageRank 
2156-1125
0
0.34
References 
Authors
8
6
Name
Order
Citations
PageRank
Jinbo Bi11432104.24
Jiangwen Sun2668.73
Tingyang Xu300.34
Jin Lu4324.46
Yansong Ma500.34
Lijuan Qiu600.34