DocumentCode :
78271
Title :
A Probabilistic Latent Semantic Analysis Model for Coclustering the Mouse Brain Atlas
Author :
Shuiwang Ji ; Wenlu Zhang ; Rongjian Li
Author_Institution :
Dept. of Comput. Sci., Old Dominion Univ., Norfolk, VA, USA
Volume :
10
Issue :
6
fYear :
2013
fDate :
Nov.-Dec. 2013
Firstpage :
1460
Lastpage :
1468
Abstract :
The mammalian brain contains cells of a large variety of types. The phenotypic properties of cells of different types are largely the results of distinct gene expression patterns. Therefore, it is of critical importance to characterize the gene expression patterns in the mammalian brain. The Allen Developing Mouse Brain Atlas provides spatiotemporal in situ hybridization gene expression data across multiple stages of mouse brain development. It provides a framework to explore spatiotemporal regulation of gene expression during development. We employ a graph approximation formulation to cocluster the genes and the brain voxels simultaneously for each time point. We show that this formulation can be expressed as a probabilistic latent semantic analysis (PLSA) model, thereby allowing us to use the expectation-maximization algorithm for PLSA to estimate the coclustering parameters. To provide a quantitative comparison with prior methods, we evaluate the coclustering method on a set of standard synthetic data sets. Results indicate that our method consistently outperforms prior methods. We apply our method to cocluster the Allen Developing Mouse Brain Atlas data. Results indicate that our clustering of voxels is more consistent with classical neuroanatomy than those of prior methods. Our analysis also yields sets of genes that are co-expressed in a subset of the brain voxels.
Keywords :
bioinformatics; brain; cellular biophysics; expectation-maximisation algorithm; genetics; genomics; neurophysiology; pattern clustering; probability; semantic networks; spatiotemporal phenomena; Allen Developing Mouse Brain Atlas data; PLSA; brain voxel subsets; cell phenotypic properties; classical neuroanatomy; coclustering method; coclustering parameters; distinct gene expression patterns; expectation-maximization algorithm; gene coclustering; gene sets; graph approximation formulation; mammalian brain; mouse brain development; multiple stages; probabilistic latent semantic analysis model; spatiotemporal in situ hybridization gene expression data; spatiotemporal regulation; standard synthetic data sets; voxel clustering; Approximation methods; Bipartite graph; Brain modeling; Gene expression; Mice; Brain imaging; coclustering; developing mouse brain; in situ hybridization; spatiotemporal gene expression;
fLanguage :
English
Journal_Title :
Computational Biology and Bioinformatics, IEEE/ACM Transactions on
Publisher :
ieee
ISSN :
1545-5963
Type :
jour
DOI :
10.1109/TCBB.2013.135
Filename :
6654140
Link To Document :
بازگشت