DocumentCode
2206569
Title
Dependency detection with similarity constraints
Author
Lahti, Leo ; Myllykangas, Samuel ; Knuutila, Sakari ; Kaski, Samuel
fYear
2009
fDate
1-4 Sept. 2009
Firstpage
1
Lastpage
6
Abstract
Unsupervised two-view learning, or detection of dependencies between two paired data sets, is typically done by some variant of canonical correlation analysis (CCA). CCA searches for a linear projection for each view, such that the correlations between the projections are maximized. The solution is invariant to any linear transformation of either or both of the views; for tasks with small sample size such flexibility implies overfitting, which is even worse for more flexible nonparametric or kernel-based dependency discovery methods. We develop variants which reduce the degrees of freedom by assuming constraints on similarity of the projections in the two views. A particular example is provided by a cancer gene discovery application where chromosomal distance affects the dependencies between gene copy number and activity levels. Similarity constraints are shown to improve detection performance of known cancer genes.
Keywords
bioinformatics; data handling; unsupervised learning; cancer gene discovery; canonical correlation analysis; chromosomal distance; dependency detection; gene activity levels; gene copy number; similarity constraints; unsupervised two-view learning; Bayesian methods; Bioinformatics; Cancer; Computer science; Gene expression; Genetic mutations; Genomics; Multidimensional systems; Particle measurements; Vectors;
fLanguage
English
Publisher
ieee
Conference_Titel
Machine Learning for Signal Processing, 2009. MLSP 2009. IEEE International Workshop on
Conference_Location
Grenoble
Print_ISBN
978-1-4244-4947-7
Electronic_ISBN
978-1-4244-4948-4
Type
conf
DOI
10.1109/MLSP.2009.5306192
Filename
5306192
Link To Document