• DocumentCode
    2206569
  • Title

    Dependency detection with similarity constraints

  • Author

    Lahti, Leo ; Myllykangas, Samuel ; Knuutila, Sakari ; Kaski, Samuel

  • fYear
    2009
  • fDate
    1-4 Sept. 2009
  • Firstpage
    1
  • Lastpage
    6
  • Abstract
    Unsupervised two-view learning, or detection of dependencies between two paired data sets, is typically done by some variant of canonical correlation analysis (CCA). CCA searches for a linear projection for each view, such that the correlations between the projections are maximized. The solution is invariant to any linear transformation of either or both of the views; for tasks with small sample size such flexibility implies overfitting, which is even worse for more flexible nonparametric or kernel-based dependency discovery methods. We develop variants which reduce the degrees of freedom by assuming constraints on similarity of the projections in the two views. A particular example is provided by a cancer gene discovery application where chromosomal distance affects the dependencies between gene copy number and activity levels. Similarity constraints are shown to improve detection performance of known cancer genes.
  • Keywords
    bioinformatics; data handling; unsupervised learning; cancer gene discovery; canonical correlation analysis; chromosomal distance; dependency detection; gene activity levels; gene copy number; similarity constraints; unsupervised two-view learning; Bayesian methods; Bioinformatics; Cancer; Computer science; Gene expression; Genetic mutations; Genomics; Multidimensional systems; Particle measurements; Vectors;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Machine Learning for Signal Processing, 2009. MLSP 2009. IEEE International Workshop on
  • Conference_Location
    Grenoble
  • Print_ISBN
    978-1-4244-4947-7
  • Electronic_ISBN
    978-1-4244-4948-4
  • Type

    conf

  • DOI
    10.1109/MLSP.2009.5306192
  • Filename
    5306192