• DocumentCode
    3748895
  • Title

    Unsupervised Learning of Spatiotemporally Coherent Metrics

  • Author

    Ross Goroshin;Joan Bruna;Jonathan Tompson;David Eigen;Yann LeCun

  • Author_Institution
    Courant Inst., NYU, New York, NY, USA
  • fYear
    2015
  • Firstpage
    4086
  • Lastpage
    4093
  • Abstract
    Current state-of-the-art classification and detection algorithms train deep convolutional networks using labeled data. In this work we study unsupervised feature learning with convolutional networks in the context of temporally coherent unlabeled data. We focus on feature learning from unlabeled video data, using the assumption that adjacent video frames contain semantically similar information. This assumption is exploited to train a convolutional pooling auto-encoder regularized by slowness and sparsity priors. We establish a connection between slow feature learning and metric learning. Using this connection we define "temporal coherence" -- a criterion which can be used to set hyper-parameters in a principled and automated manner. In a transfer learning experiment, we show that the resulting encoder can be used to define a more semantically coherent metric without the use of labels.
  • Keywords
    "Feature extraction","Measurement","Dictionaries","Unsupervised learning","Training","Convolution","Video sequences"
  • Publisher
    ieee
  • Conference_Titel
    Computer Vision (ICCV), 2015 IEEE International Conference on
  • Electronic_ISBN
    2380-7504
  • Type

    conf

  • DOI
    10.1109/ICCV.2015.465
  • Filename
    7410822