• DocumentCode
    3672270
  • Title

    Ranking and retrieval of image sequences from multiple paragraph queries

  • Author

    Gunhee Kim; Seungwhan Moon;Leonid Sigal

  • Author_Institution
    Seoul National University, Korea
  • fYear
    2015
  • fDate
    6/1/2015 12:00:00 AM
  • Firstpage
    1993
  • Lastpage
    2001
  • Abstract
    We propose a method to rank and retrieve image sequences from a natural language text query, consisting of multiple sentences or paragraphs. One of the method´s key applications is to visualize visitors´ text-only reviews on TRIPADVISOR or YELP, by automatically retrieving the most illustrative image sequences. While most previous work has dealt with the relations between a natural language sentence and an image or a video, our work extends to the relations between paragraphs and image sequences. Our approach leverages the vast user-generated resource of blog posts and photo streams on the Web. We use blog posts as text-image parallel training data that co-locate informative text with representative images that are carefully selected by users. We exploit large-scale photo streams to augment the image samples for retrieval. We design a latent structural SVM framework to learn the semantic relevance relations between text and image sequences. We present both quantitative and qualitative results on the newly created DISNEYLAND dataset.
  • Keywords
    "Image segmentation","Blogs","Image sequences","Streaming media","Semantics","Training","Natural languages"
  • Publisher
    ieee
  • Conference_Titel
    Computer Vision and Pattern Recognition (CVPR), 2015 IEEE Conference on
  • Electronic_ISBN
    1063-6919
  • Type

    conf

  • DOI
    10.1109/CVPR.2015.7298810
  • Filename
    7298810