• DocumentCode
    1791682
  • Title

    Multiresolution analysis of incomplete rankings with applications to prediction

  • Author

    Sibony, Eric ; Clemencon, Stephan ; Jakubowicz, Jeremie

  • Author_Institution
    Inst. Mines-Telecom, Telecom ParisTech, Paris, France
  • fYear
    2014
  • fDate
    27-30 Oct. 2014
  • Firstpage
    88
  • Lastpage
    95
  • Abstract
    Data representing preferences of users are at the core of many Big Data modern applications, such as recommender systems or search engines. While most of the introduced machine learning approaches are designed to handle preference data under the form of cardinal scores, such as ratings given by the users to the items, many situations require to deal with ordinal preferences, coming from implicit feedback data for instance. Methods relying on the analysis of ranking data are best suited for these situations, but they face a great computational challenge insofar as the number of ways to express ordinal preferences on a catalog of n items explodes with n. It is the main purpose of this paper to promote a new representation of preference data when they come under the form of incomplete rankings, that is to say ordinal preferences on small subsets of items. The representation exploits the “multiscale” structure of incomplete rankings and though it relies on recent results in algebraic topology, it is used and interpreted similar to classic wavelet multiresolution analysis on a Euclidean space. We apply it to the problem of incomplete rankings prediction and show at the same time that it is statistically consistent and that it can be computed at a reasonable cost given the complexity of the original data. It is illustrated by very encouraging empirical work based on real datasets.
  • Keywords
    Big Data; learning (artificial intelligence); Big Data modern applications; Euclidean space; algebraic topology; cardinal scores; data representing preferences; implicit feedback data; incomplete rankings; machine learning; ordinal preferences; preference data; real datasets; recommender systems; search engines; wavelet multiresolution analysis; Equations; Estimation; Mathematical model; Multiresolution analysis; Probability distribution; Ranking (statistics); Tin; incomplete rankings; multiresolution analysis; preference data;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Big Data (Big Data), 2014 IEEE International Conference on
  • Conference_Location
    Washington, DC
  • Type

    conf

  • DOI
    10.1109/BigData.2014.7004361
  • Filename
    7004361