• Title of article

    Lightweight methods for large-scale product categorization

  • Author/Authors

    Eli Cortez1، نويسنده , , Mauro Rojas Herrera1، نويسنده , , Altigran S. da Silva1، نويسنده , , Edleno S. de Moura1، نويسنده , , Marden Neubert2، نويسنده ,

  • Issue Information
    ماهنامه با شماره پیاپی سال 2011
  • Pages
    10
  • From page
    1839
  • To page
    1848
  • Abstract
    In this article, we present a study about classification methods for large-scale categorization of product offers on e-shopping web sites. We present a study about the performance of previously proposed approaches and deployed a probabilistic approach to model the classification problem. We also studied an alternative way of modeling information about the description of product offers and investigated the usage of price and store of product offers as features adopted in the classification process. Our experiments used two collections of over a million product offers previously categorized by human editors and taxonomies of hundreds of categories from a real e-shopping web site. In these experiments, our method achieved an improvement of up to 9% in the quality of the categorization in comparison with the best baseline we have found.
  • Journal title
    Journal of the American Society for Information Science and Technology
  • Serial Year
    2011
  • Journal title
    Journal of the American Society for Information Science and Technology
  • Record number

    994511