Title of article

Preferences in Wikipedia abstracts: Empirical findings and implications for automatic entity summarization

Author/Authors

Danyun Xu، نويسنده , , Gong Cheng، نويسنده , , Yuzhong Qu، نويسنده ,

Issue Information

دوماهنامه با شماره پیاپی سال 2014

Pages

13

From page

284

To page

296

Abstract

The volume of entity-centric structured data grows rapidly on the Web. The description of an entity, composed of property-value pairs (a.k.a. features), has become very large in many applications. To avoid information overload, efforts have been made to automatically select a limited number of features to be shown to the user based on certain criteria, which is called automatic entity summarization. However, to the best of our knowledge, there is a lack of extensive studies on how humans rank and select features in practice, which can provide empirical support and inspire future research. In this article, we present a large-scale statistical analysis of the descriptions of entities provided by DBpedia and the abstracts of their corresponding Wikipedia articles, to empirically study, along several different dimensions, which kinds of features are preferable when humans summarize. Implications for automatic entity summarization are drawn from the findings.

Keywords

DBpedia , Entity summarization , feature selection , Property ranking , Wikipedia

Journal title

Information Processing and Management

Serial Year

2014

Journal title

Information Processing and Management

Record number

Preferences in Wikipedia abstracts: Empirical findings and implications for automatic entity summarization

Danyun Xu، نويسنده , , Gong Cheng، نويسنده , , Yuzhong Qu، نويسنده ,

1229497