DocumentCode
589178
Title
Learning to Extract Entity Uniqueness from Web for Helping User Decision Making
Author
Wenhan Wang ; Ning Liu ; Yiran Xie
Author_Institution
Dept. of Math., Univ. of Washington, Seattle, WA, USA
fYear
2012
fDate
10-10 Dec. 2012
Firstpage
850
Lastpage
857
Abstract
Web entities are the building blocks of human knowledge and users are making decisions among vast varieties of entities. For example, recommendation systems generate lists of entities to users, but seldom show the reasons of recommendation such as the uniqueness of each item to assist user decision making. In this paper, we mathematically define Web entity uniqueness and uniqueness patterns, based on which we propose a novel unsupervised natural language learning algorithm for entity uniqueness extraction. We leverage the bootstrapping strategy to recognize uniqueness from the free-text Web corpus with assistance from semi-structured Web such as lists, tables and query logs. To avoid extracting the subjective entity uniqueness, which may bias user decision making, we propose the probabilistic likelihood of a uniqueness property using bipartite graph models over entities and properties. Experiments verify that our algorithms have higher accuracy and coverage of entity uniqueness extraction technique compared to other related algorithms. We also show by conducting a user study survey that entity uniqueness information indeed positively supports user decision making.
Keywords
Internet; decision making; graph theory; human computer interaction; information retrieval; learning (artificial intelligence); natural language processing; recommender systems; unsupervised learning; user interfaces; Web entities; Web entity uniqueness extraction technique; bipartite graph models; bootstrapping strategy; free-text Web corpus; probabilistic likelihood; recommendation systems; semi structured Web; subjective entity uniqueness extraction; uniqueness property; uniqueness recognition; unsupervised natural language learning algorithm; user decision making; Airports; Algorithm design and analysis; Bipartite graph; Decision making; Humans; Measurement; Natural languages; Web entity; decision making; entity uniqueness; information extraction;
fLanguage
English
Publisher
ieee
Conference_Titel
Data Mining Workshops (ICDMW), 2012 IEEE 12th International Conference on
Conference_Location
Brussels
Print_ISBN
978-1-4673-5164-5
Type
conf
DOI
10.1109/ICDMW.2012.127
Filename
6406528
Link To Document