DocumentCode
2539629
Title
Word sense distribution in a web corpus
Author
Chen, Ping ; Brown, David ; Tran, Andrew ; Ozoka, Noble ; Ortiz, Rafael
Author_Institution
Dept. of Comput. & Math. Sci., Univ. of Houston-Downtown, Houston, TX, USA
fYear
2010
fDate
7-9 July 2010
Firstpage
449
Lastpage
453
Abstract
World Wide Web has become an important knowledge source for many research fields, and quality of Web-acquired knowledge has direct impact on their performance. While evaluation of the vast amount of Web resources is out of question, in this paper we examined thousands of sentences containing twelve preselected words and produced several quality measures including sentence coherence and sense distribution information. Our goal is to provide some insight to several Computational Linguistics areas that acquire knowledge from the Web.
Keywords
Internet; Web sites; computational linguistics; knowledge acquisition; word processing; Web corpus; Web resources; Web-acquired knowledge quality; World Wide Web; computational linguistics; knowledge source; sentence coherence; word sense distribution information; Coherence; Computational linguistics; Knowledge engineering; Search engines; Semantics; Speech; Syntactics; Computational Linguistics; Sense annotation; Web corpus acquisition and quality analysis; Word sense distribution;
fLanguage
English
Publisher
ieee
Conference_Titel
Cognitive Informatics (ICCI), 2010 9th IEEE International Conference on
Conference_Location
Beijing
Print_ISBN
978-1-4244-8041-8
Type
conf
DOI
10.1109/COGINF.2010.5599697
Filename
5599697
Link To Document