• DocumentCode
    2539629
  • Title

    Word sense distribution in a web corpus

  • Author

    Chen, Ping ; Brown, David ; Tran, Andrew ; Ozoka, Noble ; Ortiz, Rafael

  • Author_Institution
    Dept. of Comput. & Math. Sci., Univ. of Houston-Downtown, Houston, TX, USA
  • fYear
    2010
  • fDate
    7-9 July 2010
  • Firstpage
    449
  • Lastpage
    453
  • Abstract
    World Wide Web has become an important knowledge source for many research fields, and quality of Web-acquired knowledge has direct impact on their performance. While evaluation of the vast amount of Web resources is out of question, in this paper we examined thousands of sentences containing twelve preselected words and produced several quality measures including sentence coherence and sense distribution information. Our goal is to provide some insight to several Computational Linguistics areas that acquire knowledge from the Web.
  • Keywords
    Internet; Web sites; computational linguistics; knowledge acquisition; word processing; Web corpus; Web resources; Web-acquired knowledge quality; World Wide Web; computational linguistics; knowledge source; sentence coherence; word sense distribution information; Coherence; Computational linguistics; Knowledge engineering; Search engines; Semantics; Speech; Syntactics; Computational Linguistics; Sense annotation; Web corpus acquisition and quality analysis; Word sense distribution;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Cognitive Informatics (ICCI), 2010 9th IEEE International Conference on
  • Conference_Location
    Beijing
  • Print_ISBN
    978-1-4244-8041-8
  • Type

    conf

  • DOI
    10.1109/COGINF.2010.5599697
  • Filename
    5599697