Title of article :
WHIRL: A word-based information representation language Original Research Article
Author/Authors :
William W. Cohen، نويسنده ,
Issue Information :
روزنامه با شماره پیاپی سال 2000
Pages :
34
From page :
163
To page :
196
Abstract :
We describe WHIRL, an “information representation language” that synergistically combines properties of logic-based and text-based representation systems. WHIRL is a subset of Datalog that has been extended by introducing an atomic type for textual entities, an atomic operation for computing textual similarity, and a “soft” semantics; that is, inferences in WHIRL are associated with numeric scores, and presented to the user in decreasing order by score. This paper briefly describes WHIRL, and then surveys a number of applications. We show that WHIRL strictly generalizes both ranked retrieval of documents, and logical deduction; that nontrivial queries about large databases can be answered efficiently; that WHIRL can be used to accurately integrate data from heterogeneous information sources, such as those found on the Web; that WHIRL can be used effectively for inductive classification of text; and finally, that WHIRL can be used to semi-automatically generate extraction programs for structured documents.
Keywords :
Knowledge representation , Information retrieval , Textual similarity , Heterogeneous databases , Information integration , Text Categorization , Information extraction
Journal title :
Artificial Intelligence
Serial Year :
2000
Journal title :
Artificial Intelligence
Record number :
1206836
Link To Document :
بازگشت