Title :
Web-Based Verification on the Representativeness of Terms Extracted from Single Short Documents
Author_Institution :
Nat. Taipei Univ. of Technol., Taipei, Taiwan
Abstract :
Single document summarization is useful for extracting the major ideas from huge amount of daily information. However, it´s a challenge to distinguish the relative importance among terms. In this paper, we propose a Web-based approach to term verification. Search-results of extracted terms are utilized as their expanded representation, and their similarity with the original document are calculated as an estimate of term representative ness. We experimented with term extraction methods on multilingual news extracts and compared the effectiveness of term verification with various Jaccard similarity measures. The experimental results show the feasibility of Web-based verification on the representativeness of extracted terms.
Keywords :
Internet; document handling; formal verification; information retrieval; linguistics; Jaccard similarity measures; Web-based verification; multilingual news extraction; single short document summarization; term extraction methods; term representativeness estimation; term verification; Artificial intelligence; Conferences; Data mining; Estimation; Feature extraction; Google; Search engines; Web mining; short text; term verification;
Conference_Titel :
Web Intelligence and Intelligent Agent Technology (WI-IAT), 2011 IEEE/WIC/ACM International Conference on
Conference_Location :
Lyon
Print_ISBN :
978-1-4577-1373-6
Electronic_ISBN :
978-0-7695-4513-4
DOI :
10.1109/WI-IAT.2011.258