DocumentCode
2860973
Title
Ontology-Based Structured Cosine Similarity in Speech Document Summarization
Author
Yuan, Soe-Tsyr ; Sun, Jerry
Author_Institution
National Chengchi University, Taipei, Taiwan
fYear
2004
fDate
20-24 Sept. 2004
Firstpage
508
Lastpage
513
Abstract
Development of algorithms for automated text categorization in massive text document sets is an important research area of data mining and knowledge discovery. Most of the text-clustering methods were grounded in the term-based measurement of distance or similarity, ignoring the structure of terms in documents. In this paper we present a novel method named Structured Cosine Similarity that furnishes document clustering with a new way of modeling on document summarization, considering the structure of terms in documents in order to improve the quality of speech document clustering.
Keywords
Clustering methods; Data mining; Frequency; Large scale integration; Management information systems; Ontologies; Size measurement; Speech; Sun; Text categorization;
fLanguage
English
Publisher
ieee
Conference_Titel
Web Intelligence, 2004. WI 2004. Proceedings. IEEE/WIC/ACM International Conference on
Print_ISBN
0-7695-2100-2
Type
conf
DOI
10.1109/WI.2004.10091
Filename
1410855
Link To Document