• DocumentCode
    2860973
  • Title

    Ontology-Based Structured Cosine Similarity in Speech Document Summarization

  • Author

    Yuan, Soe-Tsyr ; Sun, Jerry

  • Author_Institution
    National Chengchi University, Taipei, Taiwan
  • fYear
    2004
  • fDate
    20-24 Sept. 2004
  • Firstpage
    508
  • Lastpage
    513
  • Abstract
    Development of algorithms for automated text categorization in massive text document sets is an important research area of data mining and knowledge discovery. Most of the text-clustering methods were grounded in the term-based measurement of distance or similarity, ignoring the structure of terms in documents. In this paper we present a novel method named Structured Cosine Similarity that furnishes document clustering with a new way of modeling on document summarization, considering the structure of terms in documents in order to improve the quality of speech document clustering.
  • Keywords
    Clustering methods; Data mining; Frequency; Large scale integration; Management information systems; Ontologies; Size measurement; Speech; Sun; Text categorization;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Web Intelligence, 2004. WI 2004. Proceedings. IEEE/WIC/ACM International Conference on
  • Print_ISBN
    0-7695-2100-2
  • Type

    conf

  • DOI
    10.1109/WI.2004.10091
  • Filename
    1410855