• DocumentCode
    3425040
  • Title

    Detection of trends of technical phrases in text mining

  • Author

    Abe, Hidenao ; Tsumoto, Shusaku

  • Author_Institution
    Sch. of Med., Shimane Univ., Matsue, Japan
  • fYear
    2009
  • fDate
    17-19 Aug. 2009
  • Firstpage
    7
  • Lastpage
    12
  • Abstract
    In text mining processes, the importance indices of the technical terms play a key role in finding valuable patterns from various documents. Further, methods for finding emergent terms have attracted considerable attention as an important issue called temporal text mining. However, many conventional methods are not robust against changes in technical terms. In order to detect remarkable temporal trends of technical terms in given textual datasets robustly, we propose a method based on temporal changes in several importance indices by assuming the importance indices of the terms to be a dataset. The method consists of an automatic term extraction method in given documents, three importance indices from text mining studies, and temporal trends detection based on results of linear regression analysis. Empirical studies show that the three importance indices are applied to the titles of four annual conferences about data mining field as sets of documents. After detecting the temporal trends of automatically extracted phrases, we compared the trends of the technical phrases among the titles of the annual conferences.
  • Keywords
    data mining; regression analysis; text analysis; automatic term extraction method; data mining; linear regression analysis; technical phrases; text mining; trend detection; Automata; Data mining; Frequency; Hidden Markov models; Humans; Linear regression; Pattern recognition; Robustness; Text analysis; Text mining;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Granular Computing, 2009, GRC '09. IEEE International Conference on
  • Conference_Location
    Nanchang
  • Print_ISBN
    978-1-4244-4830-2
  • Type

    conf

  • DOI
    10.1109/GRC.2009.5255172
  • Filename
    5255172