DocumentCode
3425040
Title
Detection of trends of technical phrases in text mining
Author
Abe, Hidenao ; Tsumoto, Shusaku
Author_Institution
Sch. of Med., Shimane Univ., Matsue, Japan
fYear
2009
fDate
17-19 Aug. 2009
Firstpage
7
Lastpage
12
Abstract
In text mining processes, the importance indices of the technical terms play a key role in finding valuable patterns from various documents. Further, methods for finding emergent terms have attracted considerable attention as an important issue called temporal text mining. However, many conventional methods are not robust against changes in technical terms. In order to detect remarkable temporal trends of technical terms in given textual datasets robustly, we propose a method based on temporal changes in several importance indices by assuming the importance indices of the terms to be a dataset. The method consists of an automatic term extraction method in given documents, three importance indices from text mining studies, and temporal trends detection based on results of linear regression analysis. Empirical studies show that the three importance indices are applied to the titles of four annual conferences about data mining field as sets of documents. After detecting the temporal trends of automatically extracted phrases, we compared the trends of the technical phrases among the titles of the annual conferences.
Keywords
data mining; regression analysis; text analysis; automatic term extraction method; data mining; linear regression analysis; technical phrases; text mining; trend detection; Automata; Data mining; Frequency; Hidden Markov models; Humans; Linear regression; Pattern recognition; Robustness; Text analysis; Text mining;
fLanguage
English
Publisher
ieee
Conference_Titel
Granular Computing, 2009, GRC '09. IEEE International Conference on
Conference_Location
Nanchang
Print_ISBN
978-1-4244-4830-2
Type
conf
DOI
10.1109/GRC.2009.5255172
Filename
5255172
Link To Document