DocumentCode :
1999710
Title :
A Case Study: News Classification Based on Term Frequency
Author :
Kroha, Petr ; Baeza-Yates, Ricardo
Author_Institution :
Fac. of Comput. Sci., Univ. of Technol., Chemnitz
fYear :
2005
fDate :
26-26 Aug. 2005
Firstpage :
428
Lastpage :
432
Abstract :
In this paper, we investigate how much similarity good news and bad news have in context of long-terms market trends and we discuss the relation between information retrieval and text mining. We have analyzed about 400 thousand news stories coming from the years 1999 to 2002 and we argue that classification methods of information retrieval are not strong enough to solve problems like this one because the meaning of news is given not only by the used words and their frequency but also by the structure of sentences and their context. We present results of our experiments and examples of news that support this statement
Keywords :
business data processing; classification; data mining; information retrieval; stock markets; text analysis; business news classification; financial markets; information retrieval; term frequency; text mining; Business; Chemical technology; Computer aided software engineering; Computer science; Data mining; Databases; Engines; Frequency; Information retrieval; Text mining;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Database and Expert Systems Applications, 2005. Proceedings. Sixteenth International Workshop on
Conference_Location :
Copenhagen
ISSN :
1529-4188
Print_ISBN :
0-7695-2424-9
Type :
conf
DOI :
10.1109/DEXA.2005.6
Filename :
1508310
Link To Document :
بازگشت