DocumentCode :
2253866
Title :
Knowledge discovery in an earthquake text database: correlation between significant earthquakes and the time of day
Author :
Goldman, Jeffrey A. ; Parker, D. Stott ; Chu, Wesley W.
Author_Institution :
Dept. of Comput. Sci., California Univ., Los Angeles, CA, USA
fYear :
1997
fDate :
11-13 Aug 1997
Firstpage :
12
Lastpage :
21
Abstract :
The authors take a real world application from a text database and present a case history. The techniques ultimately led to a discovery contradicting an accepted paradigm in seismology. Using simple, tailored, keyword extraction, they examined a text collection of earthquake data. A discovery was made when an unusual pattern emerged from the text. They then tested a more comprehensive numerical database, treating the the text discovery as a hypothesis. It was verified using a standard χ2 statistic. The hypothesis was significant earthquakes in the longitude regions that include California, occur more often in the morning hours than any other time of day
Keywords :
earthquakes; full-text databases; geophysics computing; query processing; scientific information systems; seismology; χ2 statistic; California; earthquake data; earthquake text database; keyword extraction; knowledge discovery; longitude regions; numerical database; pattern; seismology; significant earthquakes; text collection; text discovery; time of day; Application software; Computer science; Data mining; Databases; Dictionaries; Earthquakes; Frequency; Routing; Statistics; Testing;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Scientific and Statistical Database Management, 1997. Proceedings., Ninth International Conference on
Conference_Location :
Olympia, WA
Print_ISBN :
0-8186-7952-2
Type :
conf
DOI :
10.1109/SSDM.1997.621144
Filename :
621144
Link To Document :
بازگشت