Title :
Research on Sports Game News Information Extraction
Author :
Yang, Yonggui ; Li, Lei
Author_Institution :
Beijing Univ. of Posts & Telecomm., Beijing
fDate :
Aug. 30 2007-Sept. 1 2007
Abstract :
With the development of Internet and the development of information technology, a tremendous amount of news information appears everyday. How to extract the useful knowledge is a burning problem. The sports game news-oriented information extraction system introduced in the paper combined the statistics-based hidden Markov model (HMM) and rule-based method based on the technology of natural language processing and information extraction. The system could transform Chinese sports game news in free text or html pages into structured data of useful knowledge automatically. In addition, we studied named entity recognition for knowledge candidates using rule-based method. Testing results have shown that the performance is good.
Keywords :
hidden Markov models; humanities; information management; Internet; hidden Markov model; natural language processing; rule-based method; sports game news-oriented information extraction system; Data mining; Databases; Finance; HTML; Helium; Information analysis; Information technology; Internet; Personnel; System testing;
Conference_Titel :
Natural Language Processing and Knowledge Engineering, 2007. NLP-KE 2007. International Conference on
Conference_Location :
Beijing
Print_ISBN :
978-1-4244-1611-0
Electronic_ISBN :
978-1-4244-1611-0
DOI :
10.1109/NLPKE.2007.4368017