DocumentCode
1909325
Title
Research on Sports Game News Information Extraction
Author
Yang, Yonggui ; Li, Lei
Author_Institution
Beijing Univ. of Posts & Telecomm., Beijing
fYear
2007
fDate
Aug. 30 2007-Sept. 1 2007
Firstpage
96
Lastpage
101
Abstract
With the development of Internet and the development of information technology, a tremendous amount of news information appears everyday. How to extract the useful knowledge is a burning problem. The sports game news-oriented information extraction system introduced in the paper combined the statistics-based hidden Markov model (HMM) and rule-based method based on the technology of natural language processing and information extraction. The system could transform Chinese sports game news in free text or html pages into structured data of useful knowledge automatically. In addition, we studied named entity recognition for knowledge candidates using rule-based method. Testing results have shown that the performance is good.
Keywords
hidden Markov models; humanities; information management; Internet; hidden Markov model; natural language processing; rule-based method; sports game news-oriented information extraction system; Data mining; Databases; Finance; HTML; Helium; Information analysis; Information technology; Internet; Personnel; System testing;
fLanguage
English
Publisher
ieee
Conference_Titel
Natural Language Processing and Knowledge Engineering, 2007. NLP-KE 2007. International Conference on
Conference_Location
Beijing
Print_ISBN
978-1-4244-1611-0
Electronic_ISBN
978-1-4244-1611-0
Type
conf
DOI
10.1109/NLPKE.2007.4368017
Filename
4368017
Link To Document