• DocumentCode
    1909325
  • Title

    Research on Sports Game News Information Extraction

  • Author

    Yang, Yonggui ; Li, Lei

  • Author_Institution
    Beijing Univ. of Posts & Telecomm., Beijing
  • fYear
    2007
  • fDate
    Aug. 30 2007-Sept. 1 2007
  • Firstpage
    96
  • Lastpage
    101
  • Abstract
    With the development of Internet and the development of information technology, a tremendous amount of news information appears everyday. How to extract the useful knowledge is a burning problem. The sports game news-oriented information extraction system introduced in the paper combined the statistics-based hidden Markov model (HMM) and rule-based method based on the technology of natural language processing and information extraction. The system could transform Chinese sports game news in free text or html pages into structured data of useful knowledge automatically. In addition, we studied named entity recognition for knowledge candidates using rule-based method. Testing results have shown that the performance is good.
  • Keywords
    hidden Markov models; humanities; information management; Internet; hidden Markov model; natural language processing; rule-based method; sports game news-oriented information extraction system; Data mining; Databases; Finance; HTML; Helium; Information analysis; Information technology; Internet; Personnel; System testing;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Natural Language Processing and Knowledge Engineering, 2007. NLP-KE 2007. International Conference on
  • Conference_Location
    Beijing
  • Print_ISBN
    978-1-4244-1611-0
  • Electronic_ISBN
    978-1-4244-1611-0
  • Type

    conf

  • DOI
    10.1109/NLPKE.2007.4368017
  • Filename
    4368017