• DocumentCode
    3301240
  • Title

    A Neural Networks-Based graph algorithm for cross-document coreference resolution

  • Author

    He, Saike ; Dong, Yuan ; Wang, Haila

  • Author_Institution
    Sch. of Inf. Eng., Beijing Univ. of Posts & Telecommun., Beijing
  • fYear
    2008
  • fDate
    19-22 Oct. 2008
  • Firstpage
    1
  • Lastpage
    9
  • Abstract
    Cross-document coreference resolution, which is an important subtask in natural language processing systems, focus on the problem of determining if two mentions from different documents refer to the same entity in the world. In this paper we present a two-step approach, employing a classification and clusterization phase. In a novel way, the clusterization is produced as a graph cutting algorithm, namely, neural networks-based BestCut (NBCut). To our knowledge, our system is the first that employs a statistical model in graph partitioning. We evaluate our approach on ACE 2008 cross-document coreference resolution data sets and obtain encouraging result, indicating that on named noun phrase coreference task, the approach holds promise and achieves competitive performance.
  • Keywords
    document handling; graph theory; natural language processing; neural nets; pattern classification; pattern clustering; statistical analysis; ACE 2008 cross-document coreference resolution data sets; classification phase; clusterization phase; graph cutting algorithm; graph partitioning; named noun phrase coreference task; natural language processing systems; neural networks-based BestCut; statistical model; Clustering algorithms; Entropy; Helium; Natural language processing; Neural networks; Noise measurement; Partitioning algorithms; Performance analysis; Research and development; Telecommunications; Maximum-Entropy; Min-Cut BestCut; Neural-Networks NBCut;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Natural Language Processing and Knowledge Engineering, 2008. NLP-KE '08. International Conference on
  • Conference_Location
    Beijing
  • Print_ISBN
    978-1-4244-4515-8
  • Electronic_ISBN
    978-1-4244-2780-2
  • Type

    conf

  • DOI
    10.1109/NLPKE.2008.4906800
  • Filename
    4906800