• DocumentCode
    2352398
  • Title

    K-Similar Conditional Random Fields for Semi-supervised Sequence Labeling

  • Author

    Chen, Xi ; Chen, Shihong ; Xiao, Kun

  • Author_Institution
    Comput. Sch., Wuhan Univ., Wuhan
  • fYear
    2008
  • fDate
    23-25 July 2008
  • Firstpage
    21
  • Lastpage
    26
  • Abstract
    Sequence labeling tasks, such as named entity recognition and part of speech tagging, are the fundamental compositions of the information extraction system, and thus received attentions these years. This paper proposes k-similar conditional random fields for semi-supervised sequence labeling, and makes use of unlabeled data to calculate the similarity between words with distributional clustering. The named entity recognition experiments show that this method can improve the performance through unlabeled data.
  • Keywords
    knowledge acquisition; random processes; K-similar conditional random field; distributional clustering; information extraction system; named entity recognition; semisupervised sequence labeling; Data mining; Entropy; Hidden Markov models; Inference algorithms; Information technology; Labeling; Natural language processing; Semisupervised learning; Speech recognition; Tagging;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Advanced Language Processing and Web Information Technology, 2008. ALPIT '08. International Conference on
  • Conference_Location
    Dalian Liaoning
  • Print_ISBN
    978-0-7695-3273-8
  • Type

    conf

  • DOI
    10.1109/ALPIT.2008.16
  • Filename
    4584335