K-Similar Conditional Random Fields for Semi-supervised Sequence Labeling

Author

Chen, Xi ; Chen, Shihong ; Xiao, Kun

Author_Institution

Comput. Sch., Wuhan Univ., Wuhan

fYear

2008

fDate

23-25 July 2008

Firstpage

Lastpage

Abstract

Sequence labeling tasks, such as named entity recognition and part of speech tagging, are the fundamental compositions of the information extraction system, and thus received attentions these years. This paper proposes k-similar conditional random fields for semi-supervised sequence labeling, and makes use of unlabeled data to calculate the similarity between words with distributional clustering. The named entity recognition experiments show that this method can improve the performance through unlabeled data.

Keywords

knowledge acquisition; random processes; K-similar conditional random field; distributional clustering; information extraction system; named entity recognition; semisupervised sequence labeling; Data mining; Entropy; Hidden Markov models; Inference algorithms; Information technology; Labeling; Natural language processing; Semisupervised learning; Speech recognition; Tagging;

fLanguage

English

Publisher

ieee

Conference_Titel

Advanced Language Processing and Web Information Technology, 2008. ALPIT '08. International Conference on

Conference_Location

Dalian Liaoning

Print_ISBN

978-0-7695-3273-8

Type

conf

DOI

10.1109/ALPIT.2008.16

Filename

4584335

Link To Document

https://search.isc.ac/dl/search/defaultta.aspx?DTC=49&DC=2352398