Title :
Combining local and non-local information with dual decomposition for named entity recognition from text
Author :
Chieu, Hai Leong ; Teow, Loo-Nin
Author_Institution :
DSO Nat. Labs., Singapore, Singapore
Abstract :
Named entity recognition (NER) is the task of segmenting and classifying occurrences of names in text. In NER, local contextual cues provide important evidence, but non-local information from the whole document could also prove useful: for example, it is useful to know that “Mary Kay Inc.” has been mentioned in a document to classify subsequent mentions of “Mary Kay” as an organization and not as a person. Previous works for NER typically model the problem as a sequence labeling problem, coupling the predictions of neighboring words with a Markov model such as conditional random fields. We propose applying the dual decomposition approach to combine a local sentential model and a non-local label consistency model for NER. The dual decomposition approach is a fusion approach which combines two models by constraining them to agree on their predictions on the test data. Empirically, we show that this approach outperforms the local sentential models on four out of five data sets.
Keywords :
data integrity; pattern classification; text analysis; NER; documents; dual-decomposition approach; fusion approach; local contextual cues; local information; local sentential model; name occurrence classification; name occurrence segmentation; named entity recognition; nonlocal information; nonlocal label consistency model; text analysis; Data models; Hidden Markov models; Organizations; Predictive models; Shape; Training; Training data;
Conference_Titel :
Information Fusion (FUSION), 2012 15th International Conference on
Conference_Location :
Singapore
Print_ISBN :
978-1-4673-0417-7
Electronic_ISBN :
978-0-9824438-4-2