Title :
Experimental Evaluation of the Value of Structure: How to Efficiently Exploit Interdependencies in Sequence Labeling
Author :
Wisniewski, Guillaume ; Gallinari, Patrick
Author_Institution :
LIP6, UPMC, Paris
Abstract :
Many problems in natural language processing, information extraction or bioinformatics consist in predicting a label for each element of a sequence of observations. The sequence of labels generally presents multiple dependencies that restrict the possible labels the elements can take. Therefore, relations between labels intuitively provide information valuable for the prediction. Several approaches have been proposed to take advantage of this additional information. However, experimental results show that taking relations into account does not always improve prediction performances, while it significantly increases the computational cost of both learning and prediction. In this work, we aim at both explaining these surprising results and proposing a simple but computationally efficient approach for labeling sequences.
Keywords :
natural language processing; information extraction; multiple dependencies; natural language processing; sequence labeling; Bioinformatics; Buildings; Computational efficiency; Data mining; Dynamic programming; Inference algorithms; Labeling; Learning systems; Machine learning algorithms; Natural language processing;
Conference_Titel :
Data Mining, 2008. ICDM '08. Eighth IEEE International Conference on
Conference_Location :
Pisa
Print_ISBN :
978-0-7695-3502-9
DOI :
10.1109/ICDM.2008.96