Title of article :
A bio-inspired application of natural language processing: A case study in extracting multiword expression
Author/Authors :
Duan، نويسنده , , Jianyong and Li، نويسنده , , Ru and Hu، نويسنده , , Yi، نويسنده ,
Issue Information :
روزنامه با شماره پیاپی سال 2009
Abstract :
For the multiword expression (MWE) extraction, the multiple sequence alignment (MSA) is proposed on the motivation of gene recognition. Because textual sequence is similar to gene sequence in pattern analysis. This MSA technique is combined with error-driven rules, with the improved efficiency beyond the traditional methods. It provides a guarantee for the MWE recall. It uses the dynamic programming method to prevent candidates from combinational explosion, and provides a global solution for pattern extraction instead of sub-pattern redundancy. Consequently, it has accurate measures for flexible patterns. In experiment, some advanced statistical measures are performed for ranking candidates. In the comparison experiment, the MSA approach achieved better results.
Keywords :
Error driven rule , Text Mining , multiple sequence alignment , Multiword expression
Journal title :
Expert Systems with Applications
Journal title :
Expert Systems with Applications