Title :
Acronym extraction using SVM with Uneven Margins
Author :
Weijian Ni ; Jun Xu ; Yalou Huang ; Tong Liu ; Jianye Ge
Abstract :
Extracting acronyms and their expansions from plain text is an important problem in text mining. Previous research shows that the problem can be solved via machine learning approaches. That is, converting the problem of acronym extraction to binary classification. We investigate the classification problem and find that the classes are highly unbalanced (the positive instances are very rare compared to negative ones). So we try to tackle the problem using an uneven margin classifier - SVM with Uneven Margins. Experimental results showed that our approach can get better results than baseline methods of using heuristic rules and conventional SVM models. Experimental results also showed how uneven margins classifier made the tradeoff between the precision and recall of extraction.
Keywords :
data mining; pattern classification; support vector machines; text analysis; SVM; acronym extraction; binary classification; heuristic rules; machine learning; text mining; uneven margin classifier; Classification algorithms; Context; Mathematical model; Optimization; Read only memory; Support vector machines; Training;
Conference_Titel :
Web Society (SWS), 2010 IEEE 2nd Symposium on
Conference_Location :
Beijing
Print_ISBN :
978-1-4244-6356-5
DOI :
10.1109/SWS.2010.5607463