DocumentCode :
3445597
Title :
Chinese Base Noun Phrase Based on Multi-Class Support Vector Machines and Rules of Post-Processing
Author :
Gan Runsheng ; Shi Shuicai ; Wang Meihua ; Wang Tao
Author_Institution :
Chinese Inf. Process. Res. Cerner, BISTU, Beijing, China
fYear :
2010
fDate :
27-28 Nov. 2010
Firstpage :
1
Lastpage :
4
Abstract :
In the paper, Chinese base noun phrase chunking is considered as a classification problem, and the paper proposes an approach, combines SVM-based method and rules of post-processing method, to distinguish Chinese base noun phrase. But the paper introduce threshold in multi-class SVM algorithm and study the usefulness of threshold for Chinese base noun phrase chunking, complete analyses of the result in this paper, then according to the special structures of Chinese base noun phrase, customize some appropriate rules to process the result. From overall experiments, the method achieves a higher accuracy in the final results.
Keywords :
natural language processing; pattern classification; support vector machines; Chinese base noun phrase chunking; classification problem; multiclass support vector machines; post processing rules; Algorithm design and analysis; Classification algorithms; Hidden Markov models; Support vector machines; Tagging; Testing; Training;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Database Technology and Applications (DBTA), 2010 2nd International Workshop on
Conference_Location :
Wuhan
Print_ISBN :
978-1-4244-6975-8
Electronic_ISBN :
978-1-4244-6977-2
Type :
conf
DOI :
10.1109/DBTA.2010.5658598
Filename :
5658598
Link To Document :
بازگشت