DocumentCode :
1846998
Title :
Bio Named Entity Recognition Based on Co-training Algorithm
Author :
Munkhdalai, Tsendsuren ; Li, Meijing ; Kim, Taewook ; Namsrai, Oyun-Erdene ; Jeong, Seon-phil ; Shin, Jungpil ; Ryu, Keun Ho
Author_Institution :
Database/Bioinf. Lab., Chungbuk Nat. Univ., Cheongju, South Korea
fYear :
2012
fDate :
26-29 March 2012
Firstpage :
857
Lastpage :
862
Abstract :
One essential task in extracting information from biomedical literature is the bio Named Entity Recognition (NER) process, which basically defines the boundaries between typical words and biomedical terminology in particular text data, and assigns them based on domain knowledge. This paper presents a semi supervised integration of completely different classifiers to cover knowledge from unlabeled data to recognize bio named entities in text. We modified the original co-training, a semi supervised learning algorithm, with a scalable feature processing schema, which extracts the bio NER feature from a number of unlabeled data and converts different types of feature sets. Our base result shows that the classifiers of co-training achieve significant learning from unlabeled data.
Keywords :
bioinformatics; data mining; feature extraction; learning (artificial intelligence); pattern classification; text analysis; bio NER feature extraction; bio named entity recognition; bio-text mining; biomedical literature; biomedical terminology; cotraining algorithm; domain knowledge; feature processing; information extraction; semisupervised classifier integration; semisupervised learning algorithm; text data; unlabeled data; Abstracts; Classification algorithms; Context; Data mining; Dictionaries; Feature extraction; Training; Bio named entity recognition; co-training; feature processing; semisupervised learning; text mining;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Advanced Information Networking and Applications Workshops (WAINA), 2012 26th International Conference on
Conference_Location :
Fukuoka
Print_ISBN :
978-1-4673-0867-0
Type :
conf
DOI :
10.1109/WAINA.2012.75
Filename :
6185353
Link To Document :
بازگشت