DocumentCode :
3781844
Title :
The Symptoms and Pathogenesis Entity Recognition of TCM Medical Records Based on CRF
Author :
Liu Honglan;Qin Xiaona;Fu Bin
Author_Institution :
Beijing Key Lab. of Knowledge Eng. for Mater. Sci., Beijing, China
fYear :
2015
Firstpage :
1479
Lastpage :
1484
Abstract :
TCM (Traditional Chinese Medicine) medical records are the great medical wealth of the Chinese nation. Since 1980s, China has begun to attach importance to the heritage of TCM. That how to effectively and maximize use these valuable resources is a problem for the TCM informationization. However, the dialectical information that includes the core idea of the famous doctors is still stored in the form of natural language. Obtaining structural information must rely on information extraction technology. With the development of science and technology information, the symptoms and pathogenesis entity recognition of TCM medical records is the key to build the TCM information extraction system. Conditional Random Fields (CRF) proposed by Lafferty et al in 2001, which combines the features of the Maximum Entropy Model and Hidden Markov Model. In recent years, it achieved good results in word segmentation, part of speech tagging and named entity recognition sequence labeling tasks [1]. This paper use the latest CRFidea, formulate appropriate feature template, using 500 marked TCM medical records from"11th five-year plan" medical records database to train the CRF model that suit for the symptoms and pathogenesis entity recognition. For verifying the model accuracy, we use this CRF model to label the symptoms and pathogenesis entity. After ten-fold cross validation, its symptoms entity F1 measure reached 81.53%, the pathogenesis entity F1 measure was 83.98%. Experimental results show that its performance is very high, it is suitable for the identification of TCM medical records information extraction.
Keywords :
"Hidden Markov models","Medical diagnostic imaging","Information retrieval","Training","Speech","Parameter estimation","Entropy"
Publisher :
ieee
Conference_Titel :
Ubiquitous Intelligence and Computing and 2015 IEEE 12th Intl Conf on Autonomic and Trusted Computing and 2015 IEEE 15th Intl Conf on Scalable Computing and Communications and Its Associated Workshops (UIC-ATC-ScalCom), 2015 IEEE 12th Intl Conf on
Type :
conf
DOI :
10.1109/UIC-ATC-ScalCom-CBDCom-IoP.2015.267
Filename :
7518446
Link To Document :
بازگشت