• DocumentCode
    2691546
  • Title

    A semi-supervised learning method for Names of Traditional Chinese Prescriptions and Drugs recognition

  • Author

    Cai, Dongfeng ; Ding, Changlin ; Zuo, Junjun ; Bai, Yu

  • Author_Institution
    Res. Center for Knowledge Eng., Shenyang Aerosp. Univ., Shenyang, China
  • fYear
    2012
  • fDate
    4-7 Oct. 2012
  • Firstpage
    1
  • Lastpage
    4
  • Abstract
    Knowledge discovery of Ancient Medical Literatures (AMLs) is a research focus due to wide applications of computer technology in Traditional Chinese Medicine (TCM). The foundation of the knowledge discovery research is to get semantic labels within the AMLs and to restructure the text. Due to the diversity of AMLs, low coverage rate of current semantic lexicons and the ambiguities of the lexicon words, low recall rate and low accuracy are resulted by using only lexicons to recognize TCM terms. This paper presents a semi-supervised learning and Bootstrapping based approach, Barpidusk, which aims at using semantic lexicons and simple features to recognize Names of Traditional Chinese Prescriptions and Drugs (NTCPDs) in un-annotated AMLs. And human-computer interaction is added to the Bootstrapping, which increases recognition accuracy without much loss of efficiency. Experiments show that the F values of recognizing Names of Traditional Chinese Prescriptions (NTCPs) and Names of Traditional Chinese Drugs (NTCDs) reaches 44.9% and 51.3% respectively without interaction with humans. By gradually adding human-computer interactions 10 times during recognition process, these values are increased to 74.9% and 90.6% respectively.
  • Keywords
    data mining; drugs; human computer interaction; learning (artificial intelligence); medical computing; medical information systems; text analysis; AML diversity; NTCPD; TCM recognition; ancient medical literatures; barpidusk approach; bootstrapping-based approach; computer technology; coverage rate; human-computer interaction; knowledge discovery; semantic lexicons; semisupervised learning method; text restructure; traditional Chinese medicine; traditional Chinese prescription-and-drug recognition; Accuracy; Context; Drugs; Human computer interaction; Manuals; Semantics; Syntactics; Bootstrapping; ancient medical literature; human-computer interaction; recognition for names of traditional Chinese prescriptions and drugs;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Bioinformatics and Biomedicine (BIBM), 2012 IEEE International Conference on
  • Conference_Location
    Philadelphia, PA
  • Print_ISBN
    978-1-4673-2559-2
  • Electronic_ISBN
    978-1-4673-2558-5
  • Type

    conf

  • DOI
    10.1109/BIBM.2012.6392707
  • Filename
    6392707