Title :
Performance Improvement of Drug Effects Extraction System from Japanese Blogs
Author :
Kitajima, S. ; Rzepka, Rafal ; Araki, Kotaro
Author_Institution :
Grad. Sch. of Inf. Sci. & Technol., Hokkaido Univ., Sapporo, Japan
Abstract :
Information disclosed to the public by patients is very important for people who are suffering from same illness because such information can be a source of knowledge and encouragement. Our aim is to make a system that extracts, organizes and visually represents information from patients´ blogs. As the first step, the purpose of this paper is to extract descriptions of the effects caused by taking drugs as a triplet of expressions - drug name, object of change, and its effect - from illness survival blogs. However, conventional extraction methods are not suitable since these blogs are written in free natural language. Therefore, this paper proposes a method to extract the triplets using specific clue words and parsing the results. An evaluation experiment confirmed that medication usage information can be extracted with high accuracy using our proposed method, in comparison to existing methods. Moreover, recall was improved by combining our proposed method and a baseline system.
Keywords :
Web sites; data mining; drugs; electronic health records; grammars; natural language processing; text analysis; Japanese blogs; drug effects extraction system; drug name; free natural language; illness survival blogs; medication usage information; parsing; patients blogs; Accuracy; Blogs; Data mining; Dictionaries; Diseases; Drugs; Information extraction; Medication usage information; Opinion mining; Text mining;
Conference_Titel :
Semantic Computing (ICSC), 2013 IEEE Seventh International Conference on
Conference_Location :
Irvine, CA
DOI :
10.1109/ICSC.2013.71