DocumentCode :
3437424
Title :
MedCat: A Framework for High Level Conceptualization of Medical Notes
Author :
Fodeh, Samah Jamal ; Zirkle, Maryan ; Finch, Dezon ; Brandt, Christian ; Erdos, Joseph ; Reeves, R.
Author_Institution :
Sch. of Med., Yale Univ., New Haven, CT, USA
fYear :
2013
fDate :
7-10 Dec. 2013
Firstpage :
274
Lastpage :
280
Abstract :
In this paper we introduce a new framework called MedCat to delineate and demonstrate an approach for projecting representations of concept-derived content in clinical notes into a new categorization space to reduce dimensionality and noise in the data. Constructing MedCat framework required several steps including manual annotation, knowledge base expansion using MetaMap, concept category construction, automated annotation using NLP to generate a bag of concepts, and finally concept conversion to higher level abstracted categories. The framework was applied to Post Traumatic Stress Disorder (PTSD) clinical notes for evaluation. A random sample of PTSD clinical note content was automatically recategorized into six PTSD treatment categories using MedCat. Using existing annotations from PTSD notes that were categorized by content experts into treatment categories as the reference standard, the sensitivity of the framework in detecting the treatment categories was greater than 90%. The results suggest that representations of concept-derived content when categorized by relevance features can be used to reliably understand and summarize clinical notes.
Keywords :
data reduction; knowledge based systems; medical computing; medical disorders; natural language processing; MedCat framework; MetaMap; NLP; PTSD clinical note content; PTSD treatment categories; automated annotation; clinical notes; concept category construction; concept-derived content representation; dimensionality reduction; knowledge base expansion; manual annotation; natural language processing; noise reduction; post traumatic stress disorder clinical notes; random sample; Data mining; Feature extraction; Manuals; Natural language processing; Noise; Sensitivity; Unified modeling language; MedCat; Natural Language Processing; PTSD; categorization;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Data Mining Workshops (ICDMW), 2013 IEEE 13th International Conference on
Conference_Location :
Dallas, TX
Print_ISBN :
978-1-4799-3143-9
Type :
conf
DOI :
10.1109/ICDMW.2013.89
Filename :
6753931
Link To Document :
بازگشت