• DocumentCode
    2059430
  • Title

    Dependency Parsing for Extracting Family History

  • Author

    Lewis, Neal ; Gruhl, Daniel ; Yang, Hui

  • Author_Institution
    Almaden Res. Center, IBM, San Jose, CA, USA
  • fYear
    2011
  • fDate
    26-29 July 2011
  • Firstpage
    237
  • Lastpage
    242
  • Abstract
    Family history is an important part of the clinical record. Automatically extracting family history from clinical reports requires an understanding of how clinicians describe family history as well as an understanding of how they discuss diseases. We perform a characteristic analysis of family history sentences and compare them with sentences which contain a family member term but no family history. We observe the frequency of family history in clinical reports, the reliability of using family member mentions to indicate family history, as well as the frequency that family history is labeled. We develop a top down processing syntactic approach to extracting family history, perform an analysis on the distribution of syntactic context in family history, and compare the use of a phrase structure parser versus a dependency parser for extraction.
  • Keywords
    computational linguistics; data handling; grammars; information retrieval; medical administrative data processing; medical computing; clinical record; clinical reports; dependency parser; dependency parsing; family history extraction; family history frequency; family history sentences; family member mentions; phrase structure parser; top-down processing syntactic approach; Cancer; Context; Diabetes; Diseases; History; Syntactics; Training; Dependency Parsing; Family History; Text Mining;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Healthcare Informatics, Imaging and Systems Biology (HISB), 2011 First IEEE International Conference on
  • Conference_Location
    San Jose, CA
  • Print_ISBN
    978-1-4577-0325-6
  • Electronic_ISBN
    978-0-7695-4407-6
  • Type

    conf

  • DOI
    10.1109/HISB.2011.23
  • Filename
    6061405