• DocumentCode
    2724421
  • Title

    Influence of a priori Knowledge on Medical Document Categorization

  • Author

    Itert, Lukasz ; Duch, Wlodzislaw ; Pestian, John

  • Author_Institution
    Div. of Rheumatology, Children´´s Hosp. Res. Found., Cincinnati, OH
  • fYear
    2007
  • fDate
    March 1 2007-April 5 2007
  • Firstpage
    163
  • Lastpage
    170
  • Abstract
    A significant part of medical data remains stored as unstructured texts. Semantic search requires introduction of markup tags. Medical concepts discovered in hospital discharge summaries are used to create several feature spaces. Experts use their background knowledge to categorize new documents, and knowing category of the document disambiguate words and acronyms. A model of document similarity to reference sources that captures some intuitions of an expert is introduced. Parameters of the model are evaluated using linear programming techniques. This approach is applied to categorization of the medical discharge summaries providing simpler and more accurate model than alternative text categorization approaches
  • Keywords
    document handling; linear programming; medical administrative data processing; a priori knowledge; document similarity; feature spaces; hospital discharge summaries; linear programming; markup tags; medical data; medical document categorization; semantic search; Biomedical informatics; Computational intelligence; Data mining; Fault location; Hospitals; Linear programming; Medical diagnostic imaging; Natural language processing; Pediatrics; Unified modeling language;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Computational Intelligence and Data Mining, 2007. CIDM 2007. IEEE Symposium on
  • Conference_Location
    Honolulu, HI
  • Print_ISBN
    1-4244-0705-2
  • Type

    conf

  • DOI
    10.1109/CIDM.2007.368868
  • Filename
    4221292