• DocumentCode
    655110
  • Title

    Automatic Extraction of Personal Experiences from Patients´ Blogs: A Case Study in Chronic Obstructive Pulmonary Disease

  • Author

    Greenwood, Mark ; Elwyn, Glyn ; Francis, Nick ; Preece, Alun ; Spasic, Irena

  • Author_Institution
    Sch. of Comput. Sci. & Inf., Cardiff Univ., Cardiff, UK
  • fYear
    2013
  • fDate
    Sept. 30 2013-Oct. 2 2013
  • Firstpage
    377
  • Lastpage
    382
  • Abstract
    People with long-term illness such as chronic obstructive pulmonary disease (COPD) often use social media to document and share information, opinions and their experiences with others. Analysing the self-reported experiences of patients shared online has the potential to help medical researchers gain insight into some of the key issues affecting patients. However, the scale of health conversation taking place online poses considerable challenges to traditional content analysis. In this paper, we present a system which automates extraction of patient statements which refer to a personal experience. We applied a crowd sourcing methodology to create a set of 1770 annotated sentences from blog posts written by COPD patients. Our machine learning approach trained on lexical features successfully extracted sentences about patient experience with 93% precision and 80% recall (F-measure: 86%). Automatic annotation of sentences about patient experience can facilitate subsequent content analysis by highlighting the most relevant sentences to this particular problem.
  • Keywords
    Web sites; diseases; learning (artificial intelligence); medical administrative data processing; medical computing; COPD; automatic extraction; chronic obstructive pulmonary disease; content analysis; crowd sourcing methodology; health conversation; machine learning approach; patient statements; patients blogs; personal experiences; social media; Blogs; Diseases; Feature extraction; Internet; Media; Training; blog mining; blogs; health informatics; machine learning; natural language processing; social media; text processing;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Cloud and Green Computing (CGC), 2013 Third International Conference on
  • Conference_Location
    Karlsruhe
  • Type

    conf

  • DOI
    10.1109/CGC.2013.66
  • Filename
    6686058