• DocumentCode
    1831587
  • Title

    Towards a minimum description length based stopping criterion for semi-supervised time series classification

  • Author

    Begum, Nurjahan ; Bing Hu ; Rakthanmanon, Thanawin ; Keogh, Eamonn

  • fYear
    2013
  • fDate
    14-16 Aug. 2013
  • Firstpage
    333
  • Lastpage
    340
  • Abstract
    In the last decade the plunging costs of sensors/storage have made it possible to obtain vast amounts of medical telemetry. However for this data to be useful, it must be annotated. This annotation, requiring the attention of medical experts is very expensive and time consuming, and remains the critical bottleneck in medical analysis. Semi-supervised learning is an obvious way to mitigate the need for human labor, however, most such algorithms are designed for intrinsically discrete objects, and do not work well in this domain, which requires the ability to deal with real-valued objects arriving in a streaming fashion. In this work we make two contributions. First, we demonstrate that in many cases just a handful of human annotated examples are sufficient to perform accurate classification. Second, we devise a novel parameter-free stopping criterion for semi-supervised learning. We evaluate our work with a comprehensive set of experiments on diverse medical data sources including electrocardiograms. Our experimental results show that our approach can construct accurate classifiers even if given only a single annotated instance.
  • Keywords
    biomedical telemetry; learning (artificial intelligence); medical information systems; pattern classification; time series; electrocardiograms; human annotated examples; human labor; intrinsically discrete objects; medical analysis; medical data sources; medical experts; medical telemetry; minimum description length based stopping criterion; parameter-free stopping criterion; semisupervised learning; semisupervised time series classification; single annotated instance; streaming fashion; Classification algorithms; Encoding; Euclidean distance; Semisupervised learning; Supervised learning; Time series analysis; Training; MDL; Semi-Supervised Learning; Time Series;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Information Reuse and Integration (IRI), 2013 IEEE 14th International Conference on
  • Conference_Location
    San Francisco, CA
  • Type

    conf

  • DOI
    10.1109/IRI.2013.6642490
  • Filename
    6642490