DocumentCode :
1305270
Title :
A Mining Technique Using N n-Grams and Motion Transcripts for Body Sensor Network Data Repository
Author :
Loseu, Vitali ; Ghasemzadeh, Hassan ; Jafari, Roozbeh
Author_Institution :
Comput. Eng. Dept., Univ. of Texas at Dallas, Richardson, TX, USA
Volume :
100
Issue :
1
fYear :
2012
Firstpage :
107
Lastpage :
121
Abstract :
Recent years have witnessed a large influx of applications in the field of cyber-physical systems. An important class of these systems is body sensor networks (BSNs) where lightweight embedded processors and communication systems are tightly coupled with the human body. BSNs can provide researchers, care providers and clinicians access to tremendously valuable information extracted from data that are collected in users´ natural environment. With this information, one can monitor the progression of a disease, identify its early onset, or simply assess user´s wellness. One major obstacle is managing repositories that store the large amount of sensing data. To address this issue, we propose a data mining approach inspired by the experience in the areas of text and natural language processing. We represent sensor readings with a sequence of characters, called motion transcripts. Transcripts reduce complexity of the data significantly while maintaining morphological and structural properties of the physiological signals. To further take advantage of the physiological signal´s structure, our data mining technique focuses on the characteristic transitions in the signals. These transitions are efficiently captured using the concept of n-grams. To facilitate a lightweight and fast mining approach, we reduce the overwhelmingly large number of n-grams via information gain (IG) feature selection. We report the effectiveness of the proposed approach in terms of the speed of mining while maintaining an acceptable accuracy in terms of the F-score combining both precision and recall.
Keywords :
body sensor networks; data mining; diseases; information retrieval; medical signal processing; natural language processing; patient monitoring; physiological models; text analysis; body sensor network; cyber physical systems; data mining; data repository; disease; feature selection; information extraction; information gain; motion transcripts; n-grams; natural language processing; patient monitoring; physiological signals; text analysis; Biomedical monitoring; Body sensor networks; Cyberspace; Data mining; Data models; Network topology; $n$-grams; Body sensor networks (BSNs); Patricia tree; data mining; string templates;
fLanguage :
English
Journal_Title :
Proceedings of the IEEE
Publisher :
ieee
ISSN :
0018-9219
Type :
jour
DOI :
10.1109/JPROC.2011.2161238
Filename :
5995280
Link To Document :
بازگشت