• DocumentCode
    2098072
  • Title

    Dynamic minimum pause threshold estimation for speech analysis in studies of cognitive function in ageing

  • Author

    Rochford, I. ; Rapcan, V. ; D´arcy, Shona ; Reilly, Richard B.

  • Author_Institution
    Trinity Centre for Bioeng., Trinity Coll. Dublin, Dublin, Ireland
  • fYear
    2012
  • fDate
    Aug. 28 2012-Sept. 1 2012
  • Firstpage
    3700
  • Lastpage
    3703
  • Abstract
    Cognitive decline represents the biggest limiting factor to independence in older adults. Speech analysis has emerged as an alternative to standard cognitive assessment tools. Temporal segmentation of speech is reported in many studies and typically employs a static threshold to define a pause. This study investigated the effect of using pause and utterance duration distribution data in differentiating between cognitively healthy and impaired older adults. Three sets of features were extracted from 187 speech recordings: temporal features using a static 250ms threshold; temporal features using a dynamic threshold; and pause and utterance duration distribution parameters. The ability of each of these sets to differentiate between cognitively healthy and cognitively impaired participants was investigated using a Linear Discriminant Analysis (LDA) classifier. Improvements of 0.22% (to 64.20%) in sensitivity, 6.33% (73.12%) in specificity, and 3.27% (68.66%) in overall accuracy were observed in the performance of the classifier using the pause and utterance duration distribution parameters when compared to the static temporal features. The use of the dynamic threshold had a negative impact on the classifier performance, with a decrease of 5.73% (to 58.25%) in sensitivity, 1.10% (65.69%) in specificity, and 3.42% (61.97%) in accuracy.
  • Keywords
    cognition; feature extraction; geriatrics; pattern classification; speech processing; LDA; ageing; cognitive decline; cognitive function; dynamic minimum pause threshold estimation; feature extraction; impaired older adults; linear discriminant analysis classifier; speech analysis; speech recordings; static temporal feature; static threshold; temporal speech segmentation; utterance duration distribution data; Accuracy; Dementia; Estimation; Feature extraction; Sensitivity; Speech; Speech analysis; Aged; Aged, 80 and over; Aging; Cognition; Discriminant Analysis; Female; Humans; Male; Middle Aged; ROC Curve; Sensory Thresholds; Speech;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Engineering in Medicine and Biology Society (EMBC), 2012 Annual International Conference of the IEEE
  • Conference_Location
    San Diego, CA
  • ISSN
    1557-170X
  • Print_ISBN
    978-1-4244-4119-8
  • Electronic_ISBN
    1557-170X
  • Type

    conf

  • DOI
    10.1109/EMBC.2012.6346770
  • Filename
    6346770