Title :
Phonetic and anthropometric conditioning of MSA-KST cognitive impairment characterization system
Author :
Ivanov, A.V. ; Jalalvand, Shahab ; Gretter, Roberto ; Falavigna, Daniele
Author_Institution :
Fondazione Bruno Kessler, Povo, Italy
Abstract :
We explore the impact of speech- and speaker-specific modeling onto the Modulation Spectrum Analysis - Kolmogorov-Smirnov feature Testing (MSA-KST) characterization method in the task of automated prediction of the cognitive impairment diagnosis, namely dysphasia and pervasive development disorder. Phoneme-synchronous capturing of speech dynamics is a reasonable choice for a segmental speech characterization system as it allows comparing speech dynamics in the similar phonetic contexts. Speaker-specific modeling aims at reducing the “within-the-class” variability of the characterized speech or speaker population by removing the effect of speaker properties that should have no relation to the characterization. Specifically the vocal tract length of a speaker has nothing to do with the diagnosis attribution and, thus, the feature set shall be normalized accordingly. The resulting system compares favorably to the baseline system of the Interspeech´2013 Computational Paralinguistics Challenge.
Keywords :
anthropometry; cognition; feature selection; handicapped aids; medical disorders; modulation spectra; speech processing; speech recognition; MSA-KST characterization method; MSA-KST cognitive impairment characterization system; anthropometric conditioning; automated cognitive impairment diagnosis prediction; dysphasia; feature set; modulation spectrum analysis Kolmogorov-Smirnov feature testing characterization method; pervasive development disorder; phoneme-synchronous speech dynamics capturing; phonetic conditioning; segmental speech characterization system; speaker-specific modeling; speech-specific modeling; within-the-class variability; Autism; Correlation; Feature extraction; Modulation; Speech; Speech processing; Training; feature selection; modulation spectrum; speech characterization;
Conference_Titel :
Automatic Speech Recognition and Understanding (ASRU), 2013 IEEE Workshop on
Conference_Location :
Olomouc
DOI :
10.1109/ASRU.2013.6707734