مرکز منطقه ای اطلاع رساني علوم و فناوري - Modeling pathological speech perception from data with similarity labels

DocumentCode :

177741

Title :

Modeling pathological speech perception from data with similarity labels

Author :

Berisha, Visar ; Liss, Julie ; Sandoval, Steven ; Utianski, Rene ; Spanias, A.

Author_Institution :

Dept. of Speech & Hearing Sci., Arizona State Univ., Tempe, AZ, USA

fYear :

2014

fDate :

4-9 May 2014

Firstpage :

915

Lastpage :

919

Abstract :

The current state of the art in judging pathological speech intelligibility is subjective assessment performed by trained speech pathologists (SLP). These tests, however, are inconsistent, costly and, oftentimes suffer from poor intra- and inter-judge reliability. As such, consistent, reliable, and perceptually-relevant objective evaluations of pathological speech are critical. Here, we propose a data-driven approach to this problem. We propose new cost functions for examining data from a series of experiments, whereby we ask certified SLPs to rate pathological speech along the perceptual dimensions that contribute to decreased intelligibility. We consider qualitative feedback from SLPs in the form of comparisons similar to statements “Is Speaker A´s rhythm more similar to Speaker B or Speaker C?” Data of this form is common in behavioral research, but is different from the traditional data structures expected in supervised (data matrix + class labels) or unsupervised (data matrix) machine learning. The proposed method identifies relevant acoustic features that correlate with the ordinal data collected during the experiment. Using these features, we show that we are able to develop objective measures of the speech signal degradation that correlate well with SLP responses.

Keywords :

acoustic signal processing; feature extraction; speech intelligibility; unsupervised learning; SLP; acoustic features; cost functions; data-driven approach; interjudge reliability; intrajudge reliability; pathological speech intelligibility; pathological speech perception modeling; perceptual dimensions; similarity label data; speech signal degradation; trained speech pathologists; unsupervised machine learning; Acoustics; Cost function; Feature extraction; Pathology; Prediction algorithms; Speech; Vectors;

fLanguage :

English

Publisher :

ieee

Conference_Titel :

Acoustics, Speech and Signal Processing (ICASSP), 2014 IEEE International Conference on

Conference_Location :

Florence

Type :

conf

DOI :

10.1109/ICASSP.2014.6853730

Filename :

6853730

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=177741