DocumentCode
1887704
Title
Automatic assessment of voice signals according to the GRBAS scale using modulation spectra, Mel frequency Cepstral Coefficients and Noise parameters
Author
Villa-Canas, T. ; Orozco-Arroyave, J.R. ; Arias-Londono, J.D. ; Vargas-Bonilla, J.F. ; Godino-Llorente, J.I.
Author_Institution
Dept. of Electron. & Telecommun. Eng., Univ. de Antioquia, Antioquia, Colombia
fYear
2013
fDate
11-13 Sept. 2013
Firstpage
1
Lastpage
5
Abstract
This paper presents a system for the automatic assessment of voice quality, according to the GRBAS scale, which considers different speech measures. The set of features includes the centroids and the energy content of different frequency bands in the modulation spectra of the recordings, Mel-frequency Cepstral Coefficients, Harmonics to Noise Ratio, Normalizes Noise Energy and Glottal to Noise Excitation Ratio. Additionally, with the aim of eliminate possible redundance in the information provided by the features, two different feature extraction techniques are applied, Principal Component Analysis and Linear Discriminant Analysis. The multiclass classification is done by means of K Nearest Neighbors classifier. The performance of the system is measured in terms of efficiency and statistical agreement index Kappa. The results show that this approach provides acceptable results for this purpose, with the best efficiency around 89.3% for Asthenia (A).
Keywords
cepstral analysis; modulation; principal component analysis; speech synthesis; GRBAS scale; K nearest neighbors classifier; Mel frequency cepstral coefficients; asthenia; centroids; feature extraction techniques; frequency bands; linear discriminant analysis; modulation spectra; multiclass classification; noise energy; noise excitation ratio; noise parameters; noise ratio; principal component analysis; statistical agreement index Kappa; voice quality automatic assessment; voice signal automatic assessment; Frequency modulation; Mel frequency cepstral coefficient; Noise; Pathology; Principal component analysis; Speech; MFCC; Modulation spectra; Noise measures; Scale of voice quality; Voice Quality Assessment;
fLanguage
English
Publisher
ieee
Conference_Titel
Image, Signal Processing, and Artificial Vision (STSIVA), 2013 XVIII Symposium of
Conference_Location
Bogota
Print_ISBN
978-1-4799-1120-2
Type
conf
DOI
10.1109/STSIVA.2013.6644930
Filename
6644930
Link To Document