DocumentCode :
312351
Title :
The Nemours database of dysarthric speech
Author :
Menéndez-Pidal, Xavier ; Polikoff, James B. ; Peters, Shirley M. ; Leonzio, Jennie E. ; Bunnel, H.T.
Author_Institution :
Appl. Sci. & Eng. Lab., A.I. duPont Inst., Wilmington, DE, USA
Volume :
3
fYear :
1996
fDate :
3-6 Oct 1996
Firstpage :
1962
Abstract :
The Nemours database is a collection of 814 short nonsense sentences; 74 sentences spoken by each of 11 male speakers with varying degrees of dysarthria. Additionally, the database contains two connected-speech paragraphs produced by each of the 11 speakers. The database was designed to test the intelligibility of dysarthric speech before and after enhancement by various signal processing methods, and is available on CD-ROM. It can also be used to investigate general characteristics of dysarthric speech such as production error patterns. The entire database has been marked at the word level and sentences for 10 of the 11 talkers have been marked at the phoneme level as well. The paper describes the database structure and techniques adopted to improve the performance of a Discrete Hidden Markov Model (DHMM) labeler used to assign initial phoneme labels to the elements of the database. These techniques may be useful in the design of automatic recognition systems for persons with speech disorders, especially when limited amounts of training data are available
Keywords :
hidden Markov models; scientific information systems; signal processing; speech; speech enhancement; speech intelligibility; speech recognition; CD-ROM; Nemours database; automatic recognition systems; connected-speech paragraphs; database structure; discrete hidden Markov model labeler; dysarthria; dysarthric speech; dysarthric speech enhancement; dysarthric speech intelligibility; initial phoneme labels; male speakers; phoneme level marking; production error patterns; sentence marking; short nonsense sentences; signal processing methods; speech disorders; word level marking; Automatic speech recognition; CD-ROMs; Databases; Hidden Markov models; Signal design; Signal processing; Speech enhancement; Speech processing; Testing; Training data;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Spoken Language, 1996. ICSLP 96. Proceedings., Fourth International Conference on
Conference_Location :
Philadelphia, PA
Print_ISBN :
0-7803-3555-4
Type :
conf
DOI :
10.1109/ICSLP.1996.608020
Filename :
608020
Link To Document :
بازگشت