DocumentCode
312351
Title
The Nemours database of dysarthric speech
Author
Menéndez-Pidal, Xavier ; Polikoff, James B. ; Peters, Shirley M. ; Leonzio, Jennie E. ; Bunnel, H.T.
Author_Institution
Appl. Sci. & Eng. Lab., A.I. duPont Inst., Wilmington, DE, USA
Volume
3
fYear
1996
fDate
3-6 Oct 1996
Firstpage
1962
Abstract
The Nemours database is a collection of 814 short nonsense sentences; 74 sentences spoken by each of 11 male speakers with varying degrees of dysarthria. Additionally, the database contains two connected-speech paragraphs produced by each of the 11 speakers. The database was designed to test the intelligibility of dysarthric speech before and after enhancement by various signal processing methods, and is available on CD-ROM. It can also be used to investigate general characteristics of dysarthric speech such as production error patterns. The entire database has been marked at the word level and sentences for 10 of the 11 talkers have been marked at the phoneme level as well. The paper describes the database structure and techniques adopted to improve the performance of a Discrete Hidden Markov Model (DHMM) labeler used to assign initial phoneme labels to the elements of the database. These techniques may be useful in the design of automatic recognition systems for persons with speech disorders, especially when limited amounts of training data are available
Keywords
hidden Markov models; scientific information systems; signal processing; speech; speech enhancement; speech intelligibility; speech recognition; CD-ROM; Nemours database; automatic recognition systems; connected-speech paragraphs; database structure; discrete hidden Markov model labeler; dysarthria; dysarthric speech; dysarthric speech enhancement; dysarthric speech intelligibility; initial phoneme labels; male speakers; phoneme level marking; production error patterns; sentence marking; short nonsense sentences; signal processing methods; speech disorders; word level marking; Automatic speech recognition; CD-ROMs; Databases; Hidden Markov models; Signal design; Signal processing; Speech enhancement; Speech processing; Testing; Training data;
fLanguage
English
Publisher
ieee
Conference_Titel
Spoken Language, 1996. ICSLP 96. Proceedings., Fourth International Conference on
Conference_Location
Philadelphia, PA
Print_ISBN
0-7803-3555-4
Type
conf
DOI
10.1109/ICSLP.1996.608020
Filename
608020
Link To Document