The Nemours database of dysarthric speech

Author

Menéndez-Pidal, Xavier ; Polikoff, James B. ; Peters, Shirley M. ; Leonzio, Jennie E. ; Bunnel, H.T.

Author_Institution

Appl. Sci. & Eng. Lab., A.I. duPont Inst., Wilmington, DE, USA

Volume

3

fYear

1996

fDate

3-6 Oct 1996

Firstpage

1962

Abstract

The Nemours database is a collection of 814 short nonsense sentences; 74 sentences spoken by each of 11 male speakers with varying degrees of dysarthria. Additionally, the database contains two connected-speech paragraphs produced by each of the 11 speakers. The database was designed to test the intelligibility of dysarthric speech before and after enhancement by various signal processing methods, and is available on CD-ROM. It can also be used to investigate general characteristics of dysarthric speech such as production error patterns. The entire database has been marked at the word level and sentences for 10 of the 11 talkers have been marked at the phoneme level as well. The paper describes the database structure and techniques adopted to improve the performance of a Discrete Hidden Markov Model (DHMM) labeler used to assign initial phoneme labels to the elements of the database. These techniques may be useful in the design of automatic recognition systems for persons with speech disorders, especially when limited amounts of training data are available

Keywords

hidden Markov models; scientific information systems; signal processing; speech; speech enhancement; speech intelligibility; speech recognition; CD-ROM; Nemours database; automatic recognition systems; connected-speech paragraphs; database structure; discrete hidden Markov model labeler; dysarthria; dysarthric speech; dysarthric speech enhancement; dysarthric speech intelligibility; initial phoneme labels; male speakers; phoneme level marking; production error patterns; sentence marking; short nonsense sentences; signal processing methods; speech disorders; word level marking; Automatic speech recognition; CD-ROMs; Databases; Hidden Markov models; Signal design; Signal processing; Speech enhancement; Speech processing; Testing; Training data;

fLanguage

English

Publisher

ieee

Conference_Titel

Spoken Language, 1996. ICSLP 96. Proceedings., Fourth International Conference on

Conference_Location

Philadelphia, PA

Print_ISBN

0-7803-3555-4

Type

conf

DOI

10.1109/ICSLP.1996.608020

Filename

608020