Title :
In search of optimal data selection for training of automatic speech recognition systems
Author :
Nagroski, A. ; Boves, Lou ; Steeneken, Herman
Author_Institution :
TNO Human Factors Res. Inst., Soesterberg, Netherlands
fDate :
30 Nov.-3 Dec. 2003
Abstract :
This paper presents an extended study in the topic of optimal selection of speech data from a database for efficient training of ASR systems. We reconsider a method of optimal selection introduced in our previous work and introduce variosearch as an alternative selection method developed in order to find a representative sample of speech data with a simultaneous control of acoustical and statistical parameters of data selected. Next, we present experiments in which the performance of a standard ASR system trained with data sets selected from a Dutch digits database via different selection methods was compared. The results show that the length of training utterances has a dominant impact on the recognition performance. Therefore, the length of the utterances is a factor that must be taken into account when interpreting phoneme recognition scores.
Keywords :
optimisation; search problems; speech processing; speech recognition; statistical analysis; ASR system training; Dutch digits database; acoustical parameters; automatic speech recognition systems; optimal data selection; performance; phoneme recognition scores; recognition performance; representative sample; statistical parameters; training utterance length; variosearch; Automatic speech recognition; Hidden Markov models; Human factors; Natural languages; Optimal control; Proposals; Spatial databases; Speech recognition; System testing; Training data;
Conference_Titel :
Automatic Speech Recognition and Understanding, 2003. ASRU '03. 2003 IEEE Workshop on
Print_ISBN :
0-7803-7980-2
DOI :
10.1109/ASRU.2003.1318405