Title :
Data collection for investigating speech variability in a specific speaker over long and short time periods
Author :
Tsuge, Satoru ; Shishibori, Masami ; Ren, Fuji ; Kita, Kenji ; Kuroiwa, Shingo
Author_Institution :
Fac. of Eng., Tokushima Univ., Japan
fDate :
30 Oct.-1 Nov. 2005
Abstract :
In this paper, we describe a Japanese speech corpus collected for investigating the speech variability of a specific speaker over short and long time periods. Although speakers use a speaker-dependent speech recognition system, it is known that speech recognition performance varies pending when the utterance was uttered. This is because speech varies even if the speaker utters a specific sentence. However, the relationship between intra-speaker speech variability and speech recognition performance is not clear. We have not seen a corpus of Japanese speech data of a specific speaker over a long time period. Hence, since 2002, we have been collecting speech data for investigating the relationships between speech variability and speech recognition performance. In this paper, we introduce our speech corpus and conduct speech recognition experiments. Experimental results show that the variability of recognition performance over different days is larger than variability of recognition performance within a day.
Keywords :
natural languages; speech processing; speech recognition; Japanese speech corpus; speaker-dependent speech recognition system; speech variability; Automatic speech recognition; Background noise; Cellular phones; Data engineering; Databases; Degradation; Information technology; Navigation; Speech processing; Speech recognition;
Conference_Titel :
Natural Language Processing and Knowledge Engineering, 2005. IEEE NLP-KE '05. Proceedings of 2005 IEEE International Conference on
Print_ISBN :
0-7803-9361-9
DOI :
10.1109/NLPKE.2005.1598725