Title :
European speech databases for telephone applications
Author :
Hoge, Harald ; Tropf, Herbert S. ; Winski, Richard ; Van den Heuvel, Henk ; Haeb-Umbach, Reinhold ; Choukri, Khalid
Abstract :
The SpeechDat project aims to produce speech databases for all official languages of the European Union and some major dialectal variants and minority languages resulting in 28 speech databases. They will be recorded over fixed and mobile telephone networks. This will provide a realistic basis for training and assessment of both isolated and continuous-speech utterances, employing whole-word or subword approaches, and thus can be used for developing voice driven teleservices including speaker verification. The specification of the databases has been developed jointly, and is essentially the same for each language to facilitate dissemination and use. There will be a controlled variation among the speakers concerning sex, age, dialect, environment of call, etc. The validation of all databases will be carried out centrally. The SpeechDat databases will be transferred to ELRA for distribution. The next databases to be recorded will cover East European languages
Keywords :
database management systems; information services; speech recognition; telephony; East European languages; European Union; European speech databases; SpeechDat project; assessment; continuous-speech utterances; dialect; dialectal variants; dissemination; isolated utterances; minority languages; official languages; speaker verification; subword approaches; telephone applications; training; voice driven teleservices; whole-word approaches; Costs; Distributed databases; Natural languages; Recruitment; Speech recognition; System testing; Telecommunications; Telematics; Telephony;
Conference_Titel :
Acoustics, Speech, and Signal Processing, 1997. ICASSP-97., 1997 IEEE International Conference on
Conference_Location :
Munich
Print_ISBN :
0-8186-7919-0
DOI :
10.1109/ICASSP.1997.598873