Title :
Introducing a Framework to Create Telephony Speech Databases from Direct Ones
Author :
Momtazi, Saeedeh ; Sameti, Hossein ; Vaisipour, Saman ; Tefagh, Meysam
Author_Institution :
Sharif Univ. of Technol., Tehran
Abstract :
A comprehensive speech database is one of the important tools for developing speech recognition systems; these tools are necessary for telephony recognition, too. Although adequate databases for direct speech recognizers exist, there is not an appropriate database for telephony speech recognizers. Most methods suggested for solving this problem are based on building new databases which tends to consume much time and many resources; or they used a filter which simulates circuit switch behavior to transform direct databases to telephony ones, in this case resulted databases have many differences with real telephony databases. In this paper we introduce a framework for creating telephony speech database from direct ones in order to reduce the costs of other existing methods. We apply this framework to FARSDAT and produce a telephony database which was used in a telephony command recognizer.
Keywords :
audio databases; speech recognition; telephony; FARSDAT; filter; speech recognition systems; telephony speech databases; Buildings; Data engineering; Degradation; Filters; Loudspeakers; Spatial databases; Speech recognition; Switches; Telephony; Testing; FARSDAT; Speech Databases; Speech Recognition; Telephony Speech Recognition;
Conference_Titel :
Systems, Signals and Image Processing, 2007 and 6th EURASIP Conference focused on Speech and Image Processing, Multimedia Communications and Services. 14th International Workshop on
Conference_Location :
Maribor
Print_ISBN :
978-961-248-029-5
Electronic_ISBN :
978-961-248-029-5
DOI :
10.1109/IWSSIP.2007.4381108