DocumentCode
2073704
Title
Introducing a Framework to Create Telephony Speech Databases from Direct Ones
Author
Momtazi, Saeedeh ; Sameti, Hossein ; Vaisipour, Saman ; Tefagh, Meysam
Author_Institution
Sharif Univ. of Technol., Tehran
fYear
2007
fDate
27-30 June 2007
Firstpage
327
Lastpage
330
Abstract
A comprehensive speech database is one of the important tools for developing speech recognition systems; these tools are necessary for telephony recognition, too. Although adequate databases for direct speech recognizers exist, there is not an appropriate database for telephony speech recognizers. Most methods suggested for solving this problem are based on building new databases which tends to consume much time and many resources; or they used a filter which simulates circuit switch behavior to transform direct databases to telephony ones, in this case resulted databases have many differences with real telephony databases. In this paper we introduce a framework for creating telephony speech database from direct ones in order to reduce the costs of other existing methods. We apply this framework to FARSDAT and produce a telephony database which was used in a telephony command recognizer.
Keywords
audio databases; speech recognition; telephony; FARSDAT; filter; speech recognition systems; telephony speech databases; Buildings; Data engineering; Degradation; Filters; Loudspeakers; Spatial databases; Speech recognition; Switches; Telephony; Testing; FARSDAT; Speech Databases; Speech Recognition; Telephony Speech Recognition;
fLanguage
English
Publisher
ieee
Conference_Titel
Systems, Signals and Image Processing, 2007 and 6th EURASIP Conference focused on Speech and Image Processing, Multimedia Communications and Services. 14th International Workshop on
Conference_Location
Maribor
Print_ISBN
978-961-248-029-5
Electronic_ISBN
978-961-248-029-5
Type
conf
DOI
10.1109/IWSSIP.2007.4381108
Filename
4381108
Link To Document