DocumentCode
2788522
Title
Multipass strategies for improving accuracy in a voice search application
Author
Zhang, Tianhe ; Rose, Richard ; Dahan, Jean
fYear
2010
fDate
14-19 March 2010
Firstpage
5354
Lastpage
5357
Abstract
This paper describes a set of techniques for improving the performance of automated voice search services intended for mobile users accessing these services over a range of portable devices. Voice search is implemented as a two stage search procedure where string candidates generated by an automatic speech recognition (ASR) system are re-scored in order to identify the best matching entry from a potentially very large application specific database. The work in this paper deals specifically with user utterances that contain spoken letter sequences corresponding to spelled instances of search terms. Methods are investigated for identifying the most likely database entry associated with the decoded utterance. An experimental study is presented describing the characteristics of actual user utterances obtained from a prototype voice search service. The impact of these methods on word error rate is presented.
Keywords
mobile computing; speech recognition; very large databases; automated voice search services; automatic speech recognition system; mobile users; multipass strategies; portable devices; spoken letter sequences; two stage search procedure; very large application specific database; Application software; Automatic speech recognition; Data analysis; Databases; Decoding; Displays; Humans; Prototypes; Search engines; Speech recognition; Speech recognition; String matching;
fLanguage
English
Publisher
ieee
Conference_Titel
Acoustics Speech and Signal Processing (ICASSP), 2010 IEEE International Conference on
Conference_Location
Dallas, TX
ISSN
1520-6149
Print_ISBN
978-1-4244-4295-9
Electronic_ISBN
1520-6149
Type
conf
DOI
10.1109/ICASSP.2010.5494949
Filename
5494949
Link To Document