DocumentCode :
3123780
Title :
Using Semantics for Speech Annotation of Images
Author :
Desai, Chaitanya ; Kalashnikov, Dmitri V. ; Mehrotra, Sharad ; Venkatasubramanian, Nalini
Author_Institution :
Comput. Sci. Dept., Univ. of California, Irvine, Irvine, CA
fYear :
2009
fDate :
March 29 2009-April 2 2009
Firstpage :
1227
Lastpage :
1230
Abstract :
In this paper, we have postulated the problem of using discrete speech utterances to annotate an image as that of disambiguation across multiple N-best lists. Our solution is based on the Maximum Entropy approach and uses correlations between tags in an existing corpus of images to set up the constrains of the corresponding constrained optimization problem. Our experiments suggest that the proposed approach gives a significant improvement in quality as compared to an approach that considers the best answer suggested by a popular off-the-shelf recognizer.
Keywords :
maximum entropy methods; speech processing; constrained optimization problem; discrete speech utterances; maximum entropy; off-the-shelf recognizer; semantics; speech annotation; Computer science; Data engineering; Digital cameras; Image recognition; Image retrieval; Microphones; Object detection; Speech recognition; Tagging; Working environment noise; Disambiguation; Image Tagging; Speech annotation;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Data Engineering, 2009. ICDE '09. IEEE 25th International Conference on
Conference_Location :
Shanghai
ISSN :
1084-4627
Print_ISBN :
978-1-4244-3422-0
Electronic_ISBN :
1084-4627
Type :
conf
DOI :
10.1109/ICDE.2009.207
Filename :
4812507
Link To Document :
بازگشت