Title :
Using Semantics for Speech Annotation of Images
Author :
Desai, Chaitanya ; Kalashnikov, Dmitri V. ; Mehrotra, Sharad ; Venkatasubramanian, Nalini
Author_Institution :
Comput. Sci. Dept., Univ. of California, Irvine, Irvine, CA
fDate :
March 29 2009-April 2 2009
Abstract :
In this paper, we have postulated the problem of using discrete speech utterances to annotate an image as that of disambiguation across multiple N-best lists. Our solution is based on the Maximum Entropy approach and uses correlations between tags in an existing corpus of images to set up the constrains of the corresponding constrained optimization problem. Our experiments suggest that the proposed approach gives a significant improvement in quality as compared to an approach that considers the best answer suggested by a popular off-the-shelf recognizer.
Keywords :
maximum entropy methods; speech processing; constrained optimization problem; discrete speech utterances; maximum entropy; off-the-shelf recognizer; semantics; speech annotation; Computer science; Data engineering; Digital cameras; Image recognition; Image retrieval; Microphones; Object detection; Speech recognition; Tagging; Working environment noise; Disambiguation; Image Tagging; Speech annotation;
Conference_Titel :
Data Engineering, 2009. ICDE '09. IEEE 25th International Conference on
Conference_Location :
Shanghai
Print_ISBN :
978-1-4244-3422-0
Electronic_ISBN :
1084-4627
DOI :
10.1109/ICDE.2009.207