Spoken document retrieval by discriminative modeling in a high dimensional feature space

Author

Oba, Takanobu ; Hori, Takaaki ; Nakamura, Atsushi ; Ito, Akinori

Author_Institution

NTT Commun. Sci. Labs., NTT Corp., Kyoto, Japan

fYear

2012

fDate

25-30 March 2012

Firstpage

5153

Lastpage

5156

Abstract

This paper proposes discriminative modeling in a high dimensional feature space for spoken document retrieval (SDR). To estimate the parameters of a high dimensional model properly, a large quantity of data is necessary, but there is no such large corpus for document retrieval. This paper employs two approaches to overcome this problem. One is a reranking approach. A baseline system first gives each document a score and then the score is compensated by employing a high dimensional model. The other approach is automatic query generation. A large number of queries are automatically generated and used for parameter estimation. Our experimental result shows that our proposed method can greatly improve SDR performance.

Keywords

parameter estimation; query processing; speech processing; SDR; automatic query generation; baseline system; discriminative modeling; high dimensional feature space; high dimensional model properly; parameter estimation; reranking approach; spoken document retrieval; Computational modeling; Hidden Markov models; Indexes; Speech recognition; Training; Training data; Vectors; Discriminative model; Linear model; Spoken document retrieval;

fLanguage

English

Publisher

ieee

Conference_Titel

Acoustics, Speech and Signal Processing (ICASSP), 2012 IEEE International Conference on

Conference_Location

Kyoto

ISSN

1520-6149

Print_ISBN

978-1-4673-0045-2

Electronic_ISBN

1520-6149

Type

conf

DOI

10.1109/ICASSP.2012.6289080

Filename

6289080