• DocumentCode
    1695268
  • Title

    System combination and score normalization for spoken term detection

  • Author

    Mamou, Jonathan ; Jia Cui ; Xiaodong Cui ; Gales, Mark J.F. ; Kingsbury, Brian ; Knill, Kate ; Mangu, Lidia ; Nolden, David ; Picheny, Michael ; Ramabhadran, Bhuvana ; Schluter, Ralf ; Sethy, Abhinav ; Woodland, Philip C.

  • Author_Institution
    IBM Haifa Res. Labs., Haifa, Israel
  • fYear
    2013
  • Firstpage
    8272
  • Lastpage
    8276
  • Abstract
    Spoken content in languages of emerging importance needs to be searchable to provide access to the underlying information. In this paper, we investigate the problem of extending data fusion methodologies from Information Retrieval for Spoken Term Detection on low-resource languages in the framework of the IARPA Babel program. We describe a number of alternative methods improving keyword search performance. We apply these methods to Cantonese, a language that presents some new issues in terms of reduced resources and shorter query lengths. First, we show score normalization methodology that improves in average by 20% keyword search performance. Second, we show that properly combining the outputs of diverse ASR systems performs 14% better than the best normalized ASR system.
  • Keywords
    information retrieval; natural language processing; sensor fusion; speech recognition; Cantonese language; IARPA Babel program; automatic speech recognition; data fusion methodology; diverse ASR systems; information retrieval; keyword search performance; low-resource languages; normalized ASR system; query lengths; score normalization methodology; spoken term detection; system combination; Data integration; Hidden Markov models; Indexes; Lattices; Speech; Training; Tuning; data fusion; keyword search; score normalization; spoken term detection; system combination;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Acoustics, Speech and Signal Processing (ICASSP), 2013 IEEE International Conference on
  • Conference_Location
    Vancouver, BC
  • ISSN
    1520-6149
  • Type

    conf

  • DOI
    10.1109/ICASSP.2013.6639278
  • Filename
    6639278