• DocumentCode
    2788831
  • Title

    Balancing false alarms and hits in Spoken Term Detection

  • Author

    Parada, Carolina ; Sethy, Abhinav ; Ramabhadran, Bhuvana

  • Author_Institution
    Human Language Technol. Center of Excellence, Johns Hopkins Univ., Baltimore, MD, USA
  • fYear
    2010
  • fDate
    14-19 March 2010
  • Firstpage
    5286
  • Lastpage
    5289
  • Abstract
    This paper presents methods to improve retrieval of Out-Of-Vocabulary (OOV) terms in a Spoken Term Detection (STD) system. We demonstrate that automated tagging of OOV regions helps to reduce false alarms while incorporating phonetic confusability increases the hits. Additional features that boost the probability of a hit in accordance with the number of neighboring hits for the same query and query-length normalization also improve the overall performance of the spoken-term detection system. We show that these methods can be combined effectively to provide a relative improvement of 21% in Average Term Weighted Value (ATWV) on a 100-hour corpus with 1290 OOV-only queries and 2% relative on the NIST 2006 STD task, where only 16 of the 1107 queries were OOV terms. Lastly, we present results to show that the proposed methods are general enough to work well in query-by-example based spoken-term detection, and in mismatched situations when the representation of the index being searched through and the queries are not generated by the same system.
  • Keywords
    speech processing; speech recognition; automated tagging; average term weighted value; false alarms; out-of-vocabulary terms; phonetic confusability; query-length normalization; spoken term detection; Decoding; Humans; Lattices; NIST; Natural languages; Speech processing; Speech recognition; Tagging; Transducers; Vocabulary; OOV Detection; Spoken Term Detection;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Acoustics Speech and Signal Processing (ICASSP), 2010 IEEE International Conference on
  • Conference_Location
    Dallas, TX
  • ISSN
    1520-6149
  • Print_ISBN
    978-1-4244-4295-9
  • Electronic_ISBN
    1520-6149
  • Type

    conf

  • DOI
    10.1109/ICASSP.2010.5494966
  • Filename
    5494966