• DocumentCode
    661290
  • Title

    Using acoustic dissimilarity measures based on state-level distance vector representation for improved spoken term detection

  • Author

    Yamamoto, Naoji ; Kai, Atsuhiko

  • Author_Institution
    Grad. Sch. of Eng., Shizuoka Univ., Hamamatsu, Japan
  • fYear
    2013
  • fDate
    Oct. 29 2013-Nov. 1 2013
  • Firstpage
    1
  • Lastpage
    4
  • Abstract
    This paper proposes a simple approach to subword-based spoken term detection (STD) which uses improved acoustic dissimilarity measures based on a distance-vector representation at the state-level. Our approach assumes that both the query term and spoken documents are represented by subword units and then converted to the sequence of HMM states. A set of all distributions in subword-based HMMs is used for generating distance-vector representation of each state of all subword units. The element of a distance-vector corresponds to the distance between distributions of two different states, and thus a vector represents a structural feature at the state-level. The experimental result showed that the proposed method significantly outperforms the baseline method, which employs a conventional acoustic dissimilarity measure based on subword unit, with very little increase in the required search time.
  • Keywords
    hidden Markov models; signal representation; speech processing; improved acoustic dissimilarity measures; improved subword-based spoken term detection; query term; spoken documents; state-level distance vector representation; subword-based HMM unit; Acoustic measurements; Acoustics; Hidden Markov models; Robustness; Speech; Speech recognition; Vectors;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Signal and Information Processing Association Annual Summit and Conference (APSIPA), 2013 Asia-Pacific
  • Conference_Location
    Kaohsiung
  • Type

    conf

  • DOI
    10.1109/APSIPA.2013.6694151
  • Filename
    6694151