• DocumentCode
    730819
  • Title

    Enhancing automatically discovered multi-level acoustic patterns considering context consistency with applications in spoken term detection

  • Author

    Cheng-Tao Chung ; Wei-Ning Hsu ; Cheng-Yi Lee ; Lin-shan Lee

  • Author_Institution
    Grad. Inst. of Electr. Eng., Nat. Taiwan Univ., Taipei, Taiwan
  • fYear
    2015
  • fDate
    19-24 April 2015
  • Firstpage
    5231
  • Lastpage
    5235
  • Abstract
    This paper presents a novel approach for enhancing the multiple sets of acoustic patterns automatically discovered from a given corpus. In a previous work it was proposed that different HMM configurations (number of states per model, number of distinct models) for the acoustic patterns form a two-dimensional space. Multiple sets of acoustic patterns automatically discovered with the HMM configurations properly located on different points over this two-dimensional space were shown to be complementary to one another, jointly capturing the characteristics of the given corpus. By representing the given corpus as sequences of acoustic patterns on different HMM sets, the pattern indices in these sequences can be relabeled considering the context consistency across the different sequences. Good improvements were observed in preliminary experiments of pattern spoken term detection (STD) performed on both TIMIT and Mandarin Broadcast News with such enhanced patterns.
  • Keywords
    hidden Markov models; speech enhancement; speech recognition; HMM configurations; Mandarin Broadcast News; STD; TIMIT; multilevel acoustic patterns; spoken term detection; two-dimensional space; Acoustics; Conferences; Context; Hidden Markov models; Impurities; Speech; Training; acoustic patterns; hidden Markov models; spoken term detection; unsupervised learning; zero-resourced speech recognition;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Acoustics, Speech and Signal Processing (ICASSP), 2015 IEEE International Conference on
  • Conference_Location
    South Brisbane, QLD
  • Type

    conf

  • DOI
    10.1109/ICASSP.2015.7178969
  • Filename
    7178969