• DocumentCode
    1413367
  • Title

    Semantic Analysis and Organization of Spoken Documents Based on Parameters Derived From Latent Topics

  • Author

    Kong, Sheng-Yi ; Lee, Lin-shan

  • Author_Institution
    Dept. of Comput. Sci. & Inf. Eng., Nat. Taiwan Univ., Taipei, Taiwan
  • Volume
    19
  • Issue
    7
  • fYear
    2011
  • Firstpage
    1875
  • Lastpage
    1889
  • Abstract
    Spoken documents are audio signals and are thus not easily displayed on-screen and not easily scanned and browsed by the user. It is therefore highly desirable to automatically construct summaries, titles, latent topic trees and key term-based topic labels for these spoken documents to aid the user in browsing. We refer to this as semantic analysis and organization. Also, as network content is both copious and dynamic, with topics and domains changing everyday, the approaches here must be primarily unsupervised. We propose a framework for unsupervised semantic analysis and organization of spoken documents and for this purpose propose two measures derived from latent topic analysis: latent topic significance and latent topic entropy. We show that these can be integrated into an application system, with which the user can more easily navigate archives of spoken documents. Probabilistic latent semantic analysis is used as a typical example approach for unsupervised topic analysis in most experiments, although latent Dirichlet allocation is also used in some experiments to show that the proposed measures are equally applicable for different analysis approaches. All of the experiments were performed on Mandarin Chinese broadcast news.
  • Keywords
    information retrieval; probability; speech processing; text analysis; unsupervised learning; Mandarin Chinese broadcast news; audio signal; key term-based topic label; latent Dirichlet allocation; latent topic analysis; latent topic entropy; latent topic significance; latent topic trees; probabilistic latent semantic analysis; spoken document; unsupervised semantic analysis; unsupervised topic analysis; Multimedia communication; Organizations; Probabilistic logic; Semantics; Speech; Speech recognition; Training; Latent semantic analysis (LSA); latent topic organization; spoken document understanding;
  • fLanguage
    English
  • Journal_Title
    Audio, Speech, and Language Processing, IEEE Transactions on
  • Publisher
    ieee
  • ISSN
    1558-7916
  • Type

    jour

  • DOI
    10.1109/TASL.2010.2102592
  • Filename
    5676182