DocumentCode
1413367
Title
Semantic Analysis and Organization of Spoken Documents Based on Parameters Derived From Latent Topics
Author
Kong, Sheng-Yi ; Lee, Lin-shan
Author_Institution
Dept. of Comput. Sci. & Inf. Eng., Nat. Taiwan Univ., Taipei, Taiwan
Volume
19
Issue
7
fYear
2011
Firstpage
1875
Lastpage
1889
Abstract
Spoken documents are audio signals and are thus not easily displayed on-screen and not easily scanned and browsed by the user. It is therefore highly desirable to automatically construct summaries, titles, latent topic trees and key term-based topic labels for these spoken documents to aid the user in browsing. We refer to this as semantic analysis and organization. Also, as network content is both copious and dynamic, with topics and domains changing everyday, the approaches here must be primarily unsupervised. We propose a framework for unsupervised semantic analysis and organization of spoken documents and for this purpose propose two measures derived from latent topic analysis: latent topic significance and latent topic entropy. We show that these can be integrated into an application system, with which the user can more easily navigate archives of spoken documents. Probabilistic latent semantic analysis is used as a typical example approach for unsupervised topic analysis in most experiments, although latent Dirichlet allocation is also used in some experiments to show that the proposed measures are equally applicable for different analysis approaches. All of the experiments were performed on Mandarin Chinese broadcast news.
Keywords
information retrieval; probability; speech processing; text analysis; unsupervised learning; Mandarin Chinese broadcast news; audio signal; key term-based topic label; latent Dirichlet allocation; latent topic analysis; latent topic entropy; latent topic significance; latent topic trees; probabilistic latent semantic analysis; spoken document; unsupervised semantic analysis; unsupervised topic analysis; Multimedia communication; Organizations; Probabilistic logic; Semantics; Speech; Speech recognition; Training; Latent semantic analysis (LSA); latent topic organization; spoken document understanding;
fLanguage
English
Journal_Title
Audio, Speech, and Language Processing, IEEE Transactions on
Publisher
ieee
ISSN
1558-7916
Type
jour
DOI
10.1109/TASL.2010.2102592
Filename
5676182
Link To Document