• DocumentCode
    1696255
  • Title

    Unsupervised topic model for broadcast program segmentation

  • Author

    Boulianne, Gilles ; Dumouchel, P.

  • Author_Institution
    Centre de Rech. Inf. de Montreal (CRIM), Montréal, QC, Canada
  • fYear
    2013
  • Firstpage
    8455
  • Lastpage
    8459
  • Abstract
    Several unsupervised methods have been proposed to segment a continuous text stream into individual topics. A simple HMM formulation of the most successful of these methods exposes their underlying assumptions and suggests the use of a new prior for segmentation probability. Under this formulation, we explore the space of possible modeling choices on databases of English and French TV and radio programs. We show that the proposed prior improves segmentation results and can also accommodate additional knowledge sources within the HMM efficient dynamic programming.
  • Keywords
    computational linguistics; hidden Markov models; probability; speech processing; English databases; French TV; HMM efficient dynamic programming; broadcast program segmentation; continuous text stream segmentation; radio programs; segmentation probability; unsupervised topic model; Dynamic programming; Equations; Hidden Markov models; Length measurement; Mathematical model; Space exploration; TV; Bayesian methods; Graphical models; HMM; Story segmentation; Topic models;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Acoustics, Speech and Signal Processing (ICASSP), 2013 IEEE International Conference on
  • Conference_Location
    Vancouver, BC
  • ISSN
    1520-6149
  • Type

    conf

  • DOI
    10.1109/ICASSP.2013.6639315
  • Filename
    6639315