• DocumentCode
    62316
  • Title

    Genre-Based Music Language Modeling with Latent Hierarchical Pitman-Yor Process Allocation

  • Author

    Raczynski, Stanislaw A. ; Vincent, Emmanuel

  • Author_Institution
    Inria Rennes, Rennes, France
  • Volume
    22
  • Issue
    3
  • fYear
    2014
  • fDate
    Mar-14
  • Firstpage
    672
  • Lastpage
    681
  • Abstract
    In this work we present a new Bayesian topic model: latent hierarchical Pitman-Yor process allocation (LHPYA), which uses hierarchical Pitman-Yor process priors for both word and topic distributions, and generalizes a few of the existing topic models, including the latent Dirichlet allocation (LDA), the bigram topic model and the hierarchical Pitman-Yor topic model. Using such priors allows for integration of n-grams with a topic model, while smoothing them with the state-of-the-art method. Our model is evaluated by measuring its perplexity on a dataset of musical genre and harmony annotations 3 Genre Database (3GDB) and by measuring its ability to predict musical genre from chord sequences. In terms of perplexity, for a 262-chord dictionary we achieve a value of 2.74, compared to 18.05 for trigrams and 7.73 for a unigram topic model. In terms of genre prediction accuracy with 9 genres, the proposed approach performs about 33% better in relative terms than genre-dependent n-grams, achieving 60.4% of accuracy.
  • Keywords
    Bayes methods; music; natural language processing; 3 Genre Database; Bayesian topic model; bigram topic model; genre based music language modeling; harmony annotations; hierarchical Pitman-Yor topic model; latent Dirichlet allocation; latent hierarchical Pitman-Yor process allocation; musical genre; topic distribution; word distribution; Context; Context modeling; IEEE transactions; Resource management; Smoothing methods; Speech; Speech processing; Chinese restaurant process; chord model; genre model; hierarchical Pitman-Yor process; music information retrieval; musical genre recognition; topic models;
  • fLanguage
    English
  • Journal_Title
    Audio, Speech, and Language Processing, IEEE/ACM Transactions on
  • Publisher
    ieee
  • ISSN
    2329-9290
  • Type

    jour

  • DOI
    10.1109/TASLP.2014.2300344
  • Filename
    6714400