• DocumentCode
    3641667
  • Title

    Model based audio sequence alignment

  • Author

    Doğaç Başaran;Emin Anarım;Ali Taylan Cemgil

  • Author_Institution
    Elektrik ve Elektronik Mü
  • fYear
    2011
  • fDate
    4/1/2011 12:00:00 AM
  • Firstpage
    606
  • Lastpage
    609
  • Abstract
    We formulate alignment of multiple audio sequences in a probabilistic framework. Our approach defines a generative model for time varying features extracted from audio clips that are recorded independently and asynchronously. We are able to handle missing data and multiple clips where no clip is covering the entire material. The matching is achieved via approximate Bayesian inference. Here, we illustrate a simulated tempering approach for sampling from the exact posterior density of the clip offsets. The simulation results on synthetic and real data suggest that the framework is able to handle difficult ambiguous scenarios or partial matchings.
  • Keywords
    "Markov processes","Conferences","Bayesian methods","Speech processing","Feature extraction","Speech"
  • Publisher
    ieee
  • Conference_Titel
    Signal Processing and Communications Applications (SIU), 2011 IEEE 19th Conference on
  • ISSN
    2165-0608
  • Print_ISBN
    978-1-4577-0462-8
  • Type

    conf

  • DOI
    10.1109/SIU.2011.5929723
  • Filename
    5929723