Model based audio sequence alignment

Author

Doğaç Başaran;Emin Anarım;Ali Taylan Cemgil

Author_Institution

Elektrik ve Elektronik Mü

fYear

2011

fDate

4/1/2011 12:00:00 AM

Firstpage

606

Lastpage

609

Abstract

We formulate alignment of multiple audio sequences in a probabilistic framework. Our approach defines a generative model for time varying features extracted from audio clips that are recorded independently and asynchronously. We are able to handle missing data and multiple clips where no clip is covering the entire material. The matching is achieved via approximate Bayesian inference. Here, we illustrate a simulated tempering approach for sampling from the exact posterior density of the clip offsets. The simulation results on synthetic and real data suggest that the framework is able to handle difficult ambiguous scenarios or partial matchings.

Keywords

"Markov processes","Conferences","Bayesian methods","Speech processing","Feature extraction","Speech"

Publisher

ieee

Conference_Titel

Signal Processing and Communications Applications (SIU), 2011 IEEE 19th Conference on

ISSN

2165-0608

Print_ISBN

978-1-4577-0462-8

Type

conf

DOI

10.1109/SIU.2011.5929723

Filename

5929723

Link To Document

https://search.isc.ac/dl/search/defaultta.aspx?DTC=49&DC=3641667