DocumentCode :
1749723
Title :
Experiments on speech tracking in audio documents using Gaussian mixture modeling
Author :
Seck, Mouhamadou ; Magrin-Chagnolleau, Ivan ; Bimbot, Frédéric
Author_Institution :
IRISA, Rennes, France
Volume :
1
fYear :
2001
fDate :
2001
Firstpage :
601
Abstract :
This paper deals with the tracking of speech segments in audio documents. We use a cepstral-based acoustic analysis and Gaussian mixture models for the representation of the training data. Three ways of scoring an audio document based on a frame-level likelihood calculation are proposed and compared. Our experiments are done on a database composed of television programs including news reports, advertisements, and documentaries. The best equal error rate obtained is approximately 12%
Keywords :
Gaussian processes; acoustic signal processing; audio signal processing; cepstral analysis; signal representation; speech processing; tracking; Gaussian mixture modeling; advertisements; audio document scoring; audio documents; cepstral-based acoustic analysis; covariance matrices; database; equal error rate; frame-level likelihood; music; news reports; noise segments; smoothed log-likelihood ratio; speech segments tracking; television programs; training data representation; Cepstral analysis; Covariance matrix; Databases; Error analysis; Indexing; Smoothing methods; Speech enhancement; TV; Testing; Training data;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Acoustics, Speech, and Signal Processing, 2001. Proceedings. (ICASSP '01). 2001 IEEE International Conference on
Conference_Location :
Salt Lake City, UT
ISSN :
1520-6149
Print_ISBN :
0-7803-7041-4
Type :
conf
DOI :
10.1109/ICASSP.2001.940903
Filename :
940903
Link To Document :
بازگشت