Commentator´s Speech Extraction in Audio Stream of Sports Games

Author

Lu, Li ; Ge, Fengpei ; Zhao, Qingwei ; Yan, Yonghong

Author_Institution

ThinkIT Speech Lab., Chinese Acad. of Sci., Beijing, China

fYear

2009

fDate

28-29 Dec. 2009

Firstpage

64

Lastpage

67

Abstract

This paper proposes a method to deal with the problem of extracting commentator´s speech in audio stream of live sports games. First, a two-pass metric-based audio segmentation module is developed to segment the audio stream into short ones with homogeneous acoustic features. Then a model-based classification module is adopted to extract the speech segments. For robust audio classification, various audio features have been used in this paper. Finally, a music scene analysis (Music-CASA) method is adopted to remove the speech in the advertisements with minimum loss of commentator´s speech. By integrating all the techniques, an average F value of 94.79% is achieved in the commentator´s speech extraction task evaluated on eleven games of six kinds of sports.

Keywords

audio signal processing; audio streaming; music; signal classification; speech processing; sport; audio stream; commentator speech extraction; homogeneous acoustic features; live sports games; model-based classification module; music scene analysis method; robust audio classification; speech segments; two-pass metric-based audio segmentation module; Computer science; Data mining; Image analysis; Information retrieval; Robustness; Speech analysis; Speech recognition; Streaming media; Support vector machines; Technology management;

fLanguage

English

Publisher

ieee

Conference_Titel

Research Challenges in Computer Science, 2009. ICRCCS '09. International Conference on

Conference_Location

Shanghai

Print_ISBN

978-0-7695-3927-0

Electronic_ISBN

978-1-4244-5410-5

Type

conf

DOI

10.1109/ICRCCS.2009.24

Filename

5401297