• DocumentCode
    3320789
  • Title

    Commentator´s Speech Extraction in Audio Stream of Sports Games

  • Author

    Lu, Li ; Ge, Fengpei ; Zhao, Qingwei ; Yan, Yonghong

  • Author_Institution
    ThinkIT Speech Lab., Chinese Acad. of Sci., Beijing, China
  • fYear
    2009
  • fDate
    28-29 Dec. 2009
  • Firstpage
    64
  • Lastpage
    67
  • Abstract
    This paper proposes a method to deal with the problem of extracting commentator´s speech in audio stream of live sports games. First, a two-pass metric-based audio segmentation module is developed to segment the audio stream into short ones with homogeneous acoustic features. Then a model-based classification module is adopted to extract the speech segments. For robust audio classification, various audio features have been used in this paper. Finally, a music scene analysis (Music-CASA) method is adopted to remove the speech in the advertisements with minimum loss of commentator´s speech. By integrating all the techniques, an average F value of 94.79% is achieved in the commentator´s speech extraction task evaluated on eleven games of six kinds of sports.
  • Keywords
    audio signal processing; audio streaming; music; signal classification; speech processing; sport; audio stream; commentator speech extraction; homogeneous acoustic features; live sports games; model-based classification module; music scene analysis method; robust audio classification; speech segments; two-pass metric-based audio segmentation module; Computer science; Data mining; Image analysis; Information retrieval; Robustness; Speech analysis; Speech recognition; Streaming media; Support vector machines; Technology management;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Research Challenges in Computer Science, 2009. ICRCCS '09. International Conference on
  • Conference_Location
    Shanghai
  • Print_ISBN
    978-0-7695-3927-0
  • Electronic_ISBN
    978-1-4244-5410-5
  • Type

    conf

  • DOI
    10.1109/ICRCCS.2009.24
  • Filename
    5401297