• DocumentCode
    2139923
  • Title

    Speech and Song Search on the Web: System Design and Implementation

  • Author

    Yaguchi, Yuichi ; Watanabe, Yoshiyuki ; Naruse, Keitaro ; Oka, Ryuichi

  • Author_Institution
    Univ. of Aizu, Aizuwakamatsu
  • fYear
    2007
  • fDate
    16-19 Oct. 2007
  • Firstpage
    270
  • Lastpage
    278
  • Abstract
    This paper proposes a novel search system for speech and song segments. The amount of accumulated video data in the World Wide Web is expanding and its content is varied. Video content includes natural voices and singing voices, and these differ in their phoneme lengths. Our system uses frame-wise phoneme recognition and continuous dynamic programming (CDP). First, each target and query waveform is divided into fixed short-time frames; second, each frame of the waveform is used to estimate a phoneme label using Bayes estimation; third, the query sequences of phoneme labels are searched from target sequences by time-robustness CDP; and, finally, this system gets candidate answers. This method is robust along the time dimension, and thus has a great advantage for natural voice as well as song. This paper also introduces an implementation of this system, which is published on the Web, as a secondary search engine for Youtube data.
  • Keywords
    Bayes methods; Internet; dynamic programming; video retrieval; Bayes estimation; World Wide Web; continuous dynamic programming; frame-wise phoneme recognition; query waveform; song retrieval; speech retrieval; video content retrieval; Acoustic noise; Dynamic programming; Hidden Markov models; Information retrieval; Music information retrieval; Rhythm; Search engines; Speech recognition; Web sites; YouTube;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Computer and Information Technology, 2007. CIT 2007. 7th IEEE International Conference on
  • Conference_Location
    Aizu-Wakamatsu, Fukushima
  • Print_ISBN
    978-0-7695-2983-7
  • Type

    conf

  • DOI
    10.1109/CIT.2007.128
  • Filename
    4385093