• DocumentCode
    3167906
  • Title

    Automatic caption generation for video data. Time alignment between caption and acoustic signal

  • Author

    Watanabe, K. ; Sugiyama, M.

  • Author_Institution
    Graduate Sch. of Comput. Sci. & Eng., Aizu Univ., Fukushima, Japan
  • fYear
    1999
  • fDate
    1999
  • Firstpage
    65
  • Lastpage
    70
  • Abstract
    This paper discusses automatic caption generation, and specifically focuses on correspondence between Japanese text and its speech data. This paper proposes the time alignment module implemented using DP matching and evaluates its performance. Optimizing weight and DP path, the caption display time gap between correct and estimated is less than 39.0 ms in the phoneme boundary. Effects of other speaker´s phoneme templates and text phrase deletion are evaluated
  • Keywords
    handicapped aids; speech recognition; video signal processing; DP matching; Japanese text; acoustic signal; automatic caption generation; performance evaluation; phoneme templates; speech data; text phrase deletion; time alignment; video data; Acoustical engineering; Auditory system; Computer science; Costs; Data engineering; Displays; Signal generators; Speech; TV broadcasting; Timing;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Multimedia Signal Processing, 1999 IEEE 3rd Workshop on
  • Conference_Location
    Copenhagen
  • Print_ISBN
    0-7803-5610-1
  • Type

    conf

  • DOI
    10.1109/MMSP.1999.793799
  • Filename
    793799