• DocumentCode
    3245513
  • Title

    Experimental interactive system for telephone applications with speech recognition and synthesis functions

  • Author

    Kitai, Mikio ; Yamada, Tomokazu ; Tsukada, Hajime ; Takahashi, Satoshi ; Noda, Yoshiaki ; Takahashi, Jun-ichi ; Yoshida, Yuki ; Arai, Kazuhiro ; Imoto, Takashi ; Hakoda, Kazuo ; Hirokawa, Tomohisa ; Sagayama, Shigeki

  • Author_Institution
    NTT Human Interface Labs., Kanagawa, Japan
  • fYear
    1996
  • fDate
    30 Sep-1 Oct 1996
  • Firstpage
    25
  • Lastpage
    28
  • Abstract
    This paper describes an experimental interactive system featuring: (1) highly accurate speaker independent and large vocabulary speech recognition based on context-dependent accurate acoustic phoneme HMM models trained with speech data from more than 10000 speakers collected over a telephone network; (2) high quality text-to-speech synthesis that generates speech by concatenating triphone-context-dependent waveform segments; (3) software-based configuration that requires no special hardware except a PC equipped with a sound board and a voice modem; and (4) easy and rapid prototyping which enables the developer to build a system by writing two types of service scenarios
  • Keywords
    hidden Markov models; interactive systems; microcomputer applications; modems; speech recognition; speech synthesis; telecommunication computing; telephony; context-dependent accurate acoustic phoneme HMM models; experimental interactive system; hidden Markov model; large vocabulary speech recognition; personal computer; rapid prototyping; software-based configuration; sound board; speaker independent recognition; speech recognition; speech synthesis; telephone applications; telephone network; text-to-speech synthesis; triphone-context-dependent waveform segments; voice modem; Acoustic waves; Context modeling; Context-aware services; Hidden Markov models; Interactive systems; Loudspeakers; Speech recognition; Speech synthesis; Telephony; Vocabulary;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Interactive Voice Technology for Telecommunications Applications, 1996. Proceedings., Third IEEE Workshop on
  • Conference_Location
    Basking Ridge, NJ
  • Print_ISBN
    0-7803-3238-5
  • Type

    conf

  • DOI
    10.1109/IVTTA.1996.552713
  • Filename
    552713