Speaker-TTS voice mapping towards natural and characteristic robot storytelling

Author

Hye-Jin Min ; Sang-Chae Kim ; Joonyeob Kim ; Jin-Woo Chung ; Park, Jong C.

Author_Institution

Dept. of Comput. Sci., KAIST, Daejeon, South Korea

fYear

2013

fDate

26-29 Aug. 2013

Firstpage

793

Lastpage

800

Abstract

Robot storytelling has the potential for its practical use in various domains such as entertainment, education, and rehabilitation. However, relying on human-recorded voices for natural storytelling is costly, and automation with text-to-speech systems is not readily applicable due to the difficulty of reflecting the full nature of stories in TTS systems. In this paper, we address the problem of automating robot storytelling with a particular focus on two issues: speaker identification and speaker-TTS voice mapping. We first conduct text analysis with rich linguistic clues to identify speakers from a given textual story. We then consider the task of speaker-TTS voice mapping as the graph coloring problem and propose effective algorithms for assigning voices to speakers given a limited number of TTS voices. Finally, we perform a user experiment on validating the usefulness of our method. The results demonstrate that our system significantly outperforms baseline systems and is also more acceptable to users.

Keywords

graph colouring; robots; speaker recognition; speech synthesis; text analysis; characteristic robot storytelling; education; entertainment; graph coloring problem; human-recorded voices; natural robot storytelling; rehabilitation; rich linguistic clues; robot storytelling automation; speaker identification; speaker-TTS voice mapping; text analysis; text-to-speech systems; textual story; Decision trees; Dictionaries; Feature extraction; Robots; Semantics; Speech; Training;

fLanguage

English

Publisher

ieee

Conference_Titel

RO-MAN, 2013 IEEE

Conference_Location

Gyeongju

ISSN

1944-9445

Type

conf

DOI

10.1109/ROMAN.2013.6628410

Filename

6628410