DocumentCode
3482394
Title
Speaker-TTS voice mapping towards natural and characteristic robot storytelling
Author
Hye-Jin Min ; Sang-Chae Kim ; Joonyeob Kim ; Jin-Woo Chung ; Park, Jong C.
Author_Institution
Dept. of Comput. Sci., KAIST, Daejeon, South Korea
fYear
2013
fDate
26-29 Aug. 2013
Firstpage
793
Lastpage
800
Abstract
Robot storytelling has the potential for its practical use in various domains such as entertainment, education, and rehabilitation. However, relying on human-recorded voices for natural storytelling is costly, and automation with text-to-speech systems is not readily applicable due to the difficulty of reflecting the full nature of stories in TTS systems. In this paper, we address the problem of automating robot storytelling with a particular focus on two issues: speaker identification and speaker-TTS voice mapping. We first conduct text analysis with rich linguistic clues to identify speakers from a given textual story. We then consider the task of speaker-TTS voice mapping as the graph coloring problem and propose effective algorithms for assigning voices to speakers given a limited number of TTS voices. Finally, we perform a user experiment on validating the usefulness of our method. The results demonstrate that our system significantly outperforms baseline systems and is also more acceptable to users.
Keywords
graph colouring; robots; speaker recognition; speech synthesis; text analysis; characteristic robot storytelling; education; entertainment; graph coloring problem; human-recorded voices; natural robot storytelling; rehabilitation; rich linguistic clues; robot storytelling automation; speaker identification; speaker-TTS voice mapping; text analysis; text-to-speech systems; textual story; Decision trees; Dictionaries; Feature extraction; Robots; Semantics; Speech; Training;
fLanguage
English
Publisher
ieee
Conference_Titel
RO-MAN, 2013 IEEE
Conference_Location
Gyeongju
ISSN
1944-9445
Type
conf
DOI
10.1109/ROMAN.2013.6628410
Filename
6628410
Link To Document