Title :
Some aspects of synthetic elderly voices in ambient assisted living systems
Author :
Zainko, Csaba ; Toth, Brenda ; Bartalis, Matyas ; Nemeth, G. ; Fegyo, Tibor
Author_Institution :
Dept. of Telecommun. & Media Inf., Budapest Univ. of Technol. & Econ., Budapest, Hungary
Abstract :
Senior citizens are in the focus of current research in Europe. This paper investigates the usability aspects of synthetic voices intended for elderly people in Ambient Assisted Living (AAL) systems. The first topic of the study is the selection of an appropriate age of Personal Life Assistant´s (PLA) voice intended for active seniors. The second topic is whether the user´s own voice is feasible in personal messages. Third, the use of rather short speech corpora from elderly people for HMM speaker adaptation is studied. The question is whether adapted voice is categorized to the same age group by listeners as the original. Corpus based unit-selection TTS and adapted HMM-TTS voices were created from elderly speech samples and these are compared to other middle-aged and elderly voices. In listening tests the synthesized sentences were evaluated and compared to natural speech samples by elderly test subjects. The authors found that the TTS voices of more pleasant (younger) speakers are preferred, HMM-TTS adapted voices of elderly speakers retained age identification features of the original recordings and are suitable for personal messages.
Keywords :
assisted living; audio databases; hidden Markov models; speech processing; speech synthesis; AAL systems; Europe; HMM speaker adaptation; PLA voice; adapted HMM-TTS voices; age identification features; ambient assisted living systems; corpus based unit-selection TTS; elderly speech samples; listening tests; middle-aged voices; natural speech samples; personal life assistant voice; personal messages; senior citizens; short speech corpora; synthesized sentences; synthetic elderly voices; synthetic voices usability aspects; Aging; Databases; Hidden Markov models; Programmable logic arrays; Senior citizens; Speech; Speech synthesis; elderly speakers; elderly users; speaker adaptation for AAL; speech synthesis; voice talent selection;
Conference_Titel :
Speech Technology and Human - Computer Dialogue (SpeD), 2013 7th Conference on
Conference_Location :
Cluj-Napoca
DOI :
10.1109/SpeD.2013.6682670