Title :
Evaluating the effect of normalizing informal text on TTS output
Author :
Pennell, D.L. ; Yang Liu
Author_Institution :
Dept. of Comput. Sci., Univ. of Texas at Dallas, Richardson, TX, USA
Abstract :
Abbreviations in informal text, and research efforts to expand them to the standard English words from which they were derived, have become increasingly common. These methods are almost solely evaluated using the final word error rate (WER) after normalization; however, this metric may not be reasonable for a text-to-speech (TTS) system where words may be pronounced correctly despite being misspelled. This paper shows that normalization of informal text improves the output of TTS not only in terms of WER but also in terms of phoneme error rate (PER) and human perceptual experiments.
Keywords :
error statistics; speech synthesis; PER; TTS output; WER; final word error rate; human perceptual experiments; informal text normalization; phoneme error rate; standard English words; text-to-speech system; Accuracy; Dictionaries; Error analysis; Humans; Speech; Standards; Twitter; abbreviation expansion; noisy text; normalization; text-to-speech;
Conference_Titel :
Spoken Language Technology Workshop (SLT), 2012 IEEE
Conference_Location :
Miami, FL
Print_ISBN :
978-1-4673-5125-6
Electronic_ISBN :
978-1-4673-5124-9
DOI :
10.1109/SLT.2012.6424271