DocumentCode :
3399393
Title :
Optimization of letter to sound rules construction
Author :
Vasek, Marie ; Rozinaj, Gregor
Author_Institution :
Inst. of Telecommun., Slovak Univ. of Technol., Bratislava, Slovakia
fYear :
2013
fDate :
5-7 June 2013
Firstpage :
1
Lastpage :
6
Abstract :
This article falls within the field of speech processing - speech synthesis. The content is focused on phonetic transcription of the text. The transcript rules as well as dictionary of phonetic transcriptions are important tools for the phonetic transcription. The article deals with the transcript rules called LTS (Letter-To-Sound). Automatically generated LTS rules are presented in form of binary decision trees as a solution especially suitable for flexible languages. This article introduces an idea how to improve and optimize the training process of LTS rules (data-driven approach). Optimization consists of two phases. The first phase lies in the training data preprocessing and in the training database preparation. The second phase aims to achieve an optimal size of LTS rules regarding the size of phonetic transcription dictionary. This paper introduces a method how to generate residual dictionary of exceptions. The size of generated LTS rules has a direct impact on the size of residual (delta) dictionary. The paper describes the comparison between individual iterations of LTS rules training process. Iterations differ from each other due to a different threshold parameter. Finally article presents the results achieved using this method to select the most optimal threshold parameter.
Keywords :
decision trees; optimisation; speech processing; text analysis; LTS rules; Letter-To-Sound; binary decision trees; data driven approach; flexible languages; letter optimisation; phonetic transcription dictionary; phonetic transcriptions; sound rules construction; speech processing speech synthesis; text phonetic transcription; Buildings; Classification algorithms; Databases; Dictionaries; Optimization; Speech; Training; CART; LTS rules; Phonetic Transcription; Speech Synthesis; data-driven;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Signal Processing Symposium (SPS), 2013
Conference_Location :
Serock
Type :
conf
DOI :
10.1109/SPS.2013.6623600
Filename :
6623600
Link To Document :
بازگشت