• DocumentCode
    2708252
  • Title

    WordNet Based Sindhi Text to Speech Synthesis System

  • Author

    Mahar, Javed Ahmed ; Memon, Ghulam Qadir ; Shah, Syed Hyder Abbass

  • Author_Institution
    Dept. of Comput. Sci., Shah Abdul Latif Univ., Khairpur, Pakistan
  • fYear
    2010
  • fDate
    7-10 May 2010
  • Firstpage
    20
  • Lastpage
    24
  • Abstract
    The text-to-speech (TTS) synthesis technology enables machine to convert text into audible speech and used throughout the world to enhance the accessibility of the information. The important component of any TTS synthesis system is the database of sounds. In this study, three types of sound units i.e., phonemes, diphones and syllables are concatenated to produce natural sound for good quality Sindhi text to speech (STTS) system. The object of this paper consists in treating the phonemes, diphones and syllables under the aspect of the lexicon. The methodology used in STTS is to exploit acoustic representations of speech for synthesis, together with linguistic analyses of text. Sindhi is highly homographic language, the text is written without diacritics in real life applications, that creates lexical and morphological ambiguity. The problem of understating non-diacritic words can be solved using semantic knowledge. This paper describes a Sindhi TTS synthesis system that relies on a WordNet to identify the analogical relations between words in the text. The proposed approach is focused on the use of WordNet structures for the task of synthesis. The architecture and novel algorithm for STTS is proposed. The experiments using WordNet that show promising results and the accuracy of our proposed approach is acceptable.
  • Keywords
    linguistics; speech synthesis; Sindhi text to speech synthesis system; WordNet; homographic language; lexicon; linguistic text analyses; sounds database; Computer science; Concatenated codes; Databases; Joining processes; Natural languages; Research and development; Signal synthesis; Speech analysis; Speech enhancement; Speech synthesis; Diphone; Phoneme; Syllabification; Text-to-Speech; WordNet;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Computer Research and Development, 2010 Second International Conference on
  • Conference_Location
    Kuala Lumpur
  • Print_ISBN
    978-0-7695-4043-6
  • Type

    conf

  • DOI
    10.1109/ICCRD.2010.31
  • Filename
    5489422