• DocumentCode
    3723547
  • Title

    Development of Assamese Text-to-speech synthesis system

  • Author

    Bidisha Sharma;Nagaraj Adiga;S. R. Mahadeva Prasanna

  • Author_Institution
    Dept. of Electronics and Electrical Engineering, Indian Institute of Technology Guwahati, India
  • fYear
    2015
  • Firstpage
    1
  • Lastpage
    6
  • Abstract
    This paper presents the design and development of Assamese Text to speech (TTS) synthesis system. In particular, work focused on designing language specific rules, developing quality database, data segmentation, and to handle bilingual sound units. In Assamese language, till now no study is done to construct the grapheme to phoneme conversion rules. In this work, grapheme to phoneme conversion rules are proposed for Assamese language. The database is recorded by checking the speaking rate, variation in amplitude level, dc wandering, and clipping during data collection. A significant improvement in the synthesized voice is observed by ensuring uniform speaking rate, controlling variation in the signal amplitude level, and avoiding dc wandering and clipping during data collection. A semi-automatic segmentation approach is developed for data segmentation. Initially, segmentation is done by automatic process and later manual correction of segmentation boundaries is done to improve quality and intelligibility. It also reduce time required for the segmentation process. The developed TTS can work in bilingual mode. It can switch between Assamese and English language smoothly and maintains the sentence level intonation even for mixed texts.
  • Keywords
    "Speech","Databases","Hidden Markov models","High-temperature superconductors","Data collection","Buildings","Switches"
  • Publisher
    ieee
  • Conference_Titel
    TENCON 2015 - 2015 IEEE Region 10 Conference
  • ISSN
    2159-3442
  • Print_ISBN
    978-1-4799-8639-2
  • Electronic_ISBN
    2159-3450
  • Type

    conf

  • DOI
    10.1109/TENCON.2015.7372786
  • Filename
    7372786