• DocumentCode
    2910721
  • Title

    Adopting Malay Syllable Structure for Syllable Based Speech Synthesizer for Iban and Bidayuh Languages

  • Author

    Juan, Sarah F S ; Edwin, Vyonne ; Chai Yeen Cheong ; Lee Jun Choi ; Yeo, Alvin W.

  • Author_Institution
    Fac. of Comput. Sci. & Inf. Technol., Univ. Malaysia Sarawak (UNIMAS), Kota Samarahan, Malaysia
  • fYear
    2011
  • fDate
    15-17 Nov. 2011
  • Firstpage
    216
  • Lastpage
    219
  • Abstract
    Sarawak, Malaysia, has many under-resourced languages, which stands to become extinct if measures are not taken to preserve and maintain them. These languages are mostly spoken by the indigenous groups and not all of the languages are documented or studied. As an initiative to preserve, a Text to Speech (TTS) system has been built for Iban and Bidayuh languages, two out of 44 living languages in Sarawak. To expedite the development, we employed knowledge of closely-related language, i.e. Malay, which is the first language in Malaysia. In this paper, we employed a syllabification algorithm based on Malay syllable structure to build the Iban and Bidayuh syllable list and speech corpus. An accuracy test for the algorithm was conducted to determine the quality of the output from the TTS system using Categorical Estimation (CE). Test showed high percentage in accuracy and quality has a mean score of 3.07 out of 5, suggesting the approach works.
  • Keywords
    natural language processing; speech synthesis; Bidayuh language; Iban language; Malay syllable structure; Malaysia; Sarawak; accuracy test; categorical estimation; speech corpus; syllabification algorithm; syllable based speech synthesizer; syllable list; text-to-speech system; Encyclopedias; Gold; Pragmatics; Speech; Speech synthesis; Synthesizers; syllabification algorithm; text to speech system;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Asian Language Processing (IALP), 2011 International Conference on
  • Conference_Location
    Penang
  • Print_ISBN
    978-1-4577-1733-8
  • Type

    conf

  • DOI
    10.1109/IALP.2011.21
  • Filename
    6121506