• DocumentCode
    172561
  • Title

    Development of language resources for speech application in Gujarati and Marathi

  • Author

    Madhavi, Maulik C. ; Sharma, Shantanu ; Patil, Hemant A.

  • Author_Institution
    Dhirubhai Ambani Inst. of Inf. & Commun. Technol. (DA-IICT), Gandhinagar, India
  • fYear
    2014
  • fDate
    20-22 Oct. 2014
  • Firstpage
    115
  • Lastpage
    118
  • Abstract
    This paper discusses development of resources using linguistics and signal processing aspects for two low resource Indian languages, viz., Gujarati and Marathi. Speech resource development discusses the details of data collection, transcription at phone and syllable level and corresponding linguistic units such as phones and syllables. In order to analyze the performance at different fluency levels, three types of recording modes, viz., read, conversation and lecture are considered in this paper. Manual annotation of speech in terms of International Phonetic Alphabet (IPA) symbols is presented. In the later section, we discuss speech segmentation at syllable level and prosodic level marking (pitch marking). Short-term Energy contour is smoothened using group-delay-based algorithm in order to detect syllable units in the speech signal. Detection rate obtained for syllable marking within 20 % agreement duration is of the order of 60 % in case of read mode speech. Prosody pitch marks are analyzed via Fo pattern of a speech signal. The key strength of this study is the analysis for different kinds of recording modes, viz., read, conversation and lecture mode. It is found that CV (where, Consonant is followed by Vowel) type of syllables have highest occurrence (more than 50 %) in both the languages. Read speech is observed to perform better than spontaneous speech in terms of automatic prosodic marking.
  • Keywords
    natural language processing; speech processing; Gujarati language; IPA symbols; Indian languages; Marathi language; group-delay-based algorithm; international phonetic alphabet; language resource development; linguistics aspect; pitch marking; prosodic level marking; short-term energy contour; signal processing aspect; speech annotation; speech application; syllable level; Engines; Manuals; Pragmatics; Signal processing algorithms; Speech; Speech processing; Phonetic transcription; low resource language; pitch marking; syllabification;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Asian Language Processing (IALP), 2014 International Conference on
  • Conference_Location
    Kuching
  • Type

    conf

  • DOI
    10.1109/IALP.2014.6973517
  • Filename
    6973517