• DocumentCode
    672840
  • Title

    Development of speech corpora in Gujarati and Marathi for phonetic transcription

  • Author

    Malde, Kewal D. ; Vachhani, Bhavik B. ; Madhavi, Maulik C. ; Chhayani, Nirav H. ; Patil, Hemant A.

  • Author_Institution
    Dhirubhai Ambani Inst. of Inf. & Commun. Technol., Gandhinagar, India
  • fYear
    2013
  • fDate
    25-27 Nov. 2013
  • Firstpage
    1
  • Lastpage
    6
  • Abstract
    There have been growing interest to use speech technology for rural areas. In this context, this paper describes the development of speech corpora in Indian languages (viz., Gujarati and Marathi from remote villages) for the task of phonetic transcription. This paper also presents related analysis of phonetic transcription. The manual phonetic transcription was done for two Indian languages, viz., Gujarati and Marathi for 8 hours of field recorded speech data in real-life settings. Dialectal variations are also analyzed using spectrograms and phonetic transcription. In addition, it was found that for consonant sounds, plosive sounds are having large coverage in broad phonetic category. The collected speech corpora can be very useful for speech and speaker recognition tasks.
  • Keywords
    speaker recognition; speech processing; Gujarati language; Indian languages; Marathi language; consonant sounds; dialectal variation analysis; field recorded speech data; phonetic transcription; plosive sounds; remote villages; rural areas; speaker recognition task; spectrograms; speech corpora development; speech recognition task; speech technology; Data collection; Manuals; Materials; Pragmatics; Spectrogram; Speech; Time-domain analysis; Database collection; Indian languages; dialectal variation; phonetic transcription;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Oriental COCOSDA held jointly with 2013 Conference on Asian Spoken Language Research and Evaluation (O-COCOSDA/CASLRE), 2013 International Conference
  • Conference_Location
    Gurgaon
  • Type

    conf

  • DOI
    10.1109/ICSDA.2013.6709865
  • Filename
    6709865