DocumentCode
672840
Title
Development of speech corpora in Gujarati and Marathi for phonetic transcription
Author
Malde, Kewal D. ; Vachhani, Bhavik B. ; Madhavi, Maulik C. ; Chhayani, Nirav H. ; Patil, Hemant A.
Author_Institution
Dhirubhai Ambani Inst. of Inf. & Commun. Technol., Gandhinagar, India
fYear
2013
fDate
25-27 Nov. 2013
Firstpage
1
Lastpage
6
Abstract
There have been growing interest to use speech technology for rural areas. In this context, this paper describes the development of speech corpora in Indian languages (viz., Gujarati and Marathi from remote villages) for the task of phonetic transcription. This paper also presents related analysis of phonetic transcription. The manual phonetic transcription was done for two Indian languages, viz., Gujarati and Marathi for 8 hours of field recorded speech data in real-life settings. Dialectal variations are also analyzed using spectrograms and phonetic transcription. In addition, it was found that for consonant sounds, plosive sounds are having large coverage in broad phonetic category. The collected speech corpora can be very useful for speech and speaker recognition tasks.
Keywords
speaker recognition; speech processing; Gujarati language; Indian languages; Marathi language; consonant sounds; dialectal variation analysis; field recorded speech data; phonetic transcription; plosive sounds; remote villages; rural areas; speaker recognition task; spectrograms; speech corpora development; speech recognition task; speech technology; Data collection; Manuals; Materials; Pragmatics; Spectrogram; Speech; Time-domain analysis; Database collection; Indian languages; dialectal variation; phonetic transcription;
fLanguage
English
Publisher
ieee
Conference_Titel
Oriental COCOSDA held jointly with 2013 Conference on Asian Spoken Language Research and Evaluation (O-COCOSDA/CASLRE), 2013 International Conference
Conference_Location
Gurgaon
Type
conf
DOI
10.1109/ICSDA.2013.6709865
Filename
6709865
Link To Document