DocumentCode
172561
Title
Development of language resources for speech application in Gujarati and Marathi
Author
Madhavi, Maulik C. ; Sharma, Shantanu ; Patil, Hemant A.
Author_Institution
Dhirubhai Ambani Inst. of Inf. & Commun. Technol. (DA-IICT), Gandhinagar, India
fYear
2014
fDate
20-22 Oct. 2014
Firstpage
115
Lastpage
118
Abstract
This paper discusses development of resources using linguistics and signal processing aspects for two low resource Indian languages, viz., Gujarati and Marathi. Speech resource development discusses the details of data collection, transcription at phone and syllable level and corresponding linguistic units such as phones and syllables. In order to analyze the performance at different fluency levels, three types of recording modes, viz., read, conversation and lecture are considered in this paper. Manual annotation of speech in terms of International Phonetic Alphabet (IPA) symbols is presented. In the later section, we discuss speech segmentation at syllable level and prosodic level marking (pitch marking). Short-term Energy contour is smoothened using group-delay-based algorithm in order to detect syllable units in the speech signal. Detection rate obtained for syllable marking within 20 % agreement duration is of the order of 60 % in case of read mode speech. Prosody pitch marks are analyzed via Fo pattern of a speech signal. The key strength of this study is the analysis for different kinds of recording modes, viz., read, conversation and lecture mode. It is found that CV (where, Consonant is followed by Vowel) type of syllables have highest occurrence (more than 50 %) in both the languages. Read speech is observed to perform better than spontaneous speech in terms of automatic prosodic marking.
Keywords
natural language processing; speech processing; Gujarati language; IPA symbols; Indian languages; Marathi language; group-delay-based algorithm; international phonetic alphabet; language resource development; linguistics aspect; pitch marking; prosodic level marking; short-term energy contour; signal processing aspect; speech annotation; speech application; syllable level; Engines; Manuals; Pragmatics; Signal processing algorithms; Speech; Speech processing; Phonetic transcription; low resource language; pitch marking; syllabification;
fLanguage
English
Publisher
ieee
Conference_Titel
Asian Language Processing (IALP), 2014 International Conference on
Conference_Location
Kuching
Type
conf
DOI
10.1109/IALP.2014.6973517
Filename
6973517
Link To Document