DocumentCode
137184
Title
Automatic Phonetic Transcription for read, extempore and conversation speech for an Indian language: Bengali
Author
Manjunath, K.E. ; Rao, K. Sreenivasa
Author_Institution
Sch. of Inf. Technol., Indian Inst. of Technol., Kharagpur, Kharagpur, India
fYear
2014
fDate
Feb. 28 2014-March 2 2014
Firstpage
1
Lastpage
6
Abstract
In this work, we have analyzed the proposed Automatic Phonetic Transcription (APT) approach for read, extempore and conversation modes of speech for Bengali language. In our earlier work, the APT was carried out using read speech. In this paper, main focus is on deriving APT for Extempore and Conversation modes of speech in Bengali language and their analysis. This framework of deriving APT can be extended to any Indian language. The Automatic Phonetic Transcription Systems (APTS) were developed separately for read, extempore and conversation modes of speech. In this study, APT has been carried out on read, extempore and conversation modes of speech using 35, 33 and 30 phones respectively. APT has been carried out using Hidden Markov Models (HMMs) and FeedForward Neural Networks (FFNNs). Mel-frequency Cepstral Coefficients are used as features for building the models. The best obtained performance accuracies using HMMs for read, extempore and conversation modes are 41.65%, 29.20% and, 23.48% respectively. Using FFNNs, the recognition accuracies for read, extempore and conversation modes are 53.87%, 46.19% and 33.63% respectively.
Keywords
feedforward neural nets; hidden Markov models; natural language processing; speech processing; APT approach; Bengali language; FFNN; HMM; Indian language; Mel-frequency cepstral coefficients; automatic phonetic transcription approach; feedforward neural networks; hidden Markov models; Accuracy; Feature extraction; Hidden Markov models; Silicon; Speech; Speech recognition; Training; APT; Automatic Phonetic Transcription; Conversation speech; Extempore speech; FeedForward Neural Network; Hidden Markov Model; IPA; International Phonetic Alphabet; Read Speech;
fLanguage
English
Publisher
ieee
Conference_Titel
Communications (NCC), 2014 Twentieth National Conference on
Conference_Location
Kanpur
Type
conf
DOI
10.1109/NCC.2014.6811347
Filename
6811347
Link To Document