Title :
Predicting vowel substitution in code-switched speech
Author :
Thipe I. Modipa;Marelie H. Davel
Author_Institution :
HLT Research Group, CSIR Meraka, South Africa
Abstract :
The accuracy of automatic speech recognition (ASR) systems typically degrades when encountering code-switched speech. Some of this degradation is due to the unexpected pronunciation effects introduced when languages are mixed. Embedded (foreign) phonemes typically show more variation than phonemes from the matrix language: either approximating the embedded language pronunciation fairly closely, or realised as any of a set of phonemic counterparts from the matrix language. In this paper we describe a technique for predicting the phoneme substitutions that are expected to occur during code-switching, using non-acoustic features only. As case study we consider Sepedi/English code switching and analyse the different realisations of the English schwa. A code-switched speech corpus is used as input and vowel substitutions identified by auto-tagging this corpus based on acoustic characteristics. We first evaluate the accuracy of our auto-tagging process, before determining the predictability of our auto-tagged corpus, using non-acoustic features.
Keywords :
"Speech","Hidden Markov models","Switches","Speech coding","Speech recognition","Matrices","Acoustics"
Conference_Titel :
Pattern Recognition Association of South Africa and Robotics and Mechatronics International Conference (PRASA-RobMech), 2015
DOI :
10.1109/RoboMech.2015.7359515