• DocumentCode
    2173344
  • Title

    Language identification using a combined articulatory prosody framework

  • Author

    Sangwan, Abhijeet ; Mehrabani, Mahnoosh ; Hansen, John H L

  • Author_Institution
    Center for Robust Speech Syst. (CRSS), Univ. of Texas at Dallas, Richardson, TX, USA
  • fYear
    2011
  • fDate
    22-27 May 2011
  • Firstpage
    4400
  • Lastpage
    4403
  • Abstract
    This study presents new advancements in our articulatory-based language identification (LID) system. Our LID system automatically identifies language-features (LFs) from a phonological features (PFs) based representation of speech. While our baseline system uses a static PF-representation for extracting LFs, die new system is based on a dynamic PF representation for feature extraction. Interestingly, the new LFs outperform our baseline system by 11.8% absolute in a difficult 5-way classification task of South Indian Languages. Additionally, we incorporate pitch and energy based features in our new system to leverage prosody in classification. In particular, we employ a Legendre polynomial based contour-estimation to capture shape parameters which are used in classification. Additionally, die fusion of PF and prosody-based LFs further improves die overall classification result by 16.5% absolute over die baseline system Finally, die proposed articulatory language ID system is combined with a PPRLM (parallel phone recognition language model) system to obtain an overall classification accuracy of 86.6%.
  • Keywords
    Legendre polynomials; feature extraction; speech recognition; LID system; PF-representation; South Indian languages; articulatory prosody framework; capture shape parameters; feature extraction; language identification; language-features; legendre polynomial based contour-estimation; parallel phone recognition language model; phonological features; Accuracy; Feature extraction; Least squares approximation; Polynomials; Shape; Speech; Articulatory Features; Language Identification; Phonological Features; Prosodical Features;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Acoustics, Speech and Signal Processing (ICASSP), 2011 IEEE International Conference on
  • Conference_Location
    Prague
  • ISSN
    1520-6149
  • Print_ISBN
    978-1-4577-0538-0
  • Electronic_ISBN
    1520-6149
  • Type

    conf

  • DOI
    10.1109/ICASSP.2011.5947329
  • Filename
    5947329