• DocumentCode
    3113384
  • Title

    Sparse representation based language identification using prosodic features for Indian languages

  • Author

    Singh, O.P. ; Haris, B.C. ; Sinha, Roopak ; Chettri, Bhusan ; PRADHAN, ASHOK KUMAR

  • Author_Institution
    Dept. of Electron. & Electr. Eng., Indian Inst. of Technol. Guwahati, Guwahati, India
  • fYear
    2013
  • fDate
    13-15 Dec. 2013
  • Firstpage
    1
  • Lastpage
    5
  • Abstract
    This work explores the use of prosodic feature based sparse representation classification (SRC) system for language identification (LID) task. The prosodic features are computed by extracting syllable like unit with the help of a vowel onset points detection algorithm and mapped to i-vector domain for SRC using an exemplar dictionary. This work is a motivation from recently reported LID approach using low-dimensional i-vectors. The experiments are performed on a locally collected dataset consisting of five Indian languages. On comparing the SRC system performance with that of a contrast system based on cosine distance scoring (CDS), it is noted that the former one performs significantly better than the latter one. The performance of the best system with session/channel compensation in terms of equal error rate and minimum detection cost function turns out to be 7.46% and 0.1338, respectively.
  • Keywords
    natural language processing; signal classification; signal representation; speech processing; vectors; Indian languages; LID task; SRC system; channel compensation; equal error rate; exemplar dictionary; i-vector domain; language identification task; low-dimensional i-vectors; minimum detection cost function; prosodic features; session compensation; sparse representation classification system; speech signal segmentation; syllable like unit extraction; vowel onset points detection algorithm; Error analysis; Feature extraction; Rhythm; Speech; Support vector machines; Training; Vectors; Language identification; i-vector; prosodic features; sparse representation; vowel onset point;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    India Conference (INDICON), 2013 Annual IEEE
  • Conference_Location
    Mumbai
  • Print_ISBN
    978-1-4799-2274-1
  • Type

    conf

  • DOI
    10.1109/INDCON.2013.6726123
  • Filename
    6726123