• DocumentCode
    672839
  • Title

    Language identification using Hilbert envelope and phase information of linear prediction residual

  • Author

    Nandi, Dipanjan ; Pati, Debadatta ; Rao, K. Sreenivasa

  • Author_Institution
    Sch. of Inf. Technol., Indian Inst. of Technol. Kharagpur, Kharagpur, India
  • fYear
    2013
  • fDate
    25-27 Nov. 2013
  • Firstpage
    1
  • Lastpage
    6
  • Abstract
    In this paper, magnitude and phase components of excitation source information are explored for language identification (LID) study. The linear prediction (LP) residual of speech signal represents the excitation source information. The magnitude and phase components of LP residual are processed individually at sub-segmental, segmental and supra-segmental levels. Evidences from both magnitude and phase components of LP residual are combined to capture the language-specific excitation source information. The LID studies are carried out on IITKGP-MLILSC speech database. The segmental level information yields better performance compared to sub-segmental and supra-segmental level information. The combined evidences from three levels represent the excitation source information. This study shows that, both magnitude and phase of LP residual contains significant language-specific excitation source information. From the LID performances of this study, it is observed that the phase component of LP residual contains more language discriminative information than the magnitude component of LP residual.
  • Keywords
    audio databases; natural language processing; speech processing; Hilbert envelope; IITKGP-MLILSC speech database; LID studies; LP residual; language discriminative information; language identification study; language-specific excitation source information; linear prediction residual; magnitude components; phase components; phase information; segmental level information; speech signal; subsegmental level information; suprasegmental level information; Accuracy; Databases; Educational institutions; Feature extraction; Speech; Speech recognition; Vibrations; IITKGP-MLILSC; LP residual; segmental; sub-segmental; supra-segmental;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Oriental COCOSDA held jointly with 2013 Conference on Asian Spoken Language Research and Evaluation (O-COCOSDA/CASLRE), 2013 International Conference
  • Conference_Location
    Gurgaon
  • Type

    conf

  • DOI
    10.1109/ICSDA.2013.6709864
  • Filename
    6709864