• DocumentCode
    2199974
  • Title

    Improving speech intelligibility in cochlear implants using vocoder-centric acoustic models

  • Author

    Gladston, Anushiya Rachel ; Vijayalakshmi, P. ; Thangavelu, Nagarajan

  • Author_Institution
    Dept. of Electron. & Commun. Eng., SSN Coll. of Eng., Chennai, India
  • fYear
    2012
  • fDate
    19-21 April 2012
  • Firstpage
    66
  • Lastpage
    71
  • Abstract
    The cochlear implant is a prosthetic device, used to replace a damaged inner ear. It consists of an externally worn speech processor and an internal receiver stimulator. The cochlear implant is patient specific and system specific and so in the current work, a lab model for the speech processor, based on various vocoder models is designed to analyse the effect of system specific parameters such as filter bandwidth, number of channels and vocal excitation, on the speech intelligibility. Initially a formant vocoder is designed and used in the analysis and synthesis of English vowels. A channel vocoder is then developed for the same and extended to perform the analysis and synthesis of words from the Lexical Neighbourhood Test and sentences from the TIMIT database. The effect of number of channels on the synthetic speech quality is analysed and a 21-channel vocoder is found to yield the best response with a mean opinion score (MOS) of 4 out of 5 for vowels and 3.4 for sentences. The formant trajectories and CosH distance are also used to validate the speech intelligibility. The influence of glottal pulse on speech intelligibility is analysed and the synthetic speech is found to sound more natural with a glottal pulse train than an impulse train with an MOS of 4.2 for vowels and 4 for sentences.
  • Keywords
    cochlear implants; speech intelligibility; vocoders; CosH distance; English vowels; Lexical Neighbourhood Test; TIMIT database; cochlear implants; damaged inner ear; filter bandwidth; internal receiver stimulator; mean opinion score; prosthetic device; speech intelligibility; speech processor; vocal excitation; vocoder centric acoustic model; Band pass filters; Bandwidth; Cochlear implants; Natural languages; Speech; Trajectory; Vocoders; channel vocoder; cochlear implant; formant vocoder; glottal pulse; speech intelligibility;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Recent Trends In Information Technology (ICRTIT), 2012 International Conference on
  • Conference_Location
    Chennai, Tamil Nadu
  • Print_ISBN
    978-1-4673-1599-9
  • Type

    conf

  • DOI
    10.1109/ICRTIT.2012.6206795
  • Filename
    6206795