• DocumentCode
    1365228
  • Title

    On Normalized MSE Analysis of Speech Fundamental Frequency in the Cochlear Implant-Like Spectrally Reduced Speech

  • Author

    Do, Cong-Thanh ; Pastor, Dominique ; Goalic, André

  • Author_Institution
    Inst. TELECOM, Univ. Europeenne de Bretagne, Brest, France
  • Volume
    57
  • Issue
    3
  • fYear
    2010
  • fDate
    3/1/2010 12:00:00 AM
  • Firstpage
    572
  • Lastpage
    577
  • Abstract
    In this paper, we present a quantitative study on the speech fundamental frequency (F0) of the cochlear implant-like spectrally reduced speech (SRS). The SRS was synthesized from the subband amplitude and frequency modulations (AM and FM) of original clean speech utterances selected from the TI-digits database. The SRS synthesis algorithm was derived from the frequency amplitude modulation encoding (FAME) strategy, proposed by Nie et al., 2005. The normalized mses (NMSEs), calculated between the F0 of the original clean speech and that of the SRSs, were analyzed. The NMSEs analysis of F0 revealed the greater F0 distortion in the AM-based SRS, which is the acoustic simulation of present-day cochlear implants, compared to the FAME-based SRS. This evidence supports the fact that current cochlear implant users have difficulty in the speaker recognition task as reported by Zeng et al., 2005. Further, the analysis results showed that it is better to keep the rapidly varying FM components to reduce the F0 distortion in the FAME-based SRS at low spectral resolution.
  • Keywords
    cochlear implants; speaker recognition; speech synthesis; FAME; SRS synthesis algorithm; TI-digits database; clean speech utterances; cochlear implant-like spectrally reduced speech; frequency amplitude modulation encoding; frequency modulations; normalized MSE analysis; speaker recognition; speech fundamental frequency; subband amplitude; Acoustic distortion; Amplitude modulation; Analytical models; Cochlear implants; Databases; Encoding; Frequency modulation; Frequency synthesizers; Speech analysis; Speech synthesis; Amplitude modulation (AM); cochlear implant; frequency modulation (FM); spectrally reduced speech (SRS); speech fundamental frequency (F0); Adult; Algorithms; Child; Cochlear Implants; Computer Simulation; Female; Humans; Male; Models, Statistical; Signal Processing, Computer-Assisted; Sound Spectrography; Speech;
  • fLanguage
    English
  • Journal_Title
    Biomedical Engineering, IEEE Transactions on
  • Publisher
    ieee
  • ISSN
    0018-9294
  • Type

    jour

  • DOI
    10.1109/TBME.2009.2031097
  • Filename
    5233802