• DocumentCode
    431624
  • Title

    Relating the acoustic space of vowels to the perceptual space in cochlear implant simulations

  • Author

    Liu, Chuping ; Fu, Qian-Jie

  • Author_Institution
    Dept. of Electr. Eng., Univ. of Southern California, Los Angeles, CA, USA
  • Volume
    3
  • fYear
    2005
  • fDate
    18-23 March 2005
  • Abstract
    In automatic speech recognition, speech features are measured and used as reference templates for machine learning and recognition. The differences between these features can be used to calculate relative acoustic distances between phonemes. However, when speech signals are spectrally degraded, as in electric hearing with cochlear implants, it is unclear whether these acoustic distances can predict speech recognition performance. The present study measured acoustic distances between spectrally degraded vowel tokens and investigated the relation between acoustic vowel space and perceptual vowel space. After processing vowel tokens using a cochlear implant simulation, the Mel-frequency cepstrum was extracted from each token; features were then time aligned and the weighted Euclidean distance was calculated between all tokens. Results demonstrate a significant correlation between vowel perception data and averaged acoustic distance between vowel tokens, for a variety of experimental conditions. These results suggest that acoustic distance between phonemes may well predict the recognition performance of spectrally degraded speech.
  • Keywords
    acoustic signal processing; digital simulation; hearing aids; speech processing; speech recognition; Mel-frequency cepstrum; acoustic distance; acoustic distances; acoustic vowel space; automatic speech recognition; cochlear implant; machine learning; perceptual space; perceptual vowel space; spectrally degraded speech signals; spectrally degraded vowel tokens; vowel perception data; weighted Euclidean distance; Acoustic measurements; Auditory system; Automatic speech recognition; Cepstrum; Cochlear implants; Data mining; Degradation; Extraterrestrial measurements; Machine learning; Speech recognition;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Acoustics, Speech, and Signal Processing, 2005. Proceedings. (ICASSP '05). IEEE International Conference on
  • ISSN
    1520-6149
  • Print_ISBN
    0-7803-8874-7
  • Type

    conf

  • DOI
    10.1109/ICASSP.2005.1415639
  • Filename
    1415639