• DocumentCode
    2266925
  • Title

    A new speech enhancement: speech stream segregation

  • Author

    Okuno, Hiroshi G. ; Nakatani, Tomohiro ; Kawabata, Takeshi

  • Author_Institution
    NTT Basic Res. Labs., Atsugi, Japan
  • Volume
    4
  • fYear
    1996
  • fDate
    3-6 Oct 1996
  • Firstpage
    2356
  • Abstract
    Speech stream segregation is presented as a new speech enhancement method for automatic speech recognition. Two issues are addressed: speech stream segregation from a mixture of sounds, and interfacing speech stream segregation with automatic speech recognition. Speech stream segregation is modeled as a process of extracting harmonic fragments, grouping these extracted harmonic fragments, and substituting non harmonic residue for non harmonic parts of groups. The main problem in interfacing speech stream segregation with HMM based speech recognition is how to improve the degradation of recognition performance due to spectral distortion of segregated sounds, which is caused mainly by transfer function of a binaural input. Our solution is to retrain the parameters of HMM with training data binauralized for four directions. Experiments with 500 mixtures of two women´s utterances of a word showed that the cumulative accuracy of word recognition up to the 10th candidate of each woman´s utterance, is on average 75%
  • Keywords
    feature extraction; hidden Markov models; speech enhancement; speech recognition; HMM based speech recognition; automatic speech recognition; binaural input; harmonic fragment extraction; interfacing speech stream segregation; non harmonic residue; recognition performance; segregated sounds; spectral distortion; speech enhancement; training data; transfer function; word recognition; Auditory system; Automatic speech recognition; Hearing aids; Hidden Markov models; Human voice; Laboratories; Layout; Psychoacoustic models; Speech enhancement; Speech processing;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Spoken Language, 1996. ICSLP 96. Proceedings., Fourth International Conference on
  • Conference_Location
    Philadelphia, PA
  • Print_ISBN
    0-7803-3555-4
  • Type

    conf

  • DOI
    10.1109/ICSLP.1996.607281
  • Filename
    607281