• DocumentCode
    3560598
  • Title

    Degradation Type Classifier for Full Band Speech Contaminated With Echo, Broadband Noise, and Reverberation

  • Author

    Nunes, Leonardo O. ; Biscainho, Luiz W. P. ; Bowon Lee ; Said, Amir ; Kalker, T. ; Schafer, Ronald W.

  • Author_Institution
    Signal Process. Lab., Fed. Univ. of Rio de Janeiro, Rio de Janeiro, Brazil
  • Volume
    19
  • Issue
    8
  • fYear
    2011
  • Firstpage
    2516
  • Lastpage
    2526
  • Abstract
    This paper addresses the problem of identifying impairment types that might be present in a speech signal. In particular, three acoustically induced degradation types that occur in teleconference systems are considered: acoustic echo, reverberation, and broadband noise, as well as combinations among them. The proposed system is double-ended (full reference) and is developed using a database of degraded full-band speech signals created according to a model for teleconference systems. A set of features obtained from both the degraded and non-degraded signals is proposed and shown to adequately capture information associated with each degradation type. A random forest classifier and a support vector machine are successfully employed, achieving a classification error below 2%. Such classifiers can be used to select an appropriate quality assessment tool for a given degraded signal.
  • Keywords
    echo; pattern classification; reverberation; speech processing; support vector machines; teleconferencing; acoustic echo; broadband noise; classification error; degradation type classifier; full band speech; impairment types; quality assessment tool; random forest classifier; reverberation; speech signal; support vector machine; teleconference systems; Degradation; Feature extraction; Noise; Oral communication; Pattern classification; Quality assessment; Reverberation; Teleconferencing; Pattern classification; speech communication; speech quality assessment; teleconference systems;
  • fLanguage
    English
  • Journal_Title
    Audio, Speech, and Language Processing, IEEE Transactions on
  • Publisher
    ieee
  • Conference_Location
    4/21/2011 12:00:00 AM
  • ISSN
    1558-7916
  • Type

    jour

  • DOI
    10.1109/TASL.2011.2144973
  • Filename
    5753923