• DocumentCode
    989998
  • Title

    Auditory Segmentation Based on Onset and Offset Analysis

  • Author

    Hu, Guoning ; Wang, DeLiang

  • Author_Institution
    Ohio State Univ., Columbus, OH
  • Volume
    15
  • Issue
    2
  • fYear
    2007
  • Firstpage
    396
  • Lastpage
    405
  • Abstract
    A typical auditory scene in a natural environment contains multiple sources. Auditory scene analysis (ASA) is the process in which the auditory system segregates a scene into streams corresponding to different sources. Segmentation is a major stage of ASA by which an auditory scene is decomposed into segments, each containing signal mainly from one source. We propose a system for auditory segmentation by analyzing onsets and offsets of auditory events. The proposed system first detects onsets and offsets, and then generates segments by matching corresponding onset and offset fronts. This is achieved through a multiscale approach. A quantitative measure is suggested for segmentation evaluation. Systematic evaluation shows that most of target speech, including unvoiced speech, is correctly segmented, and target speech and interference are well separated into different segments
  • Keywords
    speech processing; auditory scene analysis; auditory segmentation; interference; multiscale approach; offset analysis; onset analysis; target speech; unvoiced speech; Auditory system; Filtering; Hidden Markov models; Image analysis; Interference; Layout; Speech analysis; Speech enhancement; Speech processing; Time frequency analysis; Auditory segmentation; event detection; multiscale analysis; onset and offset;
  • fLanguage
    English
  • Journal_Title
    Audio, Speech, and Language Processing, IEEE Transactions on
  • Publisher
    ieee
  • ISSN
    1558-7916
  • Type

    jour

  • DOI
    10.1109/TASL.2006.881700
  • Filename
    4067035