• DocumentCode
    2984565
  • Title

    A visualized acoustic saliency feature extraction method for environment sound signal processing

  • Author

    Jingyu Wang ; Ke Zhang ; Madani, Kurash ; Sabourin, Christophe

  • Author_Institution
    Sch. of Astronaut., Northwestern Polytech. Univ., Xi´an, China
  • fYear
    2013
  • fDate
    22-25 Oct. 2013
  • Firstpage
    1
  • Lastpage
    4
  • Abstract
    Environment perception is an important research issue for both unmanned ground vehicles and robots. To improve the capacity of perception, a visualized acoustic saliency feature extraction (VASFE) method based on both the short-time Fourier transform (STFT) and the Mel-Frequency Cepstrum Coefficient (MFCC) for environment sound signal processing is proposed in this paper. Sound signal is visualized by using the STFT algorithm as local image feature and the Mel-Frequency Cepstrum Coefficient (MFCC) is used to represent the local acoustic feature of the signal. The proposed VASFE method is tested by the natural sound data which collected from real world of both indoor and outdoor environment. The results show that this method is able to extract the saliency features of both long-term and short-term sound signal successfully and clearly, and conducts to very distinguishable features for future processing of environment sound information.
  • Keywords
    Fourier transforms; acoustic signal processing; cepstral analysis; data visualisation; feature extraction; MFCC; STFT algorithm; VASFE method; environment perception; environment sound information; environment sound signal processing; indoor environment; local acoustic feature; local image feature; long-term sound signal; mel-frequency cepstrum coefficient; natural sound data; outdoor environment; perception capacity; robots; short-term sound signal; short-time Fourier transform; sound signal visualization; unmanned ground vehicles; visualized acoustic saliency feature extraction method; Equations; Feature extraction; Mel frequency cepstral coefficient; Signal processing algorithms; Spectrogram; Visualization; MFCC; STFT algorithm; environment perception; natural sound; spectrogram; visualized acoustic saliency;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    TENCON 2013 - 2013 IEEE Region 10 Conference (31194)
  • Conference_Location
    Xi´an
  • ISSN
    2159-3442
  • Print_ISBN
    978-1-4799-2825-5
  • Type

    conf

  • DOI
    10.1109/TENCON.2013.6718918
  • Filename
    6718918