• DocumentCode
    2811699
  • Title

    Feature extraction by incremental parsing for music indexing

  • Author

    Almoosa, Nawaf I. ; Bae, Soo Hyun ; Juang, Biing-Hwang

  • Author_Institution
    Sch. of Electr. & Comput. Eng., Georgia Inst. of Technol., Atlanta, GA, USA
  • fYear
    2010
  • fDate
    14-19 March 2010
  • Firstpage
    2410
  • Lastpage
    2413
  • Abstract
    In this paper, we employ a linguistic-processing approach to the content-based retrieval of music information. Central to the approach is the use of a lossy version of the Lempel-Ziv incremental parsing (LZIP) algorithm, which constructs a dictionary by incrementally parsing music feature vectors. LZIP is adopted as a source characterization technique owing to it´s universal-coding nature, and asymptotic convergence to the entropy of the source. The dictionary is composed of variable-length parsed representations, which are used to construct a highly sparse co-occurrence matrix, which counts the occurrence of the parsed representations in each music. As a feature analysis framework, Latent Semantic Analysis (LSA) is then applied to the co-occurrence matrix to generate a lower-dimensional approximation that exposes the most salient features of the represented audio documents. The aforementioned approach, in addition to adopting reduced sampling rates and quantized feature vectors, yields a system with reduced requirements in terms of processing and storage, and increases the tolerance to noisy queries. We demonstrate the performance of the system in the music genre classification problem, and analyze its robustness to perturbed queries. Moreover, we demonstrate that using the incremental parsing algorithm in forming the audio dictionary has superior retrieval performance compared to techniques yielding a dictionary with fixed-length entries such as vector quantization.
  • Keywords
    computational linguistics; content-based retrieval; dictionaries; feature extraction; music; sparse matrices; vector quantisation; Lempel-Ziv incremental parsing algorithm; asymptotic convergence; audio dictionary; audio document; content based retrieval; feature extraction; latent semantic analysis; linguistic processing approach; music feature vector; music indexing; source characterization technique; sparse cooccurrence matrix; universal coding nature; variable length parsed representation; vector quantization; Content based retrieval; Convergence; Dictionaries; Entropy; Feature extraction; Indexing; Music information retrieval; Noise reduction; Sampling methods; Sparse matrices; Lempel-Ziv; Music retrieval; feature extraction; incremental parsing; vector quantization;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Acoustics Speech and Signal Processing (ICASSP), 2010 IEEE International Conference on
  • Conference_Location
    Dallas, TX
  • ISSN
    1520-6149
  • Print_ISBN
    978-1-4244-4295-9
  • Electronic_ISBN
    1520-6149
  • Type

    conf

  • DOI
    10.1109/ICASSP.2010.5496245
  • Filename
    5496245