• DocumentCode
    3157990
  • Title

    Sparse image representations with shift and rotation invariance constraints

  • Author

    Tomokusa, Yu-ki ; Nakashizuka, Makoto ; Iiguni, Youji

  • Author_Institution
    Grad. Sch. of Eng. Sci., Osaka Univ., Toyonaka, Japan
  • fYear
    2009
  • fDate
    7-9 Jan. 2009
  • Firstpage
    256
  • Lastpage
    259
  • Abstract
    This paper presents a sparse image representation and its dictionary learning under shift and rotation invariance constraints. Sparse coding is a generative signal model that approximates signals by linear combinations of atoms in a dictionary. Since a sparsity penalty is introduced during signal approximation and dictionary learning, the dictionary represents primal structures of the signals. Under the shift and rotation invariance, the atoms in the dictionary are generated by the rotation and translation of the two-dimensional basic functions that indicate primal local structures of an image. The number of atoms for representation of an image is a product of the numbers of translation positions, rotation angles and basic functions. For the decomposition and dictionary learning, the huge storage capacity is required to store the coefficients, which are assigned to the atoms. In order to reduce the number of the coefficients and the computational burden, we propose a restricted image generative model for the shift and rotation invariant sparse representation. In experiment, the dictionary learning for synthetic and natural images is demonstrated. The results show that the sparse decomposition using the dictionary learnt by the proposed method can decompose images into parts which have different features.
  • Keywords
    image coding; image representation; image texture; unsupervised learning; vector quantisation; 2D basic functions; dictionary learning; generative signal model; image texture analysis; restricted image generative model; rotation invariance constraints; shift invariance constraints; signal approximation; sparse coding; sparse image representation; sparsity penalty; unsupervised learning; vector quantisation; Cost function; Dictionaries; Image generation; Image representation; Image texture analysis; Linear approximation; Signal generators; Signal processing; Signal representations; Unsupervised learning; Image texture analysis; signal representation; unsupervised learning; vector quantization;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Intelligent Signal Processing and Communication Systems, 2009. ISPACS 2009. International Symposium on
  • Conference_Location
    Kanazawa
  • Print_ISBN
    978-1-4244-5015-2
  • Electronic_ISBN
    978-1-4244-5016-9
  • Type

    conf

  • DOI
    10.1109/ISPACS.2009.5383854
  • Filename
    5383854