• DocumentCode
    2534920
  • Title

    A psychovisually tuned image codec

  • Author

    Zhai, Guangtao ; Wu, Xiaolin ; Niu, Yi

  • Author_Institution
    ECE Dept., McMaster Univ., Hamilton, ON, Canada
  • fYear
    2011
  • fDate
    17-19 Oct. 2011
  • Firstpage
    1
  • Lastpage
    6
  • Abstract
    A psychovisual quality driven image codec exploiting the psychological and neurological process of visual perception is proposed in this paper. Recent findings in brain theory and neuroscience suggest that visual perception is a process of fitting brain´s internal generative model to the outside retina stimuli. And the psychovisual quality is related to how accurately visual sensory data can be explained by the internal generative model. Therefore, the design criterion of our psychovisually tuned image compression system is to find a compact description of the optimal generative model from the input image on the encoding end, which is then used to regenerate the output image on the decoding end. By noting an important finding from empirical natural image statistics that natural images have scale invariant features in the pixels´ high order statistics, the generative model can be efficiently compressed through model preserving spatial downsampling on the encoder. And the decoder can reverse the process with a model preserving upsampling module to generate the decoded image. The proposed system is fully standard complaint because the downsampled image can be compressed with any exiting codec (JPEG2000 in this work). The proposed algorithm is shown to systematically outperform JPEG2000 in a wide bit rate range in terms of both subjective and objective qualities.
  • Keywords
    data compression; decoding; image coding; psychology; JPEG2000; decoded image; internal generative model; natural image statistics; neurological process; optimal generative model; psychological process; psychovisual quality driven image codec; psychovisually tuned image codec; psychovisually tuned image compression system; retina stimuli; spatial preserving downsampling model; visual perception; Adaptation models; Brain modeling; Computational modeling; Decoding; Image coding; Transform coding; Visualization;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Multimedia Signal Processing (MMSP), 2011 IEEE 13th International Workshop on
  • Conference_Location
    Hangzhou
  • Print_ISBN
    978-1-4577-1432-0
  • Electronic_ISBN
    978-1-4577-1433-7
  • Type

    conf

  • DOI
    10.1109/MMSP.2011.6093772
  • Filename
    6093772