DocumentCode :
1423368
Title :
Census-based vision for auditory depth images and speech navigation of visually impaired users
Author :
Pei, Soo-Chang ; Wang, Yu-Ying
Author_Institution :
Dept. of Electr. Eng., Nat. Taiwan Univ., Taipei, Taiwan
Volume :
57
Issue :
4
fYear :
2011
fDate :
11/1/2011 12:00:00 AM
Firstpage :
1883
Lastpage :
1890
Abstract :
In neuroscience and psychology, visual imagery is the subjective experience of seeing in the absence of visual stimulation. Someone may experience touch or sound as a result of visual imagery. In this paper, a new visual image aid which can provide a different way to visualize the image for visually impaired users is proposed. It is done by applying the depth image to an Image-To-Sound Mapping (ITSM) system. The proposed algorithm utilizes a sparse Census transform (SCT) and color segmentation to obtain an illuminationinvariant depth image. The depth image is applied to the ITSM system and then a clear and simple sound output is obtained for constructing a mental image. Moreover, the reliable three-dimensional (3D) data of close objects are extracted and interpreted as a semantic speech output. Experimental results show that visually impaired users can perceive the image easily and without training by adding verbal description to the visually image aid. In good and poor illuminated environments, the performance is 82% and 80% respectively. The performance of our proposed systems was not influenced by various lighting. All subjects also commented that the systems would be potentially useful1.
Keywords :
feature extraction; handicapped aids; image segmentation; psychology; speech processing; vision defects; auditory depth images; census-based vision; color segmentation; illumination- invariant depth image; image-to-sound mapping system; mental image. constructing; neuroscience; psychology; sparse Census transform; speech navigation; verbal description; visual image aid; visual imagery; visually impaired users; Cameras; Computed tomography; Image segmentation; Speech; Three dimensional displays; Transforms; Visualization; Image-To-Sound Mapping (ITSM); Visual imagery; depth image; sparse Census transform (SCT).;
fLanguage :
English
Journal_Title :
Consumer Electronics, IEEE Transactions on
Publisher :
ieee
ISSN :
0098-3063
Type :
jour
DOI :
10.1109/TCE.2011.6131167
Filename :
6131167
Link To Document :
بازگشت