مرکز منطقه ای اطلاع رساني علوم و فناوري - TISNet-Enhanced Fully Convolutional Network with EncoderDecoder Structure for Tongue Image Segmentation in Traditional Chinese Medicine

Title of article :

TISNet-Enhanced Fully Convolutional Network with EncoderDecoder Structure for Tongue Image Segmentation in Traditional Chinese Medicine

Author/Authors :

Huang, Xiaodong Faculty of Information Technology - Beijing University of Technology - Beijing, China , Zhang, Hui Faculty of Information Technology - Beijing University of Technology - Beijing, China , Zhuo, Li Faculty of Information Technology - Beijing University of Technology - Beijing, China , Li, Xiaoguang Faculty of Information Technology - Beijing University of Technology - Beijing, China , Zhang, Jing Faculty of Information Technology - Beijing University of Technology - Beijing, China

Pages :

From page :

To page :

Abstract :

Extracting the tongue body accurately from a digital tongue image is a challenge for automated tongue diagnoses, as the blurred edge of the tongue body, interference of pathological details, and the huge difference in the size and shape of the tongue. In this study, an automated tongue image segmentation method using enhanced fully convolutional network with encoder-decoder structure was presented. In the frame of the proposed network, the deep residual network was adopted as an encoder to obtain dense feature maps, and a Receptive Field Block was assembled behind the encoder. Receptive Field Block can capture adequate global contextual prior because of its structure of the multibranch convolution layers with varying kernels. Moreover, the Feature Pyramid Network was used as a decoder to fuse multiscale feature maps for gathering sufficient positional information to recover the clear contour of the tongue body. The quantitative evaluation of the segmentation results of 300 tongue images from the SIPL-tongue dataset showed that the average Hausdorff Distance, average Symmetric Mean Absolute Surface Distance, average Dice Similarity Coefficient, average precision, average sensitivity, and average specificity were 11.2963, 3.4737, 97.26%, 95.66%, 98.97%, and 98.68%, respectively. The proposed method achieved the best performance compared with the other four deep-learning-based segmentation methods (including SegNet, FCN, PSPNet, and DeepLab v3+). There were also similar results on the HIT-tongue dataset. The experimental results demonstrated that the proposed method can achieve accurate tongue image segmentation and meet the practical requirements of automated tongue diagnoses.

Keywords :

TISNet-Enhanced , Chine , Traditional

Journal title :

Computational and Mathematical Methods in Medicine

Serial Year :

2020

Full Text URL :

journals/cmmm/2020/6029258.pdf

Record number :

2613386

Link To Document :

https://search.isc.ac/dl/search/defaultta.aspx?DTC=10&DC=2613386