Citation
Luo, Zhiqiang and Lim, Tek Yong and Hua, Xia (2024) Segmentation of Plain CT Image of Ischemic Lesion based on Trans-Swin-UNet. JOIV : International Journal on Informatics Visualization, 8 (3-2). pp. 1573-1581. ISSN 2549-9610
Text
3028-8998-1-PB.pdf - Published Version Restricted to Repository staff only Download (3MB) |
Abstract
The present study aims to build a hybrid convolutional neural network and transformer UNet-based model, Trans-Swin-UNet, to segment ischemic lesions of the plain computed tomography (CT) image. The model architecture is built based on TransUnet and has four main improvements. First, replace the decoder of TransUNet with a Swin transformer; second, add a Max Attention module into the skip connection; third, design a comprehensive loss function; and last, speed up the segmentation performance. The present study designs two experiments to evaluate the performance of the built model using both the self-collected and public plain CT image datasets. The model optimization experiment evaluates the improvements of Trans-Swin-UNet over TransUnet. The experimental results show that each improvement of the built model can achieve a better performance than TransUNet in terms of dice similarity coefficient (DSC), Jaccard coefficient (JAC), and accuracy (ACC). The comparison experiment compares the built model with four existing UNet-based models. The experimental results show that the built model had a DSC of 0.72±0.01, a JAC of 0.78±0.04, an ACC of 0.75±0.03 using the self-collected plain CT image dataset and a DSC of 0.73±0.02, a JAC of 0.79±0.03, an ACC of 0.76±0.02 using the public plain CT image dataset, achieving the best segmentation performance among five UNet-based neural network models. The two experimental results conclude that the built model could accurately segment ischemic lesions of the plain CT image. The limitations and future work of this study are also discussed.
Item Type: | Article |
---|---|
Uncontrolled Keywords: | TransUNet; Medical image segmentation; Ischemic lesion; Swin transformer; Attention gate |
Subjects: | T Technology > TR Photography > TR624-835 Applied photography Including artistic, commercial, medical photography, photocopying processes |
Divisions: | Faculty of Computing and Informatics (FCI) |
Depositing User: | Ms Nurul Iqtiani Ahmad |
Date Deposited: | 03 Jan 2025 05:40 |
Last Modified: | 03 Jan 2025 05:40 |
URII: | http://shdl.mmu.edu.my/id/eprint/13280 |
Downloads
Downloads per month over past year
Edit (login required) |