2010 IEEE 25th International Symposium on Defect and Fault Tolerance in VLSI Systems
Download PDF

Abstract

Cochlear implantation is the most effective treatment for severe deafness, which requires accurate localization of temporal bone anatomy. Preoperative CT image segmentation is an essential technique to determine the location of relevant tissues in the temporal bone. However, manual segmentation is usually time-consuming and suffers from low accuracy due to the complex and small structures of these tissues in temporal bone CT. To address this issue, we proposed a CNN-Transformer structure-based 3D multi-structured model for the automatic segmentation of fine and complex tissues such as the cochlea, facial nerve, ossicles, vestibule and semicircular canal in the temporal bone CT. Our model adopts a new Transformer deformation structure, which effectively utilizes the spatial attention mechanism to capture feature dependencies, and uses the channel attention mechanism to fuse different channel semantic representations to improve segmentation accuracy. Extensive experiments on a private temporal bone CT dataset show that our model achieves higher DSS and JSS scores, and lower HD95 and ASSD scores for all targets compared with other existing segmentation methods, demonstrating its superior performance.
Like what you’re reading?
Already a member?
Get this article FREE with a new membership!

Similar Articles