CvTMorph: Improving Local Feature Extraction in Medical Image Registration for Respiratory Motion Modeling with Convolutional Vision Transformer

Peizhi Chen; Xupeng Zou; Yifan Gou

doi:10.2174/0115734056302592240828074013

ISSN: 1573-4056
E-ISSN: 1875-6603

HTML

oa CvTMorph: Improving Local Feature Extraction in Medical Image Registration for Respiratory Motion Modeling with Convolutional Vision Transformer
Authors: Peizhi Chen¹, Xupeng Zou¹ and Yifan Gou¹
View Affiliations Hide Affiliations

¹ College of Computer and Information Engineering, Xiamen University of Technology, Xiamen 361024, China
Source: Current Medical Imaging, Volume 20, Issue 1, Jan 2024, e15734056302592
DOI: https://doi.org/10.2174/0115734056302592240828074013
- Received: 17 Mar 2024
- Accepted: 05 Aug 2024
- Available online: 01 Jan 2024

Abstract

Background

Accurately modeling respiratory motion in medical images is crucial for various applications, including radiation therapy planning. However, existing registration methods often struggle to extract local features effectively, limiting their performance.

Objective

In this paper, we aimed to propose a new framework called CvTMorph, which utilizes a Convolutional vision Transformer (CvT) and Convolutional Neural Networks (CNN) to improve local feature extraction.

Methods

CvTMorph integrates CvT and CNN to construct a hybrid model that combines the strengths of both approaches. Additionally, scaling and square layers are added to enhance the registration performance. We have evaluated the performance of CvTMorph on the 4D-Lung and DIR-Lab datasets and compared it with state-of-the-art methods to demonstrate its effectiveness.

Results

The experimental results have demonstrated CvTMorph to outperform the existing methods in terms of accuracy and robustness for respiratory motion modeling in 4D images. The incorporation of the convolutional vision transformer has significantly improved the registration performance and enhanced the representation of local structures.

Conclusion

CvTMorph offers a promising solution for accurately modeling respiratory motion in 4D medical images. The hybrid model, leveraging convolutional vision transformer and convolutional neural networks, has proven effective in extracting local features and improving registration performance. The results have highlighted the potential of CvTMorph for various applications, such as radiation therapy planning, and provided a basis for further research in this field.

This is an open access article distributed under the terms of the Creative Commons Attribution 4.0 International Public License (CC-BY 4.0), a copy of which is available at: https://creativecommons.org/licenses/by/4.0/legalcode. This license permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.

Article metrics loading...

/content/journals/cmir/10.2174/0115734056302592240828074013

2024-01-01

2026-02-18

From This Site

/content/journals/cmir/10.2174/0115734056302592240828074013

dcterms_title,dcterms_subject,pub_keyword

-contentType:Contributor -contentType:Concept -contentType:Institution

10

5

Full text loading...

/deliver/fulltext/cmir/20/1/CMIR-20-E15734056302592.html?itemId=/content/journals/cmir/10.2174/0115734056302592240828074013&mimeType=html&fmt=ahah

References

TanZ. LiuC. ZhouY. ShenW. Preliminary comparison of the registration effect of 4D-CBCT and 3D-CBCT in image-guided radiotherapy of Stage IA non–small-cell lung cancer.J. Radiat. Res.201758685486110.1093/jrr/rrx04028992047
[Google Scholar]
NakamotoM. AburayaN. SatoY. KonishiK. YoshinoI. HashizumeM. TamuraS. Surgical navigation system for cancer localization in collapsed lung based on estimation of lung deformation, medical image computing and computer-assisted intervention.Medical Image Computing and Computer-Assisted Intervention – MICCAI 2007: 10th International Conference, Brisbane, Australia, October 29 - November 2, 2007 pp. 68–76.
[Google Scholar]
BrostA. WimmerA. LiaoR. BourierF. KochM. StrobelN. KurzidimK. HorneggerJ. Constrained registration for motion compensation in atrial fibrillation ablation procedures.IEEE Trans. Med. Imaging201231487088110.1109/TMI.2011.218118422203705
[Google Scholar]
KleinS. StaringM. MurphyK. ViergeverM.A. PluimJ. A toolbox for intensity-based medical image registration.IEEE Trans. Med. Imaging201029119620510.1109/TMI.2009.203561619923044
[Google Scholar]
ShenD. DavatzikosC. HAMMER: hierarchical attribute matching mechanism for elastic registration.IEEE Trans. Med. Imaging200221111421143910.1109/TMI.2002.80311112575879
[Google Scholar]
AvantsB. EpsteinC. GrossmanM. GeeJ. Symmetric diffeomorphic image registration with cross-correlation: Evaluating automated labeling of elderly and neurodegenerative brain.Med. Image Anal.2008121264110.1016/j.media.2007.06.00417659998
[Google Scholar]
RonnebergerO. FischerP. BroxT. U-Net: Convolutional networks for biomedical image segmentation.arXiv2015
[Google Scholar]
MilletariF. NavabN. AhmadiS-A. V-Net: Fully convolutional neural networks for volumetric medical image segmentation.Fourth International Conference on 3D Vision (3DV), Stanford, CA, USA, 25-28 October 2016, pp.565-571.
[Google Scholar]
MehtaR. SivaswamyJ. M-net: A Convolutional Neural Network for deep brain structure segmentation.2017 IEEE 14th International Symposium on Biomedical Imaging (ISBI)Melbourne, VIC, 18-21 April 2017, pp.437-440.2017
[Google Scholar]
BalakrishnanG. ZhaoA. SabuncuM.R. GuttagJ.V. DalcaA.V. An Unsupervised learning model for deformable medical image registration2018 IEEE/CVF Conference on Computer Vision and Pattern RecognitionSalt Lake City, UT, USA, 18-23 June 2018, 9252-9260.10.1109/CVPR.2018.00964
[Google Scholar]
BalakrishnanG. ZhaoA. SabuncuM.R. GuttagJ. DalcaA.V. VoxelMorph: A learning framework for deformable medical image registration.IEEE Trans. Med. Imaging20193881788180010.1109/TMI.2019.289753830716034
[Google Scholar]
DosovitskiyA. BeyerL. KolesnikovA. WeissenbornD. ZhaiX. UnterthinerT. DehghaniM. MindererM. HeigoldG. GellyS. UszkoreitJ. HoulsbyN. An image is worth 16x16 words: Transformers for image recognition at scale.ArXiv2021
[Google Scholar]
ChenJ. HeY. FreyE.C. LiY. DuY. ViT-V-Net: Vision transformer for unsupervised volumetric medical image registration.ArXiv2021
[Google Scholar]
WuH. XiaoB. CodellaN. C. F. LiuM. DaiX. YuanL. ZhangL. Introducing convolutions to vision transformersArXiv2021
[Google Scholar]
ChenJ. LuY. YuQ. LuoX. AdeliE. WangY. LuL. YuilleA.L. ZhouY. TransUNet: Transformers make strong encoders for medical image segmentation.ArXiv2021
[Google Scholar]
AshburnerJ. A fast diffeomorphic image registration algorithm.Neuroimage20073819511310.1016/j.neuroimage.2007.07.00717761438
[Google Scholar]
DalcaA.V. BalakrishnanG. GuttagJ. SabuncuM.R. Unsupervised learning of probabilistic diffeomorphic registration for images and surfaces.Med. Image Anal.20195722623610.1016/j.media.2019.07.00631351389
[Google Scholar]
JaderbergM. SimonyanK. ZissermanA. KavukcuogluK. Spatial transformer networks.ArXiv2015
[Google Scholar]
BalikS. WeissE. JanN. RomanN. SleemanW.C. FatygaM. ChristensenG.E. ZhangC. MurphyM.J. LuJ. KeallP. WilliamsonJ.F. HugoG.D. Evaluation of 4-dimensional computed tomography to 4-dimensional cone-beam computed tomography deformable image registration for lung cancer adaptive radiation therapy.Int. J. Radiat. Oncol. Biol. Phys.201386237237910.1016/j.ijrobp.2012.12.02323462422
[Google Scholar]
ClarkK. VendtB. SmithK. FreymannJ. KirbyJ. KoppelP. MooreS. PhillipsS. MaffittD. PringleM. TarboxL. PriorF. The Cancer imaging archive (TCIA): Maintaining and operating a public information repository.J. Digit. Imaging20132661045105710.1007/s10278‑013‑9622‑723884657
[Google Scholar]
HugoG.D. WeissE. SleemanW.C. BalikS. KeallP.J. LuJ. WilliamsonJ.F. A longitudinal four‐dimensional computed tomography and cone beam computed tomography dataset for image‐guided radiation therapy research in lung cancer.Med. Phys.201744276277110.1002/mp.1205927991677
[Google Scholar]
RomanN.O. ShepherdW. MukhopadhyayN. HugoG.D. WeissE. Interfractional positional variability of fiducial markers and primary tumors in locally advanced non-small-cell lung cancer during audiovisual biofeedback radiotherapy.Int. J. Radiat. Oncol. Biol. Phys.20128351566157210.1016/j.ijrobp.2011.10.05122391105
[Google Scholar]
CastilloR. CastilloE. GuerraR. JohnsonV.E. McPhailT. GargA.K. GuerreroT. A framework for evaluation of deformable image registration spatial accuracy using large landmark point sets.Phys. Med. Biol.20095471849187010.1088/0031‑9155/54/7/00119265208
[Google Scholar]
CastilloE. CastilloR. MartinezJ. ShenoyM. GuerreroT. Four-dimensional deformable image registration using trajectory modeling.Phys. Med. Biol.201055130532710.1088/0031‑9155/55/1/01820009196
[Google Scholar]
DiceL.R. Measures of the amount of ecologic association between species.Ecology194526329730210.2307/1932409
[Google Scholar]
ZhouZ. SiddiqueeM.M.R. TajbakhshN. LiangJ. UNet++: A Nested u-net architecture for medical image segmentation, deep learning in medical image analysis and multimodal learning for clinical decision support.ArXiv2018
[Google Scholar]
ZhouZ. SiddiqueeM.M. TajbakhshN. LiangJ. UNet++: Redesigning skip connections to exploit multiscale features in image segmentation.IEEE Trans. Med. Imaging20203961856186710.1109/TMI.2019.295960931841402
[Google Scholar]
FedorovA. BeichelR. Kalpathy-CramerJ. FinetJ. Fillion-RobinJ.C. PujolS. BauerC. JenningsD. FennessyF. SonkaM. BuattiJ. AylwardS. MillerJ.V. PieperS. KikinisR. 3D Slicer as an image computing platform for the Quantitative Imaging Network.Magn. Reson. Imag.20123091323134110.1016/j.mri.2012.05.00122770690
[Google Scholar]

/content/journals/cmir/10.2174/0115734056302592240828074013

CvTMorph: Improving Local Feature Extraction in Medical Image Registration for Respiratory Motion Modeling with Convolutional Vision Transformer

Curr. Med. Imaging 20, e15734056302592 (2024); https://doi.org/10.2174/0115734056302592240828074013

/content/journals/cmir/10.2174/0115734056302592240828074013

Data & Media loading...

Article Type: Research Article

Keyword(s): Convolutional neural network; Convolutional vision transformer; Feature extraction; Image registration; Medical image analysis; Respiratory motion modeling; Vision transformer

oa CvTMorph: Improving Local Feature Extraction in Medical Image Registration for Respiratory Motion Modeling with Convolutional Vision Transformer

Abstract

From This Site

Most Read This Month

Most Cited Most Cited RSS feed

Small Animal Computed Tomography Imaging

Brain Tumor Detection Using Machine Learning and Deep Learning: A Review

Low-dose COVID-19 CT Image Denoising Using CNN and its Method Noise Thresholding

How to Collect and Interpret Medical Pictures Captured in Highly Challenging Environments that Range from Nanoscale to Hyperspectral Imaging

SegEIR-Net: A Robust Histopathology Image Analysis Framework for Accurate Breast Cancer Classification

An Efficient Ensemble-based Machine Learning approach for Predicting Chronic Kidney Disease

Automated Diagnosis of Bone Metastasis by Classifying Bone Scintigrams Using a Self-defined Deep Learning Model

Prediction of Lumbar Pedicle Screw Loosening Using Hounsfield Units in Computed Tomography

Thyroid Nodules Classification using Weighted Average Ensemble and D-CRITIC Based TOPSIS Methods for Ultrasound Images

AI-assisted Method for Efficiently Generating Breast Ultrasound Screening Reports