Skip to main navigation Skip to search Skip to main content

A Swin Transformer-Based Model for Thyroid Nodule Detection in Ultrasound Images

  • Ye Tian
  • , Jingqiang Zhu
  • , Lei Zhang
  • , Lichao Mou
  • , Xiaoxiang Zhu
  • , Yilei Shi
  • , Buyun Ma
  • , Wanjun Zhao
  • Sichuan University
  • MedAI Technology (Wuxi) Co. Ltd.

Research output: Contribution to journalArticlepeer-review

7 Scopus citations

Abstract

In recent years, the incidence of thyroid cancer has been increasing. Thyroid nodule detection is critical for both the detection and treatment of thyroid cancer. Convolutional neural networks (CNNs) have achieved good results in thyroid ultrasound image analysis tasks. However, due to the limited valid receptive field of convolutional layers, CNNs fail to capture long-range contextual dependencies, which are important for identifying thyroid nodules in ultrasound images. Transformer networks are effective in capturing long-range contextual information. Inspired by this, we propose a novel thyroid nodule detection method that combines the Swin Transformer backbone and Faster R-CNN. Specifically, an ultrasound image is first projected into a 1D sequence of embeddings, which are then fed into a hierarchical Swin Transformer. The Swin Transformer backbone extracts features at five different scales by utilizing shifted windows for the computation of self-attention. Subsequently, a feature pyramid network (FPN) is used to fuse the features from different scales. Finally, a detection head is used to predict bounding boxes and the corresponding confidence scores. Data collected from 2,680 patients were used to conduct the experiments, and the results showed that this method achieved the best mAP score of 44.8%, outperforming CNN-based baselines. In addition, we gained better sensitivity (90.5%) than the competitors. This indicates that context modeling in this model is effective for thyroid nodule detection.

Original languageEnglish
Article numbere64480
JournalJournal of Visualized Experiments
Volume2023
Issue number194
DOIs
StatePublished - Apr 2023
Externally publishedYes

UN SDGs

This output contributes to the following UN Sustainable Development Goals (SDGs)

  1. SDG 3 - Good Health and Well-being
    SDG 3 Good Health and Well-being

Fingerprint

Dive into the research topics of 'A Swin Transformer-Based Model for Thyroid Nodule Detection in Ultrasound Images'. Together they form a unique fingerprint.

Cite this