Skip to main navigation Skip to search Skip to main content

Transformer-based CNNs: Mining Temporal Context Information for Multi-sound COVID-19 Diagnosis

  • Imperial College London
  • University Hospital Augsburg

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

9 Scopus citations

Abstract

Due to the COronaVIrus Disease 2019 (COVID-19) pandemic, early screening of COVID-19 is essential to prevent its transmission. Detecting COVID-19 with computer audition techniques has in recent studies shown the potential to achieve a fast, cheap, and ecologically friendly diagnosis. Respiratory sounds and speech may contain rich and complementary information about COVID-19 clinical conditions. Therefore, we propose training three deep neural networks on three types of sounds (breathing/counting/vowel) and assembling these models to improve the performance. More specifically, we employ Convolutional Neural Networks (CNNs) to extract spatial representations from log Mel spectrograms and a multi-head attention mechanism in the transformer to mine temporal context information from the CNNs' outputs. The experimental results demonstrate that the transformer-based CNNs can effectively detect COVID-19 on the DiCOVA Track-2 database (AUC: 70.0%) and outperform simple CNNs and hybrid CNN-RNNs.

Original languageEnglish
Title of host publication43rd Annual International Conference of the IEEE Engineering in Medicine and Biology Society, EMBC 2021
PublisherInstitute of Electrical and Electronics Engineers Inc.
Pages2335-2338
Number of pages4
ISBN (Electronic)9781728111797
DOIs
StatePublished - 2021
Externally publishedYes
Event43rd Annual International Conference of the IEEE Engineering in Medicine and Biology Society, EMBC 2021 - Virtual, Online, Mexico
Duration: 1 Nov 20215 Nov 2021

Publication series

NameProceedings of the Annual International Conference of the IEEE Engineering in Medicine and Biology Society, EMBS
Volume2021-January
ISSN (Print)1557-170X

Conference

Conference43rd Annual International Conference of the IEEE Engineering in Medicine and Biology Society, EMBC 2021
Country/TerritoryMexico
CityVirtual, Online
Period1/11/215/11/21

Fingerprint

Dive into the research topics of 'Transformer-based CNNs: Mining Temporal Context Information for Multi-sound COVID-19 Diagnosis'. Together they form a unique fingerprint.

Cite this