Audio-based Kinship Verification Using Age Domain Conversion

Qiyang Sun, Alican Akman, Xin Jing, Manuel Milling, Bjorn W. Schuller

Research output: Contribution to journalArticlepeer-review

Abstract

Audio-based kinship verification (AKV) is important in many domains, such as home security monitoring, forensic identification, and social network analysis. A key challenge in the task arises from differences in age across samples from different individuals, which can be interpreted as a domain bias in a cross-domain verification task. To address this issue, we design the notion of an "age-standardised domain"wherein we utilise the optimised CycleGAN-VC3 network to perform age-audio conversion to generate the in-domain audio. The generated audio dataset is employed to extract a range of features, which are then fed into a metric learning architecture to verify kinship. Experiments are conducted on the KAN_AV audio dataset. The results demonstrate that the method markedly enhances the accuracy of kinship verification, while also offering novel insights for future kinship verification research.

Original languageEnglish
JournalIEEE Signal Processing Letters
DOIs
StateAccepted/In press - 2024

Keywords

  • Audio Kinship Verification
  • Deep Learning
  • GAN
  • Voice Conversion

Fingerprint

Dive into the research topics of 'Audio-based Kinship Verification Using Age Domain Conversion'. Together they form a unique fingerprint.

Cite this