Abstract
Audio-based kinship verification (AKV) is important in many domains, such as home security monitoring, forensic identification, and social network analysis. A key challenge in the task arises from differences in age across samples from different individuals, which can be interpreted as a domain bias in a cross-domain verification task. To address this issue, we design the notion of an "age-standardised domain"wherein we utilise the optimised CycleGAN-VC3 network to perform age-audio conversion to generate the in-domain audio. The generated audio dataset is employed to extract a range of features, which are then fed into a metric learning architecture to verify kinship. Experiments are conducted on the KAN_AV audio dataset. The results demonstrate that the method markedly enhances the accuracy of kinship verification, while also offering novel insights for future kinship verification research.
| Original language | English |
|---|---|
| Journal | IEEE Signal Processing Letters |
| DOIs | |
| State | Accepted/In press - 2024 |
Keywords
- Audio Kinship Verification
- Deep Learning
- GAN
- Voice Conversion
Fingerprint
Dive into the research topics of 'Audio-based Kinship Verification Using Age Domain Conversion'. Together they form a unique fingerprint.Cite this
- APA
- Author
- BIBTEX
- Harvard
- Standard
- RIS
- Vancouver