Abstract
Audio-based kinship verification (AKV) is important in many domains, such as home security monitoring, forensic identification, and social network analysis. A key challenge in the task arises from differences in age across samples from different individuals, which can be interpreted as a domain bias in a cross-domain verification task. To address this issue, we design the notion of an "age-standardised domain"wherein we utilise the optimised CycleGAN-VC3 network to perform age-audio conversion to generate the in-domain audio. The generated audio dataset is employed to extract a range of features, which are then fed into a metric learning architecture to verify kinship. Experiments are conducted on the KAN_AV audio dataset.The results demonstrate that the method markedly enhances the accuracy of kinship verification, while also offering novel insights for future kinship verification research.
| Original language | English |
|---|---|
| Pages (from-to) | 301-305 |
| Number of pages | 5 |
| Journal | IEEE Signal Processing Letters |
| Volume | 32 |
| DOIs | |
| State | Published - 2025 |
| Externally published | Yes |
Keywords
- Audio kinship verification
- GAN
- deep learning
- voice conversion
Fingerprint
Dive into the research topics of 'Audio-Based Kinship Verification Using Age Domain Conversion'. Together they form a unique fingerprint.Cite this
- APA
- Author
- BIBTEX
- Harvard
- Standard
- RIS
- Vancouver