Predicting Biological Signals from Speech: Introducing a Novel Multimodal Dataset and Results

Alice Baird, Shahin Amiriparian, Miriam Berschneider, Maximilian Schmitt, Björn Schuller

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

10 Scopus citations

Abstract

In recent years, diagnosis and awareness of mental health conditions, e. g., chronic stress, have been increasing globally. Biological signals can be an effective way to monitor such conditions, yet acquisition can be cumbersome and invasive. Alternatively, acoustic features offer non-invasive and efficient monitoring of an array of health and wellbeing characteristics. This study presents the BioSpeech Database (BioS-DB), a novel database of audio and biological signals - blood volume pulse (BVP) and skin conductance (SC) - from 55 individuals speaking aloud in front of others, whilst having their emotional state annotated in real time. Through a variation of conventional and state-of-the-art approaches, initial experiments have shown for the first time that acoustic features can be applied for the task of BVP prediction. Notably, using deep representations of audio and a sequence-to-sequence auto-encoders with a GRU-RNN as a time-dependent regressor achieved at best 0.075 and 0.123 RMSE for [0; 1] normalised BVP and SC, respectively.

Original languageEnglish
Title of host publicationIEEE 21st International Workshop on Multimedia Signal Processing, MMSP 2019
PublisherInstitute of Electrical and Electronics Engineers Inc.
ISBN (Electronic)9781728118178
DOIs
StatePublished - Sep 2019
Externally publishedYes
Event21st IEEE International Workshop on Multimedia Signal Processing, MMSP 2019 - Kuala Lumpur, Malaysia
Duration: 27 Sep 201929 Sep 2019

Publication series

NameIEEE 21st International Workshop on Multimedia Signal Processing, MMSP 2019

Conference

Conference21st IEEE International Workshop on Multimedia Signal Processing, MMSP 2019
Country/TerritoryMalaysia
CityKuala Lumpur
Period27/09/1929/09/19

Fingerprint

Dive into the research topics of 'Predicting Biological Signals from Speech: Introducing a Novel Multimodal Dataset and Results'. Together they form a unique fingerprint.

Cite this