TY - GEN
T1 - A bag-of-audio-words approach for snore sounds' excitation localisation
AU - Schmitt, Maximilian
AU - Janott, Christoph
AU - Pandit, Vedhas
AU - Qian, Kun
AU - Heiser, Clemens
AU - Hemmert, Werner
AU - Schuller, Björn
N1 - Publisher Copyright:
© 2016 VDE VERLAG GMBH.
PY - 2016
Y1 - 2016
N2 - Habitual snoring and Obstructive Sleep Apnea are serious conditions that can affect the health of the snorer. For a targeted surgical treatment, it is crucial to identify the exact location of the vibration within the upper airways. As opposed to earlier work, we present the first unsupervised feature learning approach to this task based on bags-of-audio-words. Likewise, we cluster feature values within a given time-segment into acoustic 'words'. The frequency of occurrence per such word is then represented in a histogram per sound chunk to classify between four excitation locations. In extensive test runs based on snore sound data of 24 patients labelled by experts, we evaluated several feature sets as basis for audio word creation. In the result, we find audio words based on wavelet features, formants, and MFCC to be highly suited and outperform previous experiments based on the same data set.
AB - Habitual snoring and Obstructive Sleep Apnea are serious conditions that can affect the health of the snorer. For a targeted surgical treatment, it is crucial to identify the exact location of the vibration within the upper airways. As opposed to earlier work, we present the first unsupervised feature learning approach to this task based on bags-of-audio-words. Likewise, we cluster feature values within a given time-segment into acoustic 'words'. The frequency of occurrence per such word is then represented in a histogram per sound chunk to classify between four excitation locations. In extensive test runs based on snore sound data of 24 patients labelled by experts, we evaluated several feature sets as basis for audio word creation. In the result, we find audio words based on wavelet features, formants, and MFCC to be highly suited and outperform previous experiments based on the same data set.
KW - Bag-of-audio-words
KW - Drug induced sleep endoscopy
KW - Obstructive sleep apnea
KW - Snoring
KW - Unsupervised feature learning
UR - http://www.scopus.com/inward/record.url?scp=85073247180&partnerID=8YFLogxK
M3 - Conference contribution
AN - SCOPUS:85073247180
T3 - Speech Communication - 12. ITG-Fachtagung Sprachkommunikation
SP - 230
EP - 234
BT - Speech Communication - 12. ITG-Fachtagung Sprachkommunikation
PB - VDE VERLAG GMBH
T2 - 12. ITG-Fachtagung Sprachkommunikation - 12th ITG Conference on Speech Communication
Y2 - 5 October 2016 through 7 October 2016
ER -