TY - JOUR
T1 - BRINGING THE DISCUSSION OF MINIMA SHARPNESS TO THE AUDIO DOMAIN
T2 - 49th IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP 2024
AU - Milling, Manuel
AU - Triantafyllopoulos, Andreas
AU - Tsangko, Iosif
AU - Rampp, Simon David Noel
AU - Schuller, Björn Wolfgang
N1 - Publisher Copyright:
© 2024 IEEE.
PY - 2024
Y1 - 2024
N2 - The correlation between the sharpness of loss minima and generalisation in the context of deep neural networks has been subject to discussion for a long time. Whilst mostly investigated in the context of selected benchmark data sets in the area of computer vision, we explore this aspect for the acoustic scene classification task of the DCASE2020 challenge data. Our analysis is based on two-dimensional filter-normalised visualisations and a derived sharpness measure. Our exploratory analysis shows that sharper minima tend to show better generalisation than flat minima –even more so for out-of-domain data, recorded from previously unseen devices–, thus adding to the dispute about better generalisation capabilities of flat minima. We further find that, in particular, the choice of optimisers is a main driver of the sharpness of minima and we discuss resulting limitations with respect to comparability. Our code, trained model states and loss landscape visualisations are publicly available.
AB - The correlation between the sharpness of loss minima and generalisation in the context of deep neural networks has been subject to discussion for a long time. Whilst mostly investigated in the context of selected benchmark data sets in the area of computer vision, we explore this aspect for the acoustic scene classification task of the DCASE2020 challenge data. Our analysis is based on two-dimensional filter-normalised visualisations and a derived sharpness measure. Our exploratory analysis shows that sharper minima tend to show better generalisation than flat minima –even more so for out-of-domain data, recorded from previously unseen devices–, thus adding to the dispute about better generalisation capabilities of flat minima. We further find that, in particular, the choice of optimisers is a main driver of the sharpness of minima and we discuss resulting limitations with respect to comparability. Our code, trained model states and loss landscape visualisations are publicly available.
KW - acoustic scene classification
KW - deep neural networks
KW - generalisation
KW - loss landscape
KW - sharp minima
UR - http://www.scopus.com/inward/record.url?scp=85198560577&partnerID=8YFLogxK
U2 - 10.1109/ICASSP48485.2024.10446177
DO - 10.1109/ICASSP48485.2024.10446177
M3 - Conference article
AN - SCOPUS:85198560577
SN - 1520-6149
SP - 391
EP - 395
JO - ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings
JF - ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings
Y2 - 14 April 2024 through 19 April 2024
ER -