TY - JOUR
T1 - Comparative analysis of machine learning algorithms for multi-syndrome classification of neurodegenerative syndromes
AU - FTLD-Consortium Germany
AU - German Atypical Parkinson Consortium Study Group
AU - Lampe, Leonie
AU - Niehaus, Sebastian
AU - Huppertz, Hans Jürgen
AU - Merola, Alberto
AU - Reinelt, Janis
AU - Mueller, Karsten
AU - Anderl-Straub, Sarah
AU - Fassbender, Klaus
AU - Fliessbach, Klaus
AU - Jahn, Holger
AU - Kornhuber, Johannes
AU - Lauer, Martin
AU - Prudlo, Johannes
AU - Schneider, Anja
AU - Synofzik, Matthis
AU - Danek, Adrian
AU - Diehl-Schmid, Janine
AU - Otto, Markus
AU - Villringer, Arno
AU - Egger, Karl
AU - Hattingen, Elke
AU - Hilker-Roggendorf, Rüdiger
AU - Schnitzler, Alfons
AU - Südmeyer, Martin
AU - Oertel, Wolfgang
AU - Kassubek, Jan
AU - Höglinger, Günter
AU - Schroeter, Matthias L.
N1 - Publisher Copyright:
© 2022, The Author(s).
PY - 2022/12
Y1 - 2022/12
N2 - Importance: The entry of artificial intelligence into medicine is pending. Several methods have been used for the predictions of structured neuroimaging data, yet nobody compared them in this context. Objective: Multi-class prediction is key for building computational aid systems for differential diagnosis. We compared support vector machine, random forest, gradient boosting, and deep feed-forward neural networks for the classification of different neurodegenerative syndromes based on structural magnetic resonance imaging. Design, setting, and participants: Atlas-based volumetry was performed on multi-centric T1-weighted MRI data from 940 subjects, i.e., 124 healthy controls and 816 patients with ten different neurodegenerative diseases, leading to a multi-diagnostic multi-class classification task with eleven different classes. Interventions: N.A. Main outcomes and measures: Cohen’s kappa, accuracy, and F1-score to assess model performance. Results: Overall, the neural network produced both the best performance measures and the most robust results. The smaller classes however were better classified by either the ensemble learning methods or the support vector machine, while performance measures for small classes were comparatively low, as expected. Diseases with regionally specific and pronounced atrophy patterns were generally better classified than diseases with widespread and rather weak atrophy. Conclusions and relevance: Our study furthermore underlines the necessity of larger data sets but also calls for a careful consideration of different machine learning methods that can handle the type of data and the classification task best.
AB - Importance: The entry of artificial intelligence into medicine is pending. Several methods have been used for the predictions of structured neuroimaging data, yet nobody compared them in this context. Objective: Multi-class prediction is key for building computational aid systems for differential diagnosis. We compared support vector machine, random forest, gradient boosting, and deep feed-forward neural networks for the classification of different neurodegenerative syndromes based on structural magnetic resonance imaging. Design, setting, and participants: Atlas-based volumetry was performed on multi-centric T1-weighted MRI data from 940 subjects, i.e., 124 healthy controls and 816 patients with ten different neurodegenerative diseases, leading to a multi-diagnostic multi-class classification task with eleven different classes. Interventions: N.A. Main outcomes and measures: Cohen’s kappa, accuracy, and F1-score to assess model performance. Results: Overall, the neural network produced both the best performance measures and the most robust results. The smaller classes however were better classified by either the ensemble learning methods or the support vector machine, while performance measures for small classes were comparatively low, as expected. Diseases with regionally specific and pronounced atrophy patterns were generally better classified than diseases with widespread and rather weak atrophy. Conclusions and relevance: Our study furthermore underlines the necessity of larger data sets but also calls for a careful consideration of different machine learning methods that can handle the type of data and the classification task best.
KW - Comparative analysis
KW - Deep neural network
KW - Gradient boosting
KW - Multi-syndrome classification
KW - Neurodegenerative syndromes
KW - Random forest
KW - Support vector machine
UR - https://www.scopus.com/pages/publications/85129997063
U2 - 10.1186/s13195-022-00983-z
DO - 10.1186/s13195-022-00983-z
M3 - Article
C2 - 35505442
AN - SCOPUS:85129997063
SN - 1758-9193
VL - 14
JO - Alzheimer's Research and Therapy
JF - Alzheimer's Research and Therapy
IS - 1
M1 - 62
ER -