TY - JOUR
T1 - Federated machine learning for a facilitated implementation of Artificial Intelligence in healthcare - a proof of concept study for the prediction of coronary artery calcification scores
AU - Wolff, Justus
AU - Matschinske, Julian
AU - Baumgart, Dietrich
AU - Pytlik, Anne
AU - Keck, Andreas
AU - Natarajan, Arunakiry
AU - von Schacky, Claudio E.
AU - Pauling, Josch K.
AU - Baumbach, Jan
N1 - Publisher Copyright:
© 2022 the author(s), published by De Gruyter, Berlin/Boston.
PY - 2022/12/1
Y1 - 2022/12/1
N2 - The implementation of Artificial Intelligence (AI) still faces significant hurdles and one key factor is the access to data. One approach that could support that is federated machine learning (FL) since it allows for privacy preserving data access. For this proof of concept, a prediction model for coronary artery calcification scores (CACS) has been applied. The FL was trained based on the data in the different institutions, while the centralized machine learning model was trained on one allocation of data. Both algorithms predict patients with risk scores ≥5 based on age, biological sex, waist circumference, dyslipidemia and HbA1c. The centralized model yields a sensitivity of c. 66% and a specificity of c. 70%. The FL slightly outperforms that with a sensitivity of 67% while slightly underperforming it with a specificity of 69%. It could be demonstrated that CACS prediction is feasible via both, a centralized and an FL approach, and that both show very comparable accuracy. In order to increase accuracy, additional and a higher volume of patient data is required and for that FL is utterly necessary. The developed "CACulator" serves as proof of concept, is available as research tool and shall support future research to facilitate AI implementation.
AB - The implementation of Artificial Intelligence (AI) still faces significant hurdles and one key factor is the access to data. One approach that could support that is federated machine learning (FL) since it allows for privacy preserving data access. For this proof of concept, a prediction model for coronary artery calcification scores (CACS) has been applied. The FL was trained based on the data in the different institutions, while the centralized machine learning model was trained on one allocation of data. Both algorithms predict patients with risk scores ≥5 based on age, biological sex, waist circumference, dyslipidemia and HbA1c. The centralized model yields a sensitivity of c. 66% and a specificity of c. 70%. The FL slightly outperforms that with a sensitivity of 67% while slightly underperforming it with a specificity of 69%. It could be demonstrated that CACS prediction is feasible via both, a centralized and an FL approach, and that both show very comparable accuracy. In order to increase accuracy, additional and a higher volume of patient data is required and for that FL is utterly necessary. The developed "CACulator" serves as proof of concept, is available as research tool and shall support future research to facilitate AI implementation.
KW - artificial intelligence
KW - coronary artery calcification
KW - federated machine learning
KW - privacy-preserving data processing
UR - http://www.scopus.com/inward/record.url?scp=85145242087&partnerID=8YFLogxK
U2 - 10.1515/jib-2022-0032
DO - 10.1515/jib-2022-0032
M3 - Article
C2 - 36054833
AN - SCOPUS:85145242087
SN - 1613-4516
VL - 19
JO - Journal of integrative bioinformatics
JF - Journal of integrative bioinformatics
IS - 4
ER -