TY - JOUR
T1 - Screening/diagnosis of pediatric endocrine disorders through the artificial intelligence model in different language settings
AU - Ying, Lingwen
AU - Li, Sichen
AU - Chen, Chunyang
AU - Yang, Fan
AU - Li, Xin
AU - Chen, Yao
AU - Ding, Yu
AU - Chang, Guoying
AU - Li, Juan
AU - Wang, Xiumin
N1 - Publisher Copyright:
© The Author(s) 2024.
PY - 2024/6
Y1 - 2024/6
N2 - This study is aimed at examining the impact of ChatGPT on pediatric endocrine and metabolic conditions, particularly in the areas of screening and diagnosis, in both Chinese and English modes. A 40-question questionnaire covering the four most common pediatric endocrine and metabolic conditions was posed to ChatGPT in both Chinese and English three times each. Six pediatric endocrinologists evaluated the responses. ChatGPT performed better when responding to questions in English, with an unreliable rate of 7.5% compared to 27.5% for Chinese questions, indicating a more consistent response pattern in English. Among the reliable questions, the answers were more comprehensive and satisfactory in the English mode. We also found disparities in ChatGPT’s performance when interacting with different target groups and diseases, with improved performance for questions posed by clinicians in English and better performance for questions related to diabetes and overweight/obesity in Chinese for both clinicians and patients. Language comprehension, providing incomprehensive answers, and errors in key data were the main contributors to the low scores, according to reviewer feedback. Conclusion: Despite these limitations, as ChatGPT continues to evolve and expand its network, it has significant potential as a practical and effective tool for clinical diagnosis and treatment. (Table presented.)
AB - This study is aimed at examining the impact of ChatGPT on pediatric endocrine and metabolic conditions, particularly in the areas of screening and diagnosis, in both Chinese and English modes. A 40-question questionnaire covering the four most common pediatric endocrine and metabolic conditions was posed to ChatGPT in both Chinese and English three times each. Six pediatric endocrinologists evaluated the responses. ChatGPT performed better when responding to questions in English, with an unreliable rate of 7.5% compared to 27.5% for Chinese questions, indicating a more consistent response pattern in English. Among the reliable questions, the answers were more comprehensive and satisfactory in the English mode. We also found disparities in ChatGPT’s performance when interacting with different target groups and diseases, with improved performance for questions posed by clinicians in English and better performance for questions related to diabetes and overweight/obesity in Chinese for both clinicians and patients. Language comprehension, providing incomprehensive answers, and errors in key data were the main contributors to the low scores, according to reviewer feedback. Conclusion: Despite these limitations, as ChatGPT continues to evolve and expand its network, it has significant potential as a practical and effective tool for clinical diagnosis and treatment. (Table presented.)
KW - Artificial intelligence
KW - ChatGPT
KW - Language mode
KW - Pediatric endocrine and metabolism
KW - Physician and patients
KW - Screening and diagnosis
UR - http://www.scopus.com/inward/record.url?scp=85188070131&partnerID=8YFLogxK
U2 - 10.1007/s00431-024-05527-1
DO - 10.1007/s00431-024-05527-1
M3 - Article
C2 - 38502320
AN - SCOPUS:85188070131
SN - 0340-6199
VL - 183
SP - 2655
EP - 2661
JO - European Journal of Pediatrics
JF - European Journal of Pediatrics
IS - 6
ER -