TY - JOUR
T1 - Feature selection and syndrome classification for rheumatoid arthritis patients with Traditional Chinese Medicine treatment
AU - Xie, Jingui
AU - Li, Yan
AU - Wang, Ning
AU - Xin, Ling
AU - Fang, Yanyan
AU - Liu, Jian
N1 - Publisher Copyright:
© 2020 Elsevier GmbH
PY - 2020/2
Y1 - 2020/2
N2 - Introduction: The classification of TCM syndromes is central to understanding the nature of diseases and improving treatment. This study focuses on selecting critical features of demographic information, personal medical history and symptoms and improving the accuracy of syndrome classification. Methods: A total of 1713 records were collected from the First Affiliated Hospital of Anhui Chinese Medicine University. Five rules for feature selection and six models were applied to classify TCM syndromes. Results: Patients with rheumatoid arthritis were diagnosed with one of four TCM syndromes: damp-heat obstruction syndrome (DHO, 60.5 %), phlegm and blood stagnation syndrome (PBS, 19.8 %), liver and kidney deficiency syndrome (LKD, 15.8 %), or wind-cold obstruction syndrome (WCO, 4 %). In total, 200 features were extracted from electronic medical records. From these, 42 were selected as critical features. The classification accuracy of using feature selection was higher than when using all features, with a maximum value of 0.88 for the Artificial neural network (ANN). Conclusions: Feature selection methods and classification techniques were applied to mine data on TCM syndromes. Feature selection improved the performance of the classification models. Of six algorithms, ANN had the highest accuracy for syndrome classification.
AB - Introduction: The classification of TCM syndromes is central to understanding the nature of diseases and improving treatment. This study focuses on selecting critical features of demographic information, personal medical history and symptoms and improving the accuracy of syndrome classification. Methods: A total of 1713 records were collected from the First Affiliated Hospital of Anhui Chinese Medicine University. Five rules for feature selection and six models were applied to classify TCM syndromes. Results: Patients with rheumatoid arthritis were diagnosed with one of four TCM syndromes: damp-heat obstruction syndrome (DHO, 60.5 %), phlegm and blood stagnation syndrome (PBS, 19.8 %), liver and kidney deficiency syndrome (LKD, 15.8 %), or wind-cold obstruction syndrome (WCO, 4 %). In total, 200 features were extracted from electronic medical records. From these, 42 were selected as critical features. The classification accuracy of using feature selection was higher than when using all features, with a maximum value of 0.88 for the Artificial neural network (ANN). Conclusions: Feature selection methods and classification techniques were applied to mine data on TCM syndromes. Feature selection improved the performance of the classification models. Of six algorithms, ANN had the highest accuracy for syndrome classification.
KW - Feature selection
KW - Machine learning
KW - Syndrome classification
KW - TCM
UR - http://www.scopus.com/inward/record.url?scp=85078656784&partnerID=8YFLogxK
U2 - 10.1016/j.eujim.2020.101059
DO - 10.1016/j.eujim.2020.101059
M3 - Article
AN - SCOPUS:85078656784
SN - 1876-3820
VL - 34
JO - European Journal of Integrative Medicine
JF - European Journal of Integrative Medicine
M1 - 101059
ER -