An Ensemble Learning Based Framework for Traditional Chinese Medicine Data Analysis with ICD-10 Labels

Objective. This study aims to establish a model to analyze clinical experience of TCM veteran doctors. We propose an ensemble learning based framework to analyze clinical records with ICD-10 labels information for effective diagnosis and acupoints recommendation. Methods. We propose an ensemble lear...

Full description

Saved in:
Bibliographic Details
Main Authors: Gang Zhang, Yonghui Huang, Ling Zhong, Shanxing Ou, Yi Zhang, Ziping Li
Format: Article
Language:English
Published: Wiley 2015-01-01
Series:The Scientific World Journal
Online Access:http://dx.doi.org/10.1155/2015/507925
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1832560320456949760
author Gang Zhang
Yonghui Huang
Ling Zhong
Shanxing Ou
Yi Zhang
Ziping Li
author_facet Gang Zhang
Yonghui Huang
Ling Zhong
Shanxing Ou
Yi Zhang
Ziping Li
author_sort Gang Zhang
collection DOAJ
description Objective. This study aims to establish a model to analyze clinical experience of TCM veteran doctors. We propose an ensemble learning based framework to analyze clinical records with ICD-10 labels information for effective diagnosis and acupoints recommendation. Methods. We propose an ensemble learning framework for the analysis task. A set of base learners composed of decision tree (DT) and support vector machine (SVM) are trained by bootstrapping the training dataset. The base learners are sorted by accuracy and diversity through nondominated sort (NDS) algorithm and combined through a deep ensemble learning strategy. Results. We evaluate the proposed method with comparison to two currently successful methods on a clinical diagnosis dataset with manually labeled ICD-10 information. ICD-10 label annotation and acupoints recommendation are evaluated for three methods. The proposed method achieves an accuracy rate of 88.2%  ±  2.8% measured by zero-one loss for the first evaluation session and 79.6%  ±  3.6% measured by Hamming loss, which are superior to the other two methods. Conclusion. The proposed ensemble model can effectively model the implied knowledge and experience in historic clinical data records. The computational cost of training a set of base learners is relatively low.
format Article
id doaj-art-78313f201fb641938b474e9adfea5e14
institution Kabale University
issn 2356-6140
1537-744X
language English
publishDate 2015-01-01
publisher Wiley
record_format Article
series The Scientific World Journal
spelling doaj-art-78313f201fb641938b474e9adfea5e142025-02-03T01:27:51ZengWileyThe Scientific World Journal2356-61401537-744X2015-01-01201510.1155/2015/507925507925An Ensemble Learning Based Framework for Traditional Chinese Medicine Data Analysis with ICD-10 LabelsGang Zhang0Yonghui Huang1Ling Zhong2Shanxing Ou3Yi Zhang4Ziping Li5School of Automation, Guangdong University of Technology, Guangzhou 510006, ChinaSchool of Automation, Guangdong University of Technology, Guangzhou 510006, ChinaSchool of Automation, Guangdong University of Technology, Guangzhou 510006, ChinaDepartment of Radiology, Guangzhou General Hospital of Guangzhou Military Command, Guangzhou 510010, ChinaDepartment of Plastic and Reconstructive Surgery, The First Affiliated Hospital of Sun Yat-Sen University, Guangzhou 510080, ChinaThe Second Affiliated Hospital of Guangzhou University of Chinese Medicine, Guangzhou 510120, ChinaObjective. This study aims to establish a model to analyze clinical experience of TCM veteran doctors. We propose an ensemble learning based framework to analyze clinical records with ICD-10 labels information for effective diagnosis and acupoints recommendation. Methods. We propose an ensemble learning framework for the analysis task. A set of base learners composed of decision tree (DT) and support vector machine (SVM) are trained by bootstrapping the training dataset. The base learners are sorted by accuracy and diversity through nondominated sort (NDS) algorithm and combined through a deep ensemble learning strategy. Results. We evaluate the proposed method with comparison to two currently successful methods on a clinical diagnosis dataset with manually labeled ICD-10 information. ICD-10 label annotation and acupoints recommendation are evaluated for three methods. The proposed method achieves an accuracy rate of 88.2%  ±  2.8% measured by zero-one loss for the first evaluation session and 79.6%  ±  3.6% measured by Hamming loss, which are superior to the other two methods. Conclusion. The proposed ensemble model can effectively model the implied knowledge and experience in historic clinical data records. The computational cost of training a set of base learners is relatively low.http://dx.doi.org/10.1155/2015/507925
spellingShingle Gang Zhang
Yonghui Huang
Ling Zhong
Shanxing Ou
Yi Zhang
Ziping Li
An Ensemble Learning Based Framework for Traditional Chinese Medicine Data Analysis with ICD-10 Labels
The Scientific World Journal
title An Ensemble Learning Based Framework for Traditional Chinese Medicine Data Analysis with ICD-10 Labels
title_full An Ensemble Learning Based Framework for Traditional Chinese Medicine Data Analysis with ICD-10 Labels
title_fullStr An Ensemble Learning Based Framework for Traditional Chinese Medicine Data Analysis with ICD-10 Labels
title_full_unstemmed An Ensemble Learning Based Framework for Traditional Chinese Medicine Data Analysis with ICD-10 Labels
title_short An Ensemble Learning Based Framework for Traditional Chinese Medicine Data Analysis with ICD-10 Labels
title_sort ensemble learning based framework for traditional chinese medicine data analysis with icd 10 labels
url http://dx.doi.org/10.1155/2015/507925
work_keys_str_mv AT gangzhang anensemblelearningbasedframeworkfortraditionalchinesemedicinedataanalysiswithicd10labels
AT yonghuihuang anensemblelearningbasedframeworkfortraditionalchinesemedicinedataanalysiswithicd10labels
AT lingzhong anensemblelearningbasedframeworkfortraditionalchinesemedicinedataanalysiswithicd10labels
AT shanxingou anensemblelearningbasedframeworkfortraditionalchinesemedicinedataanalysiswithicd10labels
AT yizhang anensemblelearningbasedframeworkfortraditionalchinesemedicinedataanalysiswithicd10labels
AT zipingli anensemblelearningbasedframeworkfortraditionalchinesemedicinedataanalysiswithicd10labels
AT gangzhang ensemblelearningbasedframeworkfortraditionalchinesemedicinedataanalysiswithicd10labels
AT yonghuihuang ensemblelearningbasedframeworkfortraditionalchinesemedicinedataanalysiswithicd10labels
AT lingzhong ensemblelearningbasedframeworkfortraditionalchinesemedicinedataanalysiswithicd10labels
AT shanxingou ensemblelearningbasedframeworkfortraditionalchinesemedicinedataanalysiswithicd10labels
AT yizhang ensemblelearningbasedframeworkfortraditionalchinesemedicinedataanalysiswithicd10labels
AT zipingli ensemblelearningbasedframeworkfortraditionalchinesemedicinedataanalysiswithicd10labels