Artificial intelligence based Chinese clinical trials eligibility criteria classification_Journal of Biomedical Engineering

Authors：

ZONG Hui ¹ , ZHANG Zeyu ¹ , YANG Jinxuan ¹ , LEI Jianbo ² , LI Zuofeng ³ , HAO Tianyong ⁴ ,  ZHANG Xiaoyan ¹

1. School of Life Sciences and Technology, Tongji University, Shanghai 200092, P.R.China;
2. Center for Medical Informatics, Peking University, Beijing 100080, P.R.China;
3. Philips Research China, Shanghai 200072, P.R.China;
4. School of Computer Science, South China Normal University, Guangzhou 510631, P.R.China;

Corresponding?author：

ZHANG Xiaoyan, Email: xyzhang@#edu.cn

Keywords：

clinical trial; eligibility criteria; text classification; artificial intelligence; natural language processing

DOI：

10.7507/1001-5515.202006035

Video：

Export PDF Favorites Scan Get Citation

Abstract Full text Figures/Tables Video References Cited by

Subject recruitment is a key component that affects the progress and results of clinical trials, and generally conducted with eligibility criteria (includes inclusion criteria and exclusion criteria). The semantic category analysis of eligibility criteria can help optimizing clinical trials design and building automated patient recruitment system. This study explored the automatic semantic categories classification of Chinese eligibility criteria based on artificial intelligence by academic shared task. We totally collected 38 341 annotated eligibility criteria sentences and predefined 44 semantic categories. A total of 75 teams participated in competition, with 27 teams having submitted system outputs. Based on the results, we found out that most teams adopted mixed models. The mainstream resolution was applying pre-trained language models capable of providing rich semantic representation, which were combined with neural network models and used to fine-tune the models with reference to classifier tasks, and finally improved classification performance could be obtained by ensemble modeling. The best-performing system achieved a macro F1 score of 0.81 by using a pre-trained language model, i.e. bidirectional encoder representations from transformers (BERT) and ensemble modeling. With the error analysis we found out that from the point of data processing steps the data pre-processing and post-processing were very important for classification, while from the point of data volume these categories with less data volume showed lower classification performance. Finally, we hope that this study could provide a valuable dataset and state-of-the-art result for the research of Chinese medical short text classification.

Citation： ZONG Hui, ZHANG Zeyu, YANG Jinxuan, LEI Jianbo, LI Zuofeng, HAO Tianyong, ZHANG Xiaoyan. Artificial intelligence based Chinese clinical trials eligibility criteria classification. Journal of Biomedical Engineering, 2021, 38(1): 105-110, 121. doi: 10.7507/1001-5515.202006035 Copy

1.	Hao T, Rusanov A, Boland M R, et al. Clustering clinical trials with similar eligibility criteria features. J Biomed Inform, 2014, 52: 112-120.
2.	Zhang K, Ma H, Zhao Y, et al. The comparative experimental study of multilabel classification for diagnosis assistant based on Chinese obstetric EMRs. J Healthc Eng, 2018, 2018: 1-9.
3.	Yao Liang, Zhang Yin, Wei Baogang, et al. Traditional Chinese medicine clinical records classification using knowledge-powered document embedding//2016 IEEE International Conference on Bioinformatics and Biomedicine (BIBM), 2016: 1926-1928.
4.	Zhang N L, Fu C, Liu T F, et al. A data-driven method for syndrome type identification and classification in traditional Chinese medicine. J Integr Med, 2017, 15(2): 110-123.
5.	Fridsma D B, Evans J, Hastak S, et al. The BRIDG project: a technical report. J Am Med Inform Assoc, 2008, 15(2): 130-137.
6.	Luo Z, Johnson S B, Weng C. Semi-automatically inducing semantic classes of clinical research eligibility criteria using UMLS and hierarchical clustering, 2010, 2010: 487-491.
7.	Luo Z, Yetisgen-Yildiz M, Weng C. Dynamic categorization of clinical research eligibility criteria by hierarchical clustering. J Biomed Inform, 2011, 44(6): 927-935.
8.	Zhang K, Demner-Fushman D. Automated classification of eligibility criteria in clinical trials to facilitate patient-trial matching for specific patient populations. J Am Med Inform Assoc, 2017, 24(4): 781-787.
9.	Stubbs A, Filannino M, Soysal E, et al. Cohort selection for clinical trials: n2c2 2018 shared task track 1. J Am Med Inform Assoc, 2019, 26(11): 1163-1171.
10.	Oleynik M, Kugic A, Kasá? Z, et al. Evaluating shallow and deep learning strategies for the 2018 n2c2 shared task on clinical text classification. J Am Med Inform Assoc, 2019, 26(11): 1247-1254.
11.	Gore L, Ivy S P, Balis F M, et al. Modernizing clinical trial eligibility: recommendations of the American society of clinical oncology-friends of cancer research minimum age working group. J Clin Oncol, 2017, 35(33): 3781-3787.
12.	Uldrick T S, Ison G, Rudek M, et al. Modernizing clinical trial eligibility criteria: recommendations of the American society of clinical oncology-friends of cancer research HIV working group. J Clin Oncol, 2017, 35(33): 3774-3780.
13.	Lichtman S M, Harvey R D, Damiette S A, et al. Modernizing clinical trial eligibility criteria: recommendations of the American society of clinical oncology-friends of cancer research organ dysfunction, prior or concurrent malignancy, and comorbidities working group. J Clin Oncol, 2017, 35(33): 3753-3759.
14.	Lin N U, Prowell T, Tan A R, et al. Modernizing clinical trial eligibility criteria: recommendations of the American society of clinical oncology-friends of cancer research brain metastases working group. J Clin Oncol, 2017, 35(33): 3760-3773.

1. Hao T, Rusanov A, Boland M R, et al. Clustering clinical trials with similar eligibility criteria features. J Biomed Inform, 2014, 52: 112-120.
2. Zhang K, Ma H, Zhao Y, et al. The comparative experimental study of multilabel classification for diagnosis assistant based on Chinese obstetric EMRs. J Healthc Eng, 2018, 2018: 1-9.
3. Yao Liang, Zhang Yin, Wei Baogang, et al. Traditional Chinese medicine clinical records classification using knowledge-powered document embedding//2016 IEEE International Conference on Bioinformatics and Biomedicine (BIBM), 2016: 1926-1928.
4. Zhang N L, Fu C, Liu T F, et al. A data-driven method for syndrome type identification and classification in traditional Chinese medicine. J Integr Med, 2017, 15(2): 110-123.
5. Fridsma D B, Evans J, Hastak S, et al. The BRIDG project: a technical report. J Am Med Inform Assoc, 2008, 15(2): 130-137.
6. Luo Z, Johnson S B, Weng C. Semi-automatically inducing semantic classes of clinical research eligibility criteria using UMLS and hierarchical clustering, 2010, 2010: 487-491.
7. Luo Z, Yetisgen-Yildiz M, Weng C. Dynamic categorization of clinical research eligibility criteria by hierarchical clustering. J Biomed Inform, 2011, 44(6): 927-935.
8. Zhang K, Demner-Fushman D. Automated classification of eligibility criteria in clinical trials to facilitate patient-trial matching for specific patient populations. J Am Med Inform Assoc, 2017, 24(4): 781-787.
9. Stubbs A, Filannino M, Soysal E, et al. Cohort selection for clinical trials: n2c2 2018 shared task track 1. J Am Med Inform Assoc, 2019, 26(11): 1163-1171.
10. Oleynik M, Kugic A, Kasá? Z, et al. Evaluating shallow and deep learning strategies for the 2018 n2c2 shared task on clinical text classification. J Am Med Inform Assoc, 2019, 26(11): 1247-1254.
11. Gore L, Ivy S P, Balis F M, et al. Modernizing clinical trial eligibility: recommendations of the American society of clinical oncology-friends of cancer research minimum age working group. J Clin Oncol, 2017, 35(33): 3781-3787.
12. Uldrick T S, Ison G, Rudek M, et al. Modernizing clinical trial eligibility criteria: recommendations of the American society of clinical oncology-friends of cancer research HIV working group. J Clin Oncol, 2017, 35(33): 3774-3780.
13. Lichtman S M, Harvey R D, Damiette S A, et al. Modernizing clinical trial eligibility criteria: recommendations of the American society of clinical oncology-friends of cancer research organ dysfunction, prior or concurrent malignancy, and comorbidities working group. J Clin Oncol, 2017, 35(33): 3753-3759.
14. Lin N U, Prowell T, Tan A R, et al. Modernizing clinical trial eligibility criteria: recommendations of the American society of clinical oncology-friends of cancer research brain metastases working group. J Clin Oncol, 2017, 35(33): 3760-3773.

Journal of Biomedical Engineering

Artificial intelligence based Chinese clinical trials eligibility criteria classification

Abstract Full text Figures/Tables Video References Cited by

Previous Article

Next Article

Format

Content