Data visualization of multiple linear regression analysis practiced by R Studio software_Chinese Journal of Evidence-Based Medicine

Authors：

LI Duoduo ¹ ,  YU Xing ¹ , HAN Sheng ^2,3 , ZHU He ^2,3 , YUAN Yi ¹ , SHEN Jie ⁴ , LIN Jingfeng ⁵ , LI Xia ¹ , GAN Yena ¹ ,  LIU Jianping ⁶

1. Dongzhimen Hospital, Beijing University of Chinese Medicine, Beijing 100700, P.R.China;
2. School of Pharmaceutical Science, Peking University, Beijing 100191, P.R.China;
3. International Research Center for Medicinal Administration, Peking University, Beijing 100191, P.R.China;
4. iHealth Labs Inc, Shanghai 200235, P.R.China;
5. Dongfang Hospital, Beijing University of Chinese Medicine, Beijing 100078, P.R.China;
6. Centre for Evidence-Based Chinese Medicine, Beijing University of Chinese Medicine, Beijing 100029, P.R.China;

Corresponding?author：

YU Xing, Email: yuxing34@sina.com; LIU Jianping, Email: liujp@bucm.edu.cn

Keywords：

R language; Data visualization; R Studio; Medicine; Multiple linear regression analysis (MLRA)

DOI：

10.7507/1672-2531.202008172

Video：

Export PDF Favorites Scan Get Citation

Abstract Full text Figures/Tables Video References Cited by

Objective To provide method references for data visualization of multiple linear regression analysis.Methods After importing data to R Studio, this paper conducted general descriptive statistics analysis, then constructed a linear model between independent variables and the target. After checking independence of observations, the normality of the target, and the linearity between variables, this paper estimated coefficients of independent variables, dealt with multicollinearity, tested significance of estimates and performed residual analysis to guarantee that the regression met its assumptions, and eventually used the fitted model for prediction.Results The multiple linear regression analysis implemented by R Studio software had better visualization functions and easier operation than traditional R language software.Conclusions R Studio software has good application value in realizing multiple linear regression analysis data visualization.

Citation： LI Duoduo, YU Xing, HAN Sheng, ZHU He, YUAN Yi, SHEN Jie, LIN Jingfeng, LI Xia, GAN Yena, LIU Jianping. Data visualization of multiple linear regression analysis practiced by R Studio software. Chinese Journal of Evidence-Based Medicine, 2021, 21(4): 482-490. doi: 10.7507/1672-2531.202008172 Copy

1.	許茜, 黃子杰, 蔡晶, 等. 基于大數據研究的醫學數據可視化. 中國衛生統計, 2017, 34(2): 347-349.
2.	王藝, 任淑霞. 醫療大數據可視化研究綜述. 計算機科學與探索, 2017, 11(5): 681-699.
3.	孫振球, 徐勇勇. 醫學統計學(第4版). 北京: 人民衛生出版社, 2014.
4.	張華, 王曉曉, 曾琳, 等. 斷點回歸設計在臨床治療性研究中的應用. 中國循證醫學雜志, 2018, 18(11): 1207-1211.
5.	郭梓鑫, 馮雨嘉, 王清華, 等. 應用R軟件metaplus程序包實現Meta分析. 中國循證醫學雜志, 2018, 18(7): 763-768.
6.	張天嵩. Stata和R軟件在生存資料Meta分析中的正確使用. 中國循證醫學雜志, 2017, 17(10): 1237-1240.
7.	馬桂峰, 盛紅旗, 馬安寧, 等. 基于logistic函數模型的我國衛生總費用發展階段分析. 中國衛生統計, 2017, 34(6): 976-978.
8.	王予, 徐洪斌. 基于多元線性回歸模型的社區居民孤獨感影響因素研究. 中國衛生統計, 2017, 34(2): 309-311.

1. 許茜, 黃子杰, 蔡晶, 等. 基于大數據研究的醫學數據可視化. 中國衛生統計, 2017, 34(2): 347-349.
2. 王藝, 任淑霞. 醫療大數據可視化研究綜述. 計算機科學與探索, 2017, 11(5): 681-699.
3. 孫振球, 徐勇勇. 醫學統計學(第4版). 北京: 人民衛生出版社, 2014.
4. 張華, 王曉曉, 曾琳, 等. 斷點回歸設計在臨床治療性研究中的應用. 中國循證醫學雜志, 2018, 18(11): 1207-1211.
5. 郭梓鑫, 馮雨嘉, 王清華, 等. 應用R軟件metaplus程序包實現Meta分析. 中國循證醫學雜志, 2018, 18(7): 763-768.
6. 張天嵩. Stata和R軟件在生存資料Meta分析中的正確使用. 中國循證醫學雜志, 2017, 17(10): 1237-1240.
7. 馬桂峰, 盛紅旗, 馬安寧, 等. 基于logistic函數模型的我國衛生總費用發展階段分析. 中國衛生統計, 2017, 34(6): 976-978.
8. 王予, 徐洪斌. 基于多元線性回歸模型的社區居民孤獨感影響因素研究. 中國衛生統計, 2017, 34(2): 309-311.

Previous Article
Methods for safety signal detection in healthcare databases: a literature review
Next Article
An introduction to the development methods and cases of living guidelines

Chinese Journal of Evidence-Based Medicine

Data visualization of multiple linear regression analysis practiced by R Studio software

Abstract Full text Figures/Tables Video References Cited by

Previous Article

Next Article

Format

Content