Skip to main content

Table 3 Random forest feature importance (top 15 variables)

From: Association between breastfeeding duration and diabetes mellitus in menopausal women: a machine-learning analysis using population-based retrospective study

(a) Random-forest feature importance of prediction model for diabetes mellitus

Rank

Variables

Variable importance

1

HbA1c

0.26

2

Fasting glucose

0.18

3

LDL

0.05

4

Total cholesterol

0.05

5

HDL

0.03

6

Waist circumference

0.03

7

DBP

0.03

8

Triglyceride

0.03

9

BMI at enrollment

0.03

10

SBP

0.02

11

Age at enrollment

0.02

12

Age at menopause

0.02

13

Age at last delivery

0.02

14

Age at first delivery

0.02

15

Total breastfeeding duration

0.02

(b) Random-forest feature importance of prediction model for HbA1c

Rank

Variables

Variable importance

1

Diabetes mellitus

2757

2

BMI at enrollment

543

3

Age at enrollment

350

4

Age at menopause

307

5

Age at last delivery

275

6

Frequency of alcohol consumption

269

7

Age at menarche

255

8

Age at first delivery

236

9

Total breastfeeding duration

229

10

Occupation

217

11

Gravidity

202

12

Average breastfeeding duration

179

13

Household income

169

14

Stress awareness

164

15

Breastfeeding duration group

150

  1. DBP Diastolic blood pressure, HbA1c Hemoglobin A1c, LDL Low-density lipoprotein, HDL High-density lipoprotein, SBP Systolic blood pressure, BMI Body mass index, HbA1c Hemoglobin A1c