Construction of a prediction model for moderate to severe perimenopausal syndrome based on machine learning algorithms
Objective To identify risk factors for perimenopausal syndrome (PMS) among perimenopausal women using machine learning algorithms, and to construct a predictive model for the risk of developing moderate to severe PMS in perimenopausal women. Methods Perimenopausal women from 48 communites in Pudong...
Saved in:
| Main Authors: | , , , , |
|---|---|
| Format: | Article |
| Language: | zho |
| Published: |
Editorial Office of New Medicine
2024-08-01
|
| Series: | Yixue xinzhi zazhi |
| Subjects: | |
| Online Access: | https://yxxz.whuznhmedj.com/futureApi/storage/attach/2408/VkKxzxor1ZuT53BfGiqnmvp0XscH9CmKpOwhQWY2.pdf |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
| Summary: | Objective To identify risk factors for perimenopausal syndrome (PMS) among perimenopausal women using machine learning algorithms, and to construct a predictive model for the risk of developing moderate to severe PMS in perimenopausal women. Methods Perimenopausal women from 48 communites in Pudong New Area, Shanghai were selected as the study subjects. Based on the Kupperman index, participants were divided into the normal or mild PMS group and the moderate to severe PMS group. The data was randomly divided into training set and testing set, and feature selection was performed using the Boruta algorithm and SHAP algorithm. Logistic regression (LR), random forest (RF), support vector machine (SVM), and gradient boosting decision tree (GBDT) were constructed, and model performances were evaluated using accuracy, precision, recall, area under curve(AUC) of the receiver operating characteristic curve, and F1-score.Results A total of 856 perimenopausal women were included in the study, of which 557 were in the normal or mild PMS group and 299 were in the moderate to severe PMS group; 599 were in the training set and 257 were in the testing set. 9 features (employment status, exercise, age, menstrual condition, medical history, obesity, residence area, history of health education, household register) were selected as predictors for the final model using the Boruta algorithm and SHAP analysis. After parameter tuning, the 10-fold cross-validation AUC of LR, RF, SVM, and GBDT models based on the training set were 0.64, 0.77, 0.74, and 0.77, respectively. The AUC of the LR, RF, SVM, and GBDT models based on the testing set were 0.63, 0.69, 0.69, and 0.73, respectively, with recall rates of 0.59, 0.55, 0.55, and 0.62.Conclusion Among the constructed predictive models for the risk of developing moderate to severe PMS in perimenopausal women, the GBDT model demonstrated the best predictive performance and has potential clinical value. This study provides a new approach for the early identification and intervention of moderate to severe PMS in perimenopausal women. |
|---|---|
| ISSN: | 1004-5511 |