Establishment and assessment of an early screening model for cervical cancer based on single-cell Raman spectroscopy combined with machine learning algorithms
Objective To establish an early screening model for cervical cancer based on single-cell Raman spectroscopy (SCRS) combined with machine learning algorithms, and to assess the performance of the model. Methods Cervical exfoliated cell samples were collected from 128 patients who were treated in our...
Saved in:
| Main Author: | |
|---|---|
| Format: | Article |
| Language: | zho |
| Published: |
Editorial Office of Journal of Precision Medicine
2025-08-01
|
| Series: | 精准医学杂志 |
| Subjects: | |
| Online Access: | https://jpmed.qdu.edu.cn/fileup/2096-529X/PDF/1754471584044-1741177256.pdf |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
| _version_ | 1849335570706202624 |
|---|---|
| author | MA Dongmei, ZHAO Wenjie, LIU Shihai, XU Haicang, CAI Duo, JI Yuetong, XU Jian, GUO Cancan, MA Bo, PAN Huazheng |
| author_facet | MA Dongmei, ZHAO Wenjie, LIU Shihai, XU Haicang, CAI Duo, JI Yuetong, XU Jian, GUO Cancan, MA Bo, PAN Huazheng |
| author_sort | MA Dongmei, ZHAO Wenjie, LIU Shihai, XU Haicang, CAI Duo, JI Yuetong, XU Jian, GUO Cancan, MA Bo, PAN Huazheng |
| collection | DOAJ |
| description | Objective To establish an early screening model for cervical cancer based on single-cell Raman spectroscopy (SCRS) combined with machine learning algorithms, and to assess the performance of the model. Methods Cervical exfoliated cell samples were collected from 128 patients who were treated in our hospital from September 2023 to June 2024, among whom 65 had normal results of ThinPrep cytologic test (TCT), 35 had abnormal TCT results, and 28 did not receive TCT. R language was used to divide the 100 cervical exfoliated cell samples with TCT results into training set and test set at a ratio of 8∶2, and SCRS was performed for all samples. Based on the SCRS data of the training set, 7 machine learning algorithms (KNN, PLS, LDA, RF, SVM, SVMRBF, and Stack) were used to establish an early screening model for cervical cancer, which was applied in the test set to identify the optimal model. The optimal model was then used to predict the TCT results of 100 cervical exfoliated cell samples in the training and test sets, which were compared with the actual TCT results. The remaining 28 samples without prior TCT results were used as a validation set and were subjected to TCT, and the optimal model was used to predict the TCT results of these samples, which were compared with the actual TCT results. Results There were significant differences in the relative intensities of characteristic Raman peaks at 874, 935, 1 024, 1 119, 1 250, 1 328, 1 569, and 1 642 cm-1 between the cervical exfoliated cells negative for intraepithelial lesion or malignancy, atypical squamous cells of undetermined significance, and the cervical exfoliated cells of low-grade squamous intraepithelial lesion. Among the 7 algorithms, the stacking model showed the best performance, with an AUC of 0.987, an accuracy of 99.2%, a sensitivity of 98.9%, and a specificity of 99.3%. In both training and test sets, the results predicted by the Stack model were relatively highly consistent with actual TCT results, with an accuracy of 91.0%, a sensitivity of 91.0%, a specificity of 87.4%, and an F1-score of 90.3%. In the validation set, the Stack model achieved an accuracy of 96.4%, a sensitivity of 100.0%, a specificity of 95.5%, and an F1-score of 92.3% in predicting TCT results. Conclusion The early screening model for cervical cancer based on SCRS and machine learning algorithms has a good performance and can be used as a noninvasive, efficient, and rapid tool to facilitate the early screening of cervical cancer. |
| format | Article |
| id | doaj-art-edab72853e5e4a7ca7f3464bb90f788e |
| institution | Kabale University |
| issn | 2096-529X |
| language | zho |
| publishDate | 2025-08-01 |
| publisher | Editorial Office of Journal of Precision Medicine |
| record_format | Article |
| series | 精准医学杂志 |
| spelling | doaj-art-edab72853e5e4a7ca7f3464bb90f788e2025-08-20T03:45:14ZzhoEditorial Office of Journal of Precision Medicine精准医学杂志2096-529X2025-08-0140434835210.13362/j.jpmed.202540079Establishment and assessment of an early screening model for cervical cancer based on single-cell Raman spectroscopy combined with machine learning algorithmsMA Dongmei, ZHAO Wenjie, LIU Shihai, XU Haicang, CAI Duo, JI Yuetong, XU Jian, GUO Cancan, MA Bo, PAN Huazheng0Department of Clinical Laboratory, The Affiliated Hospital of Qingdao University, Qingdao 266003, ChinaObjective To establish an early screening model for cervical cancer based on single-cell Raman spectroscopy (SCRS) combined with machine learning algorithms, and to assess the performance of the model. Methods Cervical exfoliated cell samples were collected from 128 patients who were treated in our hospital from September 2023 to June 2024, among whom 65 had normal results of ThinPrep cytologic test (TCT), 35 had abnormal TCT results, and 28 did not receive TCT. R language was used to divide the 100 cervical exfoliated cell samples with TCT results into training set and test set at a ratio of 8∶2, and SCRS was performed for all samples. Based on the SCRS data of the training set, 7 machine learning algorithms (KNN, PLS, LDA, RF, SVM, SVMRBF, and Stack) were used to establish an early screening model for cervical cancer, which was applied in the test set to identify the optimal model. The optimal model was then used to predict the TCT results of 100 cervical exfoliated cell samples in the training and test sets, which were compared with the actual TCT results. The remaining 28 samples without prior TCT results were used as a validation set and were subjected to TCT, and the optimal model was used to predict the TCT results of these samples, which were compared with the actual TCT results. Results There were significant differences in the relative intensities of characteristic Raman peaks at 874, 935, 1 024, 1 119, 1 250, 1 328, 1 569, and 1 642 cm-1 between the cervical exfoliated cells negative for intraepithelial lesion or malignancy, atypical squamous cells of undetermined significance, and the cervical exfoliated cells of low-grade squamous intraepithelial lesion. Among the 7 algorithms, the stacking model showed the best performance, with an AUC of 0.987, an accuracy of 99.2%, a sensitivity of 98.9%, and a specificity of 99.3%. In both training and test sets, the results predicted by the Stack model were relatively highly consistent with actual TCT results, with an accuracy of 91.0%, a sensitivity of 91.0%, a specificity of 87.4%, and an F1-score of 90.3%. In the validation set, the Stack model achieved an accuracy of 96.4%, a sensitivity of 100.0%, a specificity of 95.5%, and an F1-score of 92.3% in predicting TCT results. Conclusion The early screening model for cervical cancer based on SCRS and machine learning algorithms has a good performance and can be used as a noninvasive, efficient, and rapid tool to facilitate the early screening of cervical cancer.https://jpmed.qdu.edu.cn/fileup/2096-529X/PDF/1754471584044-1741177256.pdfuterine cervical neoplasms|single-cell analysis|spectrum analysis, raman|machine learning|algorithms|early detection of cancer |
| spellingShingle | MA Dongmei, ZHAO Wenjie, LIU Shihai, XU Haicang, CAI Duo, JI Yuetong, XU Jian, GUO Cancan, MA Bo, PAN Huazheng Establishment and assessment of an early screening model for cervical cancer based on single-cell Raman spectroscopy combined with machine learning algorithms 精准医学杂志 uterine cervical neoplasms|single-cell analysis|spectrum analysis, raman|machine learning|algorithms|early detection of cancer |
| title | Establishment and assessment of an early screening model for cervical cancer based on single-cell Raman spectroscopy combined with machine learning algorithms |
| title_full | Establishment and assessment of an early screening model for cervical cancer based on single-cell Raman spectroscopy combined with machine learning algorithms |
| title_fullStr | Establishment and assessment of an early screening model for cervical cancer based on single-cell Raman spectroscopy combined with machine learning algorithms |
| title_full_unstemmed | Establishment and assessment of an early screening model for cervical cancer based on single-cell Raman spectroscopy combined with machine learning algorithms |
| title_short | Establishment and assessment of an early screening model for cervical cancer based on single-cell Raman spectroscopy combined with machine learning algorithms |
| title_sort | establishment and assessment of an early screening model for cervical cancer based on single cell raman spectroscopy combined with machine learning algorithms |
| topic | uterine cervical neoplasms|single-cell analysis|spectrum analysis, raman|machine learning|algorithms|early detection of cancer |
| url | https://jpmed.qdu.edu.cn/fileup/2096-529X/PDF/1754471584044-1741177256.pdf |
| work_keys_str_mv | AT madongmeizhaowenjieliushihaixuhaicangcaiduojiyuetongxujianguocancanmabopanhuazheng establishmentandassessmentofanearlyscreeningmodelforcervicalcancerbasedonsinglecellramanspectroscopycombinedwithmachinelearningalgorithms |