Privacy protection of sexually transmitted infections information from Chinese electronic medical records

Abstract The comprehensive adoption of Electronic Medical Records (EMRs) offers numerous benefits but also introduces risks of privacy leakage, particularly for patients with Sexually Transmitted Infections (STI) who need protection from social secondary harm. Despite advancements in privacy protect...

Full description

Saved in:
Bibliographic Details
Main Authors: Mengchun Gong, Yue Yu, Zihao Ouyang, Wenzhao Shi, Chao Liu, Qilin Wang, Jiale Nan, Endi Cai, Fen Ding, Sheng Nie
Format: Article
Language:English
Published: Nature Portfolio 2025-01-01
Series:Scientific Reports
Subjects:
Online Access:https://doi.org/10.1038/s41598-024-84658-9
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1841544742071959552
author Mengchun Gong
Yue Yu
Zihao Ouyang
Wenzhao Shi
Chao Liu
Qilin Wang
Jiale Nan
Endi Cai
Fen Ding
Sheng Nie
author_facet Mengchun Gong
Yue Yu
Zihao Ouyang
Wenzhao Shi
Chao Liu
Qilin Wang
Jiale Nan
Endi Cai
Fen Ding
Sheng Nie
author_sort Mengchun Gong
collection DOAJ
description Abstract The comprehensive adoption of Electronic Medical Records (EMRs) offers numerous benefits but also introduces risks of privacy leakage, particularly for patients with Sexually Transmitted Infections (STI) who need protection from social secondary harm. Despite advancements in privacy protection research, the effectiveness of these strategies in real-world data remains debatable. The objective is to develop effective information extraction and privacy protection strategies to safeguard STI patients in the Chinese healthcare environment and prevent unnecessary privacy leakage during the data-sharing process of EMRs. The research was conducted at a national healthcare data center, where a committee of experts designed rule-based protocols utilizing natural language processing techniques to extract STI information. Extraction Protocol of Sexually Transmitted Infections Information (EPSTII), designed specifically for the Chinese EMRs system, enables accurate and complete identification and extraction of STI-related information, ensuring high protection performance. The protocol was refined multiple times based on the calculated precision and recall. Final protocol was applied to 5,000 randomly selected EMRs to calculate the success rate of privacy protection. A total of 3,233,174 patients were selected based on the inclusion criteria and a 50% entry ratio. Of these, 148,856 patients with sensitive STI information were identified from disease history. The identification frequency varied, with the diagnosis sub-dataset being the highest at 4.8%. Both the precision and recall rates have reached over 95%, demonstrating the effectiveness of our method. The success rate of privacy protection was 98.25%, ensuring the utmost privacy protection for patients with STI. Finding an effective method to protect privacy information in EMRs is meaningful. We demonstrated the feasibility of applying the EPSTII method to EMRs. Our protocol offers more comprehensive results compared to traditional methods of including STI information.
format Article
id doaj-art-578436ece30e442ba4c52121ad31ee05
institution Kabale University
issn 2045-2322
language English
publishDate 2025-01-01
publisher Nature Portfolio
record_format Article
series Scientific Reports
spelling doaj-art-578436ece30e442ba4c52121ad31ee052025-01-12T12:17:47ZengNature PortfolioScientific Reports2045-23222025-01-0115111410.1038/s41598-024-84658-9Privacy protection of sexually transmitted infections information from Chinese electronic medical recordsMengchun Gong0Yue Yu1Zihao Ouyang2Wenzhao Shi3Chao Liu4Qilin Wang5Jiale Nan6Endi Cai7Fen Ding8Sheng Nie9School of Biomedical Engineering, Guangdong Medical UniversityDigital Health China Technologies Co., Ltd.Digital Health China Technologies Co., Ltd.Digital Health China Technologies Co., Ltd.Digital Health China Technologies Co., Ltd.Digital Health China Technologies Co., Ltd.Digital Health China Technologies Co., Ltd.Digital Health China Technologies Co., Ltd.Digital Health China Technologies Co., Ltd.Nanfang Hospital, Southern Medical UniversityAbstract The comprehensive adoption of Electronic Medical Records (EMRs) offers numerous benefits but also introduces risks of privacy leakage, particularly for patients with Sexually Transmitted Infections (STI) who need protection from social secondary harm. Despite advancements in privacy protection research, the effectiveness of these strategies in real-world data remains debatable. The objective is to develop effective information extraction and privacy protection strategies to safeguard STI patients in the Chinese healthcare environment and prevent unnecessary privacy leakage during the data-sharing process of EMRs. The research was conducted at a national healthcare data center, where a committee of experts designed rule-based protocols utilizing natural language processing techniques to extract STI information. Extraction Protocol of Sexually Transmitted Infections Information (EPSTII), designed specifically for the Chinese EMRs system, enables accurate and complete identification and extraction of STI-related information, ensuring high protection performance. The protocol was refined multiple times based on the calculated precision and recall. Final protocol was applied to 5,000 randomly selected EMRs to calculate the success rate of privacy protection. A total of 3,233,174 patients were selected based on the inclusion criteria and a 50% entry ratio. Of these, 148,856 patients with sensitive STI information were identified from disease history. The identification frequency varied, with the diagnosis sub-dataset being the highest at 4.8%. Both the precision and recall rates have reached over 95%, demonstrating the effectiveness of our method. The success rate of privacy protection was 98.25%, ensuring the utmost privacy protection for patients with STI. Finding an effective method to protect privacy information in EMRs is meaningful. We demonstrated the feasibility of applying the EPSTII method to EMRs. Our protocol offers more comprehensive results compared to traditional methods of including STI information.https://doi.org/10.1038/s41598-024-84658-9Chinese electronic medical recordsSexually transmitted infectionsPrivacy protectionInfectious diseaseNatural language processing
spellingShingle Mengchun Gong
Yue Yu
Zihao Ouyang
Wenzhao Shi
Chao Liu
Qilin Wang
Jiale Nan
Endi Cai
Fen Ding
Sheng Nie
Privacy protection of sexually transmitted infections information from Chinese electronic medical records
Scientific Reports
Chinese electronic medical records
Sexually transmitted infections
Privacy protection
Infectious disease
Natural language processing
title Privacy protection of sexually transmitted infections information from Chinese electronic medical records
title_full Privacy protection of sexually transmitted infections information from Chinese electronic medical records
title_fullStr Privacy protection of sexually transmitted infections information from Chinese electronic medical records
title_full_unstemmed Privacy protection of sexually transmitted infections information from Chinese electronic medical records
title_short Privacy protection of sexually transmitted infections information from Chinese electronic medical records
title_sort privacy protection of sexually transmitted infections information from chinese electronic medical records
topic Chinese electronic medical records
Sexually transmitted infections
Privacy protection
Infectious disease
Natural language processing
url https://doi.org/10.1038/s41598-024-84658-9
work_keys_str_mv AT mengchungong privacyprotectionofsexuallytransmittedinfectionsinformationfromchineseelectronicmedicalrecords
AT yueyu privacyprotectionofsexuallytransmittedinfectionsinformationfromchineseelectronicmedicalrecords
AT zihaoouyang privacyprotectionofsexuallytransmittedinfectionsinformationfromchineseelectronicmedicalrecords
AT wenzhaoshi privacyprotectionofsexuallytransmittedinfectionsinformationfromchineseelectronicmedicalrecords
AT chaoliu privacyprotectionofsexuallytransmittedinfectionsinformationfromchineseelectronicmedicalrecords
AT qilinwang privacyprotectionofsexuallytransmittedinfectionsinformationfromchineseelectronicmedicalrecords
AT jialenan privacyprotectionofsexuallytransmittedinfectionsinformationfromchineseelectronicmedicalrecords
AT endicai privacyprotectionofsexuallytransmittedinfectionsinformationfromchineseelectronicmedicalrecords
AT fending privacyprotectionofsexuallytransmittedinfectionsinformationfromchineseelectronicmedicalrecords
AT shengnie privacyprotectionofsexuallytransmittedinfectionsinformationfromchineseelectronicmedicalrecords