De-identification of clinical notes with pseudo-labeling using regular expression rules and pre-trained BERT

Abstract Background De-identification of clinical notes is essential to utilize the rich information in unstructured text data in medical research. However, only limited work has been done in removing personal information from clinical notes in Korea. Methods Our study utilized a comprehensive datas...

Full description

Saved in:
Bibliographic Details
Main Authors: Jiyong An, Jiyun Kim, Leonard Sunwoo, Hyunyoung Baek, Sooyoung Yoo, Seunggeun Lee
Format: Article
Language:English
Published: BMC 2025-02-01
Series:BMC Medical Informatics and Decision Making
Subjects:
Online Access:https://doi.org/10.1186/s12911-025-02913-z
Tags: Add Tag
No Tags, Be the first to tag this record!