An oversampling-undersampling strategy for large-scale data linkage

Effective record linkage in big data, particularly in imbalanced datasets, is a critical yet highly challenging task due to the inherent complexity involved. This article utilizes an oversampling-undersampling strategy to address linkage imbalances, enabling more accurate and efficient record linkag...

Full description

Saved in:
Bibliographic Details
Main Authors: Hossein Hassani, Mohammad Reza Entezarian, Sara Zaeimzadeh, Leila Marvian, Nadejda Komendantova
Format: Article
Language:English
Published: Frontiers Media S.A. 2025-04-01
Series:Frontiers in Big Data
Subjects:
Online Access:https://www.frontiersin.org/articles/10.3389/fdata.2025.1542483/full
Tags: Add Tag
No Tags, Be the first to tag this record!