Collaborative Data Cleaning Framework: a Pilot Case Study for Machine Learning Development
This study experiments with collaborative data cleaning, a pivotal phase in data preparation for both analysis and machine learning. We used a provenance Data Cleaning Model (DCM) for multi-user scenarios to track changes on a dataset and conduct comprehensive experiments that simulate multiple dat...
Saved in:
| Main Authors: | Nikolaus Parulian, Bertram Ludäscher |
|---|---|
| Format: | Article |
| Language: | English |
| Published: |
University of Edinburgh
2024-12-01
|
| Series: | International Journal of Digital Curation |
| Online Access: | https://ijdc.net/index.php/ijdc/article/view/942 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
-
TROV - A Model and Vocabulary for Describing Transparent Research Objects
by: Meng Li, et al.
Published: (2025-02-01) -
Equipment Quality Data Integration and Cleaning Based on Multiterminal Collaboration
by: Cui-Bin Ji, et al.
Published: (2021-01-01) -
Reliability-enhanced data cleaning in biomedical machine learning using inductive conformal prediction.
by: Xianghao Zhan, et al.
Published: (2025-02-01) -
Predictive Framework for Sustainable Engineering through Machine Learning and Cross-Sector Collaboration
by: Choudhary Abhik, et al.
Published: (2025-01-01) -
Improving Data Cleaning by Learning From Unstructured Textual Data
by: Rihem Nasfi, et al.
Published: (2025-01-01)