A Multisource Retrospective Audit Method for Data Quality Optimization and Evaluation

With the rapid development of information technology and the coming of the era of big data, various data are constantly emerging and present the characteristics of autonomy and heterogeneity. How to optimize data quality and evaluate the effect has become a challenging problem. Firstly, a heterogene...

Full description

Saved in:
Bibliographic Details
Main Authors: Li Jiang, Hao Chen, Yueqi Ouyang, Canbing Li
Format: Article
Language:English
Published: Wiley 2015-10-01
Series:International Journal of Distributed Sensor Networks
Online Access:https://doi.org/10.1155/2015/195015
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1849415022815477760
author Li Jiang
Hao Chen
Yueqi Ouyang
Canbing Li
author_facet Li Jiang
Hao Chen
Yueqi Ouyang
Canbing Li
author_sort Li Jiang
collection DOAJ
description With the rapid development of information technology and the coming of the era of big data, various data are constantly emerging and present the characteristics of autonomy and heterogeneity. How to optimize data quality and evaluate the effect has become a challenging problem. Firstly, a heterogeneous data integration model based on retrospective audit is proposed to locate the original data source and match the data. Secondly, in order to improve the integrated data quality, a retrospective audit model and associative audit rules are proposed to fix incomplete and incorrect data from multiple heterogeneous data sources. The heterogeneous data integration model based on retrospective audit is divided into four modules including original heterogeneous data, data structure, data processing, and data retrospective audit. At last, some assessment criteria such as redundancy, sparsity, and accuracy are defined to evaluate the effect of the optimized data quality. Experimental results show that the quality of the integrated data is significantly higher than the quality of the original data.
format Article
id doaj-art-57ef983775b24392b5e2bb7ddb91f4a8
institution Kabale University
issn 1550-1477
language English
publishDate 2015-10-01
publisher Wiley
record_format Article
series International Journal of Distributed Sensor Networks
spelling doaj-art-57ef983775b24392b5e2bb7ddb91f4a82025-08-20T03:33:39ZengWileyInternational Journal of Distributed Sensor Networks1550-14772015-10-011110.1155/2015/195015195015A Multisource Retrospective Audit Method for Data Quality Optimization and EvaluationLi Jiang0Hao Chen1Yueqi Ouyang2Canbing Li3 Information and Art Design Department, Huaihua Vocational and Technical College, Huaihua 418000, China College of Computer Science and Electronic Engineering, Hunan University, Changsha 410082, China College of Computer Science and Electronic Engineering, Hunan University, Changsha 410082, China College of Electrical and Information Engineering, Hunan University, Changsha 410082, ChinaWith the rapid development of information technology and the coming of the era of big data, various data are constantly emerging and present the characteristics of autonomy and heterogeneity. How to optimize data quality and evaluate the effect has become a challenging problem. Firstly, a heterogeneous data integration model based on retrospective audit is proposed to locate the original data source and match the data. Secondly, in order to improve the integrated data quality, a retrospective audit model and associative audit rules are proposed to fix incomplete and incorrect data from multiple heterogeneous data sources. The heterogeneous data integration model based on retrospective audit is divided into four modules including original heterogeneous data, data structure, data processing, and data retrospective audit. At last, some assessment criteria such as redundancy, sparsity, and accuracy are defined to evaluate the effect of the optimized data quality. Experimental results show that the quality of the integrated data is significantly higher than the quality of the original data.https://doi.org/10.1155/2015/195015
spellingShingle Li Jiang
Hao Chen
Yueqi Ouyang
Canbing Li
A Multisource Retrospective Audit Method for Data Quality Optimization and Evaluation
International Journal of Distributed Sensor Networks
title A Multisource Retrospective Audit Method for Data Quality Optimization and Evaluation
title_full A Multisource Retrospective Audit Method for Data Quality Optimization and Evaluation
title_fullStr A Multisource Retrospective Audit Method for Data Quality Optimization and Evaluation
title_full_unstemmed A Multisource Retrospective Audit Method for Data Quality Optimization and Evaluation
title_short A Multisource Retrospective Audit Method for Data Quality Optimization and Evaluation
title_sort multisource retrospective audit method for data quality optimization and evaluation
url https://doi.org/10.1155/2015/195015
work_keys_str_mv AT lijiang amultisourceretrospectiveauditmethodfordataqualityoptimizationandevaluation
AT haochen amultisourceretrospectiveauditmethodfordataqualityoptimizationandevaluation
AT yueqiouyang amultisourceretrospectiveauditmethodfordataqualityoptimizationandevaluation
AT canbingli amultisourceretrospectiveauditmethodfordataqualityoptimizationandevaluation
AT lijiang multisourceretrospectiveauditmethodfordataqualityoptimizationandevaluation
AT haochen multisourceretrospectiveauditmethodfordataqualityoptimizationandevaluation
AT yueqiouyang multisourceretrospectiveauditmethodfordataqualityoptimizationandevaluation
AT canbingli multisourceretrospectiveauditmethodfordataqualityoptimizationandevaluation