Stochastic algorithm for HDFS data theft detection based on MapReduce

To address the problems of big data efficient analysis and insider theft detection in the data theft detection of distributed cloud computing storage,taking HDFS (hadoop distributed file system) as a case study,a stochastic algorithm for HDFS data theft detection based on MapReduce was proposed.By a...

Full description

Saved in:
Bibliographic Details
Main Authors: Yuanzhao GAO, Binglong LI, Xingyuan CHEN
Format: Article
Language:zho
Published: Editorial Department of Journal on Communications 2018-10-01
Series:Tongxin xuebao
Subjects:
Online Access:http://www.joconline.com.cn/zh/article/doi/10.11959/j.issn.1000-436x.2018222/
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1841539412833337344
author Yuanzhao GAO
Binglong LI
Xingyuan CHEN
author_facet Yuanzhao GAO
Binglong LI
Xingyuan CHEN
author_sort Yuanzhao GAO
collection DOAJ
description To address the problems of big data efficient analysis and insider theft detection in the data theft detection of distributed cloud computing storage,taking HDFS (hadoop distributed file system) as a case study,a stochastic algorithm for HDFS data theft detection based on MapReduce was proposed.By analyzing the MAC timestamp features of HDFS generated by folder replication,the replication behavior’s detection and measurement method was established to detect all data theft modes including insider theft.The data set which is suitable for MapReduce task partition and maintains the HDFS hierarchy was designed to achieve efficient analysis of large-volume timestamps.The experimental results show that the missed rate and the number of mislabeled folders could be kept at a low level by adopting segment detection strategy.The algorithm was proved to be efficient and had good scalability under the MapReduce framework.
format Article
id doaj-art-3179e165a0a64e949b2d852139a5ea2e
institution Kabale University
issn 1000-436X
language zho
publishDate 2018-10-01
publisher Editorial Department of Journal on Communications
record_format Article
series Tongxin xuebao
spelling doaj-art-3179e165a0a64e949b2d852139a5ea2e2025-01-14T07:15:34ZzhoEditorial Department of Journal on CommunicationsTongxin xuebao1000-436X2018-10-0139112159721049Stochastic algorithm for HDFS data theft detection based on MapReduceYuanzhao GAOBinglong LIXingyuan CHENTo address the problems of big data efficient analysis and insider theft detection in the data theft detection of distributed cloud computing storage,taking HDFS (hadoop distributed file system) as a case study,a stochastic algorithm for HDFS data theft detection based on MapReduce was proposed.By analyzing the MAC timestamp features of HDFS generated by folder replication,the replication behavior’s detection and measurement method was established to detect all data theft modes including insider theft.The data set which is suitable for MapReduce task partition and maintains the HDFS hierarchy was designed to achieve efficient analysis of large-volume timestamps.The experimental results show that the missed rate and the number of mislabeled folders could be kept at a low level by adopting segment detection strategy.The algorithm was proved to be efficient and had good scalability under the MapReduce framework.http://www.joconline.com.cn/zh/article/doi/10.11959/j.issn.1000-436x.2018222/stochastic detection algorithmHDFSMapReduceMAC timestampcloud computing storage
spellingShingle Yuanzhao GAO
Binglong LI
Xingyuan CHEN
Stochastic algorithm for HDFS data theft detection based on MapReduce
Tongxin xuebao
stochastic detection algorithm
HDFS
MapReduce
MAC timestamp
cloud computing storage
title Stochastic algorithm for HDFS data theft detection based on MapReduce
title_full Stochastic algorithm for HDFS data theft detection based on MapReduce
title_fullStr Stochastic algorithm for HDFS data theft detection based on MapReduce
title_full_unstemmed Stochastic algorithm for HDFS data theft detection based on MapReduce
title_short Stochastic algorithm for HDFS data theft detection based on MapReduce
title_sort stochastic algorithm for hdfs data theft detection based on mapreduce
topic stochastic detection algorithm
HDFS
MapReduce
MAC timestamp
cloud computing storage
url http://www.joconline.com.cn/zh/article/doi/10.11959/j.issn.1000-436x.2018222/
work_keys_str_mv AT yuanzhaogao stochasticalgorithmforhdfsdatatheftdetectionbasedonmapreduce
AT binglongli stochasticalgorithmforhdfsdatatheftdetectionbasedonmapreduce
AT xingyuanchen stochasticalgorithmforhdfsdatatheftdetectionbasedonmapreduce