Stochastic algorithm for HDFS data theft detection based on MapReduce
To address the problems of big data efficient analysis and insider theft detection in the data theft detection of distributed cloud computing storage,taking HDFS (hadoop distributed file system) as a case study,a stochastic algorithm for HDFS data theft detection based on MapReduce was proposed.By a...
Saved in:
Main Authors: | , , |
---|---|
Format: | Article |
Language: | zho |
Published: |
Editorial Department of Journal on Communications
2018-10-01
|
Series: | Tongxin xuebao |
Subjects: | |
Online Access: | http://www.joconline.com.cn/zh/article/doi/10.11959/j.issn.1000-436x.2018222/ |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
_version_ | 1841539412833337344 |
---|---|
author | Yuanzhao GAO Binglong LI Xingyuan CHEN |
author_facet | Yuanzhao GAO Binglong LI Xingyuan CHEN |
author_sort | Yuanzhao GAO |
collection | DOAJ |
description | To address the problems of big data efficient analysis and insider theft detection in the data theft detection of distributed cloud computing storage,taking HDFS (hadoop distributed file system) as a case study,a stochastic algorithm for HDFS data theft detection based on MapReduce was proposed.By analyzing the MAC timestamp features of HDFS generated by folder replication,the replication behavior’s detection and measurement method was established to detect all data theft modes including insider theft.The data set which is suitable for MapReduce task partition and maintains the HDFS hierarchy was designed to achieve efficient analysis of large-volume timestamps.The experimental results show that the missed rate and the number of mislabeled folders could be kept at a low level by adopting segment detection strategy.The algorithm was proved to be efficient and had good scalability under the MapReduce framework. |
format | Article |
id | doaj-art-3179e165a0a64e949b2d852139a5ea2e |
institution | Kabale University |
issn | 1000-436X |
language | zho |
publishDate | 2018-10-01 |
publisher | Editorial Department of Journal on Communications |
record_format | Article |
series | Tongxin xuebao |
spelling | doaj-art-3179e165a0a64e949b2d852139a5ea2e2025-01-14T07:15:34ZzhoEditorial Department of Journal on CommunicationsTongxin xuebao1000-436X2018-10-0139112159721049Stochastic algorithm for HDFS data theft detection based on MapReduceYuanzhao GAOBinglong LIXingyuan CHENTo address the problems of big data efficient analysis and insider theft detection in the data theft detection of distributed cloud computing storage,taking HDFS (hadoop distributed file system) as a case study,a stochastic algorithm for HDFS data theft detection based on MapReduce was proposed.By analyzing the MAC timestamp features of HDFS generated by folder replication,the replication behavior’s detection and measurement method was established to detect all data theft modes including insider theft.The data set which is suitable for MapReduce task partition and maintains the HDFS hierarchy was designed to achieve efficient analysis of large-volume timestamps.The experimental results show that the missed rate and the number of mislabeled folders could be kept at a low level by adopting segment detection strategy.The algorithm was proved to be efficient and had good scalability under the MapReduce framework.http://www.joconline.com.cn/zh/article/doi/10.11959/j.issn.1000-436x.2018222/stochastic detection algorithmHDFSMapReduceMAC timestampcloud computing storage |
spellingShingle | Yuanzhao GAO Binglong LI Xingyuan CHEN Stochastic algorithm for HDFS data theft detection based on MapReduce Tongxin xuebao stochastic detection algorithm HDFS MapReduce MAC timestamp cloud computing storage |
title | Stochastic algorithm for HDFS data theft detection based on MapReduce |
title_full | Stochastic algorithm for HDFS data theft detection based on MapReduce |
title_fullStr | Stochastic algorithm for HDFS data theft detection based on MapReduce |
title_full_unstemmed | Stochastic algorithm for HDFS data theft detection based on MapReduce |
title_short | Stochastic algorithm for HDFS data theft detection based on MapReduce |
title_sort | stochastic algorithm for hdfs data theft detection based on mapreduce |
topic | stochastic detection algorithm HDFS MapReduce MAC timestamp cloud computing storage |
url | http://www.joconline.com.cn/zh/article/doi/10.11959/j.issn.1000-436x.2018222/ |
work_keys_str_mv | AT yuanzhaogao stochasticalgorithmforhdfsdatatheftdetectionbasedonmapreduce AT binglongli stochasticalgorithmforhdfsdatatheftdetectionbasedonmapreduce AT xingyuanchen stochasticalgorithmforhdfsdatatheftdetectionbasedonmapreduce |