A Real Network Environment Dataset for Traffic Analysis

Abstract The objective of internet traffic analysis is to identify latent patterns and ascertain the true state of internet operations by examining traffic data. This approach is considered an effective and valuable means to achieve accurate network management. Whilst the extant network traffic data...

Full description

Saved in:
Bibliographic Details
Main Authors: Wei Jiang, Bin Zhang, Qixun Zhu, Conghui Liao, Wenyong Wang
Format: Article
Language:English
Published: Nature Portfolio 2025-05-01
Series:Scientific Data
Online Access:https://doi.org/10.1038/s41597-025-04876-2
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1850207571832471552
author Wei Jiang
Bin Zhang
Qixun Zhu
Conghui Liao
Wenyong Wang
author_facet Wei Jiang
Bin Zhang
Qixun Zhu
Conghui Liao
Wenyong Wang
author_sort Wei Jiang
collection DOAJ
description Abstract The objective of internet traffic analysis is to identify latent patterns and ascertain the true state of internet operations by examining traffic data. This approach is considered an effective and valuable means to achieve accurate network management. Whilst the extant network traffic datasets are predominantly collated within a laboratory environment, exhibiting deficiencies with regard to authenticity in terms of network scales, users, behaviours, and temporal and spatial characteristics, this paper proposes an in-situ network deployment and data collection scheme involving a large number of devices and users. The scheme involves the collection of a large real Internet traffic dataset including encrypted and non-encrypted traffic through sensors deployed on real-world network access equipment. Through desensitization, cleaning, feature engineering and labelling, an open database is created for researchers in the field of traffic analysis to use in academic and engineering.
format Article
id doaj-art-c1fa0187a1324245a44adc97e0946c91
institution OA Journals
issn 2052-4463
language English
publishDate 2025-05-01
publisher Nature Portfolio
record_format Article
series Scientific Data
spelling doaj-art-c1fa0187a1324245a44adc97e0946c912025-08-20T02:10:29ZengNature PortfolioScientific Data2052-44632025-05-0112111210.1038/s41597-025-04876-2A Real Network Environment Dataset for Traffic AnalysisWei Jiang0Bin Zhang1Qixun Zhu2Conghui Liao3Wenyong Wang4School of Computer Science and Engineering, University of Electronic Science and Technology of ChinaSchool of Computer Science and Engineering, University of Electronic Science and Technology of ChinaSichuan Communication Research Planning and Design Co., LtdSichuan Communication Research Planning and Design Co., LtdSchool of Computer Science and Engineering, University of Electronic Science and Technology of ChinaAbstract The objective of internet traffic analysis is to identify latent patterns and ascertain the true state of internet operations by examining traffic data. This approach is considered an effective and valuable means to achieve accurate network management. Whilst the extant network traffic datasets are predominantly collated within a laboratory environment, exhibiting deficiencies with regard to authenticity in terms of network scales, users, behaviours, and temporal and spatial characteristics, this paper proposes an in-situ network deployment and data collection scheme involving a large number of devices and users. The scheme involves the collection of a large real Internet traffic dataset including encrypted and non-encrypted traffic through sensors deployed on real-world network access equipment. Through desensitization, cleaning, feature engineering and labelling, an open database is created for researchers in the field of traffic analysis to use in academic and engineering.https://doi.org/10.1038/s41597-025-04876-2
spellingShingle Wei Jiang
Bin Zhang
Qixun Zhu
Conghui Liao
Wenyong Wang
A Real Network Environment Dataset for Traffic Analysis
Scientific Data
title A Real Network Environment Dataset for Traffic Analysis
title_full A Real Network Environment Dataset for Traffic Analysis
title_fullStr A Real Network Environment Dataset for Traffic Analysis
title_full_unstemmed A Real Network Environment Dataset for Traffic Analysis
title_short A Real Network Environment Dataset for Traffic Analysis
title_sort real network environment dataset for traffic analysis
url https://doi.org/10.1038/s41597-025-04876-2
work_keys_str_mv AT weijiang arealnetworkenvironmentdatasetfortrafficanalysis
AT binzhang arealnetworkenvironmentdatasetfortrafficanalysis
AT qixunzhu arealnetworkenvironmentdatasetfortrafficanalysis
AT conghuiliao arealnetworkenvironmentdatasetfortrafficanalysis
AT wenyongwang arealnetworkenvironmentdatasetfortrafficanalysis
AT weijiang realnetworkenvironmentdatasetfortrafficanalysis
AT binzhang realnetworkenvironmentdatasetfortrafficanalysis
AT qixunzhu realnetworkenvironmentdatasetfortrafficanalysis
AT conghuiliao realnetworkenvironmentdatasetfortrafficanalysis
AT wenyongwang realnetworkenvironmentdatasetfortrafficanalysis