Discussion on techniques of data cleaning, user identification, and session identification phases of web usage mining from 2000 to 2022

The data preprocessing step is an important step in web usage mining because of the nature of log data, which are heterogeneous, unstructured, and noisy. Given the scalability and efficiency of algorithms in pattern discovery, a preprocessing step must be applied. In this study, the sequential metho...

Full description

Saved in:
Bibliographic Details
Main Authors: Mohammed Ali Mohammed, Hala Abdulsalam jasim, Ahmed Oday
Format: Article
Language:Arabic
Published: University of Information Technology and Communications 2025-05-01
Series:Iraqi Journal for Computers and Informatics
Subjects:
Online Access:https://ijci.uoitc.edu.iq/index.php/ijci/article/view/549
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1849305576679407616
author Mohammed Ali Mohammed
Hala Abdulsalam jasim
Ahmed Oday
author_facet Mohammed Ali Mohammed
Hala Abdulsalam jasim
Ahmed Oday
author_sort Mohammed Ali Mohammed
collection DOAJ
description The data preprocessing step is an important step in web usage mining because of the nature of log data, which are heterogeneous, unstructured, and noisy. Given the scalability and efficiency of algorithms in pattern discovery, a preprocessing step must be applied. In this study, the sequential methodologies utilized in the preprocessing of data from web server logs, with an emphasis on sub-phases, such as session identification, user identification, and data cleansing, are comprehensively evaluated and meticulously examined.
format Article
id doaj-art-e7529275cc7a444a95b5209f1032dffc
institution Kabale University
issn 2313-190X
2520-4912
language Arabic
publishDate 2025-05-01
publisher University of Information Technology and Communications
record_format Article
series Iraqi Journal for Computers and Informatics
spelling doaj-art-e7529275cc7a444a95b5209f1032dffc2025-08-20T03:55:24ZaraUniversity of Information Technology and CommunicationsIraqi Journal for Computers and Informatics2313-190X2520-49122025-05-01511375110.25195/ijci.v51i1.549512Discussion on techniques of data cleaning, user identification, and session identification phases of web usage mining from 2000 to 2022Mohammed Ali Mohammed0Hala Abdulsalam jasim1Ahmed Oday2University of Information Technology and CommunicationsUniversity of BaghdadUniversity of Information Technology and Communications The data preprocessing step is an important step in web usage mining because of the nature of log data, which are heterogeneous, unstructured, and noisy. Given the scalability and efficiency of algorithms in pattern discovery, a preprocessing step must be applied. In this study, the sequential methodologies utilized in the preprocessing of data from web server logs, with an emphasis on sub-phases, such as session identification, user identification, and data cleansing, are comprehensively evaluated and meticulously examined.https://ijci.uoitc.edu.iq/index.php/ijci/article/view/549web usage miningaccess log filedata pre-processing
spellingShingle Mohammed Ali Mohammed
Hala Abdulsalam jasim
Ahmed Oday
Discussion on techniques of data cleaning, user identification, and session identification phases of web usage mining from 2000 to 2022
Iraqi Journal for Computers and Informatics
web usage mining
access log file
data pre-processing
title Discussion on techniques of data cleaning, user identification, and session identification phases of web usage mining from 2000 to 2022
title_full Discussion on techniques of data cleaning, user identification, and session identification phases of web usage mining from 2000 to 2022
title_fullStr Discussion on techniques of data cleaning, user identification, and session identification phases of web usage mining from 2000 to 2022
title_full_unstemmed Discussion on techniques of data cleaning, user identification, and session identification phases of web usage mining from 2000 to 2022
title_short Discussion on techniques of data cleaning, user identification, and session identification phases of web usage mining from 2000 to 2022
title_sort discussion on techniques of data cleaning user identification and session identification phases of web usage mining from 2000 to 2022
topic web usage mining
access log file
data pre-processing
url https://ijci.uoitc.edu.iq/index.php/ijci/article/view/549
work_keys_str_mv AT mohammedalimohammed discussionontechniquesofdatacleaninguseridentificationandsessionidentificationphasesofwebusageminingfrom2000to2022
AT halaabdulsalamjasim discussionontechniquesofdatacleaninguseridentificationandsessionidentificationphasesofwebusageminingfrom2000to2022
AT ahmedoday discussionontechniquesofdatacleaninguseridentificationandsessionidentificationphasesofwebusageminingfrom2000to2022