Discussion on techniques of data cleaning, user identification, and session identification phases of web usage mining from 2000 to 2022
The data preprocessing step is an important step in web usage mining because of the nature of log data, which are heterogeneous, unstructured, and noisy. Given the scalability and efficiency of algorithms in pattern discovery, a preprocessing step must be applied. In this study, the sequential metho...
Saved in:
| Main Authors: | , , |
|---|---|
| Format: | Article |
| Language: | Arabic |
| Published: |
University of Information Technology and Communications
2025-05-01
|
| Series: | Iraqi Journal for Computers and Informatics |
| Subjects: | |
| Online Access: | https://ijci.uoitc.edu.iq/index.php/ijci/article/view/549 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
| _version_ | 1849305576679407616 |
|---|---|
| author | Mohammed Ali Mohammed Hala Abdulsalam jasim Ahmed Oday |
| author_facet | Mohammed Ali Mohammed Hala Abdulsalam jasim Ahmed Oday |
| author_sort | Mohammed Ali Mohammed |
| collection | DOAJ |
| description | The data preprocessing step is an important step in web usage mining because of the nature of log data, which are heterogeneous, unstructured, and noisy. Given the scalability and efficiency of algorithms in pattern discovery, a preprocessing step must be applied. In this study, the sequential methodologies utilized in the preprocessing of data from web server logs, with an emphasis on sub-phases, such as session identification, user identification, and data cleansing, are comprehensively evaluated and meticulously examined. |
| format | Article |
| id | doaj-art-e7529275cc7a444a95b5209f1032dffc |
| institution | Kabale University |
| issn | 2313-190X 2520-4912 |
| language | Arabic |
| publishDate | 2025-05-01 |
| publisher | University of Information Technology and Communications |
| record_format | Article |
| series | Iraqi Journal for Computers and Informatics |
| spelling | doaj-art-e7529275cc7a444a95b5209f1032dffc2025-08-20T03:55:24ZaraUniversity of Information Technology and CommunicationsIraqi Journal for Computers and Informatics2313-190X2520-49122025-05-01511375110.25195/ijci.v51i1.549512Discussion on techniques of data cleaning, user identification, and session identification phases of web usage mining from 2000 to 2022Mohammed Ali Mohammed0Hala Abdulsalam jasim1Ahmed Oday2University of Information Technology and CommunicationsUniversity of BaghdadUniversity of Information Technology and Communications The data preprocessing step is an important step in web usage mining because of the nature of log data, which are heterogeneous, unstructured, and noisy. Given the scalability and efficiency of algorithms in pattern discovery, a preprocessing step must be applied. In this study, the sequential methodologies utilized in the preprocessing of data from web server logs, with an emphasis on sub-phases, such as session identification, user identification, and data cleansing, are comprehensively evaluated and meticulously examined.https://ijci.uoitc.edu.iq/index.php/ijci/article/view/549web usage miningaccess log filedata pre-processing |
| spellingShingle | Mohammed Ali Mohammed Hala Abdulsalam jasim Ahmed Oday Discussion on techniques of data cleaning, user identification, and session identification phases of web usage mining from 2000 to 2022 Iraqi Journal for Computers and Informatics web usage mining access log file data pre-processing |
| title | Discussion on techniques of data cleaning, user identification, and session identification phases of web usage mining from 2000 to 2022 |
| title_full | Discussion on techniques of data cleaning, user identification, and session identification phases of web usage mining from 2000 to 2022 |
| title_fullStr | Discussion on techniques of data cleaning, user identification, and session identification phases of web usage mining from 2000 to 2022 |
| title_full_unstemmed | Discussion on techniques of data cleaning, user identification, and session identification phases of web usage mining from 2000 to 2022 |
| title_short | Discussion on techniques of data cleaning, user identification, and session identification phases of web usage mining from 2000 to 2022 |
| title_sort | discussion on techniques of data cleaning user identification and session identification phases of web usage mining from 2000 to 2022 |
| topic | web usage mining access log file data pre-processing |
| url | https://ijci.uoitc.edu.iq/index.php/ijci/article/view/549 |
| work_keys_str_mv | AT mohammedalimohammed discussionontechniquesofdatacleaninguseridentificationandsessionidentificationphasesofwebusageminingfrom2000to2022 AT halaabdulsalamjasim discussionontechniquesofdatacleaninguseridentificationandsessionidentificationphasesofwebusageminingfrom2000to2022 AT ahmedoday discussionontechniquesofdatacleaninguseridentificationandsessionidentificationphasesofwebusageminingfrom2000to2022 |