Behavioural user segmentation of app users based on functionality interaction patterns

User segmentation categorises a large and complex user base into manageable similar groups of users. Existing works encounter challenges when dealing with a sparse dataset and finding insights from the generated clusters. This study has two objectives: (1) to identify an optimal clustering model tha...

Full description

Saved in:
Bibliographic Details
Main Authors: Li-Yoong Ooi, Choo-Yee Ting, Helmi Zakariah, Eashvaren Chandar
Format: Article
Language:English
Published: Taylor & Francis Group 2024-12-01
Series:Cogent Engineering
Subjects:
Online Access:https://www.tandfonline.com/doi/10.1080/23311916.2024.2430430
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1850107751007518720
author Li-Yoong Ooi
Choo-Yee Ting
Helmi Zakariah
Eashvaren Chandar
author_facet Li-Yoong Ooi
Choo-Yee Ting
Helmi Zakariah
Eashvaren Chandar
author_sort Li-Yoong Ooi
collection DOAJ
description User segmentation categorises a large and complex user base into manageable similar groups of users. Existing works encounter challenges when dealing with a sparse dataset and finding insights from the generated clusters. This study has two objectives: (1) to identify an optimal clustering model that can handle a sparse dataset and (2) to extract post-clustering insights via a descriptive persona for each cluster. This study deployed clustering models to handle a behavioural user-interaction dataset with a sparsity rate of 85%. The findings revealed that Density-Based Spatial Clustering of Applications with Noise that leveraged on One-hot Encoding and data representation learning via an autoencoder performed best, with a Silhouette score of 0.36. Subsequently, this study enacted techniques and tools such as classification, SHapley Additive exPlanation value, and manual analysis. Classification and SHAP values were used to identify important features that can differentiate clusters created by different clustering models. Specifically, a linear SHAP explainer object was applied to Logistic Regression had been identified to outperformed Random Forest and Light Gradient Boosting Machine, with an accuracy of 97%. A manual analysis of the central tendencies of these relatively more important features within each cluster was performed to create a descriptive persona. The findings revealed four distinctive personas, namely the “Active User,” “COVID-19 Preventer,” “Inactive User,” and “Average Joe.”
format Article
id doaj-art-74a90d575d1b4acdb145ab8d4e88f89f
institution OA Journals
issn 2331-1916
language English
publishDate 2024-12-01
publisher Taylor & Francis Group
record_format Article
series Cogent Engineering
spelling doaj-art-74a90d575d1b4acdb145ab8d4e88f89f2025-08-20T02:38:31ZengTaylor & Francis GroupCogent Engineering2331-19162024-12-0111110.1080/23311916.2024.2430430Behavioural user segmentation of app users based on functionality interaction patternsLi-Yoong Ooi0Choo-Yee Ting1Helmi Zakariah2Eashvaren Chandar3Faculty of Computing and Informatics, Multimedia University, Cyberjaya, MalaysiaFaculty of Computing and Informatics, Multimedia University, Cyberjaya, MalaysiaHayat Technologies Sdn Bhd, Kuala Lumpur, MalaysiaHayat Technologies Sdn Bhd, Kuala Lumpur, MalaysiaUser segmentation categorises a large and complex user base into manageable similar groups of users. Existing works encounter challenges when dealing with a sparse dataset and finding insights from the generated clusters. This study has two objectives: (1) to identify an optimal clustering model that can handle a sparse dataset and (2) to extract post-clustering insights via a descriptive persona for each cluster. This study deployed clustering models to handle a behavioural user-interaction dataset with a sparsity rate of 85%. The findings revealed that Density-Based Spatial Clustering of Applications with Noise that leveraged on One-hot Encoding and data representation learning via an autoencoder performed best, with a Silhouette score of 0.36. Subsequently, this study enacted techniques and tools such as classification, SHapley Additive exPlanation value, and manual analysis. Classification and SHAP values were used to identify important features that can differentiate clusters created by different clustering models. Specifically, a linear SHAP explainer object was applied to Logistic Regression had been identified to outperformed Random Forest and Light Gradient Boosting Machine, with an accuracy of 97%. A manual analysis of the central tendencies of these relatively more important features within each cluster was performed to create a descriptive persona. The findings revealed four distinctive personas, namely the “Active User,” “COVID-19 Preventer,” “Inactive User,” and “Average Joe.”https://www.tandfonline.com/doi/10.1080/23311916.2024.2430430User segmentationclusteringcluster analysissparse dataSHAPArtificial Intelligence
spellingShingle Li-Yoong Ooi
Choo-Yee Ting
Helmi Zakariah
Eashvaren Chandar
Behavioural user segmentation of app users based on functionality interaction patterns
Cogent Engineering
User segmentation
clustering
cluster analysis
sparse data
SHAP
Artificial Intelligence
title Behavioural user segmentation of app users based on functionality interaction patterns
title_full Behavioural user segmentation of app users based on functionality interaction patterns
title_fullStr Behavioural user segmentation of app users based on functionality interaction patterns
title_full_unstemmed Behavioural user segmentation of app users based on functionality interaction patterns
title_short Behavioural user segmentation of app users based on functionality interaction patterns
title_sort behavioural user segmentation of app users based on functionality interaction patterns
topic User segmentation
clustering
cluster analysis
sparse data
SHAP
Artificial Intelligence
url https://www.tandfonline.com/doi/10.1080/23311916.2024.2430430
work_keys_str_mv AT liyoongooi behaviouralusersegmentationofappusersbasedonfunctionalityinteractionpatterns
AT chooyeeting behaviouralusersegmentationofappusersbasedonfunctionalityinteractionpatterns
AT helmizakariah behaviouralusersegmentationofappusersbasedonfunctionalityinteractionpatterns
AT eashvarenchandar behaviouralusersegmentationofappusersbasedonfunctionalityinteractionpatterns