Behavioural user segmentation of app users based on functionality interaction patterns
User segmentation categorises a large and complex user base into manageable similar groups of users. Existing works encounter challenges when dealing with a sparse dataset and finding insights from the generated clusters. This study has two objectives: (1) to identify an optimal clustering model tha...
Saved in:
| Main Authors: | , , , |
|---|---|
| Format: | Article |
| Language: | English |
| Published: |
Taylor & Francis Group
2024-12-01
|
| Series: | Cogent Engineering |
| Subjects: | |
| Online Access: | https://www.tandfonline.com/doi/10.1080/23311916.2024.2430430 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
| _version_ | 1850107751007518720 |
|---|---|
| author | Li-Yoong Ooi Choo-Yee Ting Helmi Zakariah Eashvaren Chandar |
| author_facet | Li-Yoong Ooi Choo-Yee Ting Helmi Zakariah Eashvaren Chandar |
| author_sort | Li-Yoong Ooi |
| collection | DOAJ |
| description | User segmentation categorises a large and complex user base into manageable similar groups of users. Existing works encounter challenges when dealing with a sparse dataset and finding insights from the generated clusters. This study has two objectives: (1) to identify an optimal clustering model that can handle a sparse dataset and (2) to extract post-clustering insights via a descriptive persona for each cluster. This study deployed clustering models to handle a behavioural user-interaction dataset with a sparsity rate of 85%. The findings revealed that Density-Based Spatial Clustering of Applications with Noise that leveraged on One-hot Encoding and data representation learning via an autoencoder performed best, with a Silhouette score of 0.36. Subsequently, this study enacted techniques and tools such as classification, SHapley Additive exPlanation value, and manual analysis. Classification and SHAP values were used to identify important features that can differentiate clusters created by different clustering models. Specifically, a linear SHAP explainer object was applied to Logistic Regression had been identified to outperformed Random Forest and Light Gradient Boosting Machine, with an accuracy of 97%. A manual analysis of the central tendencies of these relatively more important features within each cluster was performed to create a descriptive persona. The findings revealed four distinctive personas, namely the “Active User,” “COVID-19 Preventer,” “Inactive User,” and “Average Joe.” |
| format | Article |
| id | doaj-art-74a90d575d1b4acdb145ab8d4e88f89f |
| institution | OA Journals |
| issn | 2331-1916 |
| language | English |
| publishDate | 2024-12-01 |
| publisher | Taylor & Francis Group |
| record_format | Article |
| series | Cogent Engineering |
| spelling | doaj-art-74a90d575d1b4acdb145ab8d4e88f89f2025-08-20T02:38:31ZengTaylor & Francis GroupCogent Engineering2331-19162024-12-0111110.1080/23311916.2024.2430430Behavioural user segmentation of app users based on functionality interaction patternsLi-Yoong Ooi0Choo-Yee Ting1Helmi Zakariah2Eashvaren Chandar3Faculty of Computing and Informatics, Multimedia University, Cyberjaya, MalaysiaFaculty of Computing and Informatics, Multimedia University, Cyberjaya, MalaysiaHayat Technologies Sdn Bhd, Kuala Lumpur, MalaysiaHayat Technologies Sdn Bhd, Kuala Lumpur, MalaysiaUser segmentation categorises a large and complex user base into manageable similar groups of users. Existing works encounter challenges when dealing with a sparse dataset and finding insights from the generated clusters. This study has two objectives: (1) to identify an optimal clustering model that can handle a sparse dataset and (2) to extract post-clustering insights via a descriptive persona for each cluster. This study deployed clustering models to handle a behavioural user-interaction dataset with a sparsity rate of 85%. The findings revealed that Density-Based Spatial Clustering of Applications with Noise that leveraged on One-hot Encoding and data representation learning via an autoencoder performed best, with a Silhouette score of 0.36. Subsequently, this study enacted techniques and tools such as classification, SHapley Additive exPlanation value, and manual analysis. Classification and SHAP values were used to identify important features that can differentiate clusters created by different clustering models. Specifically, a linear SHAP explainer object was applied to Logistic Regression had been identified to outperformed Random Forest and Light Gradient Boosting Machine, with an accuracy of 97%. A manual analysis of the central tendencies of these relatively more important features within each cluster was performed to create a descriptive persona. The findings revealed four distinctive personas, namely the “Active User,” “COVID-19 Preventer,” “Inactive User,” and “Average Joe.”https://www.tandfonline.com/doi/10.1080/23311916.2024.2430430User segmentationclusteringcluster analysissparse dataSHAPArtificial Intelligence |
| spellingShingle | Li-Yoong Ooi Choo-Yee Ting Helmi Zakariah Eashvaren Chandar Behavioural user segmentation of app users based on functionality interaction patterns Cogent Engineering User segmentation clustering cluster analysis sparse data SHAP Artificial Intelligence |
| title | Behavioural user segmentation of app users based on functionality interaction patterns |
| title_full | Behavioural user segmentation of app users based on functionality interaction patterns |
| title_fullStr | Behavioural user segmentation of app users based on functionality interaction patterns |
| title_full_unstemmed | Behavioural user segmentation of app users based on functionality interaction patterns |
| title_short | Behavioural user segmentation of app users based on functionality interaction patterns |
| title_sort | behavioural user segmentation of app users based on functionality interaction patterns |
| topic | User segmentation clustering cluster analysis sparse data SHAP Artificial Intelligence |
| url | https://www.tandfonline.com/doi/10.1080/23311916.2024.2430430 |
| work_keys_str_mv | AT liyoongooi behaviouralusersegmentationofappusersbasedonfunctionalityinteractionpatterns AT chooyeeting behaviouralusersegmentationofappusersbasedonfunctionalityinteractionpatterns AT helmizakariah behaviouralusersegmentationofappusersbasedonfunctionalityinteractionpatterns AT eashvarenchandar behaviouralusersegmentationofappusersbasedonfunctionalityinteractionpatterns |