Analysis of the Customer Churn Prediction Project in the Hotel Industry Based on Text Mining and the Random Forest Algorithm

The ability of hotels to differentiate themselves from competitors and continue to operate profitably depends on their ability to retain their customers by building long-term and permanent customer relationships. Technological developments in recent years have made it possible for companies to predi...

Full description

Saved in:
Bibliographic Details
Main Authors: Leila Taherkhani, Amir Daneshvar, Hossein Amoozad Khalili, Mohamad Reza Sanaei
Format: Article
Language:English
Published: Wiley 2023-01-01
Series:Advances in Civil Engineering
Online Access:http://dx.doi.org/10.1155/2023/6029121
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1849698905730580480
author Leila Taherkhani
Amir Daneshvar
Hossein Amoozad Khalili
Mohamad Reza Sanaei
author_facet Leila Taherkhani
Amir Daneshvar
Hossein Amoozad Khalili
Mohamad Reza Sanaei
author_sort Leila Taherkhani
collection DOAJ
description The ability of hotels to differentiate themselves from competitors and continue to operate profitably depends on their ability to retain their customers by building long-term and permanent customer relationships. Technological developments in recent years have made it possible for companies to predict their customers’ behavior by accessing their opinions faster and preventing them from churning. Managing customer churn prediction projects has become an important issue today, especially in the hotel industry. Therefore, this research seeks to analyze projects that predict the churn of hotel customers to provide a model to help hotel managers in this field. In this research, an approach based on text mining on customers’ comments in the Persian language is presented, which uses the random forest algorithm for classification that was considered the most effective method to solve this problem. In this model, to increase the efficiency of the proposed method in compare with existing works, the gravitational search algorithm was used to select the useful features, and the differential evolution algorithm was used to adjust the parameters of the classification method. The dataset of this research is the collected data from the customer database on social networks and hotels’ websites, especially the hotels on Kish Island in Iran. The results of this research showed that after the implementation of the preprocessing operations, the method of adjusting the parameters and removing the unimportant features, the model’s accuracy increased significantly. The precision, recall, F1, and accuracy criteria were 0.77, 0.76, 0.76, and 0.77, respectively.
format Article
id doaj-art-39a81abd2eb9467385225ae680c83421
institution DOAJ
issn 1687-8094
language English
publishDate 2023-01-01
publisher Wiley
record_format Article
series Advances in Civil Engineering
spelling doaj-art-39a81abd2eb9467385225ae680c834212025-08-20T03:18:46ZengWileyAdvances in Civil Engineering1687-80942023-01-01202310.1155/2023/6029121Analysis of the Customer Churn Prediction Project in the Hotel Industry Based on Text Mining and the Random Forest AlgorithmLeila Taherkhani0Amir Daneshvar1Hossein Amoozad Khalili2Mohamad Reza Sanaei3Department of Information Technology Management, Science and Research BranchDepartment of Industrial Management, Science and Research BranchDepartment of Industrial EngineeringDepartment of Information and Technology ManagementThe ability of hotels to differentiate themselves from competitors and continue to operate profitably depends on their ability to retain their customers by building long-term and permanent customer relationships. Technological developments in recent years have made it possible for companies to predict their customers’ behavior by accessing their opinions faster and preventing them from churning. Managing customer churn prediction projects has become an important issue today, especially in the hotel industry. Therefore, this research seeks to analyze projects that predict the churn of hotel customers to provide a model to help hotel managers in this field. In this research, an approach based on text mining on customers’ comments in the Persian language is presented, which uses the random forest algorithm for classification that was considered the most effective method to solve this problem. In this model, to increase the efficiency of the proposed method in compare with existing works, the gravitational search algorithm was used to select the useful features, and the differential evolution algorithm was used to adjust the parameters of the classification method. The dataset of this research is the collected data from the customer database on social networks and hotels’ websites, especially the hotels on Kish Island in Iran. The results of this research showed that after the implementation of the preprocessing operations, the method of adjusting the parameters and removing the unimportant features, the model’s accuracy increased significantly. The precision, recall, F1, and accuracy criteria were 0.77, 0.76, 0.76, and 0.77, respectively.http://dx.doi.org/10.1155/2023/6029121
spellingShingle Leila Taherkhani
Amir Daneshvar
Hossein Amoozad Khalili
Mohamad Reza Sanaei
Analysis of the Customer Churn Prediction Project in the Hotel Industry Based on Text Mining and the Random Forest Algorithm
Advances in Civil Engineering
title Analysis of the Customer Churn Prediction Project in the Hotel Industry Based on Text Mining and the Random Forest Algorithm
title_full Analysis of the Customer Churn Prediction Project in the Hotel Industry Based on Text Mining and the Random Forest Algorithm
title_fullStr Analysis of the Customer Churn Prediction Project in the Hotel Industry Based on Text Mining and the Random Forest Algorithm
title_full_unstemmed Analysis of the Customer Churn Prediction Project in the Hotel Industry Based on Text Mining and the Random Forest Algorithm
title_short Analysis of the Customer Churn Prediction Project in the Hotel Industry Based on Text Mining and the Random Forest Algorithm
title_sort analysis of the customer churn prediction project in the hotel industry based on text mining and the random forest algorithm
url http://dx.doi.org/10.1155/2023/6029121
work_keys_str_mv AT leilataherkhani analysisofthecustomerchurnpredictionprojectinthehotelindustrybasedontextminingandtherandomforestalgorithm
AT amirdaneshvar analysisofthecustomerchurnpredictionprojectinthehotelindustrybasedontextminingandtherandomforestalgorithm
AT hosseinamoozadkhalili analysisofthecustomerchurnpredictionprojectinthehotelindustrybasedontextminingandtherandomforestalgorithm
AT mohamadrezasanaei analysisofthecustomerchurnpredictionprojectinthehotelindustrybasedontextminingandtherandomforestalgorithm