Machine learning methods (tokenization) in marketing research

Field research is of particular interest in marketing because it often generates unique statistics. Closed-ended questions during data collection simplify data processing, but at the same time significantly limit the research subject depth. Open-ended questions provide a deeper understanding of resp...

Full description

Saved in:
Bibliographic Details
Main Authors: E. V. Ganebnykh, N. K. Savelieva, A. A. Sozinova, O. V. Fokina, I. G. Altsybeeva
Format: Article
Language:English
Published: Publishing House of the State University of Management 2024-06-01
Series:Вестник университета
Subjects:
Online Access:https://vestnik.guu.ru/jour/article/view/5224
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1832541481823371264
author E. V. Ganebnykh
N. K. Savelieva
A. A. Sozinova
O. V. Fokina
I. G. Altsybeeva
author_facet E. V. Ganebnykh
N. K. Savelieva
A. A. Sozinova
O. V. Fokina
I. G. Altsybeeva
author_sort E. V. Ganebnykh
collection DOAJ
description Field research is of particular interest in marketing because it often generates unique statistics. Closed-ended questions during data collection simplify data processing, but at the same time significantly limit the research subject depth. Open-ended questions provide a deeper understanding of respondents’ opinions, but processing responses in the form of natural language (qualitative data) is difficult and time-consuming, as it is usually done manually. Modern machine learning techniques, particularly tokenization, can be used to automate such data processing. The purpose of the study is to test this method application to data processing of the field research “Monitoring of the competition state and development in the commodity markets of the Novosibirsk Region”. The following tasks have been set and solved: primary information has been collected and prepared for processing, and token groups identified and formed. Based on the groups, the respondents’ answers have been further combined into relatively homogeneous clusters including similar answers to open-ended questions. Subsequent quality control of the conducted research has been carried out on the basis of Precision, Recall and F-measure metrics, which showed an acceptable level of data processing quality. Information collection has been realized through sociological surveys (questionnaire distribution) and CAWI surveys and included open-ended questions. The study reveals that even extremely insignificant references were not missed. The obtained data allowed us to conclude that it is necessary to form annotated databases and token libraries for the marketing research purposes.
format Article
id doaj-art-7ae161ad1767453da6b40363aaeaf7fe
institution Kabale University
issn 1816-4277
2686-8415
language English
publishDate 2024-06-01
publisher Publishing House of the State University of Management
record_format Article
series Вестник университета
spelling doaj-art-7ae161ad1767453da6b40363aaeaf7fe2025-02-04T08:28:20ZengPublishing House of the State University of ManagementВестник университета1816-42772686-84152024-06-0104617210.26425/1816-4277-2024-4-61-723076Machine learning methods (tokenization) in marketing researchE. V. Ganebnykh0N. K. Savelieva1A. A. Sozinova2O. V. Fokina3I. G. Altsybeeva4Vyatka State UniversityVyatka State UniversityVyatka State UniversityVyatka State UniversityVyatka State UniversityField research is of particular interest in marketing because it often generates unique statistics. Closed-ended questions during data collection simplify data processing, but at the same time significantly limit the research subject depth. Open-ended questions provide a deeper understanding of respondents’ opinions, but processing responses in the form of natural language (qualitative data) is difficult and time-consuming, as it is usually done manually. Modern machine learning techniques, particularly tokenization, can be used to automate such data processing. The purpose of the study is to test this method application to data processing of the field research “Monitoring of the competition state and development in the commodity markets of the Novosibirsk Region”. The following tasks have been set and solved: primary information has been collected and prepared for processing, and token groups identified and formed. Based on the groups, the respondents’ answers have been further combined into relatively homogeneous clusters including similar answers to open-ended questions. Subsequent quality control of the conducted research has been carried out on the basis of Precision, Recall and F-measure metrics, which showed an acceptable level of data processing quality. Information collection has been realized through sociological surveys (questionnaire distribution) and CAWI surveys and included open-ended questions. The study reveals that even extremely insignificant references were not missed. The obtained data allowed us to conclude that it is necessary to form annotated databases and token libraries for the marketing research purposes.https://vestnik.guu.ru/jour/article/view/5224machine learningtokentokenizationfield researchopen-ended questionqualitative datadata processingnatural language
spellingShingle E. V. Ganebnykh
N. K. Savelieva
A. A. Sozinova
O. V. Fokina
I. G. Altsybeeva
Machine learning methods (tokenization) in marketing research
Вестник университета
machine learning
token
tokenization
field research
open-ended question
qualitative data
data processing
natural language
title Machine learning methods (tokenization) in marketing research
title_full Machine learning methods (tokenization) in marketing research
title_fullStr Machine learning methods (tokenization) in marketing research
title_full_unstemmed Machine learning methods (tokenization) in marketing research
title_short Machine learning methods (tokenization) in marketing research
title_sort machine learning methods tokenization in marketing research
topic machine learning
token
tokenization
field research
open-ended question
qualitative data
data processing
natural language
url https://vestnik.guu.ru/jour/article/view/5224
work_keys_str_mv AT evganebnykh machinelearningmethodstokenizationinmarketingresearch
AT nksavelieva machinelearningmethodstokenizationinmarketingresearch
AT aasozinova machinelearningmethodstokenizationinmarketingresearch
AT ovfokina machinelearningmethodstokenizationinmarketingresearch
AT igaltsybeeva machinelearningmethodstokenizationinmarketingresearch