Machine learning methods (tokenization) in marketing research
Field research is of particular interest in marketing because it often generates unique statistics. Closed-ended questions during data collection simplify data processing, but at the same time significantly limit the research subject depth. Open-ended questions provide a deeper understanding of resp...
Saved in:
Main Authors: | , , , , |
---|---|
Format: | Article |
Language: | English |
Published: |
Publishing House of the State University of Management
2024-06-01
|
Series: | Вестник университета |
Subjects: | |
Online Access: | https://vestnik.guu.ru/jour/article/view/5224 |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
_version_ | 1832541481823371264 |
---|---|
author | E. V. Ganebnykh N. K. Savelieva A. A. Sozinova O. V. Fokina I. G. Altsybeeva |
author_facet | E. V. Ganebnykh N. K. Savelieva A. A. Sozinova O. V. Fokina I. G. Altsybeeva |
author_sort | E. V. Ganebnykh |
collection | DOAJ |
description | Field research is of particular interest in marketing because it often generates unique statistics. Closed-ended questions during data collection simplify data processing, but at the same time significantly limit the research subject depth. Open-ended questions provide a deeper understanding of respondents’ opinions, but processing responses in the form of natural language (qualitative data) is difficult and time-consuming, as it is usually done manually. Modern machine learning techniques, particularly tokenization, can be used to automate such data processing. The purpose of the study is to test this method application to data processing of the field research “Monitoring of the competition state and development in the commodity markets of the Novosibirsk Region”. The following tasks have been set and solved: primary information has been collected and prepared for processing, and token groups identified and formed. Based on the groups, the respondents’ answers have been further combined into relatively homogeneous clusters including similar answers to open-ended questions. Subsequent quality control of the conducted research has been carried out on the basis of Precision, Recall and F-measure metrics, which showed an acceptable level of data processing quality. Information collection has been realized through sociological surveys (questionnaire distribution) and CAWI surveys and included open-ended questions. The study reveals that even extremely insignificant references were not missed. The obtained data allowed us to conclude that it is necessary to form annotated databases and token libraries for the marketing research purposes. |
format | Article |
id | doaj-art-7ae161ad1767453da6b40363aaeaf7fe |
institution | Kabale University |
issn | 1816-4277 2686-8415 |
language | English |
publishDate | 2024-06-01 |
publisher | Publishing House of the State University of Management |
record_format | Article |
series | Вестник университета |
spelling | doaj-art-7ae161ad1767453da6b40363aaeaf7fe2025-02-04T08:28:20ZengPublishing House of the State University of ManagementВестник университета1816-42772686-84152024-06-0104617210.26425/1816-4277-2024-4-61-723076Machine learning methods (tokenization) in marketing researchE. V. Ganebnykh0N. K. Savelieva1A. A. Sozinova2O. V. Fokina3I. G. Altsybeeva4Vyatka State UniversityVyatka State UniversityVyatka State UniversityVyatka State UniversityVyatka State UniversityField research is of particular interest in marketing because it often generates unique statistics. Closed-ended questions during data collection simplify data processing, but at the same time significantly limit the research subject depth. Open-ended questions provide a deeper understanding of respondents’ opinions, but processing responses in the form of natural language (qualitative data) is difficult and time-consuming, as it is usually done manually. Modern machine learning techniques, particularly tokenization, can be used to automate such data processing. The purpose of the study is to test this method application to data processing of the field research “Monitoring of the competition state and development in the commodity markets of the Novosibirsk Region”. The following tasks have been set and solved: primary information has been collected and prepared for processing, and token groups identified and formed. Based on the groups, the respondents’ answers have been further combined into relatively homogeneous clusters including similar answers to open-ended questions. Subsequent quality control of the conducted research has been carried out on the basis of Precision, Recall and F-measure metrics, which showed an acceptable level of data processing quality. Information collection has been realized through sociological surveys (questionnaire distribution) and CAWI surveys and included open-ended questions. The study reveals that even extremely insignificant references were not missed. The obtained data allowed us to conclude that it is necessary to form annotated databases and token libraries for the marketing research purposes.https://vestnik.guu.ru/jour/article/view/5224machine learningtokentokenizationfield researchopen-ended questionqualitative datadata processingnatural language |
spellingShingle | E. V. Ganebnykh N. K. Savelieva A. A. Sozinova O. V. Fokina I. G. Altsybeeva Machine learning methods (tokenization) in marketing research Вестник университета machine learning token tokenization field research open-ended question qualitative data data processing natural language |
title | Machine learning methods (tokenization) in marketing research |
title_full | Machine learning methods (tokenization) in marketing research |
title_fullStr | Machine learning methods (tokenization) in marketing research |
title_full_unstemmed | Machine learning methods (tokenization) in marketing research |
title_short | Machine learning methods (tokenization) in marketing research |
title_sort | machine learning methods tokenization in marketing research |
topic | machine learning token tokenization field research open-ended question qualitative data data processing natural language |
url | https://vestnik.guu.ru/jour/article/view/5224 |
work_keys_str_mv | AT evganebnykh machinelearningmethodstokenizationinmarketingresearch AT nksavelieva machinelearningmethodstokenizationinmarketingresearch AT aasozinova machinelearningmethodstokenizationinmarketingresearch AT ovfokina machinelearningmethodstokenizationinmarketingresearch AT igaltsybeeva machinelearningmethodstokenizationinmarketingresearch |