Evaluating embedding models for text classification in apartment management

The recent proliferation of embedding models has enhanced the accessibility of textual data classification. However, the crucial challenge is evaluating and selecting the most effective embedding model for a specific domain from a vast number of options. In this study, we address this challenge by...

Full description

Saved in:
Bibliographic Details
Main Author: Changro Lee
Format: Article
Language:English
Published: Vilnius Gediminas Technical University 2025-04-01
Series:International Journal of Strategic Property Management
Subjects:
Online Access:https://ijspm.vgtu.lt/index.php/IJSPM/article/view/23637
Tags: Add Tag
No Tags, Be the first to tag this record!
Description
Summary:The recent proliferation of embedding models has enhanced the accessibility of textual data classification. However, the crucial challenge is evaluating and selecting the most effective embedding model for a specific domain from a vast number of options. In this study, we address this challenge by assessing the performance of embedding models based on their effectiveness in downstream tasks. We analyze consultation records maintained by an apartment management body in South Korea, and convert this textual data into numerical representations using various embedding models. The vectorized text is then categorized using a k-means clustering algorithm. The downstream task, specifically, the classification of consultation records, is evaluated using a quantitative metric (Silhouette score) and qualitative approaches (domain-specific knowledge and visual inspection). The qualitative approaches yield more reliable results than the quantitative approach. These findings are expected to be valuable for the various stakeholders in property management.
ISSN:1648-715X
1648-9179