Comparison of Language Models in Skills Extraction From Vacancies and Resumes

The ability of large language models (LLMs) to “understand” large volumes of text data allows for consistent quality selection of candidates for company openings. The purpose of the work is to consider the capabilities of language models (LLM) and the specified method using vector representations in...

Full description

Saved in:
Bibliographic Details
Main Authors: Lyubov Komarova, Vladimir Soloviev, Alexey Kolosov
Format: Article
Language:Russian
Published: The Fund for Promotion of Internet media, IT education, human development «League Internet Media» 2024-03-01
Series:Современные информационные технологии и IT-образование
Subjects:
Online Access:http://sitito.cs.msu.ru/index.php/SITITO/article/view/1078
Tags: Add Tag
No Tags, Be the first to tag this record!
Description
Summary:The ability of large language models (LLMs) to “understand” large volumes of text data allows for consistent quality selection of candidates for company openings. The purpose of the work is to consider the capabilities of language models (LLM) and the specified method using vector representations in the tasks of extracting skills from texts of vacancies and resumes. Particular attention is paid to the use of skill ranking methods using LLM and the following method using the cosine distance between vector representations of skills. The study consisted of three experiments: the first experiment aimed to extract skill phrases from described work experiences from a text resume; the second involves assigning skills from the resume text to a reference set of functions from the job requirements; The third experiment aims to evaluate the best performance between the two skill sets. The result of the study is the selection of the best model and the derivation of functions from the summary text, as well as a comparison of the two sets of functions with each other. Experiments have shown that language models are superior to numerical methods in terms of accuracy and flexibility in determining the capabilities of a text. Using LLM to rank features using cosine distance has shown poor performance and accuracy in measuring features between job openings and resumes. However, the numerical method using vector representation methods showed better results in quality ranking and stability with increasing number of parliamentary examples. The results of this study have practical implications for the development of more accurate and efficient personnel selection systems. The introduction of language models into human resource management processes can improve the quality and speed of processing large volumes of data, which will lead to a more accurate and faster selection of qualified specialists.
ISSN:2411-1473