Stock return forecasting based on the proxy variables of category factors

Abstract Stock return prediction has been in the spotlight because it involves numerous factors. Improving the accuracy of stock return prediction and quantifying the impact of individual factors on forecasting remain challenging tasks. Motivated by these challenges, we propose a novel forecasting m...

Full description

Saved in:
Bibliographic Details
Main Authors: Yuan Zhao, Xue Gong, Weiguo Zhang, Weijun Xu
Format: Article
Language:English
Published: SpringerOpen 2025-06-01
Series:Financial Innovation
Subjects:
Online Access:https://doi.org/10.1186/s40854-025-00779-8
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1850102238661312512
author Yuan Zhao
Xue Gong
Weiguo Zhang
Weijun Xu
author_facet Yuan Zhao
Xue Gong
Weiguo Zhang
Weijun Xu
author_sort Yuan Zhao
collection DOAJ
description Abstract Stock return prediction has been in the spotlight because it involves numerous factors. Improving the accuracy of stock return prediction and quantifying the impact of individual factors on forecasting remain challenging tasks. Motivated by these challenges, we propose a novel forecasting method that entails proxy variables of category factors and the random forest technique. This new method aims to quantify the information and importance of category factors, thereby enhancing the predictability of stock returns. Specifically, we categorize a large set of return predictors into several category factors. We then utilize the importance of the original variables to construct proxy variables for these category factors. Subsequently, we use the proxy variables to build a random forest model for predicting stock returns. Our empirical analysis results demonstrate that the proposed method effectively quantifies the importance of both the original factors and category factors. Furthermore, we find that the fundamental information factor consistently ranks as the most crucial category factor for stock return forecasting. Additionally, the proposed method exhibits a more robust and prominent prediction performance than competing models such as single-category-factor-based random forest models, dimension-reduction, and forecast-combination methods. Most importantly, the proposed method produces forecast results that can assist investors with understanding stock market dynamics and facilitate higher investment returns.
format Article
id doaj-art-d5037d99d7b74ed8a0912c703f2a3979
institution DOAJ
issn 2199-4730
language English
publishDate 2025-06-01
publisher SpringerOpen
record_format Article
series Financial Innovation
spelling doaj-art-d5037d99d7b74ed8a0912c703f2a39792025-08-20T02:39:48ZengSpringerOpenFinancial Innovation2199-47302025-06-0111114810.1186/s40854-025-00779-8Stock return forecasting based on the proxy variables of category factorsYuan Zhao0Xue Gong1Weiguo Zhang2Weijun Xu3School of Economics and Management, Lanzhou University of TechnologySchool of Economics and Management, Nanjing University of Science and TechnologyCollege of Management, Shenzhen UniversitySchool of Business Administration, South China University of TechnologyAbstract Stock return prediction has been in the spotlight because it involves numerous factors. Improving the accuracy of stock return prediction and quantifying the impact of individual factors on forecasting remain challenging tasks. Motivated by these challenges, we propose a novel forecasting method that entails proxy variables of category factors and the random forest technique. This new method aims to quantify the information and importance of category factors, thereby enhancing the predictability of stock returns. Specifically, we categorize a large set of return predictors into several category factors. We then utilize the importance of the original variables to construct proxy variables for these category factors. Subsequently, we use the proxy variables to build a random forest model for predicting stock returns. Our empirical analysis results demonstrate that the proposed method effectively quantifies the importance of both the original factors and category factors. Furthermore, we find that the fundamental information factor consistently ranks as the most crucial category factor for stock return forecasting. Additionally, the proposed method exhibits a more robust and prominent prediction performance than competing models such as single-category-factor-based random forest models, dimension-reduction, and forecast-combination methods. Most importantly, the proposed method produces forecast results that can assist investors with understanding stock market dynamics and facilitate higher investment returns.https://doi.org/10.1186/s40854-025-00779-8Return predictionCategory factorsDimension reductionRandom forest
spellingShingle Yuan Zhao
Xue Gong
Weiguo Zhang
Weijun Xu
Stock return forecasting based on the proxy variables of category factors
Financial Innovation
Return prediction
Category factors
Dimension reduction
Random forest
title Stock return forecasting based on the proxy variables of category factors
title_full Stock return forecasting based on the proxy variables of category factors
title_fullStr Stock return forecasting based on the proxy variables of category factors
title_full_unstemmed Stock return forecasting based on the proxy variables of category factors
title_short Stock return forecasting based on the proxy variables of category factors
title_sort stock return forecasting based on the proxy variables of category factors
topic Return prediction
Category factors
Dimension reduction
Random forest
url https://doi.org/10.1186/s40854-025-00779-8
work_keys_str_mv AT yuanzhao stockreturnforecastingbasedontheproxyvariablesofcategoryfactors
AT xuegong stockreturnforecastingbasedontheproxyvariablesofcategoryfactors
AT weiguozhang stockreturnforecastingbasedontheproxyvariablesofcategoryfactors
AT weijunxu stockreturnforecastingbasedontheproxyvariablesofcategoryfactors