Financial risk forecasting with RGCT-prerisk: a relational graph and cross-temporal contrastive pretraining framework

Abstract Financial risk forecasting is critical for the early detection of corporate distress, yet traditional methods and recent deep learning models exhibit notable limitations. Prior approaches often rely on predefined financial ratios or brute-force feature combinations, which may overlook the r...

Full description

Saved in:
Bibliographic Details
Main Authors: Liyu Chen, Xiangwei Fan
Format: Article
Language:English
Published: Springer 2025-07-01
Series:Journal of King Saud University: Computer and Information Sciences
Subjects:
Online Access:https://doi.org/10.1007/s44443-025-00166-4
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1849341670994214912
author Liyu Chen
Xiangwei Fan
author_facet Liyu Chen
Xiangwei Fan
author_sort Liyu Chen
collection DOAJ
description Abstract Financial risk forecasting is critical for the early detection of corporate distress, yet traditional methods and recent deep learning models exhibit notable limitations. Prior approaches often rely on predefined financial ratios or brute-force feature combinations, which may overlook the relational structures among financial items and suffer from high dimensionality. In this paper, we propose RGCT-PreRisk, a novel framework that models a company’s financial statements as a heterogeneous relational graph-where nodes represent financial accounts (e.g., assets, liabilities) and edges encode known accounting relationships (e.g., summation or ratio rules)-rather than as an unstructured feature matrix. A graph neural network (GNN) is employed to capture meaningful relationships between financial items (e.g., assets, liabilities, revenues), replacing exhaustive pairwise operations with learned propagation along true accounting dependencies. To address the challenge of limited labeled data, we introduce a cross-temporal, cross-company contrastive pretraining strategy that leverages historical data across multiple firms and time periods to learn robust and generalizable representations. Furthermore, we incorporate a prototype-attention-confidence module to enhance interpretability. This component enables the model to compare each firm’s financial state to learned prototypical risk patterns via an attention mechanism, while also producing a confidence score to quantify prediction uncertainty. Experiments on two real-world datasets demonstrate that RGCT-PreRisk consistently outperforms existing baselines in terms of accuracy and F1 score. Our approach achieves state-of-the-art predictive performance while providing human-interpretable insights into why a firm is predicted to be at risk. This work presents a new direction for interpretable financial risk forecasting by integrating graph-based representation learning, contrastive pretraining, and case-based reasoning.
format Article
id doaj-art-9cebfa0a60204829a9d926f3bed12b0c
institution Kabale University
issn 1319-1578
2213-1248
language English
publishDate 2025-07-01
publisher Springer
record_format Article
series Journal of King Saud University: Computer and Information Sciences
spelling doaj-art-9cebfa0a60204829a9d926f3bed12b0c2025-08-20T03:43:34ZengSpringerJournal of King Saud University: Computer and Information Sciences1319-15782213-12482025-07-0137612010.1007/s44443-025-00166-4Financial risk forecasting with RGCT-prerisk: a relational graph and cross-temporal contrastive pretraining frameworkLiyu Chen0Xiangwei Fan1International Business School, Qingdao Huanghai UniversityQingdao Talent and Enterprise Service Group Co., Ltd.Abstract Financial risk forecasting is critical for the early detection of corporate distress, yet traditional methods and recent deep learning models exhibit notable limitations. Prior approaches often rely on predefined financial ratios or brute-force feature combinations, which may overlook the relational structures among financial items and suffer from high dimensionality. In this paper, we propose RGCT-PreRisk, a novel framework that models a company’s financial statements as a heterogeneous relational graph-where nodes represent financial accounts (e.g., assets, liabilities) and edges encode known accounting relationships (e.g., summation or ratio rules)-rather than as an unstructured feature matrix. A graph neural network (GNN) is employed to capture meaningful relationships between financial items (e.g., assets, liabilities, revenues), replacing exhaustive pairwise operations with learned propagation along true accounting dependencies. To address the challenge of limited labeled data, we introduce a cross-temporal, cross-company contrastive pretraining strategy that leverages historical data across multiple firms and time periods to learn robust and generalizable representations. Furthermore, we incorporate a prototype-attention-confidence module to enhance interpretability. This component enables the model to compare each firm’s financial state to learned prototypical risk patterns via an attention mechanism, while also producing a confidence score to quantify prediction uncertainty. Experiments on two real-world datasets demonstrate that RGCT-PreRisk consistently outperforms existing baselines in terms of accuracy and F1 score. Our approach achieves state-of-the-art predictive performance while providing human-interpretable insights into why a firm is predicted to be at risk. This work presents a new direction for interpretable financial risk forecasting by integrating graph-based representation learning, contrastive pretraining, and case-based reasoning.https://doi.org/10.1007/s44443-025-00166-4Financial risk predictionRelational graph neural networksNatural language processingContrastive pretrainingStructured financial text modeling
spellingShingle Liyu Chen
Xiangwei Fan
Financial risk forecasting with RGCT-prerisk: a relational graph and cross-temporal contrastive pretraining framework
Journal of King Saud University: Computer and Information Sciences
Financial risk prediction
Relational graph neural networks
Natural language processing
Contrastive pretraining
Structured financial text modeling
title Financial risk forecasting with RGCT-prerisk: a relational graph and cross-temporal contrastive pretraining framework
title_full Financial risk forecasting with RGCT-prerisk: a relational graph and cross-temporal contrastive pretraining framework
title_fullStr Financial risk forecasting with RGCT-prerisk: a relational graph and cross-temporal contrastive pretraining framework
title_full_unstemmed Financial risk forecasting with RGCT-prerisk: a relational graph and cross-temporal contrastive pretraining framework
title_short Financial risk forecasting with RGCT-prerisk: a relational graph and cross-temporal contrastive pretraining framework
title_sort financial risk forecasting with rgct prerisk a relational graph and cross temporal contrastive pretraining framework
topic Financial risk prediction
Relational graph neural networks
Natural language processing
Contrastive pretraining
Structured financial text modeling
url https://doi.org/10.1007/s44443-025-00166-4
work_keys_str_mv AT liyuchen financialriskforecastingwithrgctpreriskarelationalgraphandcrosstemporalcontrastivepretrainingframework
AT xiangweifan financialriskforecastingwithrgctpreriskarelationalgraphandcrosstemporalcontrastivepretrainingframework