A Novel RMS-Driven Deep Reinforcement Learning for Optimized Portfolio Management in Stock Trading

Algorithmic stock trading has improved tremendously, with Reinforcement Learning (RL) algorithms being more adaptable than classic approaches like mean reversion and momentum. However, challenges remain in adequately depicting market events and generating suitable rewards to influence the trading de...

Full description

Saved in:
Bibliographic Details
Main Authors: Asma Sattar, Amna Sarwar, Saira Gillani, Maryam Bukhari, Seungmin Rho, Muhammad Faseeh
Format: Article
Language:English
Published: IEEE 2025-01-01
Series:IEEE Access
Subjects:
Online Access:https://ieeexplore.ieee.org/document/10904473/
Tags: Add Tag
No Tags, Be the first to tag this record!
Description
Summary:Algorithmic stock trading has improved tremendously, with Reinforcement Learning (RL) algorithms being more adaptable than classic approaches like mean reversion and momentum. However, challenges remain in adequately depicting market events and generating suitable rewards to influence the trading decisions of an agent in a dynamic environment. This study proposed an improved stock market trading framework termed the RMS-Driven Deep Reinforcement Learning (DRL) model for optimal portfolio management. The research attempts to give a more comprehensive view of the market and, as a result, enhance trading decisions by including consumer information and incorporating news sentiments into the model, in addition to data from typical earnings reports. More specifically, three kinds of DRL models are presented, combined with data from stock earnings reports, Max Drawdown rewards, and sentiment indicators (RMS), known as PPO_RMS, A2C_RMS, and DDPG_RMS, respectively. The findings of this research indicate that the integrated model, mainly the DDPG_RMS effectively outperforms the baseline ^DJI index in many risk-return analyses in ratios showing better risk management, and profitability. The proposed stock trading model generates a maximum cumulative return of 27% with a Sharpe ratio of 0.66, showing an appropriate trade-off between risk and return. This approach, which incorporates sentiment analysis and Max Drawdown rewards, significantly enhances the model’s performance in adapting to changing market conditions. Therefore, the results emphasize the appropriateness of integrating sentiment indices with traditional financial data to enhance a trader’s performance while also offering essential information to aid in the development of improved trading tactics in continuously changing financial markets.
ISSN:2169-3536