Bitcoin price direction prediction using on-chain data and feature selection

Bitcoin is the most traded cryptocurrency by volume and market cap. A number of scholars have directed their research towards characterizing Bitcoin’s speculative behavior using a myriad of techniques such as technical analysis, price regression, and direction classification. For this work, research...

Full description

Saved in:
Bibliographic Details
Main Authors: Ritwik Dubey, David Enke
Format: Article
Language:English
Published: Elsevier 2025-06-01
Series:Machine Learning with Applications
Subjects:
Online Access:http://www.sciencedirect.com/science/article/pii/S266682702500057X
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1850222488876744704
author Ritwik Dubey
David Enke
author_facet Ritwik Dubey
David Enke
author_sort Ritwik Dubey
collection DOAJ
description Bitcoin is the most traded cryptocurrency by volume and market cap. A number of scholars have directed their research towards characterizing Bitcoin’s speculative behavior using a myriad of techniques such as technical analysis, price regression, and direction classification. For this work, research is conducted using the relatively nascent technique of on-chain data analysis. The goal of this research is to evaluate Bitcoin’s on-chain data in predicting future price direction. First, a classification process of on-chain data features that helps the reader understand their relevance is proposed. To address the curse of dimensionality, feature selection algorithms such as L1 regression, Boruta, and the dimensionality reduction algorithm Principal Component Analysis (PCA) are utilized. The research then explores advanced neural networks for next day price direction prediction, including the Convolutional Neural Network-Long-Short Term Memory (CNN-LSTM) and the Temporal Convolutional Network (TCN). Neural network models and trading strategies are then compared based on their return statistics. A comparative analysis of feature selection, learning model performance, and trading strategy performance is also conducted. Results from the research show that the Boruta feature selection algorithm combined with the CNN-LSTM model performs best compared to other combinations with a prediction accuracy of 82.03 % over the testing period. In addition, the on-chain features within the category, realized value, and unrealized value classifications have higher predictive powers for next day price direction prediction. Finally, during trade simulations, the CNN-LSTM model with a Long-Short strategy had an annualized return of 1682.7 % and a Sharpe Ratio of 6.47.
format Article
id doaj-art-39d48fcbd6834f35b3fa1c730cf61cfa
institution OA Journals
issn 2666-8270
language English
publishDate 2025-06-01
publisher Elsevier
record_format Article
series Machine Learning with Applications
spelling doaj-art-39d48fcbd6834f35b3fa1c730cf61cfa2025-08-20T02:06:20ZengElsevierMachine Learning with Applications2666-82702025-06-012010067410.1016/j.mlwa.2025.100674Bitcoin price direction prediction using on-chain data and feature selectionRitwik Dubey0David Enke1Laboratory for Investment and Financial Engineering, Department of Engineering Management and Systems Engineering, Missouri University of Science and Technology, 223 Engineering Management, 600 W. 14th Street, Rolla, MO 65409-0370, USALaboratory for Investment and Financial Engineering, Department of Engineering Management and Systems Engineering, Missouri University of Science and Technology, 221 Engineering Management, 600 W. 14th Street, Rolla, MO 65409-0370, USA; Corresponding author.Bitcoin is the most traded cryptocurrency by volume and market cap. A number of scholars have directed their research towards characterizing Bitcoin’s speculative behavior using a myriad of techniques such as technical analysis, price regression, and direction classification. For this work, research is conducted using the relatively nascent technique of on-chain data analysis. The goal of this research is to evaluate Bitcoin’s on-chain data in predicting future price direction. First, a classification process of on-chain data features that helps the reader understand their relevance is proposed. To address the curse of dimensionality, feature selection algorithms such as L1 regression, Boruta, and the dimensionality reduction algorithm Principal Component Analysis (PCA) are utilized. The research then explores advanced neural networks for next day price direction prediction, including the Convolutional Neural Network-Long-Short Term Memory (CNN-LSTM) and the Temporal Convolutional Network (TCN). Neural network models and trading strategies are then compared based on their return statistics. A comparative analysis of feature selection, learning model performance, and trading strategy performance is also conducted. Results from the research show that the Boruta feature selection algorithm combined with the CNN-LSTM model performs best compared to other combinations with a prediction accuracy of 82.03 % over the testing period. In addition, the on-chain features within the category, realized value, and unrealized value classifications have higher predictive powers for next day price direction prediction. Finally, during trade simulations, the CNN-LSTM model with a Long-Short strategy had an annualized return of 1682.7 % and a Sharpe Ratio of 6.47.http://www.sciencedirect.com/science/article/pii/S266682702500057XBitcoinOn-chain dataFeature selectionCNN-LSTMTCN
spellingShingle Ritwik Dubey
David Enke
Bitcoin price direction prediction using on-chain data and feature selection
Machine Learning with Applications
Bitcoin
On-chain data
Feature selection
CNN-LSTM
TCN
title Bitcoin price direction prediction using on-chain data and feature selection
title_full Bitcoin price direction prediction using on-chain data and feature selection
title_fullStr Bitcoin price direction prediction using on-chain data and feature selection
title_full_unstemmed Bitcoin price direction prediction using on-chain data and feature selection
title_short Bitcoin price direction prediction using on-chain data and feature selection
title_sort bitcoin price direction prediction using on chain data and feature selection
topic Bitcoin
On-chain data
Feature selection
CNN-LSTM
TCN
url http://www.sciencedirect.com/science/article/pii/S266682702500057X
work_keys_str_mv AT ritwikdubey bitcoinpricedirectionpredictionusingonchaindataandfeatureselection
AT davidenke bitcoinpricedirectionpredictionusingonchaindataandfeatureselection