Learning model combined with data clustering and dimensionality reduction for short-term electricity load forecasting

Abstract Electric load forecasting is crucial in the planning and operating electric power companies. It has evolved from statistical methods to artificial intelligence-based techniques that use machine learning models. In this study, we investigate short-term load forecasting (STLF) for large-scale...

Full description

Saved in:
Bibliographic Details
Main Authors: Hyun-Jung Bae, Jong-Seong Park, Ji-hyeok Choi, Hyuk-Yoon Kwon
Format: Article
Language:English
Published: Nature Portfolio 2025-01-01
Series:Scientific Reports
Online Access:https://doi.org/10.1038/s41598-025-86982-0
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1832571808485736448
author Hyun-Jung Bae
Jong-Seong Park
Ji-hyeok Choi
Hyuk-Yoon Kwon
author_facet Hyun-Jung Bae
Jong-Seong Park
Ji-hyeok Choi
Hyuk-Yoon Kwon
author_sort Hyun-Jung Bae
collection DOAJ
description Abstract Electric load forecasting is crucial in the planning and operating electric power companies. It has evolved from statistical methods to artificial intelligence-based techniques that use machine learning models. In this study, we investigate short-term load forecasting (STLF) for large-scale electricity usage datasets. We propose a new prediction model for STLF that combines data clustering and dimensionality reduction schemes to handle large-scale electricity usage data effectively. Here, we adapt k-means clustering for data clustering, kernel principal component analysis (kernel PCA), universal manifold approximation and projection (UMAP), and t-stochastic nearest neighbor (t-SNE) for dimensionality reduction. To verify the effectiveness of the proposed model, we extensively apply it to neural network-based models. We compare and analyze the performance of the proposed model with the comparisons using actual electricity usage data for 4710 households. Experimental results demonstrate that data clustering with dimensionality reduction can improve the performance of baseline models. As a result, the prediction accuracy of the proposed method outperforms those of the existing methods by 1.01–1.76 times for summer data and by 1.03–1.36 times for winter data in terms of mean absolute percentage error (MAPE).
format Article
id doaj-art-a0f6cb8dde5e46bebf63ddb2a9cf419f
institution Kabale University
issn 2045-2322
language English
publishDate 2025-01-01
publisher Nature Portfolio
record_format Article
series Scientific Reports
spelling doaj-art-a0f6cb8dde5e46bebf63ddb2a9cf419f2025-02-02T12:19:21ZengNature PortfolioScientific Reports2045-23222025-01-0115111710.1038/s41598-025-86982-0Learning model combined with data clustering and dimensionality reduction for short-term electricity load forecastingHyun-Jung Bae0Jong-Seong Park1Ji-hyeok Choi2Hyuk-Yoon Kwon3Graduate School of Data Science, Seoul National University of Science and TechnologyGraduate School of Data Science, Seoul National University of Science and TechnologyDepartment of Industrial Engineering, Seoul National University of Science and TechnologyDepartment of Industrial Engineering/Graduate School of Data Science/Research Center for Electrical and Information Science, Seoul National University of Science and TechnologyAbstract Electric load forecasting is crucial in the planning and operating electric power companies. It has evolved from statistical methods to artificial intelligence-based techniques that use machine learning models. In this study, we investigate short-term load forecasting (STLF) for large-scale electricity usage datasets. We propose a new prediction model for STLF that combines data clustering and dimensionality reduction schemes to handle large-scale electricity usage data effectively. Here, we adapt k-means clustering for data clustering, kernel principal component analysis (kernel PCA), universal manifold approximation and projection (UMAP), and t-stochastic nearest neighbor (t-SNE) for dimensionality reduction. To verify the effectiveness of the proposed model, we extensively apply it to neural network-based models. We compare and analyze the performance of the proposed model with the comparisons using actual electricity usage data for 4710 households. Experimental results demonstrate that data clustering with dimensionality reduction can improve the performance of baseline models. As a result, the prediction accuracy of the proposed method outperforms those of the existing methods by 1.01–1.76 times for summer data and by 1.03–1.36 times for winter data in terms of mean absolute percentage error (MAPE).https://doi.org/10.1038/s41598-025-86982-0
spellingShingle Hyun-Jung Bae
Jong-Seong Park
Ji-hyeok Choi
Hyuk-Yoon Kwon
Learning model combined with data clustering and dimensionality reduction for short-term electricity load forecasting
Scientific Reports
title Learning model combined with data clustering and dimensionality reduction for short-term electricity load forecasting
title_full Learning model combined with data clustering and dimensionality reduction for short-term electricity load forecasting
title_fullStr Learning model combined with data clustering and dimensionality reduction for short-term electricity load forecasting
title_full_unstemmed Learning model combined with data clustering and dimensionality reduction for short-term electricity load forecasting
title_short Learning model combined with data clustering and dimensionality reduction for short-term electricity load forecasting
title_sort learning model combined with data clustering and dimensionality reduction for short term electricity load forecasting
url https://doi.org/10.1038/s41598-025-86982-0
work_keys_str_mv AT hyunjungbae learningmodelcombinedwithdataclusteringanddimensionalityreductionforshorttermelectricityloadforecasting
AT jongseongpark learningmodelcombinedwithdataclusteringanddimensionalityreductionforshorttermelectricityloadforecasting
AT jihyeokchoi learningmodelcombinedwithdataclusteringanddimensionalityreductionforshorttermelectricityloadforecasting
AT hyukyoonkwon learningmodelcombinedwithdataclusteringanddimensionalityreductionforshorttermelectricityloadforecasting