Scalable earthquake magnitude prediction using spatio-temporal data and model versioning
Abstract Earthquake magnitude prediction is critical for natural calamity prevention and mitigation, significantly reducing casualties and economic losses through timely warnings. This study introduces a novel approach by using spatio-temporal data from seismic records obtained from the Indian gover...
Saved in:
| Main Authors: | , |
|---|---|
| Format: | Article |
| Language: | English |
| Published: |
Nature Portfolio
2025-06-01
|
| Series: | Scientific Reports |
| Subjects: | |
| Online Access: | https://doi.org/10.1038/s41598-025-00804-x |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
| _version_ | 1850223927121412096 |
|---|---|
| author | Rahul Singh Bholanath Roy |
| author_facet | Rahul Singh Bholanath Roy |
| author_sort | Rahul Singh |
| collection | DOAJ |
| description | Abstract Earthquake magnitude prediction is critical for natural calamity prevention and mitigation, significantly reducing casualties and economic losses through timely warnings. This study introduces a novel approach by using spatio-temporal data from seismic records obtained from the Indian government seismology department and weather data sourced via VisualCrossing to predict earthquake magnitudes. By integrating environmental and seismic variables, the study explores their interrelationships to enhance predictive capabilities. The proposed framework incorporates a machine learning operations (MLOps)-driven pipeline using MLflow for automated data ingestion, preprocessing, model versioning, tracking, and deployment. This novel integration ensures adaptability to evolving datasets and facilitates dynamic model selection for optimal performance. Multiple machine learning algorithms, including Gradient Boosting, Light Gradient Boosting Machine (LightGBM), XGBoost, and Random Forest, are evaluated on dataset sizes of 20%, 35%, 65%, and 100%, with performance metrics such as Mean Absolute Error, Mean Squared Error, Root Mean Squared Error, and R 2. The results reveal that Gradient Boosting performs optimally on smaller datasets, while LightGBM demonstrates superior accuracy with larger datasets, showcasing the pipeline’s flexibility and scalability. This research presents a scalable, robust, and resilient solution for earthquake magnitude prediction by combining diverse data sources with a dynamic and operational MLOps framework. The outcomes illustrate the potential of integrating advanced machine learning techniques with lifecycle management practices to enhance prediction accuracy and applicability in real-world seismic scenarios. |
| format | Article |
| id | doaj-art-82a5b41fbef54d08b02d2c0b2a744bcc |
| institution | OA Journals |
| issn | 2045-2322 |
| language | English |
| publishDate | 2025-06-01 |
| publisher | Nature Portfolio |
| record_format | Article |
| series | Scientific Reports |
| spelling | doaj-art-82a5b41fbef54d08b02d2c0b2a744bcc2025-08-20T02:05:47ZengNature PortfolioScientific Reports2045-23222025-06-0115112110.1038/s41598-025-00804-xScalable earthquake magnitude prediction using spatio-temporal data and model versioningRahul Singh0Bholanath Roy1Maulana Azad National Institute of TechnologyMaulana Azad National Institute of TechnologyAbstract Earthquake magnitude prediction is critical for natural calamity prevention and mitigation, significantly reducing casualties and economic losses through timely warnings. This study introduces a novel approach by using spatio-temporal data from seismic records obtained from the Indian government seismology department and weather data sourced via VisualCrossing to predict earthquake magnitudes. By integrating environmental and seismic variables, the study explores their interrelationships to enhance predictive capabilities. The proposed framework incorporates a machine learning operations (MLOps)-driven pipeline using MLflow for automated data ingestion, preprocessing, model versioning, tracking, and deployment. This novel integration ensures adaptability to evolving datasets and facilitates dynamic model selection for optimal performance. Multiple machine learning algorithms, including Gradient Boosting, Light Gradient Boosting Machine (LightGBM), XGBoost, and Random Forest, are evaluated on dataset sizes of 20%, 35%, 65%, and 100%, with performance metrics such as Mean Absolute Error, Mean Squared Error, Root Mean Squared Error, and R 2. The results reveal that Gradient Boosting performs optimally on smaller datasets, while LightGBM demonstrates superior accuracy with larger datasets, showcasing the pipeline’s flexibility and scalability. This research presents a scalable, robust, and resilient solution for earthquake magnitude prediction by combining diverse data sources with a dynamic and operational MLOps framework. The outcomes illustrate the potential of integrating advanced machine learning techniques with lifecycle management practices to enhance prediction accuracy and applicability in real-world seismic scenarios.https://doi.org/10.1038/s41598-025-00804-xMachine learning operations (MLOps)Machine learningDisaster predictionSpatio-temporal dataEarthquake magnitudeMLflow |
| spellingShingle | Rahul Singh Bholanath Roy Scalable earthquake magnitude prediction using spatio-temporal data and model versioning Scientific Reports Machine learning operations (MLOps) Machine learning Disaster prediction Spatio-temporal data Earthquake magnitude MLflow |
| title | Scalable earthquake magnitude prediction using spatio-temporal data and model versioning |
| title_full | Scalable earthquake magnitude prediction using spatio-temporal data and model versioning |
| title_fullStr | Scalable earthquake magnitude prediction using spatio-temporal data and model versioning |
| title_full_unstemmed | Scalable earthquake magnitude prediction using spatio-temporal data and model versioning |
| title_short | Scalable earthquake magnitude prediction using spatio-temporal data and model versioning |
| title_sort | scalable earthquake magnitude prediction using spatio temporal data and model versioning |
| topic | Machine learning operations (MLOps) Machine learning Disaster prediction Spatio-temporal data Earthquake magnitude MLflow |
| url | https://doi.org/10.1038/s41598-025-00804-x |
| work_keys_str_mv | AT rahulsingh scalableearthquakemagnitudepredictionusingspatiotemporaldataandmodelversioning AT bholanathroy scalableearthquakemagnitudepredictionusingspatiotemporaldataandmodelversioning |