Integrated AutoML-based framework for optimizing shale gas production: A case study of the Fuling shale gas field

This study introduces a comprehensive and automated framework that leverages data-driven methodologies to address various challenges in shale gas development and production. Specifically, it harnesses the power of Automated Machine Learning (AutoML) to construct an ensemble model to predict the esti...

Full description

Saved in:
Bibliographic Details
Main Authors: Tianrui Ye, Jin Meng, Yitian Xiao, Yaqiu Lu, Aiwei Zheng, Bang Liang
Format: Article
Language:English
Published: KeAi Communications Co., Ltd. 2025-03-01
Series:Energy Geoscience
Subjects:
Online Access:http://www.sciencedirect.com/science/article/pii/S2666759224000805
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1832582102849159168
author Tianrui Ye
Jin Meng
Yitian Xiao
Yaqiu Lu
Aiwei Zheng
Bang Liang
author_facet Tianrui Ye
Jin Meng
Yitian Xiao
Yaqiu Lu
Aiwei Zheng
Bang Liang
author_sort Tianrui Ye
collection DOAJ
description This study introduces a comprehensive and automated framework that leverages data-driven methodologies to address various challenges in shale gas development and production. Specifically, it harnesses the power of Automated Machine Learning (AutoML) to construct an ensemble model to predict the estimated ultimate recovery (EUR) of shale gas wells. To demystify the “black-box” nature of the ensemble model, KernelSHAP, a kernel-based approach to compute Shapley values, is utilized for elucidating the influential factors that affect shale gas production at both global and local scales. Furthermore, a bi-objective optimization algorithm named NSGA-II is seamlessly incorporated to optimize hydraulic fracturing designs for production boost and cost control. This innovative framework addresses critical limitations often encountered in applying machine learning (ML) to shale gas production: the challenge of achieving sufficient model accuracy with limited samples, the multidisciplinary expertise required for developing robust ML models, and the need for interpretability in “black-box” models. Validation with field data from the Fuling shale gas field in the Sichuan Basin substantiates the framework's efficacy in enhancing the precision and applicability of data-driven techniques. The test accuracy of the ensemble ML model reached 83 % compared to a maximum of 72 % of single ML models. The contribution of each geological and engineering factor to the overall production was quantitatively evaluated. Fracturing design optimization raised EUR by 7 %–34 % under different production and cost tradeoff scenarios. The results empower domain experts to conduct more precise and objective data-driven analyses and optimizations for shale gas production with minimal expertise in data science.
format Article
id doaj-art-78c3df916e684c0b8fda9adcfaaba3b5
institution Kabale University
issn 2666-7592
language English
publishDate 2025-03-01
publisher KeAi Communications Co., Ltd.
record_format Article
series Energy Geoscience
spelling doaj-art-78c3df916e684c0b8fda9adcfaaba3b52025-01-30T05:15:05ZengKeAi Communications Co., Ltd.Energy Geoscience2666-75922025-03-0161100365Integrated AutoML-based framework for optimizing shale gas production: A case study of the Fuling shale gas fieldTianrui Ye0Jin Meng1Yitian Xiao2Yaqiu Lu3Aiwei Zheng4Bang Liang5SINOPEC Petroleum Exploration and Production Research Institute, Beijing, 100083, PR China; Corresponding author.SINOPEC Petroleum Exploration and Production Research Institute, Beijing, 100083, PR ChinaSINOPEC Petroleum Exploration and Production Research Institute, Beijing, 100083, PR ChinaSINOPEC Jianghan Oilfifield Company, Research Institute of Exploration and Development, Wuhan, Hubei, 430223, PR ChinaSINOPEC Jianghan Oilfifield Company, Research Institute of Exploration and Development, Wuhan, Hubei, 430223, PR ChinaSINOPEC Jianghan Oilfifield Company, Research Institute of Exploration and Development, Wuhan, Hubei, 430223, PR ChinaThis study introduces a comprehensive and automated framework that leverages data-driven methodologies to address various challenges in shale gas development and production. Specifically, it harnesses the power of Automated Machine Learning (AutoML) to construct an ensemble model to predict the estimated ultimate recovery (EUR) of shale gas wells. To demystify the “black-box” nature of the ensemble model, KernelSHAP, a kernel-based approach to compute Shapley values, is utilized for elucidating the influential factors that affect shale gas production at both global and local scales. Furthermore, a bi-objective optimization algorithm named NSGA-II is seamlessly incorporated to optimize hydraulic fracturing designs for production boost and cost control. This innovative framework addresses critical limitations often encountered in applying machine learning (ML) to shale gas production: the challenge of achieving sufficient model accuracy with limited samples, the multidisciplinary expertise required for developing robust ML models, and the need for interpretability in “black-box” models. Validation with field data from the Fuling shale gas field in the Sichuan Basin substantiates the framework's efficacy in enhancing the precision and applicability of data-driven techniques. The test accuracy of the ensemble ML model reached 83 % compared to a maximum of 72 % of single ML models. The contribution of each geological and engineering factor to the overall production was quantitatively evaluated. Fracturing design optimization raised EUR by 7 %–34 % under different production and cost tradeoff scenarios. The results empower domain experts to conduct more precise and objective data-driven analyses and optimizations for shale gas production with minimal expertise in data science.http://www.sciencedirect.com/science/article/pii/S2666759224000805Machine learningModel interpretationBi-objective optimizationShale gasKey factor analysisFracturing optimization
spellingShingle Tianrui Ye
Jin Meng
Yitian Xiao
Yaqiu Lu
Aiwei Zheng
Bang Liang
Integrated AutoML-based framework for optimizing shale gas production: A case study of the Fuling shale gas field
Energy Geoscience
Machine learning
Model interpretation
Bi-objective optimization
Shale gas
Key factor analysis
Fracturing optimization
title Integrated AutoML-based framework for optimizing shale gas production: A case study of the Fuling shale gas field
title_full Integrated AutoML-based framework for optimizing shale gas production: A case study of the Fuling shale gas field
title_fullStr Integrated AutoML-based framework for optimizing shale gas production: A case study of the Fuling shale gas field
title_full_unstemmed Integrated AutoML-based framework for optimizing shale gas production: A case study of the Fuling shale gas field
title_short Integrated AutoML-based framework for optimizing shale gas production: A case study of the Fuling shale gas field
title_sort integrated automl based framework for optimizing shale gas production a case study of the fuling shale gas field
topic Machine learning
Model interpretation
Bi-objective optimization
Shale gas
Key factor analysis
Fracturing optimization
url http://www.sciencedirect.com/science/article/pii/S2666759224000805
work_keys_str_mv AT tianruiye integratedautomlbasedframeworkforoptimizingshalegasproductionacasestudyofthefulingshalegasfield
AT jinmeng integratedautomlbasedframeworkforoptimizingshalegasproductionacasestudyofthefulingshalegasfield
AT yitianxiao integratedautomlbasedframeworkforoptimizingshalegasproductionacasestudyofthefulingshalegasfield
AT yaqiulu integratedautomlbasedframeworkforoptimizingshalegasproductionacasestudyofthefulingshalegasfield
AT aiweizheng integratedautomlbasedframeworkforoptimizingshalegasproductionacasestudyofthefulingshalegasfield
AT bangliang integratedautomlbasedframeworkforoptimizingshalegasproductionacasestudyofthefulingshalegasfield