A data-driven PCA-RF-VIM method to identify key factors driving post-fracturing gas production of tight reservoirs

Hydraulic fracturing technology has achieved remarkable results in improving the production of tight gas reservoirs, but its effectiveness is under the joint action of multiple factors of complexity. Traditional analysis methods have limitations in dealing with these complex and interrelated factors...

Full description

Saved in:
Bibliographic Details
Main Authors: Yifan Zhao, Xiaofan Li, Lei Zuo, Zhongtai Hu, Liangbin Dou, Huagui Yu, Tiantai Li, Jun Lu
Format: Article
Language:English
Published: KeAi Communications Co., Ltd. 2025-06-01
Series:Energy Geoscience
Subjects:
Online Access:http://www.sciencedirect.com/science/article/pii/S2666759225000320
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1850160266037166080
author Yifan Zhao
Xiaofan Li
Lei Zuo
Zhongtai Hu
Liangbin Dou
Huagui Yu
Tiantai Li
Jun Lu
author_facet Yifan Zhao
Xiaofan Li
Lei Zuo
Zhongtai Hu
Liangbin Dou
Huagui Yu
Tiantai Li
Jun Lu
author_sort Yifan Zhao
collection DOAJ
description Hydraulic fracturing technology has achieved remarkable results in improving the production of tight gas reservoirs, but its effectiveness is under the joint action of multiple factors of complexity. Traditional analysis methods have limitations in dealing with these complex and interrelated factors, and it is difficult to fully reveal the actual contribution of each factor to the production. Machine learning-based methods explore the complex mapping relationships between large amounts of data to provide data-driven insights into the key factors driving production. In this study, a data-driven PCA-RF-VIM (Principal Component Analysis-Random Forest-Variable Importance Measures) approach of analyzing the importance of features is proposed to identify the key factors driving post-fracturing production. Four types of parameters, including log parameters, geological and reservoir physical parameters, hydraulic fracturing design parameters, and reservoir stimulation parameters, were inputted into the PCA-RF-VIM model. The model was trained using 6-fold cross-validation and grid search, and the relative importance ranking of each factor was finally obtained. In order to verify the validity of the PCA-RF-VIM model, a consolidation model that uses three other independent data-driven methods (Pearson correlation coefficient, RF feature significance analysis method, and XGboost feature significance analysis method) are applied to compare with the PCA-RF-VIM model. A comparison the two models shows that they contain almost the same parameters in the top ten, with only minor differences in one parameter. In combination with the reservoir characteristics, the reasonableness of the PCA-RF-VIM model is verified, and the importance ranking of the parameters by this method is more consistent with the reservoir characteristics of the study area. Ultimately, the ten parameters are selected as the controlling factors that have the potential to influence post-fracturing gas production, as the combined importance of these top ten parameters is 91.95 % on driving natural gas production. Analyzing and obtaining these ten controlling factors provides engineers with a new insight into the reservoir selection for fracturing stimulation and fracturing parameter optimization to improve fracturing efficiency and productivity.
format Article
id doaj-art-9fb28e71edf447f59cc31e3b750d3fa3
institution OA Journals
issn 2666-7592
language English
publishDate 2025-06-01
publisher KeAi Communications Co., Ltd.
record_format Article
series Energy Geoscience
spelling doaj-art-9fb28e71edf447f59cc31e3b750d3fa32025-08-20T02:23:11ZengKeAi Communications Co., Ltd.Energy Geoscience2666-75922025-06-016210041110.1016/j.engeos.2025.100411A data-driven PCA-RF-VIM method to identify key factors driving post-fracturing gas production of tight reservoirsYifan Zhao0Xiaofan Li1Lei Zuo2Zhongtai Hu3Liangbin Dou4Huagui Yu5Tiantai Li6Jun Lu7College of Petroleum Engineering, Xi'an Shiyou University, Xi'an, Shaanxi, 710065, China; Engineering Research Center of Development and Management for Low to Ultra-Low Permeability Oil & Gas Reservoirs in West China, Ministry of Education, Xi'an Shiyou University, Xi'an, Shaanxi, 710065, ChinaCNOOC China Limited-Shanghai, Shanghai, 200335, ChinaCNOOC China Limited-Shanghai, Shanghai, 200335, ChinaCNOOC China Limited-Shanghai, Shanghai, 200335, ChinaCollege of Petroleum Engineering, Xi'an Shiyou University, Xi'an, Shaanxi, 710065, China; Engineering Research Center of Development and Management for Low to Ultra-Low Permeability Oil & Gas Reservoirs in West China, Ministry of Education, Xi'an Shiyou University, Xi'an, Shaanxi, 710065, ChinaCollege of Petroleum Engineering, Xi'an Shiyou University, Xi'an, Shaanxi, 710065, China; Engineering Research Center of Development and Management for Low to Ultra-Low Permeability Oil & Gas Reservoirs in West China, Ministry of Education, Xi'an Shiyou University, Xi'an, Shaanxi, 710065, ChinaCollege of Petroleum Engineering, Xi'an Shiyou University, Xi'an, Shaanxi, 710065, China; Engineering Research Center of Development and Management for Low to Ultra-Low Permeability Oil & Gas Reservoirs in West China, Ministry of Education, Xi'an Shiyou University, Xi'an, Shaanxi, 710065, China; Corresponding author.College of Petroleum Engineering, Xi'an Shiyou University, Xi'an, Shaanxi, 710065, ChinaHydraulic fracturing technology has achieved remarkable results in improving the production of tight gas reservoirs, but its effectiveness is under the joint action of multiple factors of complexity. Traditional analysis methods have limitations in dealing with these complex and interrelated factors, and it is difficult to fully reveal the actual contribution of each factor to the production. Machine learning-based methods explore the complex mapping relationships between large amounts of data to provide data-driven insights into the key factors driving production. In this study, a data-driven PCA-RF-VIM (Principal Component Analysis-Random Forest-Variable Importance Measures) approach of analyzing the importance of features is proposed to identify the key factors driving post-fracturing production. Four types of parameters, including log parameters, geological and reservoir physical parameters, hydraulic fracturing design parameters, and reservoir stimulation parameters, were inputted into the PCA-RF-VIM model. The model was trained using 6-fold cross-validation and grid search, and the relative importance ranking of each factor was finally obtained. In order to verify the validity of the PCA-RF-VIM model, a consolidation model that uses three other independent data-driven methods (Pearson correlation coefficient, RF feature significance analysis method, and XGboost feature significance analysis method) are applied to compare with the PCA-RF-VIM model. A comparison the two models shows that they contain almost the same parameters in the top ten, with only minor differences in one parameter. In combination with the reservoir characteristics, the reasonableness of the PCA-RF-VIM model is verified, and the importance ranking of the parameters by this method is more consistent with the reservoir characteristics of the study area. Ultimately, the ten parameters are selected as the controlling factors that have the potential to influence post-fracturing gas production, as the combined importance of these top ten parameters is 91.95 % on driving natural gas production. Analyzing and obtaining these ten controlling factors provides engineers with a new insight into the reservoir selection for fracturing stimulation and fracturing parameter optimization to improve fracturing efficiency and productivity.http://www.sciencedirect.com/science/article/pii/S2666759225000320Data-driven methodControlling factorHydraulic fracturingGas production
spellingShingle Yifan Zhao
Xiaofan Li
Lei Zuo
Zhongtai Hu
Liangbin Dou
Huagui Yu
Tiantai Li
Jun Lu
A data-driven PCA-RF-VIM method to identify key factors driving post-fracturing gas production of tight reservoirs
Energy Geoscience
Data-driven method
Controlling factor
Hydraulic fracturing
Gas production
title A data-driven PCA-RF-VIM method to identify key factors driving post-fracturing gas production of tight reservoirs
title_full A data-driven PCA-RF-VIM method to identify key factors driving post-fracturing gas production of tight reservoirs
title_fullStr A data-driven PCA-RF-VIM method to identify key factors driving post-fracturing gas production of tight reservoirs
title_full_unstemmed A data-driven PCA-RF-VIM method to identify key factors driving post-fracturing gas production of tight reservoirs
title_short A data-driven PCA-RF-VIM method to identify key factors driving post-fracturing gas production of tight reservoirs
title_sort data driven pca rf vim method to identify key factors driving post fracturing gas production of tight reservoirs
topic Data-driven method
Controlling factor
Hydraulic fracturing
Gas production
url http://www.sciencedirect.com/science/article/pii/S2666759225000320
work_keys_str_mv AT yifanzhao adatadrivenpcarfvimmethodtoidentifykeyfactorsdrivingpostfracturinggasproductionoftightreservoirs
AT xiaofanli adatadrivenpcarfvimmethodtoidentifykeyfactorsdrivingpostfracturinggasproductionoftightreservoirs
AT leizuo adatadrivenpcarfvimmethodtoidentifykeyfactorsdrivingpostfracturinggasproductionoftightreservoirs
AT zhongtaihu adatadrivenpcarfvimmethodtoidentifykeyfactorsdrivingpostfracturinggasproductionoftightreservoirs
AT liangbindou adatadrivenpcarfvimmethodtoidentifykeyfactorsdrivingpostfracturinggasproductionoftightreservoirs
AT huaguiyu adatadrivenpcarfvimmethodtoidentifykeyfactorsdrivingpostfracturinggasproductionoftightreservoirs
AT tiantaili adatadrivenpcarfvimmethodtoidentifykeyfactorsdrivingpostfracturinggasproductionoftightreservoirs
AT junlu adatadrivenpcarfvimmethodtoidentifykeyfactorsdrivingpostfracturinggasproductionoftightreservoirs
AT yifanzhao datadrivenpcarfvimmethodtoidentifykeyfactorsdrivingpostfracturinggasproductionoftightreservoirs
AT xiaofanli datadrivenpcarfvimmethodtoidentifykeyfactorsdrivingpostfracturinggasproductionoftightreservoirs
AT leizuo datadrivenpcarfvimmethodtoidentifykeyfactorsdrivingpostfracturinggasproductionoftightreservoirs
AT zhongtaihu datadrivenpcarfvimmethodtoidentifykeyfactorsdrivingpostfracturinggasproductionoftightreservoirs
AT liangbindou datadrivenpcarfvimmethodtoidentifykeyfactorsdrivingpostfracturinggasproductionoftightreservoirs
AT huaguiyu datadrivenpcarfvimmethodtoidentifykeyfactorsdrivingpostfracturinggasproductionoftightreservoirs
AT tiantaili datadrivenpcarfvimmethodtoidentifykeyfactorsdrivingpostfracturinggasproductionoftightreservoirs
AT junlu datadrivenpcarfvimmethodtoidentifykeyfactorsdrivingpostfracturinggasproductionoftightreservoirs