A data-driven PCA-RF-VIM method to identify key factors driving post-fracturing gas production of tight reservoirs
Hydraulic fracturing technology has achieved remarkable results in improving the production of tight gas reservoirs, but its effectiveness is under the joint action of multiple factors of complexity. Traditional analysis methods have limitations in dealing with these complex and interrelated factors...
Saved in:
| Main Authors: | , , , , , , , |
|---|---|
| Format: | Article |
| Language: | English |
| Published: |
KeAi Communications Co., Ltd.
2025-06-01
|
| Series: | Energy Geoscience |
| Subjects: | |
| Online Access: | http://www.sciencedirect.com/science/article/pii/S2666759225000320 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
| _version_ | 1850160266037166080 |
|---|---|
| author | Yifan Zhao Xiaofan Li Lei Zuo Zhongtai Hu Liangbin Dou Huagui Yu Tiantai Li Jun Lu |
| author_facet | Yifan Zhao Xiaofan Li Lei Zuo Zhongtai Hu Liangbin Dou Huagui Yu Tiantai Li Jun Lu |
| author_sort | Yifan Zhao |
| collection | DOAJ |
| description | Hydraulic fracturing technology has achieved remarkable results in improving the production of tight gas reservoirs, but its effectiveness is under the joint action of multiple factors of complexity. Traditional analysis methods have limitations in dealing with these complex and interrelated factors, and it is difficult to fully reveal the actual contribution of each factor to the production. Machine learning-based methods explore the complex mapping relationships between large amounts of data to provide data-driven insights into the key factors driving production. In this study, a data-driven PCA-RF-VIM (Principal Component Analysis-Random Forest-Variable Importance Measures) approach of analyzing the importance of features is proposed to identify the key factors driving post-fracturing production. Four types of parameters, including log parameters, geological and reservoir physical parameters, hydraulic fracturing design parameters, and reservoir stimulation parameters, were inputted into the PCA-RF-VIM model. The model was trained using 6-fold cross-validation and grid search, and the relative importance ranking of each factor was finally obtained. In order to verify the validity of the PCA-RF-VIM model, a consolidation model that uses three other independent data-driven methods (Pearson correlation coefficient, RF feature significance analysis method, and XGboost feature significance analysis method) are applied to compare with the PCA-RF-VIM model. A comparison the two models shows that they contain almost the same parameters in the top ten, with only minor differences in one parameter. In combination with the reservoir characteristics, the reasonableness of the PCA-RF-VIM model is verified, and the importance ranking of the parameters by this method is more consistent with the reservoir characteristics of the study area. Ultimately, the ten parameters are selected as the controlling factors that have the potential to influence post-fracturing gas production, as the combined importance of these top ten parameters is 91.95 % on driving natural gas production. Analyzing and obtaining these ten controlling factors provides engineers with a new insight into the reservoir selection for fracturing stimulation and fracturing parameter optimization to improve fracturing efficiency and productivity. |
| format | Article |
| id | doaj-art-9fb28e71edf447f59cc31e3b750d3fa3 |
| institution | OA Journals |
| issn | 2666-7592 |
| language | English |
| publishDate | 2025-06-01 |
| publisher | KeAi Communications Co., Ltd. |
| record_format | Article |
| series | Energy Geoscience |
| spelling | doaj-art-9fb28e71edf447f59cc31e3b750d3fa32025-08-20T02:23:11ZengKeAi Communications Co., Ltd.Energy Geoscience2666-75922025-06-016210041110.1016/j.engeos.2025.100411A data-driven PCA-RF-VIM method to identify key factors driving post-fracturing gas production of tight reservoirsYifan Zhao0Xiaofan Li1Lei Zuo2Zhongtai Hu3Liangbin Dou4Huagui Yu5Tiantai Li6Jun Lu7College of Petroleum Engineering, Xi'an Shiyou University, Xi'an, Shaanxi, 710065, China; Engineering Research Center of Development and Management for Low to Ultra-Low Permeability Oil & Gas Reservoirs in West China, Ministry of Education, Xi'an Shiyou University, Xi'an, Shaanxi, 710065, ChinaCNOOC China Limited-Shanghai, Shanghai, 200335, ChinaCNOOC China Limited-Shanghai, Shanghai, 200335, ChinaCNOOC China Limited-Shanghai, Shanghai, 200335, ChinaCollege of Petroleum Engineering, Xi'an Shiyou University, Xi'an, Shaanxi, 710065, China; Engineering Research Center of Development and Management for Low to Ultra-Low Permeability Oil & Gas Reservoirs in West China, Ministry of Education, Xi'an Shiyou University, Xi'an, Shaanxi, 710065, ChinaCollege of Petroleum Engineering, Xi'an Shiyou University, Xi'an, Shaanxi, 710065, China; Engineering Research Center of Development and Management for Low to Ultra-Low Permeability Oil & Gas Reservoirs in West China, Ministry of Education, Xi'an Shiyou University, Xi'an, Shaanxi, 710065, ChinaCollege of Petroleum Engineering, Xi'an Shiyou University, Xi'an, Shaanxi, 710065, China; Engineering Research Center of Development and Management for Low to Ultra-Low Permeability Oil & Gas Reservoirs in West China, Ministry of Education, Xi'an Shiyou University, Xi'an, Shaanxi, 710065, China; Corresponding author.College of Petroleum Engineering, Xi'an Shiyou University, Xi'an, Shaanxi, 710065, ChinaHydraulic fracturing technology has achieved remarkable results in improving the production of tight gas reservoirs, but its effectiveness is under the joint action of multiple factors of complexity. Traditional analysis methods have limitations in dealing with these complex and interrelated factors, and it is difficult to fully reveal the actual contribution of each factor to the production. Machine learning-based methods explore the complex mapping relationships between large amounts of data to provide data-driven insights into the key factors driving production. In this study, a data-driven PCA-RF-VIM (Principal Component Analysis-Random Forest-Variable Importance Measures) approach of analyzing the importance of features is proposed to identify the key factors driving post-fracturing production. Four types of parameters, including log parameters, geological and reservoir physical parameters, hydraulic fracturing design parameters, and reservoir stimulation parameters, were inputted into the PCA-RF-VIM model. The model was trained using 6-fold cross-validation and grid search, and the relative importance ranking of each factor was finally obtained. In order to verify the validity of the PCA-RF-VIM model, a consolidation model that uses three other independent data-driven methods (Pearson correlation coefficient, RF feature significance analysis method, and XGboost feature significance analysis method) are applied to compare with the PCA-RF-VIM model. A comparison the two models shows that they contain almost the same parameters in the top ten, with only minor differences in one parameter. In combination with the reservoir characteristics, the reasonableness of the PCA-RF-VIM model is verified, and the importance ranking of the parameters by this method is more consistent with the reservoir characteristics of the study area. Ultimately, the ten parameters are selected as the controlling factors that have the potential to influence post-fracturing gas production, as the combined importance of these top ten parameters is 91.95 % on driving natural gas production. Analyzing and obtaining these ten controlling factors provides engineers with a new insight into the reservoir selection for fracturing stimulation and fracturing parameter optimization to improve fracturing efficiency and productivity.http://www.sciencedirect.com/science/article/pii/S2666759225000320Data-driven methodControlling factorHydraulic fracturingGas production |
| spellingShingle | Yifan Zhao Xiaofan Li Lei Zuo Zhongtai Hu Liangbin Dou Huagui Yu Tiantai Li Jun Lu A data-driven PCA-RF-VIM method to identify key factors driving post-fracturing gas production of tight reservoirs Energy Geoscience Data-driven method Controlling factor Hydraulic fracturing Gas production |
| title | A data-driven PCA-RF-VIM method to identify key factors driving post-fracturing gas production of tight reservoirs |
| title_full | A data-driven PCA-RF-VIM method to identify key factors driving post-fracturing gas production of tight reservoirs |
| title_fullStr | A data-driven PCA-RF-VIM method to identify key factors driving post-fracturing gas production of tight reservoirs |
| title_full_unstemmed | A data-driven PCA-RF-VIM method to identify key factors driving post-fracturing gas production of tight reservoirs |
| title_short | A data-driven PCA-RF-VIM method to identify key factors driving post-fracturing gas production of tight reservoirs |
| title_sort | data driven pca rf vim method to identify key factors driving post fracturing gas production of tight reservoirs |
| topic | Data-driven method Controlling factor Hydraulic fracturing Gas production |
| url | http://www.sciencedirect.com/science/article/pii/S2666759225000320 |
| work_keys_str_mv | AT yifanzhao adatadrivenpcarfvimmethodtoidentifykeyfactorsdrivingpostfracturinggasproductionoftightreservoirs AT xiaofanli adatadrivenpcarfvimmethodtoidentifykeyfactorsdrivingpostfracturinggasproductionoftightreservoirs AT leizuo adatadrivenpcarfvimmethodtoidentifykeyfactorsdrivingpostfracturinggasproductionoftightreservoirs AT zhongtaihu adatadrivenpcarfvimmethodtoidentifykeyfactorsdrivingpostfracturinggasproductionoftightreservoirs AT liangbindou adatadrivenpcarfvimmethodtoidentifykeyfactorsdrivingpostfracturinggasproductionoftightreservoirs AT huaguiyu adatadrivenpcarfvimmethodtoidentifykeyfactorsdrivingpostfracturinggasproductionoftightreservoirs AT tiantaili adatadrivenpcarfvimmethodtoidentifykeyfactorsdrivingpostfracturinggasproductionoftightreservoirs AT junlu adatadrivenpcarfvimmethodtoidentifykeyfactorsdrivingpostfracturinggasproductionoftightreservoirs AT yifanzhao datadrivenpcarfvimmethodtoidentifykeyfactorsdrivingpostfracturinggasproductionoftightreservoirs AT xiaofanli datadrivenpcarfvimmethodtoidentifykeyfactorsdrivingpostfracturinggasproductionoftightreservoirs AT leizuo datadrivenpcarfvimmethodtoidentifykeyfactorsdrivingpostfracturinggasproductionoftightreservoirs AT zhongtaihu datadrivenpcarfvimmethodtoidentifykeyfactorsdrivingpostfracturinggasproductionoftightreservoirs AT liangbindou datadrivenpcarfvimmethodtoidentifykeyfactorsdrivingpostfracturinggasproductionoftightreservoirs AT huaguiyu datadrivenpcarfvimmethodtoidentifykeyfactorsdrivingpostfracturinggasproductionoftightreservoirs AT tiantaili datadrivenpcarfvimmethodtoidentifykeyfactorsdrivingpostfracturinggasproductionoftightreservoirs AT junlu datadrivenpcarfvimmethodtoidentifykeyfactorsdrivingpostfracturinggasproductionoftightreservoirs |