Optimizing Xylanase Production: Bridging Statistical Design and Machine Learning for Improved Protein Production

Proteins are crucial for medicine, pharmaceuticals, food, and environmental applications since they are used in various fields such as synthesis of drugs, industrial enzyme production, biodegradable plastics, bioremediation processes, etc. Xylanase is an important and versatile enzyme with applicati...

Full description

Saved in:
Bibliographic Details
Main Authors: Merve Aslı Ergün, Başak Esin Köktürk-Güzel, Tuğba Keskin-Gündoğdu
Format: Article
Language:English
Published: MDPI AG 2025-06-01
Series:Fermentation
Subjects:
Online Access:https://www.mdpi.com/2311-5637/11/6/319
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1850168518277857280
author Merve Aslı Ergün
Başak Esin Köktürk-Güzel
Tuğba Keskin-Gündoğdu
author_facet Merve Aslı Ergün
Başak Esin Köktürk-Güzel
Tuğba Keskin-Gündoğdu
author_sort Merve Aslı Ergün
collection DOAJ
description Proteins are crucial for medicine, pharmaceuticals, food, and environmental applications since they are used in various fields such as synthesis of drugs, industrial enzyme production, biodegradable plastics, bioremediation processes, etc. Xylanase is an important and versatile enzyme with applications across various industries, including pulp and paper, biofuel production, food processing, textiles, laundry detergents, and animal feed. Key parameters in biotechnological protein production include temperature, pH, and working volumes and especially medium compositions where optimization is crucial for large-scale applications due to cost considerations. Machine learning methods have emerged as effective alternatives to traditional statistical approaches in optimization. This study focuses on optimizing xylanase production via bioprocesses by employing regression analysis on datasets from various studies. The extreme gradient boosting (XGBoost) regression model was applied to predict xylanase activity under different experimental conditions, accurately predicting xylanase activity and identifying the significance of each variable. This study utilized experimentally derived datasets from peer-reviewed publications, in which the corresponding laboratory experiments had already been conducted and validated. The results demonstrate that machine learning methods can effectively optimize protein production processes, offering a strong alternative to traditional statistical approaches.
format Article
id doaj-art-ffcda37fa0944b1eb0836bc1b94141dc
institution OA Journals
issn 2311-5637
language English
publishDate 2025-06-01
publisher MDPI AG
record_format Article
series Fermentation
spelling doaj-art-ffcda37fa0944b1eb0836bc1b94141dc2025-08-20T02:20:57ZengMDPI AGFermentation2311-56372025-06-0111631910.3390/fermentation11060319Optimizing Xylanase Production: Bridging Statistical Design and Machine Learning for Improved Protein ProductionMerve Aslı Ergün0Başak Esin Köktürk-Güzel1Tuğba Keskin-Gündoğdu2Department of Operations Research, Graduate School of Natural and Applied Sciences, Izmir Demokrasi University, Izmir 35140, TürkiyeDepartment of Electrical and Electronics Engineering, Faculty of Engineering, Izmir Demokrasi University, Izmir 35140, TürkiyeDepartment of Operations Research, Graduate School of Natural and Applied Sciences, Izmir Demokrasi University, Izmir 35140, TürkiyeProteins are crucial for medicine, pharmaceuticals, food, and environmental applications since they are used in various fields such as synthesis of drugs, industrial enzyme production, biodegradable plastics, bioremediation processes, etc. Xylanase is an important and versatile enzyme with applications across various industries, including pulp and paper, biofuel production, food processing, textiles, laundry detergents, and animal feed. Key parameters in biotechnological protein production include temperature, pH, and working volumes and especially medium compositions where optimization is crucial for large-scale applications due to cost considerations. Machine learning methods have emerged as effective alternatives to traditional statistical approaches in optimization. This study focuses on optimizing xylanase production via bioprocesses by employing regression analysis on datasets from various studies. The extreme gradient boosting (XGBoost) regression model was applied to predict xylanase activity under different experimental conditions, accurately predicting xylanase activity and identifying the significance of each variable. This study utilized experimentally derived datasets from peer-reviewed publications, in which the corresponding laboratory experiments had already been conducted and validated. The results demonstrate that machine learning methods can effectively optimize protein production processes, offering a strong alternative to traditional statistical approaches.https://www.mdpi.com/2311-5637/11/6/319xylanase productionXGBoost regressionoptimizationprotein productionmachine learning
spellingShingle Merve Aslı Ergün
Başak Esin Köktürk-Güzel
Tuğba Keskin-Gündoğdu
Optimizing Xylanase Production: Bridging Statistical Design and Machine Learning for Improved Protein Production
Fermentation
xylanase production
XGBoost regression
optimization
protein production
machine learning
title Optimizing Xylanase Production: Bridging Statistical Design and Machine Learning for Improved Protein Production
title_full Optimizing Xylanase Production: Bridging Statistical Design and Machine Learning for Improved Protein Production
title_fullStr Optimizing Xylanase Production: Bridging Statistical Design and Machine Learning for Improved Protein Production
title_full_unstemmed Optimizing Xylanase Production: Bridging Statistical Design and Machine Learning for Improved Protein Production
title_short Optimizing Xylanase Production: Bridging Statistical Design and Machine Learning for Improved Protein Production
title_sort optimizing xylanase production bridging statistical design and machine learning for improved protein production
topic xylanase production
XGBoost regression
optimization
protein production
machine learning
url https://www.mdpi.com/2311-5637/11/6/319
work_keys_str_mv AT merveaslıergun optimizingxylanaseproductionbridgingstatisticaldesignandmachinelearningforimprovedproteinproduction
AT basakesinkokturkguzel optimizingxylanaseproductionbridgingstatisticaldesignandmachinelearningforimprovedproteinproduction
AT tugbakeskingundogdu optimizingxylanaseproductionbridgingstatisticaldesignandmachinelearningforimprovedproteinproduction