Machine learning-assisted rational design and evolution of novel signal peptides in Yarrowia lipolytica
Microbial proteins hold great promise as sustainable alternatives for future protein sources, and oleaginous yeast Yarrowia lipolytica has emerged as a recognized platform for heterologous protein expression and secretion. N-terminal signal peptides (SPs) are crucial for directing proteins to the se...
Saved in:
| Main Authors: | , , , , |
|---|---|
| Format: | Article |
| Language: | English |
| Published: |
KeAi Communications Co., Ltd.
2025-12-01
|
| Series: | Synthetic and Systems Biotechnology |
| Subjects: | |
| Online Access: | http://www.sciencedirect.com/science/article/pii/S2405805X2500105X |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
| Summary: | Microbial proteins hold great promise as sustainable alternatives for future protein sources, and oleaginous yeast Yarrowia lipolytica has emerged as a recognized platform for heterologous protein expression and secretion. N-terminal signal peptides (SPs) are crucial for directing proteins to the secretion pathway, which offers advantages in both academic and industrial protein production. Although some of the innate SPs of Y. lipolytica have been reported, there is a growing need to expand the genetic toolkit of SPs to support the increasing use of Y. lipolytica as a cell factory for overproduction of various secretory proteins. In this study, we employed an efficient evolutionary approach to rapidly evolve the innate SP XPR2-pre by leveraging Gibson assembly with two synthetic overlapping oligos containing high portion of degenerate nucleotides. Using Nanoluc (Nluc) luciferase as a robust reporter, we characterized the intracellular and extracellular enzymatic activity of 447 SP mutants and identified previously undescribed SPs exhibiting superior performance compared to XPR2-pre in Nluc luciferase secretion, with improvements of up to 2.91-fold of enzymatic activity in the supernatant. The generalizability of the top-performing SPs was evaluated using three additional heterologous enzymes (β-galactosidase, α-amylase, and PET hydrolase). Our results confirmed their versatility across different proteins with protein-specific efficiency. Additionally, based on our screening, we also evaluated the performance of different feature engineering strategies and machine learning models in the design and prediction of SP mutants. This study integrated rational design, directed evolution and machine learning to identify novel SPs, expanding the repertoire of signal peptides and benefiting secretory protein overexpression in Y. lipolytica. |
|---|---|
| ISSN: | 2405-805X |