Examining peptide–gold nanoparticle interactions through explainable machine learning
Abstract Peptides are promising tools for controlled nanomaterial creation. Peptides are essential for directing the process by which nanostructures develop as well as for affecting the final characteristics of the resultant nanomaterials. It is expected that integrating machine learning (ML) into t...
Saved in:
| Main Authors: | , , |
|---|---|
| Format: | Article |
| Language: | English |
| Published: |
Springer
2025-05-01
|
| Series: | Discover Chemistry |
| Subjects: | |
| Online Access: | https://doi.org/10.1007/s44371-025-00191-2 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
| Summary: | Abstract Peptides are promising tools for controlled nanomaterial creation. Peptides are essential for directing the process by which nanostructures develop as well as for affecting the final characteristics of the resultant nanomaterials. It is expected that integrating machine learning (ML) into the biomimetic workflow will speed up peptide discovery, increase resource efficiency, and reveal relationships between characteristics that might be helpful for peptide design. However, the need for interpretability in ML models is crucial, as it fosters deeper understanding about the underlying factors driving the predictions, thereby ensuring technical soundness and replicability. This work develops an explainable binary machine learning classifier using rough sets as the algorithm and amino acid composition as the features. The model was trained and evaluated on a dataset composed of 1720 peptides and their experimentally determined binding affinity for gold nanoparticles. The created model demonstrates reliable accuracy in distinguishing strong-binding peptides from weak binders. Specifically, the model identified sequence patterns important for peptide binding to gold, such as the residues categorized as tiny, small, and aromatic are greater than 15% in terms of composition. Similarly, strong binders were found to possess residues categorized as aliphatic to fall within 5–35%. Additionally, nonpolar amino acids were found to be greater than 15% in all instances. Taken together, using an explainable algorithm and intuitive features yielded an ML model that has provided straightforward insights regarding the composition of strong-binding peptides. This is valuable especially as rational peptide design for bionanotechnology applications remains a challenging endeavor. |
|---|---|
| ISSN: | 3005-1193 |