MINN: A metabolic-informed neural network for integrating omics data into genome-scale metabolic modeling

The understanding of cellular behavior relies on the integration of metabolism and its regulation. Multi-omics data provide a detailed snapshot of the molecular processes underpinning cellular functions and their regulation, describing the current state of the cell. While Machine Learning (ML) model...

Full description

Saved in:
Bibliographic Details
Main Authors: Gabriele Tazza, Francesco Moro, Dario Ruggeri, Bas Teusink, László Vidács
Format: Article
Language:English
Published: Elsevier 2025-01-01
Series:Computational and Structural Biotechnology Journal
Subjects:
Online Access:http://www.sciencedirect.com/science/article/pii/S2001037025003265
Tags: Add Tag
No Tags, Be the first to tag this record!
Description
Summary:The understanding of cellular behavior relies on the integration of metabolism and its regulation. Multi-omics data provide a detailed snapshot of the molecular processes underpinning cellular functions and their regulation, describing the current state of the cell. While Machine Learning (ML) models can uncover complex patterns and relationships within these data, they require large datasets for training and often lack interpretability. On the other hand, mathematical models, such as Genome-Scale Metabolic Models (GEMs), offer a structured framework for analyzing the organization and dynamics of specific cellular mechanisms. At the same time, they don't allow for seamless integration of omics information. Recently, a new framework to embed GEMs in a neural network has been introduced: these hybrid models combine the strengths of mechanistic and data-driven approaches, offering a promising platform for integrating different data sources with mechanistic knowledge. In this study, we present a Metabolic-Informed Neural Network (MINN) that utilizes multi-omics data to predict metabolic fluxes in Escherichia coli, under different growth rates and gene knockouts. We test its performances against pure ML and parsimonious Flux Balance Analysis (pFBA), demonstrating its efficacy in improving prediction performances. We also highlight how conflicts can emerge between the data-driven and the mechanistic objectives, and we propose different solutions to mitigate them. Finally, we illustrate a strategy to couple the MINN with pFBA, enhancing the interpretability of the solution.
ISSN:2001-0370