MINN: A metabolic-informed neural network for integrating omics data into genome-scale metabolic modeling

The understanding of cellular behavior relies on the integration of metabolism and its regulation. Multi-omics data provide a detailed snapshot of the molecular processes underpinning cellular functions and their regulation, describing the current state of the cell. While Machine Learning (ML) model...

Full description

Saved in:
Bibliographic Details
Main Authors: Gabriele Tazza, Francesco Moro, Dario Ruggeri, Bas Teusink, László Vidács
Format: Article
Language:English
Published: Elsevier 2025-01-01
Series:Computational and Structural Biotechnology Journal
Subjects:
Online Access:http://www.sciencedirect.com/science/article/pii/S2001037025003265
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1849389654984359936
author Gabriele Tazza
Francesco Moro
Dario Ruggeri
Bas Teusink
László Vidács
author_facet Gabriele Tazza
Francesco Moro
Dario Ruggeri
Bas Teusink
László Vidács
author_sort Gabriele Tazza
collection DOAJ
description The understanding of cellular behavior relies on the integration of metabolism and its regulation. Multi-omics data provide a detailed snapshot of the molecular processes underpinning cellular functions and their regulation, describing the current state of the cell. While Machine Learning (ML) models can uncover complex patterns and relationships within these data, they require large datasets for training and often lack interpretability. On the other hand, mathematical models, such as Genome-Scale Metabolic Models (GEMs), offer a structured framework for analyzing the organization and dynamics of specific cellular mechanisms. At the same time, they don't allow for seamless integration of omics information. Recently, a new framework to embed GEMs in a neural network has been introduced: these hybrid models combine the strengths of mechanistic and data-driven approaches, offering a promising platform for integrating different data sources with mechanistic knowledge. In this study, we present a Metabolic-Informed Neural Network (MINN) that utilizes multi-omics data to predict metabolic fluxes in Escherichia coli, under different growth rates and gene knockouts. We test its performances against pure ML and parsimonious Flux Balance Analysis (pFBA), demonstrating its efficacy in improving prediction performances. We also highlight how conflicts can emerge between the data-driven and the mechanistic objectives, and we propose different solutions to mitigate them. Finally, we illustrate a strategy to couple the MINN with pFBA, enhancing the interpretability of the solution.
format Article
id doaj-art-925ec5155eee44e294fabe440151099a
institution Kabale University
issn 2001-0370
language English
publishDate 2025-01-01
publisher Elsevier
record_format Article
series Computational and Structural Biotechnology Journal
spelling doaj-art-925ec5155eee44e294fabe440151099a2025-08-20T03:41:54ZengElsevierComputational and Structural Biotechnology Journal2001-03702025-01-01273609361710.1016/j.csbj.2025.08.004MINN: A metabolic-informed neural network for integrating omics data into genome-scale metabolic modelingGabriele Tazza0Francesco Moro1Dario Ruggeri2Bas Teusink3László Vidács4Department of Software Engineering, University of Szeged, Szeged, Hungary; Corresponding author.Systems Biology Lab, AIMMS/ALIFE, Vrije Universiteit Amsterdam, Amsterdam, the NetherlandsDepartment of Software Engineering, University of Szeged, Szeged, HungarySystems Biology Lab, AIMMS/ALIFE, Vrije Universiteit Amsterdam, Amsterdam, the NetherlandsDepartment of Software Engineering, University of Szeged, Szeged, HungaryThe understanding of cellular behavior relies on the integration of metabolism and its regulation. Multi-omics data provide a detailed snapshot of the molecular processes underpinning cellular functions and their regulation, describing the current state of the cell. While Machine Learning (ML) models can uncover complex patterns and relationships within these data, they require large datasets for training and often lack interpretability. On the other hand, mathematical models, such as Genome-Scale Metabolic Models (GEMs), offer a structured framework for analyzing the organization and dynamics of specific cellular mechanisms. At the same time, they don't allow for seamless integration of omics information. Recently, a new framework to embed GEMs in a neural network has been introduced: these hybrid models combine the strengths of mechanistic and data-driven approaches, offering a promising platform for integrating different data sources with mechanistic knowledge. In this study, we present a Metabolic-Informed Neural Network (MINN) that utilizes multi-omics data to predict metabolic fluxes in Escherichia coli, under different growth rates and gene knockouts. We test its performances against pure ML and parsimonious Flux Balance Analysis (pFBA), demonstrating its efficacy in improving prediction performances. We also highlight how conflicts can emerge between the data-driven and the mechanistic objectives, and we propose different solutions to mitigate them. Finally, we illustrate a strategy to couple the MINN with pFBA, enhancing the interpretability of the solution.http://www.sciencedirect.com/science/article/pii/S2001037025003265Hybrid modelMachine learningFlux balance analysisMulti-omicsGenome-scale metabolic modelingNeural-networks
spellingShingle Gabriele Tazza
Francesco Moro
Dario Ruggeri
Bas Teusink
László Vidács
MINN: A metabolic-informed neural network for integrating omics data into genome-scale metabolic modeling
Computational and Structural Biotechnology Journal
Hybrid model
Machine learning
Flux balance analysis
Multi-omics
Genome-scale metabolic modeling
Neural-networks
title MINN: A metabolic-informed neural network for integrating omics data into genome-scale metabolic modeling
title_full MINN: A metabolic-informed neural network for integrating omics data into genome-scale metabolic modeling
title_fullStr MINN: A metabolic-informed neural network for integrating omics data into genome-scale metabolic modeling
title_full_unstemmed MINN: A metabolic-informed neural network for integrating omics data into genome-scale metabolic modeling
title_short MINN: A metabolic-informed neural network for integrating omics data into genome-scale metabolic modeling
title_sort minn a metabolic informed neural network for integrating omics data into genome scale metabolic modeling
topic Hybrid model
Machine learning
Flux balance analysis
Multi-omics
Genome-scale metabolic modeling
Neural-networks
url http://www.sciencedirect.com/science/article/pii/S2001037025003265
work_keys_str_mv AT gabrieletazza minnametabolicinformedneuralnetworkforintegratingomicsdataintogenomescalemetabolicmodeling
AT francescomoro minnametabolicinformedneuralnetworkforintegratingomicsdataintogenomescalemetabolicmodeling
AT darioruggeri minnametabolicinformedneuralnetworkforintegratingomicsdataintogenomescalemetabolicmodeling
AT basteusink minnametabolicinformedneuralnetworkforintegratingomicsdataintogenomescalemetabolicmodeling
AT laszlovidacs minnametabolicinformedneuralnetworkforintegratingomicsdataintogenomescalemetabolicmodeling