A computational approach for prediction of viscosity of chemical compounds based on molecular structures

The research paper explores the feasibility of predicting the viscosity of a diverse chemical compound by using molecular structures at 25 °C through supervised machine learning methods. In this paper, Random Forest, Gradient Boosting and CatBoost supervised algorithms were implemented. The dataset...

Full description

Saved in:
Bibliographic Details
Main Authors: Sneha Das, Ram Kishore Roy, Tulshi Bezboruah
Format: Article
Language:English
Published: Elsevier 2025-01-01
Series:Results in Chemistry
Subjects:
Online Access:http://www.sciencedirect.com/science/article/pii/S2211715625000220
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1832583065703022592
author Sneha Das
Ram Kishore Roy
Tulshi Bezboruah
author_facet Sneha Das
Ram Kishore Roy
Tulshi Bezboruah
author_sort Sneha Das
collection DOAJ
description The research paper explores the feasibility of predicting the viscosity of a diverse chemical compound by using molecular structures at 25 °C through supervised machine learning methods. In this paper, Random Forest, Gradient Boosting and CatBoost supervised algorithms were implemented. The dataset consists of the Simplified Molecular Input Line Entry System (SMILES) notation of 320 chemical compounds and their corresponding viscosities at 25 °C. The study generated ten features from the compounds to correlate and predict the viscosity values. The results suggest that Catboost algorithm (R2 = 0.94 and MSE = 0.64) performs better than Gradient Boosting (R2 = 0.90 and MSE = 1.05) and Random Forest algorithm (R2 = 0.86 and MSE = 1.39), while measuring the viscosity. Although the Random Forest model lags behind, the overall results may be useful for predicting viscosity with accuracy. The findings of the research work also reveal which feature contributes the most to a given model. The results of the proposed model can be used in several industries including food and beverage, cosmetics, medicine, pharmaceuticals, etc.
format Article
id doaj-art-1060cb7500694f389de80f4947b76830
institution Kabale University
issn 2211-7156
language English
publishDate 2025-01-01
publisher Elsevier
record_format Article
series Results in Chemistry
spelling doaj-art-1060cb7500694f389de80f4947b768302025-01-29T05:01:05ZengElsevierResults in Chemistry2211-71562025-01-0113102039A computational approach for prediction of viscosity of chemical compounds based on molecular structuresSneha Das0Ram Kishore Roy1Tulshi Bezboruah2Corresponding author.; Department of Electronics and Communication Technology, Gauhati University, Guwahati 781014, Assam, IndiaDepartment of Electronics and Communication Technology, Gauhati University, Guwahati 781014, Assam, IndiaDepartment of Electronics and Communication Technology, Gauhati University, Guwahati 781014, Assam, IndiaThe research paper explores the feasibility of predicting the viscosity of a diverse chemical compound by using molecular structures at 25 °C through supervised machine learning methods. In this paper, Random Forest, Gradient Boosting and CatBoost supervised algorithms were implemented. The dataset consists of the Simplified Molecular Input Line Entry System (SMILES) notation of 320 chemical compounds and their corresponding viscosities at 25 °C. The study generated ten features from the compounds to correlate and predict the viscosity values. The results suggest that Catboost algorithm (R2 = 0.94 and MSE = 0.64) performs better than Gradient Boosting (R2 = 0.90 and MSE = 1.05) and Random Forest algorithm (R2 = 0.86 and MSE = 1.39), while measuring the viscosity. Although the Random Forest model lags behind, the overall results may be useful for predicting viscosity with accuracy. The findings of the research work also reveal which feature contributes the most to a given model. The results of the proposed model can be used in several industries including food and beverage, cosmetics, medicine, pharmaceuticals, etc.http://www.sciencedirect.com/science/article/pii/S2211715625000220Viscosity measurementMachine learningRandom forestGradient boostingCatboost
spellingShingle Sneha Das
Ram Kishore Roy
Tulshi Bezboruah
A computational approach for prediction of viscosity of chemical compounds based on molecular structures
Results in Chemistry
Viscosity measurement
Machine learning
Random forest
Gradient boosting
Catboost
title A computational approach for prediction of viscosity of chemical compounds based on molecular structures
title_full A computational approach for prediction of viscosity of chemical compounds based on molecular structures
title_fullStr A computational approach for prediction of viscosity of chemical compounds based on molecular structures
title_full_unstemmed A computational approach for prediction of viscosity of chemical compounds based on molecular structures
title_short A computational approach for prediction of viscosity of chemical compounds based on molecular structures
title_sort computational approach for prediction of viscosity of chemical compounds based on molecular structures
topic Viscosity measurement
Machine learning
Random forest
Gradient boosting
Catboost
url http://www.sciencedirect.com/science/article/pii/S2211715625000220
work_keys_str_mv AT snehadas acomputationalapproachforpredictionofviscosityofchemicalcompoundsbasedonmolecularstructures
AT ramkishoreroy acomputationalapproachforpredictionofviscosityofchemicalcompoundsbasedonmolecularstructures
AT tulshibezboruah acomputationalapproachforpredictionofviscosityofchemicalcompoundsbasedonmolecularstructures
AT snehadas computationalapproachforpredictionofviscosityofchemicalcompoundsbasedonmolecularstructures
AT ramkishoreroy computationalapproachforpredictionofviscosityofchemicalcompoundsbasedonmolecularstructures
AT tulshibezboruah computationalapproachforpredictionofviscosityofchemicalcompoundsbasedonmolecularstructures