A computational approach for prediction of viscosity of chemical compounds based on molecular structures
The research paper explores the feasibility of predicting the viscosity of a diverse chemical compound by using molecular structures at 25 °C through supervised machine learning methods. In this paper, Random Forest, Gradient Boosting and CatBoost supervised algorithms were implemented. The dataset...
Saved in:
Main Authors: | , , |
---|---|
Format: | Article |
Language: | English |
Published: |
Elsevier
2025-01-01
|
Series: | Results in Chemistry |
Subjects: | |
Online Access: | http://www.sciencedirect.com/science/article/pii/S2211715625000220 |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
_version_ | 1832583065703022592 |
---|---|
author | Sneha Das Ram Kishore Roy Tulshi Bezboruah |
author_facet | Sneha Das Ram Kishore Roy Tulshi Bezboruah |
author_sort | Sneha Das |
collection | DOAJ |
description | The research paper explores the feasibility of predicting the viscosity of a diverse chemical compound by using molecular structures at 25 °C through supervised machine learning methods. In this paper, Random Forest, Gradient Boosting and CatBoost supervised algorithms were implemented. The dataset consists of the Simplified Molecular Input Line Entry System (SMILES) notation of 320 chemical compounds and their corresponding viscosities at 25 °C. The study generated ten features from the compounds to correlate and predict the viscosity values. The results suggest that Catboost algorithm (R2 = 0.94 and MSE = 0.64) performs better than Gradient Boosting (R2 = 0.90 and MSE = 1.05) and Random Forest algorithm (R2 = 0.86 and MSE = 1.39), while measuring the viscosity. Although the Random Forest model lags behind, the overall results may be useful for predicting viscosity with accuracy. The findings of the research work also reveal which feature contributes the most to a given model. The results of the proposed model can be used in several industries including food and beverage, cosmetics, medicine, pharmaceuticals, etc. |
format | Article |
id | doaj-art-1060cb7500694f389de80f4947b76830 |
institution | Kabale University |
issn | 2211-7156 |
language | English |
publishDate | 2025-01-01 |
publisher | Elsevier |
record_format | Article |
series | Results in Chemistry |
spelling | doaj-art-1060cb7500694f389de80f4947b768302025-01-29T05:01:05ZengElsevierResults in Chemistry2211-71562025-01-0113102039A computational approach for prediction of viscosity of chemical compounds based on molecular structuresSneha Das0Ram Kishore Roy1Tulshi Bezboruah2Corresponding author.; Department of Electronics and Communication Technology, Gauhati University, Guwahati 781014, Assam, IndiaDepartment of Electronics and Communication Technology, Gauhati University, Guwahati 781014, Assam, IndiaDepartment of Electronics and Communication Technology, Gauhati University, Guwahati 781014, Assam, IndiaThe research paper explores the feasibility of predicting the viscosity of a diverse chemical compound by using molecular structures at 25 °C through supervised machine learning methods. In this paper, Random Forest, Gradient Boosting and CatBoost supervised algorithms were implemented. The dataset consists of the Simplified Molecular Input Line Entry System (SMILES) notation of 320 chemical compounds and their corresponding viscosities at 25 °C. The study generated ten features from the compounds to correlate and predict the viscosity values. The results suggest that Catboost algorithm (R2 = 0.94 and MSE = 0.64) performs better than Gradient Boosting (R2 = 0.90 and MSE = 1.05) and Random Forest algorithm (R2 = 0.86 and MSE = 1.39), while measuring the viscosity. Although the Random Forest model lags behind, the overall results may be useful for predicting viscosity with accuracy. The findings of the research work also reveal which feature contributes the most to a given model. The results of the proposed model can be used in several industries including food and beverage, cosmetics, medicine, pharmaceuticals, etc.http://www.sciencedirect.com/science/article/pii/S2211715625000220Viscosity measurementMachine learningRandom forestGradient boostingCatboost |
spellingShingle | Sneha Das Ram Kishore Roy Tulshi Bezboruah A computational approach for prediction of viscosity of chemical compounds based on molecular structures Results in Chemistry Viscosity measurement Machine learning Random forest Gradient boosting Catboost |
title | A computational approach for prediction of viscosity of chemical compounds based on molecular structures |
title_full | A computational approach for prediction of viscosity of chemical compounds based on molecular structures |
title_fullStr | A computational approach for prediction of viscosity of chemical compounds based on molecular structures |
title_full_unstemmed | A computational approach for prediction of viscosity of chemical compounds based on molecular structures |
title_short | A computational approach for prediction of viscosity of chemical compounds based on molecular structures |
title_sort | computational approach for prediction of viscosity of chemical compounds based on molecular structures |
topic | Viscosity measurement Machine learning Random forest Gradient boosting Catboost |
url | http://www.sciencedirect.com/science/article/pii/S2211715625000220 |
work_keys_str_mv | AT snehadas acomputationalapproachforpredictionofviscosityofchemicalcompoundsbasedonmolecularstructures AT ramkishoreroy acomputationalapproachforpredictionofviscosityofchemicalcompoundsbasedonmolecularstructures AT tulshibezboruah acomputationalapproachforpredictionofviscosityofchemicalcompoundsbasedonmolecularstructures AT snehadas computationalapproachforpredictionofviscosityofchemicalcompoundsbasedonmolecularstructures AT ramkishoreroy computationalapproachforpredictionofviscosityofchemicalcompoundsbasedonmolecularstructures AT tulshibezboruah computationalapproachforpredictionofviscosityofchemicalcompoundsbasedonmolecularstructures |