An Unsupervised Learning Approach for Coal Spontaneous Combustion Warning Level Classification Using t-SNE and k-Means Clustering

Accurate prediction of coal spontaneous combustion levels is crucial for preventing and controlling spontaneous combustion in goaf areas. To address the ambiguity in classification standards of coal spontaneous combustion warning levels, 21 groups of coal samples from different mining areas were sub...

Full description

Saved in:
Bibliographic Details
Main Authors: Pengyu Zhang, Xiaokun Chen
Format: Article
Language:English
Published: MDPI AG 2025-03-01
Series:Applied Sciences
Subjects:
Online Access:https://www.mdpi.com/2076-3417/15/7/3756
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1849738478716190720
author Pengyu Zhang
Xiaokun Chen
author_facet Pengyu Zhang
Xiaokun Chen
author_sort Pengyu Zhang
collection DOAJ
description Accurate prediction of coal spontaneous combustion levels is crucial for preventing and controlling spontaneous combustion in goaf areas. To address the ambiguity in classification standards of coal spontaneous combustion warning levels, 21 groups of coal samples from different mining areas were subjected to experiments with programmed temperatures, generating a database of 336 sets of temperatures and data on indicator gas concentrations. An unsupervised learning approach combining t-distributed Stochastic Neighbor Embedding (t-SNE) and k-means clustering was proposed to perform dimensionality reduction and clustering of high-dimensional data features. The clustering results of the original data were compared with Principal Component Analysis (PCA) and Stochastic Neighbor Embedding (SNE) methods to determine coal spontaneous combustion warning levels. The indicator gases and warning levels were input into a trained Support Vector Classifier (SVC) to establish a classification model for coal spontaneous combustion warning levels in goaf areas. The results showed that the maximum Maximal Information Coefficients (MICs) between temperature and CO and O<sub>2</sub> concentrations were 0.95 and 0.81, respectively, indicating strong nonlinear relationships between indicator gases and warning levels. The t-SNE method effectively extracted nonlinear mapping relationships between the indicator gas features, while the k-means clustering categorized coal spontaneous combustion data using distance as a similarity measure. By combining the t-SNE and k-means methods for accurate dimensionality reduction and clustering of goaf spontaneous combustion data, the warning levels were classified into six categories: safe, low risk, moderate risk, high risk, severe risk, and extremely severe risk. The application in the Longgu mine demonstrated that the SVC method could accurately classify spontaneous combustion warning levels in field goaf areas and implement corresponding response measures based on different warning levels, providing a valuable reference for spontaneous combustion prevention in goaf areas.
format Article
id doaj-art-1b21dfef45a84739a0cc7f7217afd5e5
institution DOAJ
issn 2076-3417
language English
publishDate 2025-03-01
publisher MDPI AG
record_format Article
series Applied Sciences
spelling doaj-art-1b21dfef45a84739a0cc7f7217afd5e52025-08-20T03:06:32ZengMDPI AGApplied Sciences2076-34172025-03-01157375610.3390/app15073756An Unsupervised Learning Approach for Coal Spontaneous Combustion Warning Level Classification Using t-SNE and k-Means ClusteringPengyu Zhang0Xiaokun Chen1School of Safety Science and Engineering, Xi’an University of Science and Technology, 58, Yanta Mid. Rd., Xi’an 710054, ChinaSchool of Safety Science and Engineering, Xi’an University of Science and Technology, 58, Yanta Mid. Rd., Xi’an 710054, ChinaAccurate prediction of coal spontaneous combustion levels is crucial for preventing and controlling spontaneous combustion in goaf areas. To address the ambiguity in classification standards of coal spontaneous combustion warning levels, 21 groups of coal samples from different mining areas were subjected to experiments with programmed temperatures, generating a database of 336 sets of temperatures and data on indicator gas concentrations. An unsupervised learning approach combining t-distributed Stochastic Neighbor Embedding (t-SNE) and k-means clustering was proposed to perform dimensionality reduction and clustering of high-dimensional data features. The clustering results of the original data were compared with Principal Component Analysis (PCA) and Stochastic Neighbor Embedding (SNE) methods to determine coal spontaneous combustion warning levels. The indicator gases and warning levels were input into a trained Support Vector Classifier (SVC) to establish a classification model for coal spontaneous combustion warning levels in goaf areas. The results showed that the maximum Maximal Information Coefficients (MICs) between temperature and CO and O<sub>2</sub> concentrations were 0.95 and 0.81, respectively, indicating strong nonlinear relationships between indicator gases and warning levels. The t-SNE method effectively extracted nonlinear mapping relationships between the indicator gas features, while the k-means clustering categorized coal spontaneous combustion data using distance as a similarity measure. By combining the t-SNE and k-means methods for accurate dimensionality reduction and clustering of goaf spontaneous combustion data, the warning levels were classified into six categories: safe, low risk, moderate risk, high risk, severe risk, and extremely severe risk. The application in the Longgu mine demonstrated that the SVC method could accurately classify spontaneous combustion warning levels in field goaf areas and implement corresponding response measures based on different warning levels, providing a valuable reference for spontaneous combustion prevention in goaf areas.https://www.mdpi.com/2076-3417/15/7/3756coal spontaneous combustionwarning levelunsupervised learningsupport vector classifierclassification model
spellingShingle Pengyu Zhang
Xiaokun Chen
An Unsupervised Learning Approach for Coal Spontaneous Combustion Warning Level Classification Using t-SNE and k-Means Clustering
Applied Sciences
coal spontaneous combustion
warning level
unsupervised learning
support vector classifier
classification model
title An Unsupervised Learning Approach for Coal Spontaneous Combustion Warning Level Classification Using t-SNE and k-Means Clustering
title_full An Unsupervised Learning Approach for Coal Spontaneous Combustion Warning Level Classification Using t-SNE and k-Means Clustering
title_fullStr An Unsupervised Learning Approach for Coal Spontaneous Combustion Warning Level Classification Using t-SNE and k-Means Clustering
title_full_unstemmed An Unsupervised Learning Approach for Coal Spontaneous Combustion Warning Level Classification Using t-SNE and k-Means Clustering
title_short An Unsupervised Learning Approach for Coal Spontaneous Combustion Warning Level Classification Using t-SNE and k-Means Clustering
title_sort unsupervised learning approach for coal spontaneous combustion warning level classification using t sne and k means clustering
topic coal spontaneous combustion
warning level
unsupervised learning
support vector classifier
classification model
url https://www.mdpi.com/2076-3417/15/7/3756
work_keys_str_mv AT pengyuzhang anunsupervisedlearningapproachforcoalspontaneouscombustionwarninglevelclassificationusingtsneandkmeansclustering
AT xiaokunchen anunsupervisedlearningapproachforcoalspontaneouscombustionwarninglevelclassificationusingtsneandkmeansclustering
AT pengyuzhang unsupervisedlearningapproachforcoalspontaneouscombustionwarninglevelclassificationusingtsneandkmeansclustering
AT xiaokunchen unsupervisedlearningapproachforcoalspontaneouscombustionwarninglevelclassificationusingtsneandkmeansclustering