Classification of Mass Spectral Data to Assist in the Identification of Novel Synthetic Cannabinoids

Detection and characterization of newly synthesized cannabinoids (NSCs) is challenging due to the lack of availability of reference standards and chemical data. In this study, a binary classification system was developed and validated using partial least square discriminant analysis (PLS-DA) by util...

Full description

Saved in:
Bibliographic Details
Main Authors: Kristopher C. Evans-Newman, Garion L. Schneider, Nuwan T. Perera
Format: Article
Language:English
Published: MDPI AG 2024-09-01
Series:Molecules
Subjects:
Online Access:https://www.mdpi.com/1420-3049/29/19/4646
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1850184955638841344
author Kristopher C. Evans-Newman
Garion L. Schneider
Nuwan T. Perera
author_facet Kristopher C. Evans-Newman
Garion L. Schneider
Nuwan T. Perera
author_sort Kristopher C. Evans-Newman
collection DOAJ
description Detection and characterization of newly synthesized cannabinoids (NSCs) is challenging due to the lack of availability of reference standards and chemical data. In this study, a binary classification system was developed and validated using partial least square discriminant analysis (PLS-DA) by utilizing readily available mass spectral data of known drugs to assist in the identification of previously unknown NCSs. First, a binary classification model was developed to discriminate cannabinoids and cannabinoid-related compounds from other drug classes. Then, a classification model was developed to discriminate classical (THC-related) from synthetic cannabinoids. Additional models were developed based on the most abundant functional groups including core groups such as indole, indazole, azaindole, and naphthoylpyrrole, as well as head and tail groups including 4-fluorobenzyl (FUB) and 5-Fluoropentyl (5-F). The predictive ability of these models was tested via both cross-validation and external validation. The results show that all models developed are highly accurate. Additionally, latent variables (LVs) of each model provide useful mass to charge (<i>m</i>/<i>z</i>) for discrimination between classes, which further facilitates the identification of different functional groups of previously unknown drug molecules.
format Article
id doaj-art-c5b5b05863a04dbabca22c5342be9169
institution OA Journals
issn 1420-3049
language English
publishDate 2024-09-01
publisher MDPI AG
record_format Article
series Molecules
spelling doaj-art-c5b5b05863a04dbabca22c5342be91692025-08-20T02:16:54ZengMDPI AGMolecules1420-30492024-09-012919464610.3390/molecules29194646Classification of Mass Spectral Data to Assist in the Identification of Novel Synthetic CannabinoidsKristopher C. Evans-Newman0Garion L. Schneider1Nuwan T. Perera2Department of Chemistry and Physics, Western Carolina University, Cullowhee, NC 28723, USADepartment of Chemistry and Physics, Western Carolina University, Cullowhee, NC 28723, USADepartment of Chemistry and Physics, Western Carolina University, Cullowhee, NC 28723, USADetection and characterization of newly synthesized cannabinoids (NSCs) is challenging due to the lack of availability of reference standards and chemical data. In this study, a binary classification system was developed and validated using partial least square discriminant analysis (PLS-DA) by utilizing readily available mass spectral data of known drugs to assist in the identification of previously unknown NCSs. First, a binary classification model was developed to discriminate cannabinoids and cannabinoid-related compounds from other drug classes. Then, a classification model was developed to discriminate classical (THC-related) from synthetic cannabinoids. Additional models were developed based on the most abundant functional groups including core groups such as indole, indazole, azaindole, and naphthoylpyrrole, as well as head and tail groups including 4-fluorobenzyl (FUB) and 5-Fluoropentyl (5-F). The predictive ability of these models was tested via both cross-validation and external validation. The results show that all models developed are highly accurate. Additionally, latent variables (LVs) of each model provide useful mass to charge (<i>m</i>/<i>z</i>) for discrimination between classes, which further facilitates the identification of different functional groups of previously unknown drug molecules.https://www.mdpi.com/1420-3049/29/19/4646novel synthetic cannabinoids (NSCs)binary classification systemmass spectral datapartial least square discriminant analysis (PLS-DA)
spellingShingle Kristopher C. Evans-Newman
Garion L. Schneider
Nuwan T. Perera
Classification of Mass Spectral Data to Assist in the Identification of Novel Synthetic Cannabinoids
Molecules
novel synthetic cannabinoids (NSCs)
binary classification system
mass spectral data
partial least square discriminant analysis (PLS-DA)
title Classification of Mass Spectral Data to Assist in the Identification of Novel Synthetic Cannabinoids
title_full Classification of Mass Spectral Data to Assist in the Identification of Novel Synthetic Cannabinoids
title_fullStr Classification of Mass Spectral Data to Assist in the Identification of Novel Synthetic Cannabinoids
title_full_unstemmed Classification of Mass Spectral Data to Assist in the Identification of Novel Synthetic Cannabinoids
title_short Classification of Mass Spectral Data to Assist in the Identification of Novel Synthetic Cannabinoids
title_sort classification of mass spectral data to assist in the identification of novel synthetic cannabinoids
topic novel synthetic cannabinoids (NSCs)
binary classification system
mass spectral data
partial least square discriminant analysis (PLS-DA)
url https://www.mdpi.com/1420-3049/29/19/4646
work_keys_str_mv AT kristophercevansnewman classificationofmassspectraldatatoassistintheidentificationofnovelsyntheticcannabinoids
AT garionlschneider classificationofmassspectraldatatoassistintheidentificationofnovelsyntheticcannabinoids
AT nuwantperera classificationofmassspectraldatatoassistintheidentificationofnovelsyntheticcannabinoids