Exploring Symbolic Regression and Genetic Algorithms for Astronomical Object Classification

This study explores the use of symbolic regression (SR) combined with genetic algorithms (GA) to classify astronomical objects. Using the SDSS17 dataset from Kaggle, which includes 100,000 observations of stars, galaxies, and quasars, we applied SR to 10% of the data to derive a mathematical express...

Full description

Saved in:
Bibliographic Details
Main Authors: Fabio Ricardo Llorella, José Antonio Cebrian
Format: Article
Language:English
Published: Maynooth Academic Publishing 2025-03-01
Series:The Open Journal of Astrophysics
Online Access:https://doi.org/10.33232/001c.132333
Tags: Add Tag
No Tags, Be the first to tag this record!
Description
Summary:This study explores the use of symbolic regression (SR) combined with genetic algorithms (GA) to classify astronomical objects. Using the SDSS17 dataset from Kaggle, which includes 100,000 observations of stars, galaxies, and quasars, we applied SR to 10% of the data to derive a mathematical expression capable of distinguishing these classes. A genetic algorithm was then employed to optimize the hyperparameters of the expression, refining the model’s performance. The final model achieved a Cohen’s kappa value of 0.81, indicating a strong agreement with true classifications. Our results demonstrate that the SR+GA approach can produce interpretable and accurate models for the classification of astronomical objects, offering a promising alternative to traditional black-box machine learning methods.
ISSN:2565-6120