A method for explaining individual predictions in neural networks

Background Recently, the explainability of the prediction results of machine learning models has attracted attention. Most high-performance prediction models are black boxes that cannot be explained. Artificial neural networks are also considered black box models. Although they can explain image cla...

Full description

Saved in:

Bibliographic Details
Main Author:	Sejong Oh
Format:	Article
Language:	English
Published:	PeerJ Inc. 2025-04-01
Series:	PeerJ Computer Science
Subjects:	Artificial neural network XAI Individual prediction Explanation Weights
Online Access:	https://peerj.com/articles/cs-2802.pdf
Tags:	Add Tag No Tags, Be the first to tag this record!

Description
Summary:	Background Recently, the explainability of the prediction results of machine learning models has attracted attention. Most high-performance prediction models are black boxes that cannot be explained. Artificial neural networks are also considered black box models. Although they can explain image classification results to some extent, they still struggle to explain the classification and regression results for tabular data. In this study, we explain the individual prediction results derived from a neural network-based prediction model. Methods The output of a neural network is fundamentally determined by multiplying the input values by the network weights. In other words, the output is a weighted sum of the input values. The weights control how much each input value contributes to the output. The degree of influence of an input value xi on the output can be evaluated as (xi · weight value wi)/weighted sum. From this insight, we can calculate the contribution of each input value to the output as it flows through the neural network. Results With the proposed method, the neural network is no longer a black box. The proposed method effectively explains the predictions made by the neural network and is independent of the depth of the hidden layers and the number of nodes in each hidden layer. This provides a clear rationale for this interpretation. It can be applied to both regression and classification models. The proposed method is implemented as a Python library, making it easy to use.
ISSN:	2376-5992

A method for explaining individual predictions in neural networks

Similar Items