Evaluating Machine Learning Algorithms for Financial Fraud Detection: Insights from Indonesia

The study utilized Multiple Linear Regression along with advanced classification algorithms such as Logistic Regression, K-Nearest Neighbors (KNN), Support Vector Machine (SVM), Decision Tree, and Random Forest, to detect financial statement fraud. Model performance was evaluated using key metrics,...

Full description

Saved in:
Bibliographic Details
Main Authors: Cheng-Wen Lee, Mao-Wen Fu, Chin-Chuan Wang, Muh. Irfandy Azis
Format: Article
Language:English
Published: MDPI AG 2025-02-01
Series:Mathematics
Subjects:
Online Access:https://www.mdpi.com/2227-7390/13/4/600
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1850231431774601216
author Cheng-Wen Lee
Mao-Wen Fu
Chin-Chuan Wang
Muh. Irfandy Azis
author_facet Cheng-Wen Lee
Mao-Wen Fu
Chin-Chuan Wang
Muh. Irfandy Azis
author_sort Cheng-Wen Lee
collection DOAJ
description The study utilized Multiple Linear Regression along with advanced classification algorithms such as Logistic Regression, K-Nearest Neighbors (KNN), Support Vector Machine (SVM), Decision Tree, and Random Forest, to detect financial statement fraud. Model performance was evaluated using key metrics, including precision, recall, accuracy, and F1-Score. The analysis also identified significant indicators of fraud, such as Accounts Receivable Turnover, Days Outstanding Accounts Receivable, Days Payables Outstanding, Logarithm of Gross Profit, Gross Profit Margin, Inventory to Sales Ratio, and Total Asset Turnover. Among the models, Random Forest emerged as the most effective algorithm, consistently outperforming others on both training and testing datasets. Logistic Regression and SVM demonstrated strong reliability, whereas KNN and Decision Tree faced overfitting challenges, limiting their practical application. These findings emphasize the critical need for enhanced fraud detection frameworks, leveraging machine learning algorithms like Random Forest to identify fraud patterns effectively. The study highlights the importance of strengthening internal controls, implementing targeted fraud detection measures, and promoting regulatory improvements to enhance transparency and financial accountability.
format Article
id doaj-art-b45502530863452bbdb7bb7970ffbe06
institution OA Journals
issn 2227-7390
language English
publishDate 2025-02-01
publisher MDPI AG
record_format Article
series Mathematics
spelling doaj-art-b45502530863452bbdb7bb7970ffbe062025-08-20T02:03:31ZengMDPI AGMathematics2227-73902025-02-0113460010.3390/math13040600Evaluating Machine Learning Algorithms for Financial Fraud Detection: Insights from IndonesiaCheng-Wen Lee0Mao-Wen Fu1Chin-Chuan Wang2Muh. Irfandy Azis3Department of International Business, Chung Yuan Christian University, Taoyuan 320314, TaiwanPh.D. Program in Business, College of Business, Chung Yuan Christian University, Taoyuan 320314, TaiwanPh.D. Program in Business, College of Business, Chung Yuan Christian University, Taoyuan 320314, TaiwanDepartment of Accounting, Universitas Borneo Tarakan, Tarakan 77123, IndonesiaThe study utilized Multiple Linear Regression along with advanced classification algorithms such as Logistic Regression, K-Nearest Neighbors (KNN), Support Vector Machine (SVM), Decision Tree, and Random Forest, to detect financial statement fraud. Model performance was evaluated using key metrics, including precision, recall, accuracy, and F1-Score. The analysis also identified significant indicators of fraud, such as Accounts Receivable Turnover, Days Outstanding Accounts Receivable, Days Payables Outstanding, Logarithm of Gross Profit, Gross Profit Margin, Inventory to Sales Ratio, and Total Asset Turnover. Among the models, Random Forest emerged as the most effective algorithm, consistently outperforming others on both training and testing datasets. Logistic Regression and SVM demonstrated strong reliability, whereas KNN and Decision Tree faced overfitting challenges, limiting their practical application. These findings emphasize the critical need for enhanced fraud detection frameworks, leveraging machine learning algorithms like Random Forest to identify fraud patterns effectively. The study highlights the importance of strengthening internal controls, implementing targeted fraud detection measures, and promoting regulatory improvements to enhance transparency and financial accountability.https://www.mdpi.com/2227-7390/13/4/600financial statementfraud detectionregressionclassification algorithms
spellingShingle Cheng-Wen Lee
Mao-Wen Fu
Chin-Chuan Wang
Muh. Irfandy Azis
Evaluating Machine Learning Algorithms for Financial Fraud Detection: Insights from Indonesia
Mathematics
financial statement
fraud detection
regression
classification algorithms
title Evaluating Machine Learning Algorithms for Financial Fraud Detection: Insights from Indonesia
title_full Evaluating Machine Learning Algorithms for Financial Fraud Detection: Insights from Indonesia
title_fullStr Evaluating Machine Learning Algorithms for Financial Fraud Detection: Insights from Indonesia
title_full_unstemmed Evaluating Machine Learning Algorithms for Financial Fraud Detection: Insights from Indonesia
title_short Evaluating Machine Learning Algorithms for Financial Fraud Detection: Insights from Indonesia
title_sort evaluating machine learning algorithms for financial fraud detection insights from indonesia
topic financial statement
fraud detection
regression
classification algorithms
url https://www.mdpi.com/2227-7390/13/4/600
work_keys_str_mv AT chengwenlee evaluatingmachinelearningalgorithmsforfinancialfrauddetectioninsightsfromindonesia
AT maowenfu evaluatingmachinelearningalgorithmsforfinancialfrauddetectioninsightsfromindonesia
AT chinchuanwang evaluatingmachinelearningalgorithmsforfinancialfrauddetectioninsightsfromindonesia
AT muhirfandyazis evaluatingmachinelearningalgorithmsforfinancialfrauddetectioninsightsfromindonesia