Pembentukan Model Pohon Keputusan pada Database Car Evaluation Menggunakan Statistik Chi-Square
The study discusses problems related to the formation of a decision tree based on a collection of evaluation data records obtained from a number of car buyers. This secondary data was obtained from the UCL machine learning website. The purpose of this research is to produce a prototype algorithm for...
Saved in:
| Main Author: | |
|---|---|
| Format: | Article |
| Language: | English |
| Published: |
Universitas Airlangga
2022-03-01
|
| Series: | Contemporary Mathematics and Applications (ConMathA) |
| Subjects: | |
| Online Access: | https://e-journal.unair.ac.id/CONMATHA/article/view/34393 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
| _version_ | 1850169493550006272 |
|---|---|
| author | Retno Maharesi |
| author_facet | Retno Maharesi |
| author_sort | Retno Maharesi |
| collection | DOAJ |
| description | The study discusses problems related to the formation of a decision tree based on a collection of evaluation data records obtained from a number of car buyers. This secondary data was obtained from the UCL machine learning website. The purpose of this research is to produce a prototype algorithm for obtaining an inductive decision tree based on Chi-square statistics. An inductive decision tree formation method based on the Chi-square contingency test was compared with a decision tree obtained using a machine learning algorithm which was done using RapidMiner5 software. The work to produce an inductive decision tree was carried out by first processing data using Microsoft excel and next processed using SPSS software, on the crosstabs descriptive menu. The results of the two methods provide some kind of similar rules, in terms of the order of priority of the variables that most influencing people's decision to accept an automotive product. The formation of the decision tree uses a random sampling of size 300 data records among 1729 respondent data records in the car evaluation database. The resulting decision tree should have a minimal structure like a binary tree. This is possible because its formation is based on the statistical inferential method, so it does not require a separate pruning process as an addition step in the C4.5 algorithm, which actually this algorithm also considers aspects of the statistical significance. |
| format | Article |
| id | doaj-art-0e9f9fda7ab443bb8a93f415709b3aed |
| institution | OA Journals |
| issn | 2686-5564 |
| language | English |
| publishDate | 2022-03-01 |
| publisher | Universitas Airlangga |
| record_format | Article |
| series | Contemporary Mathematics and Applications (ConMathA) |
| spelling | doaj-art-0e9f9fda7ab443bb8a93f415709b3aed2025-08-20T02:20:42ZengUniversitas AirlanggaContemporary Mathematics and Applications (ConMathA)2686-55642022-03-0141517110.20473/conmatha.v4i1.3439328223Pembentukan Model Pohon Keputusan pada Database Car Evaluation Menggunakan Statistik Chi-SquareRetno Maharesi0Universitas GunadarmaThe study discusses problems related to the formation of a decision tree based on a collection of evaluation data records obtained from a number of car buyers. This secondary data was obtained from the UCL machine learning website. The purpose of this research is to produce a prototype algorithm for obtaining an inductive decision tree based on Chi-square statistics. An inductive decision tree formation method based on the Chi-square contingency test was compared with a decision tree obtained using a machine learning algorithm which was done using RapidMiner5 software. The work to produce an inductive decision tree was carried out by first processing data using Microsoft excel and next processed using SPSS software, on the crosstabs descriptive menu. The results of the two methods provide some kind of similar rules, in terms of the order of priority of the variables that most influencing people's decision to accept an automotive product. The formation of the decision tree uses a random sampling of size 300 data records among 1729 respondent data records in the car evaluation database. The resulting decision tree should have a minimal structure like a binary tree. This is possible because its formation is based on the statistical inferential method, so it does not require a separate pruning process as an addition step in the C4.5 algorithm, which actually this algorithm also considers aspects of the statistical significance.https://e-journal.unair.ac.id/CONMATHA/article/view/34393c4.5 algorithmchi-square statisticsdata miningentropyinductive decision tree |
| spellingShingle | Retno Maharesi Pembentukan Model Pohon Keputusan pada Database Car Evaluation Menggunakan Statistik Chi-Square Contemporary Mathematics and Applications (ConMathA) c4.5 algorithm chi-square statistics data mining entropy inductive decision tree |
| title | Pembentukan Model Pohon Keputusan pada Database Car Evaluation Menggunakan Statistik Chi-Square |
| title_full | Pembentukan Model Pohon Keputusan pada Database Car Evaluation Menggunakan Statistik Chi-Square |
| title_fullStr | Pembentukan Model Pohon Keputusan pada Database Car Evaluation Menggunakan Statistik Chi-Square |
| title_full_unstemmed | Pembentukan Model Pohon Keputusan pada Database Car Evaluation Menggunakan Statistik Chi-Square |
| title_short | Pembentukan Model Pohon Keputusan pada Database Car Evaluation Menggunakan Statistik Chi-Square |
| title_sort | pembentukan model pohon keputusan pada database car evaluation menggunakan statistik chi square |
| topic | c4.5 algorithm chi-square statistics data mining entropy inductive decision tree |
| url | https://e-journal.unair.ac.id/CONMATHA/article/view/34393 |
| work_keys_str_mv | AT retnomaharesi pembentukanmodelpohonkeputusanpadadatabasecarevaluationmenggunakanstatistikchisquare |