Class-weighted Dempster–Shafer in dual-level fusion for multimodal fake real estate listings detection

Background Detecting fake multimodal property listings is a significant challenge in online real estate platforms due to the increasing sophistication of fraudulent activities. The existing multimodal data fusion methods have several limitations and strengths in identifying fraudulent listings. Sing...

Full description

Saved in:
Bibliographic Details
Main Authors: Maifuza Mohd Amin, Nor Samsiah Sani, Mohammad Faidzul Nasrudin
Format: Article
Language:English
Published: PeerJ Inc. 2025-05-01
Series:PeerJ Computer Science
Subjects:
Online Access:https://peerj.com/articles/cs-2797.pdf
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1849687295521718272
author Maifuza Mohd Amin
Nor Samsiah Sani
Mohammad Faidzul Nasrudin
author_facet Maifuza Mohd Amin
Nor Samsiah Sani
Mohammad Faidzul Nasrudin
author_sort Maifuza Mohd Amin
collection DOAJ
description Background Detecting fake multimodal property listings is a significant challenge in online real estate platforms due to the increasing sophistication of fraudulent activities. The existing multimodal data fusion methods have several limitations and strengths in identifying fraudulent listings. Single-level fusion models whether at the feature, decision, or intermediate level struggle with balancing the contributions of different modalities leading to suboptimal decision-making. To address these problems, a dual-level fusion from multimodal for fake real estate listings detection is proposed. The dual-level fusion allows the integration of detailed features from text and image data to be performed at an early stage, followed by the metadata fusion at the decision stage in order to obtain a more comprehensive final classification. Furthermore, a new weighting scheme is introduced to optimize Dempster–Shafer in decision fusion to help the model achieve optimal performance and as a result, our method improves the classification. The Dempster–Shafer without class weightage lacks the flexibility to adapt to varying levels of uncertainty or importance across different classes. Methods In Class Weighted Dempster–Shafer in Dual Level Fusion (CWDS-DLF), we employ advanced models (XLNet for text and ResNet101 for images) for feature extraction and use the Dempster–Shafer theory for decision fusion. A new weighting scheme, based on Bayesian optimization, was used to assign optimal weights to the ‘fake’ and ‘not fake’ classes, thereby enhancing the Dempster–Shafer theory in the decision fusion process. Results The CWDS-DLF was evaluated on the property listing website dataset and achieved an F1 score of 96% and an accuracy of 93%. A t-test confirms the significance of these improvements (p < 0.05), demonstrating the effectiveness of our method in detecting fake property listings. Compared to other models, including 2D-convolutional neural network (CNN), XGBoost, and various multimodal approaches, our model consistently outperforms in precision, recall, and F1-score. This underscores the potential of integrating multimodal analysis with sophisticated fusion techniques to enhance the detection of fake property listings, ultimately improving consumer protection and operational efficiency in online real estate platforms.
format Article
id doaj-art-1575cd15286a45b489fa17aa76c17863
institution DOAJ
issn 2376-5992
language English
publishDate 2025-05-01
publisher PeerJ Inc.
record_format Article
series PeerJ Computer Science
spelling doaj-art-1575cd15286a45b489fa17aa76c178632025-08-20T03:22:22ZengPeerJ Inc.PeerJ Computer Science2376-59922025-05-0111e279710.7717/peerj-cs.2797Class-weighted Dempster–Shafer in dual-level fusion for multimodal fake real estate listings detectionMaifuza Mohd AminNor Samsiah SaniMohammad Faidzul NasrudinBackground Detecting fake multimodal property listings is a significant challenge in online real estate platforms due to the increasing sophistication of fraudulent activities. The existing multimodal data fusion methods have several limitations and strengths in identifying fraudulent listings. Single-level fusion models whether at the feature, decision, or intermediate level struggle with balancing the contributions of different modalities leading to suboptimal decision-making. To address these problems, a dual-level fusion from multimodal for fake real estate listings detection is proposed. The dual-level fusion allows the integration of detailed features from text and image data to be performed at an early stage, followed by the metadata fusion at the decision stage in order to obtain a more comprehensive final classification. Furthermore, a new weighting scheme is introduced to optimize Dempster–Shafer in decision fusion to help the model achieve optimal performance and as a result, our method improves the classification. The Dempster–Shafer without class weightage lacks the flexibility to adapt to varying levels of uncertainty or importance across different classes. Methods In Class Weighted Dempster–Shafer in Dual Level Fusion (CWDS-DLF), we employ advanced models (XLNet for text and ResNet101 for images) for feature extraction and use the Dempster–Shafer theory for decision fusion. A new weighting scheme, based on Bayesian optimization, was used to assign optimal weights to the ‘fake’ and ‘not fake’ classes, thereby enhancing the Dempster–Shafer theory in the decision fusion process. Results The CWDS-DLF was evaluated on the property listing website dataset and achieved an F1 score of 96% and an accuracy of 93%. A t-test confirms the significance of these improvements (p < 0.05), demonstrating the effectiveness of our method in detecting fake property listings. Compared to other models, including 2D-convolutional neural network (CNN), XGBoost, and various multimodal approaches, our model consistently outperforms in precision, recall, and F1-score. This underscores the potential of integrating multimodal analysis with sophisticated fusion techniques to enhance the detection of fake property listings, ultimately improving consumer protection and operational efficiency in online real estate platforms.https://peerj.com/articles/cs-2797.pdfFake property listingsFraudulent activitiesUnimodal analysisMultimodal methodFeature fusionDecision fusion
spellingShingle Maifuza Mohd Amin
Nor Samsiah Sani
Mohammad Faidzul Nasrudin
Class-weighted Dempster–Shafer in dual-level fusion for multimodal fake real estate listings detection
PeerJ Computer Science
Fake property listings
Fraudulent activities
Unimodal analysis
Multimodal method
Feature fusion
Decision fusion
title Class-weighted Dempster–Shafer in dual-level fusion for multimodal fake real estate listings detection
title_full Class-weighted Dempster–Shafer in dual-level fusion for multimodal fake real estate listings detection
title_fullStr Class-weighted Dempster–Shafer in dual-level fusion for multimodal fake real estate listings detection
title_full_unstemmed Class-weighted Dempster–Shafer in dual-level fusion for multimodal fake real estate listings detection
title_short Class-weighted Dempster–Shafer in dual-level fusion for multimodal fake real estate listings detection
title_sort class weighted dempster shafer in dual level fusion for multimodal fake real estate listings detection
topic Fake property listings
Fraudulent activities
Unimodal analysis
Multimodal method
Feature fusion
Decision fusion
url https://peerj.com/articles/cs-2797.pdf
work_keys_str_mv AT maifuzamohdamin classweighteddempstershaferinduallevelfusionformultimodalfakerealestatelistingsdetection
AT norsamsiahsani classweighteddempstershaferinduallevelfusionformultimodalfakerealestatelistingsdetection
AT mohammadfaidzulnasrudin classweighteddempstershaferinduallevelfusionformultimodalfakerealestatelistingsdetection