Enhancing Abstractive Multi-Document Summarization with Bert2Bert Model for Indonesian Language

This study investigates the effectiveness of the proposed Bert2Bert and Bert2Bert+Xtreme models in improving abstract multi-document summarization for the Indonesian language. This study uses the transformer model as a basis for developing the proposed Bert2Bert and Bert2Bert+Xtreme models. The resu...

Full description

Saved in:
Bibliographic Details
Main Authors: Aldi Fahluzi Muharam, Yana Aditia Gerhana, Dian Sa'adillah Maylawati, Muhammad Ali Ramdhani, Titik Khawa Abdul Rahman
Format: Article
Language:English
Published: Universitas Islam Negeri Sunan Kalijaga Yogyakarta 2025-01-01
Series:JISKA (Jurnal Informatika Sunan Kalijaga)
Subjects:
Online Access:https://ejournal.uin-suka.ac.id/saintek/JISKA/article/view/4736
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1832574153484402688
author Aldi Fahluzi Muharam
Yana Aditia Gerhana
Dian Sa'adillah Maylawati
Muhammad Ali Ramdhani
Titik Khawa Abdul Rahman
author_facet Aldi Fahluzi Muharam
Yana Aditia Gerhana
Dian Sa'adillah Maylawati
Muhammad Ali Ramdhani
Titik Khawa Abdul Rahman
author_sort Aldi Fahluzi Muharam
collection DOAJ
description This study investigates the effectiveness of the proposed Bert2Bert and Bert2Bert+Xtreme models in improving abstract multi-document summarization for the Indonesian language. This study uses the transformer model as a basis for developing the proposed Bert2Bert and Bert2Bert+Xtreme models. The results of the model evaluation with the Liputan6 dataset using ROUGE-1, ROUGE-2, ROUGE-L, and BERTScore show that the proposed models have slight improvements over previous research models with Bert2Bert being better than Bert2Bert+Xtreme. Despite the challenges posed by limited reference summarization for Indonesian documents, content-based analysis using readability metrics, including FKGL, GFI, and Dwiyanto Djoko Pranowo revealed that the summaries generated by Bert2Bert and Bert2Bert+Xtreme are at a moderate readability level, which means they are suitable for adult readers and in line with the target audience of the news portal.
format Article
id doaj-art-33fd67248bc04ecd932cc0665d0ac85e
institution Kabale University
issn 2527-5836
2528-0074
language English
publishDate 2025-01-01
publisher Universitas Islam Negeri Sunan Kalijaga Yogyakarta
record_format Article
series JISKA (Jurnal Informatika Sunan Kalijaga)
spelling doaj-art-33fd67248bc04ecd932cc0665d0ac85e2025-02-02T00:37:10ZengUniversitas Islam Negeri Sunan Kalijaga YogyakartaJISKA (Jurnal Informatika Sunan Kalijaga)2527-58362528-00742025-01-01101Enhancing Abstractive Multi-Document Summarization with Bert2Bert Model for Indonesian LanguageAldi Fahluzi Muharam0Yana Aditia Gerhana1Dian Sa'adillah Maylawati2Muhammad Ali Ramdhani3Titik Khawa Abdul Rahman4Department of Informatics, Faculty of Science and Technology, UIN Sunan Gunung Djati, BandungDepartment of Informatics, Faculty of Science and Technology, UIN Sunan Gunung Djati, Bandung, Indonesia and Information and Communication Technology, Asia e University, SelangorDepartment of Informatics, UIN Sunan Gunung Djati Bandungment of Informatics, Faculty of Science and Technology, UIN Sunan Gunung Djati, BandungInformation and Communication Technology, Asia e University, SelangorThis study investigates the effectiveness of the proposed Bert2Bert and Bert2Bert+Xtreme models in improving abstract multi-document summarization for the Indonesian language. This study uses the transformer model as a basis for developing the proposed Bert2Bert and Bert2Bert+Xtreme models. The results of the model evaluation with the Liputan6 dataset using ROUGE-1, ROUGE-2, ROUGE-L, and BERTScore show that the proposed models have slight improvements over previous research models with Bert2Bert being better than Bert2Bert+Xtreme. Despite the challenges posed by limited reference summarization for Indonesian documents, content-based analysis using readability metrics, including FKGL, GFI, and Dwiyanto Djoko Pranowo revealed that the summaries generated by Bert2Bert and Bert2Bert+Xtreme are at a moderate readability level, which means they are suitable for adult readers and in line with the target audience of the news portal.https://ejournal.uin-suka.ac.id/saintek/JISKA/article/view/4736Bert2BertAbstractiveMulti-documentSummarizationTransformer
spellingShingle Aldi Fahluzi Muharam
Yana Aditia Gerhana
Dian Sa'adillah Maylawati
Muhammad Ali Ramdhani
Titik Khawa Abdul Rahman
Enhancing Abstractive Multi-Document Summarization with Bert2Bert Model for Indonesian Language
JISKA (Jurnal Informatika Sunan Kalijaga)
Bert2Bert
Abstractive
Multi-document
Summarization
Transformer
title Enhancing Abstractive Multi-Document Summarization with Bert2Bert Model for Indonesian Language
title_full Enhancing Abstractive Multi-Document Summarization with Bert2Bert Model for Indonesian Language
title_fullStr Enhancing Abstractive Multi-Document Summarization with Bert2Bert Model for Indonesian Language
title_full_unstemmed Enhancing Abstractive Multi-Document Summarization with Bert2Bert Model for Indonesian Language
title_short Enhancing Abstractive Multi-Document Summarization with Bert2Bert Model for Indonesian Language
title_sort enhancing abstractive multi document summarization with bert2bert model for indonesian language
topic Bert2Bert
Abstractive
Multi-document
Summarization
Transformer
url https://ejournal.uin-suka.ac.id/saintek/JISKA/article/view/4736
work_keys_str_mv AT aldifahluzimuharam enhancingabstractivemultidocumentsummarizationwithbert2bertmodelforindonesianlanguage
AT yanaaditiagerhana enhancingabstractivemultidocumentsummarizationwithbert2bertmodelforindonesianlanguage
AT diansaadillahmaylawati enhancingabstractivemultidocumentsummarizationwithbert2bertmodelforindonesianlanguage
AT muhammadaliramdhani enhancingabstractivemultidocumentsummarizationwithbert2bertmodelforindonesianlanguage
AT titikkhawaabdulrahman enhancingabstractivemultidocumentsummarizationwithbert2bertmodelforindonesianlanguage