Enhancing Software Requirements Classification with Semisupervised GAN-BERT Technique

The field of automatic detection of quality attributes from software requirements’ text stands as one of the most pioneering realms within software requirements research. Such automatic quality attributes aim to aid stakeholders in establishing the system architecture and preemptively circumventing...

Full description

Saved in:
Bibliographic Details
Main Author: Gregorius Airlangga
Format: Article
Language:English
Published: Wiley 2024-01-01
Series:Journal of Electrical and Computer Engineering
Online Access:http://dx.doi.org/10.1155/2024/4955691
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1832553220758568960
author Gregorius Airlangga
author_facet Gregorius Airlangga
author_sort Gregorius Airlangga
collection DOAJ
description The field of automatic detection of quality attributes from software requirements’ text stands as one of the most pioneering realms within software requirements research. Such automatic quality attributes aim to aid stakeholders in establishing the system architecture and preemptively circumventing faults. A considerable number of classifier models have been put forward, many of which show encouraging results. However, our analysis has identified substantial gaps in these studies, including (a) a limited dataset volume, (b) the absence of an evaluation study for cross-domain test sets, (c) the problem of real-time prediction scenarios where a vast amount of unlabeled data floods the system each second, and (d) a dearth of comparative studies scrutinizing diverse software requirements datasets and multiple machine learning models, with particular emphasis on in-domain and cross-domain testing. Hence, there is a pressing need to construct an alternative framework to enhance classifier performance under such conditions. Our research is primarily centered on developing a semisupervised methodology that hinges on GAN-BERT, introducing two datasets for the requirements of the engineering community and delivering comparative studies that consider a variety of classifiers and two labeling paradigms, namely, binary and multi. Remarkably, even with fewer data in a multiclassification scenario, our model outperforms other classifiers when assessing data from both identical and different domains.
format Article
id doaj-art-ba1f9c08ba354cf0a647fe1b1ee0aef4
institution Kabale University
issn 2090-0155
language English
publishDate 2024-01-01
publisher Wiley
record_format Article
series Journal of Electrical and Computer Engineering
spelling doaj-art-ba1f9c08ba354cf0a647fe1b1ee0aef42025-02-03T05:54:37ZengWileyJournal of Electrical and Computer Engineering2090-01552024-01-01202410.1155/2024/4955691Enhancing Software Requirements Classification with Semisupervised GAN-BERT TechniqueGregorius Airlangga0Information SystemThe field of automatic detection of quality attributes from software requirements’ text stands as one of the most pioneering realms within software requirements research. Such automatic quality attributes aim to aid stakeholders in establishing the system architecture and preemptively circumventing faults. A considerable number of classifier models have been put forward, many of which show encouraging results. However, our analysis has identified substantial gaps in these studies, including (a) a limited dataset volume, (b) the absence of an evaluation study for cross-domain test sets, (c) the problem of real-time prediction scenarios where a vast amount of unlabeled data floods the system each second, and (d) a dearth of comparative studies scrutinizing diverse software requirements datasets and multiple machine learning models, with particular emphasis on in-domain and cross-domain testing. Hence, there is a pressing need to construct an alternative framework to enhance classifier performance under such conditions. Our research is primarily centered on developing a semisupervised methodology that hinges on GAN-BERT, introducing two datasets for the requirements of the engineering community and delivering comparative studies that consider a variety of classifiers and two labeling paradigms, namely, binary and multi. Remarkably, even with fewer data in a multiclassification scenario, our model outperforms other classifiers when assessing data from both identical and different domains.http://dx.doi.org/10.1155/2024/4955691
spellingShingle Gregorius Airlangga
Enhancing Software Requirements Classification with Semisupervised GAN-BERT Technique
Journal of Electrical and Computer Engineering
title Enhancing Software Requirements Classification with Semisupervised GAN-BERT Technique
title_full Enhancing Software Requirements Classification with Semisupervised GAN-BERT Technique
title_fullStr Enhancing Software Requirements Classification with Semisupervised GAN-BERT Technique
title_full_unstemmed Enhancing Software Requirements Classification with Semisupervised GAN-BERT Technique
title_short Enhancing Software Requirements Classification with Semisupervised GAN-BERT Technique
title_sort enhancing software requirements classification with semisupervised gan bert technique
url http://dx.doi.org/10.1155/2024/4955691
work_keys_str_mv AT gregoriusairlangga enhancingsoftwarerequirementsclassificationwithsemisupervisedganberttechnique