Protocol for development of a checklist and guideline for transparent reporting of cluster analyses (TRoCA)

Introduction Cluster analysis, a machine learning-based and data-driven technique for identifying groups in data, has demonstrated its potential in a wide range of contexts. However, critical appraisal and reproducibility are often limited by insufficient reporting, ultimately hampering the interpre...

Full description

Saved in:
Bibliographic Details
Main Authors: Ana Marusic, Bright I Nwaru, Syed Ahmar Shah, Meredith Wallace, Dat Tran, Daniil Lisik, Rani Basna, Tai Dinh, Ryan P Browne, Jeffrey L Andrews, Absalom Ezugwu, Joaquín Torres-Sospedra, Hieu-Chi Dam, Philippe Fournier-Viger, Christian Hennig, Marieke Timmerman, Matthijs J Warrens, Eva Ceulemans, Tina M Hernandez-Boussard
Format: Article
Language:English
Published: BMJ Publishing Group 2025-08-01
Series:BMJ Open
Online Access:https://bmjopen.bmj.com/content/15/8/e099609.full
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1849229402750058496
author Ana Marusic
Bright I Nwaru
Syed Ahmar Shah
Meredith Wallace
Dat Tran
Daniil Lisik
Rani Basna
Tai Dinh
Ryan P Browne
Jeffrey L Andrews
Absalom Ezugwu
Joaquín Torres-Sospedra
Hieu-Chi Dam
Philippe Fournier-Viger
Christian Hennig
Marieke Timmerman
Matthijs J Warrens
Eva Ceulemans
Tina M Hernandez-Boussard
author_facet Ana Marusic
Bright I Nwaru
Syed Ahmar Shah
Meredith Wallace
Dat Tran
Daniil Lisik
Rani Basna
Tai Dinh
Ryan P Browne
Jeffrey L Andrews
Absalom Ezugwu
Joaquín Torres-Sospedra
Hieu-Chi Dam
Philippe Fournier-Viger
Christian Hennig
Marieke Timmerman
Matthijs J Warrens
Eva Ceulemans
Tina M Hernandez-Boussard
author_sort Ana Marusic
collection DOAJ
description Introduction Cluster analysis, a machine learning-based and data-driven technique for identifying groups in data, has demonstrated its potential in a wide range of contexts. However, critical appraisal and reproducibility are often limited by insufficient reporting, ultimately hampering the interpretation and trust of key stakeholders. The present paper describes the protocol that will guide the development of a reporting guideline and checklist for studies incorporating cluster analyses—Transparent Reporting of Cluster Analyses.Methods and analysis Following the recommended steps for developing reporting guidelines outlined by the Enhancing the QUAlity and Transparency Of health Research Network, the work will be divided into six stages. Stage 1: literature review to guide development of initial checklist. Stage 2: drafting of the initial checklist. Stage 3: internal revision of checklist. Stage 4: Delphi study in a global sample of researchers from varying fields (n=≈) to derive consensus regarding items in the checklist and piloting of the checklist. Stage 5: consensus meeting to consolidate checklist. Stage 6: production of statement paper and explanation and elaboration paper. Stage 7: dissemination via journals, conferences, social media and a dedicated web platform.Ethics and dissemination Due to local regulations, the planned study is exempt from the requirement of ethical review. The findings will be disseminated through peer-reviewed publications. The checklist with explanations will also be made available freely on a dedicated web platform (troca-statement.org) and in a repository.
format Article
id doaj-art-ba9191ae9e624fab9212decaead70e53
institution Kabale University
issn 2044-6055
language English
publishDate 2025-08-01
publisher BMJ Publishing Group
record_format Article
series BMJ Open
spelling doaj-art-ba9191ae9e624fab9212decaead70e532025-08-22T04:20:13ZengBMJ Publishing GroupBMJ Open2044-60552025-08-0115810.1136/bmjopen-2025-099609Protocol for development of a checklist and guideline for transparent reporting of cluster analyses (TRoCA)Ana Marusic0Bright I Nwaru1Syed Ahmar Shah2Meredith Wallace3Dat Tran4Daniil Lisik5Rani Basna6Tai Dinh7Ryan P Browne8Jeffrey L Andrews9Absalom Ezugwu10Joaquín Torres-Sospedra11Hieu-Chi Dam12Philippe Fournier-Viger13Christian Hennig14Marieke Timmerman15Matthijs J Warrens16Eva Ceulemans17Tina M Hernandez-Boussard18Department of Research in Biomedicine and Health, University of Split School of Medicine, Split, CroatiaKrefting Research Centre, Institute of Medicine, University of Gothenburg Sahlgrenska Academy, Gothenburg, SwedenThe University of Edinburgh Usher Institute, Edinburgh, UKDepartment of Psychiatry, Statistics and Biostatistics, University of Pittsburgh, Pittsburgh, Pennsylvania, USAUniversity of Canberra, Canberra, Australian Capital Territory, AustraliaDepartment of Public Health and Clinical Medicine, Section of Sustainable Health, The OLIN Unit, Umeå Universitet, Umeå, SwedenKrefting Research Centre, Institute of Medicine, University of Gothenburg Sahlgrenska Academy, Gothenburg, SwedenThe Kyoto College of Graduate Studies for Informatics, Kyoto, JapanDepartment of Statistics and Actuarial Science, University of Waterloo, Waterloo, Ontario, CanadaDepartment of Statistics, The University of British Columbia, Vancouver, British Columbia, CanadaUnit for Data Science and Computing, North-West University, Potchefstroom, South AfricaDepartment of Computer Science, Universitat de València, Valencia, SpainJapan Advanced Institute of Science and Technology, Nomi, JapanBig Data Institute, Shenzhen University College of Computer Science and Software Engineering, Shenzhen, Guangdong, ChinaDepartment of Statistical Sciences ‘Paolo Fortunati’, University of Bologna, Bologna, ItalyDepartment of Psychometrics and Statistics, University of Groningen, Groningen, NetherlandsGION Education/Research, Department of Pedagogical and Educational Sciences, University of Groningen, Groningen, The NetherlandsQuantitative Psychology and Individual Differences, KU Leuven, Leuven, BelgiumDepartment of Biomedical Data Science, Stanford University, Stanford, California, USAIntroduction Cluster analysis, a machine learning-based and data-driven technique for identifying groups in data, has demonstrated its potential in a wide range of contexts. However, critical appraisal and reproducibility are often limited by insufficient reporting, ultimately hampering the interpretation and trust of key stakeholders. The present paper describes the protocol that will guide the development of a reporting guideline and checklist for studies incorporating cluster analyses—Transparent Reporting of Cluster Analyses.Methods and analysis Following the recommended steps for developing reporting guidelines outlined by the Enhancing the QUAlity and Transparency Of health Research Network, the work will be divided into six stages. Stage 1: literature review to guide development of initial checklist. Stage 2: drafting of the initial checklist. Stage 3: internal revision of checklist. Stage 4: Delphi study in a global sample of researchers from varying fields (n=≈) to derive consensus regarding items in the checklist and piloting of the checklist. Stage 5: consensus meeting to consolidate checklist. Stage 6: production of statement paper and explanation and elaboration paper. Stage 7: dissemination via journals, conferences, social media and a dedicated web platform.Ethics and dissemination Due to local regulations, the planned study is exempt from the requirement of ethical review. The findings will be disseminated through peer-reviewed publications. The checklist with explanations will also be made available freely on a dedicated web platform (troca-statement.org) and in a repository.https://bmjopen.bmj.com/content/15/8/e099609.full
spellingShingle Ana Marusic
Bright I Nwaru
Syed Ahmar Shah
Meredith Wallace
Dat Tran
Daniil Lisik
Rani Basna
Tai Dinh
Ryan P Browne
Jeffrey L Andrews
Absalom Ezugwu
Joaquín Torres-Sospedra
Hieu-Chi Dam
Philippe Fournier-Viger
Christian Hennig
Marieke Timmerman
Matthijs J Warrens
Eva Ceulemans
Tina M Hernandez-Boussard
Protocol for development of a checklist and guideline for transparent reporting of cluster analyses (TRoCA)
BMJ Open
title Protocol for development of a checklist and guideline for transparent reporting of cluster analyses (TRoCA)
title_full Protocol for development of a checklist and guideline for transparent reporting of cluster analyses (TRoCA)
title_fullStr Protocol for development of a checklist and guideline for transparent reporting of cluster analyses (TRoCA)
title_full_unstemmed Protocol for development of a checklist and guideline for transparent reporting of cluster analyses (TRoCA)
title_short Protocol for development of a checklist and guideline for transparent reporting of cluster analyses (TRoCA)
title_sort protocol for development of a checklist and guideline for transparent reporting of cluster analyses troca
url https://bmjopen.bmj.com/content/15/8/e099609.full
work_keys_str_mv AT anamarusic protocolfordevelopmentofachecklistandguidelinefortransparentreportingofclusteranalysestroca
AT brightinwaru protocolfordevelopmentofachecklistandguidelinefortransparentreportingofclusteranalysestroca
AT syedahmarshah protocolfordevelopmentofachecklistandguidelinefortransparentreportingofclusteranalysestroca
AT meredithwallace protocolfordevelopmentofachecklistandguidelinefortransparentreportingofclusteranalysestroca
AT dattran protocolfordevelopmentofachecklistandguidelinefortransparentreportingofclusteranalysestroca
AT daniillisik protocolfordevelopmentofachecklistandguidelinefortransparentreportingofclusteranalysestroca
AT ranibasna protocolfordevelopmentofachecklistandguidelinefortransparentreportingofclusteranalysestroca
AT taidinh protocolfordevelopmentofachecklistandguidelinefortransparentreportingofclusteranalysestroca
AT ryanpbrowne protocolfordevelopmentofachecklistandguidelinefortransparentreportingofclusteranalysestroca
AT jeffreylandrews protocolfordevelopmentofachecklistandguidelinefortransparentreportingofclusteranalysestroca
AT absalomezugwu protocolfordevelopmentofachecklistandguidelinefortransparentreportingofclusteranalysestroca
AT joaquintorressospedra protocolfordevelopmentofachecklistandguidelinefortransparentreportingofclusteranalysestroca
AT hieuchidam protocolfordevelopmentofachecklistandguidelinefortransparentreportingofclusteranalysestroca
AT philippefournierviger protocolfordevelopmentofachecklistandguidelinefortransparentreportingofclusteranalysestroca
AT christianhennig protocolfordevelopmentofachecklistandguidelinefortransparentreportingofclusteranalysestroca
AT marieketimmerman protocolfordevelopmentofachecklistandguidelinefortransparentreportingofclusteranalysestroca
AT matthijsjwarrens protocolfordevelopmentofachecklistandguidelinefortransparentreportingofclusteranalysestroca
AT evaceulemans protocolfordevelopmentofachecklistandguidelinefortransparentreportingofclusteranalysestroca
AT tinamhernandezboussard protocolfordevelopmentofachecklistandguidelinefortransparentreportingofclusteranalysestroca