An Analytical Prior Selection Procedure for Empirical Bayesian Analysis Using Resampling Techniques: A Simulation-Based Approach Using the Pancreatic Adenocarcinoma Data from the SEER Database

<b>Introduction:</b> In the field of medical research, empirical Bayesian analysis has emerged as an increasingly applicable approach. This statistical framework offers greater flexibility, enabling researchers to incorporate prior information and rigorously estimate parameters of intere...

Full description

Saved in:
Bibliographic Details
Main Authors: Aditya Chakraborty, Mohan D. Pant
Format: Article
Language:English
Published: MDPI AG 2025-02-01
Series:Computation
Subjects:
Online Access:https://www.mdpi.com/2079-3197/13/2/51
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1849722381066567680
author Aditya Chakraborty
Mohan D. Pant
author_facet Aditya Chakraborty
Mohan D. Pant
author_sort Aditya Chakraborty
collection DOAJ
description <b>Introduction:</b> In the field of medical research, empirical Bayesian analysis has emerged as an increasingly applicable approach. This statistical framework offers greater flexibility, enabling researchers to incorporate prior information and rigorously estimate parameters of interest. However, the selection of suitable prior distributions can be a challenging endeavor, with profound implications for the resulting inferences. To address this challenge, this study proposes a new analytical procedure that leverages resampling techniques to guide the choice of priors in Bayesian analysis. <b>Subject and Methods:</b> The study group consisted of patients who had been diagnosed and had died of pancreatic adenocarcinoma (cause-specific death) who had undergone both chemotherapy and radiation at stage IV of cancer. The data were collected from the Surveillance, Epidemiology, and End Results (SEER) database. Initially, the most suitable probabilistic behavior of the survival times of patients was identified parametrically via goodness-of-fit (GOF) tests, and afterward, empirical Bayesian analysis (EBA) was performed using resampling techniques (bootstrapping and the jackknife method). The Hamiltonian Monte Carlo (HMC) method was used to obtain the posterior distribution. <b>Results:</b> The most appropriate data distribution was found to be a two-parameter log-normal via GOF tests. A sensitivity analysis, followed by a simulation study, was performed to validate the analytical method. The performance of bootstrapped and jackknifed empirical Bayesian estimates was compared with maximum likelihood (ML) methods at each simulation stage. The empirical Bayesian estimates were found to be consistent with the ML estimates. Finally, a comparison was made among the parametric, Kaplan–Meier and empirical Bayesian survival estimates at different time points to illustrate the validity of the method. <b>Conclusions:</b> Determining the appropriate prior distribution is one of the crucial components in Bayesian analysis, as it can significantly influence the resulting inferences. The cautious selection of the prior information is essential, as it encapsulates the researcher’s beliefs or external prior knowledge about the parameters of interest. In the Bayesian framework, empirical resampling methods, such as bootstrapping and jackknifing, can offer valuable insights into the significance of prior selection, thus improving the consistency of statistical inferences. However, the analytical procedure is based on the time-to-event data, and the prior selection procedure can be extended to any real data, where Bayesian analysis is needed for decision-making and uncertainty quantification.
format Article
id doaj-art-85b2178cf3cb4a7f998ed64bbdbd4608
institution DOAJ
issn 2079-3197
language English
publishDate 2025-02-01
publisher MDPI AG
record_format Article
series Computation
spelling doaj-art-85b2178cf3cb4a7f998ed64bbdbd46082025-08-20T03:11:21ZengMDPI AGComputation2079-31972025-02-011325110.3390/computation13020051An Analytical Prior Selection Procedure for Empirical Bayesian Analysis Using Resampling Techniques: A Simulation-Based Approach Using the Pancreatic Adenocarcinoma Data from the SEER DatabaseAditya Chakraborty0Mohan D. Pant1Macon & Joan Brock Virginia Health Sciences, Old Dominion University, 5115 Hampton Blvd, Norfolk, VA 23529, USAMacon & Joan Brock Virginia Health Sciences, Old Dominion University, 5115 Hampton Blvd, Norfolk, VA 23529, USA<b>Introduction:</b> In the field of medical research, empirical Bayesian analysis has emerged as an increasingly applicable approach. This statistical framework offers greater flexibility, enabling researchers to incorporate prior information and rigorously estimate parameters of interest. However, the selection of suitable prior distributions can be a challenging endeavor, with profound implications for the resulting inferences. To address this challenge, this study proposes a new analytical procedure that leverages resampling techniques to guide the choice of priors in Bayesian analysis. <b>Subject and Methods:</b> The study group consisted of patients who had been diagnosed and had died of pancreatic adenocarcinoma (cause-specific death) who had undergone both chemotherapy and radiation at stage IV of cancer. The data were collected from the Surveillance, Epidemiology, and End Results (SEER) database. Initially, the most suitable probabilistic behavior of the survival times of patients was identified parametrically via goodness-of-fit (GOF) tests, and afterward, empirical Bayesian analysis (EBA) was performed using resampling techniques (bootstrapping and the jackknife method). The Hamiltonian Monte Carlo (HMC) method was used to obtain the posterior distribution. <b>Results:</b> The most appropriate data distribution was found to be a two-parameter log-normal via GOF tests. A sensitivity analysis, followed by a simulation study, was performed to validate the analytical method. The performance of bootstrapped and jackknifed empirical Bayesian estimates was compared with maximum likelihood (ML) methods at each simulation stage. The empirical Bayesian estimates were found to be consistent with the ML estimates. Finally, a comparison was made among the parametric, Kaplan–Meier and empirical Bayesian survival estimates at different time points to illustrate the validity of the method. <b>Conclusions:</b> Determining the appropriate prior distribution is one of the crucial components in Bayesian analysis, as it can significantly influence the resulting inferences. The cautious selection of the prior information is essential, as it encapsulates the researcher’s beliefs or external prior knowledge about the parameters of interest. In the Bayesian framework, empirical resampling methods, such as bootstrapping and jackknifing, can offer valuable insights into the significance of prior selection, thus improving the consistency of statistical inferences. However, the analytical procedure is based on the time-to-event data, and the prior selection procedure can be extended to any real data, where Bayesian analysis is needed for decision-making and uncertainty quantification.https://www.mdpi.com/2079-3197/13/2/51Hamiltonian Monte Carloempirical Bayesdistributional simulation studysensitivity analysisbootstrapped empirical prior (BEP)jackknifed empirical prior (JEP)
spellingShingle Aditya Chakraborty
Mohan D. Pant
An Analytical Prior Selection Procedure for Empirical Bayesian Analysis Using Resampling Techniques: A Simulation-Based Approach Using the Pancreatic Adenocarcinoma Data from the SEER Database
Computation
Hamiltonian Monte Carlo
empirical Bayes
distributional simulation study
sensitivity analysis
bootstrapped empirical prior (BEP)
jackknifed empirical prior (JEP)
title An Analytical Prior Selection Procedure for Empirical Bayesian Analysis Using Resampling Techniques: A Simulation-Based Approach Using the Pancreatic Adenocarcinoma Data from the SEER Database
title_full An Analytical Prior Selection Procedure for Empirical Bayesian Analysis Using Resampling Techniques: A Simulation-Based Approach Using the Pancreatic Adenocarcinoma Data from the SEER Database
title_fullStr An Analytical Prior Selection Procedure for Empirical Bayesian Analysis Using Resampling Techniques: A Simulation-Based Approach Using the Pancreatic Adenocarcinoma Data from the SEER Database
title_full_unstemmed An Analytical Prior Selection Procedure for Empirical Bayesian Analysis Using Resampling Techniques: A Simulation-Based Approach Using the Pancreatic Adenocarcinoma Data from the SEER Database
title_short An Analytical Prior Selection Procedure for Empirical Bayesian Analysis Using Resampling Techniques: A Simulation-Based Approach Using the Pancreatic Adenocarcinoma Data from the SEER Database
title_sort analytical prior selection procedure for empirical bayesian analysis using resampling techniques a simulation based approach using the pancreatic adenocarcinoma data from the seer database
topic Hamiltonian Monte Carlo
empirical Bayes
distributional simulation study
sensitivity analysis
bootstrapped empirical prior (BEP)
jackknifed empirical prior (JEP)
url https://www.mdpi.com/2079-3197/13/2/51
work_keys_str_mv AT adityachakraborty ananalyticalpriorselectionprocedureforempiricalbayesiananalysisusingresamplingtechniquesasimulationbasedapproachusingthepancreaticadenocarcinomadatafromtheseerdatabase
AT mohandpant ananalyticalpriorselectionprocedureforempiricalbayesiananalysisusingresamplingtechniquesasimulationbasedapproachusingthepancreaticadenocarcinomadatafromtheseerdatabase
AT adityachakraborty analyticalpriorselectionprocedureforempiricalbayesiananalysisusingresamplingtechniquesasimulationbasedapproachusingthepancreaticadenocarcinomadatafromtheseerdatabase
AT mohandpant analyticalpriorselectionprocedureforempiricalbayesiananalysisusingresamplingtechniquesasimulationbasedapproachusingthepancreaticadenocarcinomadatafromtheseerdatabase