An Analytical Prior Selection Procedure for Empirical Bayesian Analysis Using Resampling Techniques: A Simulation-Based Approach Using the Pancreatic Adenocarcinoma Data from the SEER Database
<b>Introduction:</b> In the field of medical research, empirical Bayesian analysis has emerged as an increasingly applicable approach. This statistical framework offers greater flexibility, enabling researchers to incorporate prior information and rigorously estimate parameters of intere...
Saved in:
| Main Authors: | , |
|---|---|
| Format: | Article |
| Language: | English |
| Published: |
MDPI AG
2025-02-01
|
| Series: | Computation |
| Subjects: | |
| Online Access: | https://www.mdpi.com/2079-3197/13/2/51 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
| _version_ | 1849722381066567680 |
|---|---|
| author | Aditya Chakraborty Mohan D. Pant |
| author_facet | Aditya Chakraborty Mohan D. Pant |
| author_sort | Aditya Chakraborty |
| collection | DOAJ |
| description | <b>Introduction:</b> In the field of medical research, empirical Bayesian analysis has emerged as an increasingly applicable approach. This statistical framework offers greater flexibility, enabling researchers to incorporate prior information and rigorously estimate parameters of interest. However, the selection of suitable prior distributions can be a challenging endeavor, with profound implications for the resulting inferences. To address this challenge, this study proposes a new analytical procedure that leverages resampling techniques to guide the choice of priors in Bayesian analysis. <b>Subject and Methods:</b> The study group consisted of patients who had been diagnosed and had died of pancreatic adenocarcinoma (cause-specific death) who had undergone both chemotherapy and radiation at stage IV of cancer. The data were collected from the Surveillance, Epidemiology, and End Results (SEER) database. Initially, the most suitable probabilistic behavior of the survival times of patients was identified parametrically via goodness-of-fit (GOF) tests, and afterward, empirical Bayesian analysis (EBA) was performed using resampling techniques (bootstrapping and the jackknife method). The Hamiltonian Monte Carlo (HMC) method was used to obtain the posterior distribution. <b>Results:</b> The most appropriate data distribution was found to be a two-parameter log-normal via GOF tests. A sensitivity analysis, followed by a simulation study, was performed to validate the analytical method. The performance of bootstrapped and jackknifed empirical Bayesian estimates was compared with maximum likelihood (ML) methods at each simulation stage. The empirical Bayesian estimates were found to be consistent with the ML estimates. Finally, a comparison was made among the parametric, Kaplan–Meier and empirical Bayesian survival estimates at different time points to illustrate the validity of the method. <b>Conclusions:</b> Determining the appropriate prior distribution is one of the crucial components in Bayesian analysis, as it can significantly influence the resulting inferences. The cautious selection of the prior information is essential, as it encapsulates the researcher’s beliefs or external prior knowledge about the parameters of interest. In the Bayesian framework, empirical resampling methods, such as bootstrapping and jackknifing, can offer valuable insights into the significance of prior selection, thus improving the consistency of statistical inferences. However, the analytical procedure is based on the time-to-event data, and the prior selection procedure can be extended to any real data, where Bayesian analysis is needed for decision-making and uncertainty quantification. |
| format | Article |
| id | doaj-art-85b2178cf3cb4a7f998ed64bbdbd4608 |
| institution | DOAJ |
| issn | 2079-3197 |
| language | English |
| publishDate | 2025-02-01 |
| publisher | MDPI AG |
| record_format | Article |
| series | Computation |
| spelling | doaj-art-85b2178cf3cb4a7f998ed64bbdbd46082025-08-20T03:11:21ZengMDPI AGComputation2079-31972025-02-011325110.3390/computation13020051An Analytical Prior Selection Procedure for Empirical Bayesian Analysis Using Resampling Techniques: A Simulation-Based Approach Using the Pancreatic Adenocarcinoma Data from the SEER DatabaseAditya Chakraborty0Mohan D. Pant1Macon & Joan Brock Virginia Health Sciences, Old Dominion University, 5115 Hampton Blvd, Norfolk, VA 23529, USAMacon & Joan Brock Virginia Health Sciences, Old Dominion University, 5115 Hampton Blvd, Norfolk, VA 23529, USA<b>Introduction:</b> In the field of medical research, empirical Bayesian analysis has emerged as an increasingly applicable approach. This statistical framework offers greater flexibility, enabling researchers to incorporate prior information and rigorously estimate parameters of interest. However, the selection of suitable prior distributions can be a challenging endeavor, with profound implications for the resulting inferences. To address this challenge, this study proposes a new analytical procedure that leverages resampling techniques to guide the choice of priors in Bayesian analysis. <b>Subject and Methods:</b> The study group consisted of patients who had been diagnosed and had died of pancreatic adenocarcinoma (cause-specific death) who had undergone both chemotherapy and radiation at stage IV of cancer. The data were collected from the Surveillance, Epidemiology, and End Results (SEER) database. Initially, the most suitable probabilistic behavior of the survival times of patients was identified parametrically via goodness-of-fit (GOF) tests, and afterward, empirical Bayesian analysis (EBA) was performed using resampling techniques (bootstrapping and the jackknife method). The Hamiltonian Monte Carlo (HMC) method was used to obtain the posterior distribution. <b>Results:</b> The most appropriate data distribution was found to be a two-parameter log-normal via GOF tests. A sensitivity analysis, followed by a simulation study, was performed to validate the analytical method. The performance of bootstrapped and jackknifed empirical Bayesian estimates was compared with maximum likelihood (ML) methods at each simulation stage. The empirical Bayesian estimates were found to be consistent with the ML estimates. Finally, a comparison was made among the parametric, Kaplan–Meier and empirical Bayesian survival estimates at different time points to illustrate the validity of the method. <b>Conclusions:</b> Determining the appropriate prior distribution is one of the crucial components in Bayesian analysis, as it can significantly influence the resulting inferences. The cautious selection of the prior information is essential, as it encapsulates the researcher’s beliefs or external prior knowledge about the parameters of interest. In the Bayesian framework, empirical resampling methods, such as bootstrapping and jackknifing, can offer valuable insights into the significance of prior selection, thus improving the consistency of statistical inferences. However, the analytical procedure is based on the time-to-event data, and the prior selection procedure can be extended to any real data, where Bayesian analysis is needed for decision-making and uncertainty quantification.https://www.mdpi.com/2079-3197/13/2/51Hamiltonian Monte Carloempirical Bayesdistributional simulation studysensitivity analysisbootstrapped empirical prior (BEP)jackknifed empirical prior (JEP) |
| spellingShingle | Aditya Chakraborty Mohan D. Pant An Analytical Prior Selection Procedure for Empirical Bayesian Analysis Using Resampling Techniques: A Simulation-Based Approach Using the Pancreatic Adenocarcinoma Data from the SEER Database Computation Hamiltonian Monte Carlo empirical Bayes distributional simulation study sensitivity analysis bootstrapped empirical prior (BEP) jackknifed empirical prior (JEP) |
| title | An Analytical Prior Selection Procedure for Empirical Bayesian Analysis Using Resampling Techniques: A Simulation-Based Approach Using the Pancreatic Adenocarcinoma Data from the SEER Database |
| title_full | An Analytical Prior Selection Procedure for Empirical Bayesian Analysis Using Resampling Techniques: A Simulation-Based Approach Using the Pancreatic Adenocarcinoma Data from the SEER Database |
| title_fullStr | An Analytical Prior Selection Procedure for Empirical Bayesian Analysis Using Resampling Techniques: A Simulation-Based Approach Using the Pancreatic Adenocarcinoma Data from the SEER Database |
| title_full_unstemmed | An Analytical Prior Selection Procedure for Empirical Bayesian Analysis Using Resampling Techniques: A Simulation-Based Approach Using the Pancreatic Adenocarcinoma Data from the SEER Database |
| title_short | An Analytical Prior Selection Procedure for Empirical Bayesian Analysis Using Resampling Techniques: A Simulation-Based Approach Using the Pancreatic Adenocarcinoma Data from the SEER Database |
| title_sort | analytical prior selection procedure for empirical bayesian analysis using resampling techniques a simulation based approach using the pancreatic adenocarcinoma data from the seer database |
| topic | Hamiltonian Monte Carlo empirical Bayes distributional simulation study sensitivity analysis bootstrapped empirical prior (BEP) jackknifed empirical prior (JEP) |
| url | https://www.mdpi.com/2079-3197/13/2/51 |
| work_keys_str_mv | AT adityachakraborty ananalyticalpriorselectionprocedureforempiricalbayesiananalysisusingresamplingtechniquesasimulationbasedapproachusingthepancreaticadenocarcinomadatafromtheseerdatabase AT mohandpant ananalyticalpriorselectionprocedureforempiricalbayesiananalysisusingresamplingtechniquesasimulationbasedapproachusingthepancreaticadenocarcinomadatafromtheseerdatabase AT adityachakraborty analyticalpriorselectionprocedureforempiricalbayesiananalysisusingresamplingtechniquesasimulationbasedapproachusingthepancreaticadenocarcinomadatafromtheseerdatabase AT mohandpant analyticalpriorselectionprocedureforempiricalbayesiananalysisusingresamplingtechniquesasimulationbasedapproachusingthepancreaticadenocarcinomadatafromtheseerdatabase |