Revisiting Pearl's influenza studies by bootstrapping for forward variable selection with a null factor.

In 1919 and 1921 Raymond Pearl published four empirical studies on the Spanish Flu epidemic in which he explored the factors that might explain the explosiveness and destructiveness of the epidemic in America's largest cities. Using partial correlation coefficients he tried to isolate the net e...

Full description

Saved in:
Bibliographic Details
Main Authors: Roselinde Kessels, Chris Gotwalt, Guido Erreygers
Format: Article
Language:English
Published: Public Library of Science (PLoS) 2025-01-01
Series:PLoS ONE
Online Access:https://doi.org/10.1371/journal.pone.0318685
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1850190154377986048
author Roselinde Kessels
Chris Gotwalt
Guido Erreygers
author_facet Roselinde Kessels
Chris Gotwalt
Guido Erreygers
author_sort Roselinde Kessels
collection DOAJ
description In 1919 and 1921 Raymond Pearl published four empirical studies on the Spanish Flu epidemic in which he explored the factors that might explain the explosiveness and destructiveness of the epidemic in America's largest cities. Using partial correlation coefficients he tried to isolate the net effects of the possible explanatory factors, such as general demographic characteristics of the cities and death rates for various diseases, on the variables measuring the severity of the epidemic. Instead of Pearl's correlation analysis, we apply a bootstrap simulation to forward variable selection with a null factor for generalized linear regression with AICc validation. The null factor or pseudo-variable is a random variable that is independent of the response. The number of times it is included in the model selection simulation provides an important metric for deciding which terms should remain in the model. Our results are largely consistent with Pearl's conclusions in that the pre-pandemic death rates from organic heart disease and from all causes are most predictive of pandemic explosiveness or severity. However, our results also contain substantive nuances. Our paper contributes to the literature showing that state-of-the-art methodology for variable selection proves useful for historical epidemiology.
format Article
id doaj-art-6a3644e2d06a474f8a7978c555d9e47f
institution OA Journals
issn 1932-6203
language English
publishDate 2025-01-01
publisher Public Library of Science (PLoS)
record_format Article
series PLoS ONE
spelling doaj-art-6a3644e2d06a474f8a7978c555d9e47f2025-08-20T02:15:23ZengPublic Library of Science (PLoS)PLoS ONE1932-62032025-01-01202e031868510.1371/journal.pone.0318685Revisiting Pearl's influenza studies by bootstrapping for forward variable selection with a null factor.Roselinde KesselsChris GotwaltGuido ErreygersIn 1919 and 1921 Raymond Pearl published four empirical studies on the Spanish Flu epidemic in which he explored the factors that might explain the explosiveness and destructiveness of the epidemic in America's largest cities. Using partial correlation coefficients he tried to isolate the net effects of the possible explanatory factors, such as general demographic characteristics of the cities and death rates for various diseases, on the variables measuring the severity of the epidemic. Instead of Pearl's correlation analysis, we apply a bootstrap simulation to forward variable selection with a null factor for generalized linear regression with AICc validation. The null factor or pseudo-variable is a random variable that is independent of the response. The number of times it is included in the model selection simulation provides an important metric for deciding which terms should remain in the model. Our results are largely consistent with Pearl's conclusions in that the pre-pandemic death rates from organic heart disease and from all causes are most predictive of pandemic explosiveness or severity. However, our results also contain substantive nuances. Our paper contributes to the literature showing that state-of-the-art methodology for variable selection proves useful for historical epidemiology.https://doi.org/10.1371/journal.pone.0318685
spellingShingle Roselinde Kessels
Chris Gotwalt
Guido Erreygers
Revisiting Pearl's influenza studies by bootstrapping for forward variable selection with a null factor.
PLoS ONE
title Revisiting Pearl's influenza studies by bootstrapping for forward variable selection with a null factor.
title_full Revisiting Pearl's influenza studies by bootstrapping for forward variable selection with a null factor.
title_fullStr Revisiting Pearl's influenza studies by bootstrapping for forward variable selection with a null factor.
title_full_unstemmed Revisiting Pearl's influenza studies by bootstrapping for forward variable selection with a null factor.
title_short Revisiting Pearl's influenza studies by bootstrapping for forward variable selection with a null factor.
title_sort revisiting pearl s influenza studies by bootstrapping for forward variable selection with a null factor
url https://doi.org/10.1371/journal.pone.0318685
work_keys_str_mv AT roselindekessels revisitingpearlsinfluenzastudiesbybootstrappingforforwardvariableselectionwithanullfactor
AT chrisgotwalt revisitingpearlsinfluenzastudiesbybootstrappingforforwardvariableselectionwithanullfactor
AT guidoerreygers revisitingpearlsinfluenzastudiesbybootstrappingforforwardvariableselectionwithanullfactor