Corrected optimal subsampling for a class of generalized linear measurement error models

Subsampling techniques have been promoted in massive data and can substantially reduce the computing time. However, existing subsampling techniques do not consider the case of dirty data, especially the inaccuracy of covariates due to measurement errors, which will lead to the inconsistent estimator...

Full description

Saved in:
Bibliographic Details
Main Authors: Ruiyuan Chang, Xiuli Wang, Mingqiu Wang
Format: Article
Language:English
Published: AIMS Press 2025-02-01
Series:AIMS Mathematics
Subjects:
Online Access:https://www.aimspress.com/article/doi/10.3934/math.2025203
Tags: Add Tag
No Tags, Be the first to tag this record!
Description
Summary:Subsampling techniques have been promoted in massive data and can substantially reduce the computing time. However, existing subsampling techniques do not consider the case of dirty data, especially the inaccuracy of covariates due to measurement errors, which will lead to the inconsistent estimators of regression coefficients. Therefore, to eliminate the influence of measurement errors on parameter estimators for massive data, this paper combined the corrected score function with the subsampling technique. The consistency and asymptotic normality of the estimators in the general subsampling are also derived. In addition, optimal subsampling probabilities are obtained based on the general subsampling algorithm using the A-optimality and L-optimality criteria and the truncation method, and then an adaptive two-step algorithm is developed. The effectiveness of the proposed method is demonstrated through numerical simulations and two real data analyses.
ISSN:2473-6988