Communication-Efficient Modeling with Penalized Quantile Regression for Distributed Data

In order to deal with high-dimensional distributed data, this article develops a novel and communication-efficient approach for sparse and high-dimensional data with the penalized quantile regression. In each round, the proposed method only requires the master machine to deal with a sparse penalized...

Full description

Saved in:
Bibliographic Details
Main Authors: Aijun Hu, Chujin Li, Jing Wu
Format: Article
Language:English
Published: Wiley 2021-01-01
Series:Complexity
Online Access:http://dx.doi.org/10.1155/2021/6341707
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1832566507359436800
author Aijun Hu
Chujin Li
Jing Wu
author_facet Aijun Hu
Chujin Li
Jing Wu
author_sort Aijun Hu
collection DOAJ
description In order to deal with high-dimensional distributed data, this article develops a novel and communication-efficient approach for sparse and high-dimensional data with the penalized quantile regression. In each round, the proposed method only requires the master machine to deal with a sparse penalized quantile regression which could be realized fastly by proximal alternating direction method of multipliers (ADMM) algorithm and the other worker machines to compute the subgradient on local data. The advantage of the proximal ADMM algorithm is that it could make every parameter of iteration to have closed formula even in high-dimensional case, which greatly improves the speed of calculation. As for the communication efficiency, the proposed method does not sacrifice any statistical accuracy and provably improves the estimation error obtained by centralized method, provided the penalty levels are chosen properly. Moreover, the asymptotic properties of the proposed estimation and the convergence of the algorithm are convincible. Especially, it presents extensive experiments on both the numerical simulations and the HIV drug resistance data analysis, which all confirm the significant efficiency of our proposed method in quantile regression for distributed data by comparative and empirical analysis.
format Article
id doaj-art-cb1c0d5acf304d8d8ec777f9d69d4cc0
institution Kabale University
issn 1076-2787
1099-0526
language English
publishDate 2021-01-01
publisher Wiley
record_format Article
series Complexity
spelling doaj-art-cb1c0d5acf304d8d8ec777f9d69d4cc02025-02-03T01:04:04ZengWileyComplexity1076-27871099-05262021-01-01202110.1155/2021/63417076341707Communication-Efficient Modeling with Penalized Quantile Regression for Distributed DataAijun Hu0Chujin Li1Jing Wu2School of Mathematics and Statistics, Huazhong University of Science and Technology, Wuhan 430074, Hubei, ChinaSchool of Mathematics and Statistics, Huazhong University of Science and Technology, Wuhan 430074, Hubei, ChinaElectronic Information School, Wuhan University, Wuhan 430072, Hubei, ChinaIn order to deal with high-dimensional distributed data, this article develops a novel and communication-efficient approach for sparse and high-dimensional data with the penalized quantile regression. In each round, the proposed method only requires the master machine to deal with a sparse penalized quantile regression which could be realized fastly by proximal alternating direction method of multipliers (ADMM) algorithm and the other worker machines to compute the subgradient on local data. The advantage of the proximal ADMM algorithm is that it could make every parameter of iteration to have closed formula even in high-dimensional case, which greatly improves the speed of calculation. As for the communication efficiency, the proposed method does not sacrifice any statistical accuracy and provably improves the estimation error obtained by centralized method, provided the penalty levels are chosen properly. Moreover, the asymptotic properties of the proposed estimation and the convergence of the algorithm are convincible. Especially, it presents extensive experiments on both the numerical simulations and the HIV drug resistance data analysis, which all confirm the significant efficiency of our proposed method in quantile regression for distributed data by comparative and empirical analysis.http://dx.doi.org/10.1155/2021/6341707
spellingShingle Aijun Hu
Chujin Li
Jing Wu
Communication-Efficient Modeling with Penalized Quantile Regression for Distributed Data
Complexity
title Communication-Efficient Modeling with Penalized Quantile Regression for Distributed Data
title_full Communication-Efficient Modeling with Penalized Quantile Regression for Distributed Data
title_fullStr Communication-Efficient Modeling with Penalized Quantile Regression for Distributed Data
title_full_unstemmed Communication-Efficient Modeling with Penalized Quantile Regression for Distributed Data
title_short Communication-Efficient Modeling with Penalized Quantile Regression for Distributed Data
title_sort communication efficient modeling with penalized quantile regression for distributed data
url http://dx.doi.org/10.1155/2021/6341707
work_keys_str_mv AT aijunhu communicationefficientmodelingwithpenalizedquantileregressionfordistributeddata
AT chujinli communicationefficientmodelingwithpenalizedquantileregressionfordistributeddata
AT jingwu communicationefficientmodelingwithpenalizedquantileregressionfordistributeddata