Communication-Efficient Modeling with Penalized Quantile Regression for Distributed Data
In order to deal with high-dimensional distributed data, this article develops a novel and communication-efficient approach for sparse and high-dimensional data with the penalized quantile regression. In each round, the proposed method only requires the master machine to deal with a sparse penalized...
Saved in:
Main Authors: | , , |
---|---|
Format: | Article |
Language: | English |
Published: |
Wiley
2021-01-01
|
Series: | Complexity |
Online Access: | http://dx.doi.org/10.1155/2021/6341707 |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
_version_ | 1832566507359436800 |
---|---|
author | Aijun Hu Chujin Li Jing Wu |
author_facet | Aijun Hu Chujin Li Jing Wu |
author_sort | Aijun Hu |
collection | DOAJ |
description | In order to deal with high-dimensional distributed data, this article develops a novel and communication-efficient approach for sparse and high-dimensional data with the penalized quantile regression. In each round, the proposed method only requires the master machine to deal with a sparse penalized quantile regression which could be realized fastly by proximal alternating direction method of multipliers (ADMM) algorithm and the other worker machines to compute the subgradient on local data. The advantage of the proximal ADMM algorithm is that it could make every parameter of iteration to have closed formula even in high-dimensional case, which greatly improves the speed of calculation. As for the communication efficiency, the proposed method does not sacrifice any statistical accuracy and provably improves the estimation error obtained by centralized method, provided the penalty levels are chosen properly. Moreover, the asymptotic properties of the proposed estimation and the convergence of the algorithm are convincible. Especially, it presents extensive experiments on both the numerical simulations and the HIV drug resistance data analysis, which all confirm the significant efficiency of our proposed method in quantile regression for distributed data by comparative and empirical analysis. |
format | Article |
id | doaj-art-cb1c0d5acf304d8d8ec777f9d69d4cc0 |
institution | Kabale University |
issn | 1076-2787 1099-0526 |
language | English |
publishDate | 2021-01-01 |
publisher | Wiley |
record_format | Article |
series | Complexity |
spelling | doaj-art-cb1c0d5acf304d8d8ec777f9d69d4cc02025-02-03T01:04:04ZengWileyComplexity1076-27871099-05262021-01-01202110.1155/2021/63417076341707Communication-Efficient Modeling with Penalized Quantile Regression for Distributed DataAijun Hu0Chujin Li1Jing Wu2School of Mathematics and Statistics, Huazhong University of Science and Technology, Wuhan 430074, Hubei, ChinaSchool of Mathematics and Statistics, Huazhong University of Science and Technology, Wuhan 430074, Hubei, ChinaElectronic Information School, Wuhan University, Wuhan 430072, Hubei, ChinaIn order to deal with high-dimensional distributed data, this article develops a novel and communication-efficient approach for sparse and high-dimensional data with the penalized quantile regression. In each round, the proposed method only requires the master machine to deal with a sparse penalized quantile regression which could be realized fastly by proximal alternating direction method of multipliers (ADMM) algorithm and the other worker machines to compute the subgradient on local data. The advantage of the proximal ADMM algorithm is that it could make every parameter of iteration to have closed formula even in high-dimensional case, which greatly improves the speed of calculation. As for the communication efficiency, the proposed method does not sacrifice any statistical accuracy and provably improves the estimation error obtained by centralized method, provided the penalty levels are chosen properly. Moreover, the asymptotic properties of the proposed estimation and the convergence of the algorithm are convincible. Especially, it presents extensive experiments on both the numerical simulations and the HIV drug resistance data analysis, which all confirm the significant efficiency of our proposed method in quantile regression for distributed data by comparative and empirical analysis.http://dx.doi.org/10.1155/2021/6341707 |
spellingShingle | Aijun Hu Chujin Li Jing Wu Communication-Efficient Modeling with Penalized Quantile Regression for Distributed Data Complexity |
title | Communication-Efficient Modeling with Penalized Quantile Regression for Distributed Data |
title_full | Communication-Efficient Modeling with Penalized Quantile Regression for Distributed Data |
title_fullStr | Communication-Efficient Modeling with Penalized Quantile Regression for Distributed Data |
title_full_unstemmed | Communication-Efficient Modeling with Penalized Quantile Regression for Distributed Data |
title_short | Communication-Efficient Modeling with Penalized Quantile Regression for Distributed Data |
title_sort | communication efficient modeling with penalized quantile regression for distributed data |
url | http://dx.doi.org/10.1155/2021/6341707 |
work_keys_str_mv | AT aijunhu communicationefficientmodelingwithpenalizedquantileregressionfordistributeddata AT chujinli communicationefficientmodelingwithpenalizedquantileregressionfordistributeddata AT jingwu communicationefficientmodelingwithpenalizedquantileregressionfordistributeddata |