西北大学主页平台管理系统夏志明--中文主页-- Distributed Feature Screening via Componentwise Debiasing

论文成果

当前位置: 中文主页 >> 论文成果

Distributed Feature Screening via Componentwise Debiasing

点击次数：

影响因子：5.177
发表刊物：Journal of Machine Learning Research
关键字：Feature screening; Big data; Divide-and-conquer; Aggregated correlation; Sure screening property
摘要：Feature screening is a powerful tool in processing high-dimensional data. When the sample size N and the number of features p are both large, the implementation of classic screening methods can be numerically challenging. In this paper, we propose a distributed screening framework for big data setup. In the spirit of "divide-and-conquer", the proposed framework expresses a correlation measure as a function of several component parameters, each of which can be distributively estimated using a natural U-statistic from data segments. With the component estimates aggregated, we obtain a final correlation estimate that can be readily used for screening features. This framework enables distributed storage and parallel computing and thus is computationally attractive. Due to the unbiased distributive estimation of the component parameters, the final aggregated estimate achieves a high accuracy that is insensitive to the number of data segments m. Under mild conditions, we show that the aggregated correlation estimator is as efficient as the centralized estimator in terms of the probability convergence bound and the mean squared error rate; the corresponding screening procedure enjoys sure screening property for a wide range of correlation measures. The promising performances of the new method are supported by extensive numerical examples.
论文类型：期刊论文
卷号：21
ISSN号：1532-4435
是否译文：否
收录刊物：SCI、EI
第一作者：Xingxing Li
合写作者：Runze Li
合写作者：Zhiming Xia

上一条：Statistical Monitoring and Inference of Matrix Time Series Based on 2DPCA

下一条：Parameter estimation and hypothesis testing problems in tensor linear regression model

个人信息

夏志明Zhiming Xia

论文成果

Distributed Feature Screening via Componentwise Debiasing