arXiv Analytics

Sign in

arXiv:1204.5536 [math.ST]AbstractReferencesReviewsResources

Endogeneity in high dimensions

Jianqing Fan, Yuan Liao

Published 2012-04-25, updated 2014-05-27Version 2

Most papers on high-dimensional statistics are based on the assumption that none of the regressors are correlated with the regression error, namely, they are exogenous. Yet, endogeneity can arise incidentally from a large pool of regressors in a high-dimensional regression. This causes the inconsistency of the penalized least-squares method and possible false scientific discoveries. A necessary condition for model selection consistency of a general class of penalized regression methods is given, which allows us to prove formally the inconsistency claim. To cope with the incidental endogeneity, we construct a novel penalized focused generalized method of moments (FGMM) criterion function. The FGMM effectively achieves the dimension reduction and applies the instrumental variable methods. We show that it possesses the oracle property even in the presence of endogenous predictors, and that the solution is also near global minimum under the over-identification assumption. Finally, we also show how the semi-parametric efficiency of estimation can be achieved via a two-step approach.

Comments: Published in at http://dx.doi.org/10.1214/13-AOS1202 the Annals of Statistics (http://www.imstat.org/aos/) by the Institute of Mathematical Statistics (http://www.imstat.org)
Journal: Annals of Statistics 2014, Vol. 42, No. 3, 872-917
Categories: math.ST, stat.TH
Related articles: Most relevant | Search more
arXiv:1701.05911 [math.ST] (Published 2017-01-20)
Delta Theorem in the Age of High Dimensions
arXiv:1502.01798 [math.ST] (Published 2015-02-06)
Eigenvalue Condition and model selection consistency of lasso
arXiv:1405.5103 [math.ST] (Published 2014-05-20, updated 2014-12-02)
Estimation in high dimensions: a geometric perspective