{ "id": "1504.08031", "version": "v1", "published": "2015-04-29T21:38:34.000Z", "updated": "2015-04-29T21:38:34.000Z", "title": "Selective inference with unknown variance via the square-root LASSO", "authors": [ "Xiaoying Tian", "Joshua R. Loftus", "Jonathan E. Taylor" ], "categories": [ "math.ST", "stat.ME", "stat.TH" ], "abstract": "There has been much recent work on inference after model selection when the noise level is known, for example in forward stepwise model selection or LASSO with an independent estimate of $\\sigma$. In this work we consider the more realistic scenario of an unknown noise level and propose using the square root LASSO (also known as the scaled LASSO) to conduct selective inference without previous knowledge of $\\sigma$. Applying the selective inference framework described in Fithian et al. (2014), we construct selectively valid exact tests of parameters in the selected model. We discuss regression diagnostics including tests for inclusion of variables not selected by the square root LASSO. The selective inference framework also suggests a natural pseudo-likelihood estimate of the noise level that performs better than other estimates of $\\sigma$ from the square root LASSO. Based on this estimate we use a Gaussian approximation to construct confidence intervals. We also consider exact inference when holding out some data for inference in the second stage, noting the same increase in power observed in Fithian et al. (2014). We illustrate our method on an HIV drug resistance dataset and in a multi-scale change point detection problem where the number of regressors is order $n^2$.", "revisions": [ { "version": "v1", "updated": "2015-04-29T21:38:34.000Z" } ], "analyses": { "keywords": [ "square root lasso", "square-root lasso", "unknown variance", "noise level", "selective inference framework" ], "note": { "typesetting": "TeX", "pages": 0, "language": "en", "license": "arXiv", "status": "editable", "adsabs": "2015arXiv150408031T" } } }