Publications

Detailed Information

Fast inference for quantile regression with tens of millions of observations

Cited 0 time in Web of Science Cited 0 time in Scopus
Authors

Lee, Sokbae; Liao, Yuan; Seo, Myung Hwan; Shin, Youngki

Issue Date
2024
Publisher
Elsevier Ltd
Citation
Journal of Econometrics, p. 105673
Abstract
Big data analytics has opened new avenues in economic research, but the challenge of analyzing datasets with tens of millions of observations is substantial. Conventional econometric methods based on extreme estimators require large amounts of computing resources and memory, which are often not readily available. In this paper, we focus on linear quantile regression applied to ultra-large datasets, such as U.S. decennial censuses. A fast inference framework is presented, utilizing stochastic subgradient descent (S-subGD) updates. The inference procedure handles cross-sectional data sequentially: (i) updating the parameter estimate with each incoming new observation, (ii) aggregating it as a Polyak–Ruppert average, and (iii) computing a pivotal statistic for inference using only a solution path. The methodology draws from time-series regression to create an asymptotically pivotal statistic through random scaling. Our proposed test statistic is calculated in a fully online fashion and critical values are calculated without resampling. We conduct extensive numerical studies to showcase the computational merits of our proposed inference. For inference problems as large as (n,d)∼(107,103), where n is the sample size and d is the number of regressors, our method generates new insights, surpassing current inference methods in computation. Our method specifically reveals trends in the gender gap in the U.S. college wage premium using millions of observations, while controlling over 103 covariates to mitigate confounding effects.
ISSN
0304-4076
URI
https://hdl.handle.net/10371/204960
DOI
https://doi.org/10.1016/j.jeconom.2024.105673
Files in This Item:
There are no files associated with this item.
Appears in Collections:

Related Researcher

  • College of Social Sciences
  • Department of Economics
Research Area Econometrics, Economics, Statistics

Altmetrics

Item View & Download Count

  • mendeley

Items in S-Space are protected by copyright, with all rights reserved, unless otherwise indicated.

Share