951
Views
5
CrossRef citations to date
0
Altmetric
Articles

A Note on Distributed Quantile Regression by Pilot Sampling and One-Step Updating

ORCID Icon, , , ORCID Icon, ORCID Icon & ORCID Icon

References

  • Alhamzawi, R., and Ali, H. T. M. (2018), “Bayesian Quantile Regression for Ordinal Longitudinal Data,” Journal of Applied Statistics, 45, 815–828. DOI: 10.1080/02664763.2017.1315059.
  • Battey, H., Fan, J., Liu, H., Lu, J., and Zhu, Z. (2018), “Distributed Testing and Estimation Under Sparse High Dimensional Models,” The Annals of Statistics, 46, 1352–1382. DOI: 10.1214/17-AOS1587.
  • Chang, X., Lin, S.-B., and Zhou, D.-X. (2017), “Distributed Semi-Supervised Learning With Kernel Ridge Regression,” Journal of Machine Learning Research, 18, 1493–1514.
  • Chen, C. W., Dunson, D. B., Reed, C., and Yu, K. (2013), “Bayesian Variable Selection in Quantile Regression,” Statistics and Its Interface, 6, 261–274. DOI: 10.4310/SII.2013.v6.n2.a9.
  • Chen, X., Liu, W., and Zhang, Y. (2019), “Quantile Regression Under Memory Constraint,” The Annals of Statistics, 47, 3244–3273. DOI: 10.1214/18-AOS1777.
  • Chen, X., and Xie, M.-g. (2014), “A Split-and-Conquer Approach for Analysis of Extraordinarily Large Data,” Statistica Sinica, 24, 1655–1684.
  • Dünner, C., Parnell, T., Atasu, K., Sifalakis, M., and Pozidis, H. (2017), “Understanding and Optimizing the Performance of Distributed Machine Learning Applications on Apache Spark,” in 2017 IEEE International Conference on Big Data (Big Data), Boston, MA: IEEE, pp. 331–338. DOI: 10.1109/BigData.2017.8257942.
  • Fan, J. (2000), “Prospects of Nonparametric Modeling,” Journal of the American Statistical Association, 95, 1296–1300. DOI: 10.1080/01621459.2000.10474334.
  • Fan, J., and Marron, J. S. (1992), “Best Possible Constant for Bandwidth Selection,” The Annals of Statistics, 20, 2057–2070. DOI: 10.1214/aos/1176348902.
  • Fan, J., Wang, D., Wang, K., and Zhu, Z. (2019), “Distributed Estimation of Principal Eigenspaces,” The Annals of Statistics, 47, 3009–3031. DOI: 10.1214/18-AOS1713.
  • He, X., and Liang, H. (2000), “Quantile Regression Estimates for a Class of Linear and Partially Linear Errors-in-Variables Models,” Statistica Sinica, 10, 129–140.
  • Jia, D., Bhimani, J., Nguyen, S. N., Sheng, B. and Mi, N. (2019), “Atumm: Auto-Tuning Memory Manager in Apache Spark,” in 2019 IEEE 38th International Performance Computing and Communications Conference (IPCCC), London, UK: IEEE, pp. 1–8. DOI: 10.1109/IPCCC47392.2019.8958724.
  • Jiang, J. (2010), Large Sample Techniques for Statistics, Springer Science & Business Media.
  • Jordan, M. I., Lee, J. D., and Yang, Y. (2019), “Communication-Efficient Distributed Statistical Inference,” Journal of the American Statistical Association, 114, 668–681. DOI: 10.1080/01621459.2018.1429274.
  • Kitagawa, G., and Konishi, S. (2010), “Bias and Variance Reduction Techniques for Bootstrap Information Criteria,” Annals of the Institute of Statistical Mathematics, 62, 209–234. DOI: 10.1007/s10463-009-0237-1.
  • Kleiner, A., Talwalkar, A., Sarkar, P. and Jordan, M. I. (2014), “A Scalable Bootstrap for Massive Data,” Journal of the Royal Statistical Society, Series B, 76, 795–816. DOI: 10.1111/rssb.12050.
  • Koenker, R. (2005), Quantile Regression, Cambridge, MA: Cambridge University Press.
  • Koenker, R., and Bassett Jr, G. (1978), “Regression Quantiles,” Econometrica, 46, 33–50. DOI: 10.2307/1913643.
  • Koenker, R., and Hallock, K. F. (2001), “Quantile Regression,” Journal of Economic Perspectives, 15, 143–156. DOI: 10.1257/jep.15.4.143.
  • Koenker, R., and Park, B. J. (1996), “An Interior Point Algorithm for Nonlinear Quantile Regression,” Journal of Econometrics, 71, 265–283. DOI: 10.1016/0304-4076(96)84507-6.
  • Kostov, P., and Davidova, S. (2013), “A Quantile Regression Analysis of the Effect of Farmers’s Attitudes and Perceptions on Market Participation,” Journal of Agricultural Economics, 64, 112–132. DOI: 10.1111/j.1477-9552.2012.00366.x.
  • Lee, J. D., Liu, Q., Sun, Y., and Taylor, J. E. (2017), “Communication-Efficient Sparse Regression,”’ Journal of Machine Learning Research, 18, 115–144.
  • Li, G., Li, Y., and Tsai, C.-L. (2015), “Quantile Correlations and Quantile Autoregressive Modeling,” Journal of the American Statistical Association, 110, 246–261. DOI: 10.1080/01621459.2014.892007.
  • Li, Q., and Racine, J. S. (2007), Nonparametric Econometrics: Theory and Practice, Princeton University Press.
  • Li, X., Li, R., Xia, Z., and Xu, C. (2020), “Distributed Feature Screening Via Componentwise Debiasing,” Journal of Machine Learning Research, 21, 1–32.
  • Liu, Q., and Ihler, A. T. (2014), “Distributed Estimation, Information Loss and Exponential Families,” Advances in Neural Information Processing Systems, 1098–1106.
  • Marcu, O.-C., Costan, A., Antoniu, G. and Pérez-Hernández, M. S. (2016), “Spark Versus Flink: Understanding Performance in Big Data Analytics Frameworks,” in 2016 IEEE International Conference on Cluster Computing (CLUSTER), Taipei, Taiwan: IEEE, pp. 433–442. DOI: 10.1109/CLUSTER.2016.22.
  • Nguyen, N., Khan, M. M. H. and Wang, K. (2018), “Towards Automatic Tuning of Apache Spark Configuration,” in 2018 IEEE 11th International Conference on Cloud Computing (CLOUD), San Francisco, CA: IEEE, pp. 417–425. DOI: 10.1109/CLOUD.2018.00059.
  • Portnoy, S., Koenker, R. et al. (1997), “The Gaussian Hare and the Laplacian Tortoise: Computability of Squared-Error Versus Absolute-Error Estimators,” Statistical Science, 12, 279–300. DOI: 10.1214/ss/1030037960.
  • Reich, B. J., Fuentes, M., and Dunson, D. B. (2011), “Bayesian Spatial Quantile Regression,” Journal of the American Statistical Association, 106, 6–20. DOI: 10.1198/jasa.2010.ap09237.
  • Shao, J. (2003), Mathematical Statistics, New York: Springer-Verlag.
  • Silverman, B. W. (2018), Density Estimation for Statistics and Data Analysis, Routledge.
  • Simpson, D. G., Ruppert, D., and Carroll, R. J. (1992), “On One-Step GM Estimates and Stability of Inferences in Linear Regression,” Journal of the American Statistical Association, 87, 439–450. DOI: 10.1080/01621459.1992.10475224.
  • Tibshirani, R. (1996), “Regression Shrinkage and Selection Via the Lasso,” Journal of the Royal Statistical Society, Series B, 58, 267–288. DOI: 10.1111/j.2517-6161.1996.tb02080.x.
  • Vohra, D. (2016), “Apache Parquet,” in Practical Hadoop Ecosystem, Berkeley, CA: Springer, pp. 325–335.
  • Wang, H., Li, G., and Jiang, G. (2007), “Robust Regression Shrinkage and Consistent Variable Selection Through the Lad-Lasso,” Journal of Business & Economic Statistics, 25, 347–355.
  • Wang, X., Yang, Z., Chen, X., and Liu, W. (2019), “Distributed Inference for Linear Support Vector Machine,” Journal of Machine Learning Research, 20, 1–41.
  • Xu, G., Sit, T., Wang, L., and Huang, C.-Y. (2017), “Estimation and Inference of Quantile Regression for Survival Data Under Biased Sampling,” Journal of the American Statistical Association, 112, 1571–1586. DOI: 10.1080/01621459.2016.1222286.
  • Zaharia, M., Xin, R. S., Wendell, P., Das, T., Armbrust, M., Dave, A., Meng, X., Rosen, J., Venkataraman, S., Franklin, M. J., Ghoshi, A., Gonzalez, J., Shenker, S., and Ion, S. (2016), ‘‘Apache Spark: A Unified Engine for Big Data Processing,” Communications of the ACM, 59, 56–65. DOI: 10.1145/2934664.
  • Zhang, Y., Duchi, J. C., and Wainwright, M. J. (2013), “Communication-Efficient Algorithms for Statistical Optimization,” Journal of Machine Learning Research, 14, 3321–3363.
  • Zhong, W., Wan, C., and Zhang, W. (2021), “Estimation and Inference for Multi-Kink Quantile Regression,” Journal of Business & Economic Statistics, 1–17.
  • Zhu, X., Li, F., and Wang, H. (2021), “Least-Square Approximation for a Distributed System,” Journal of Computational and Graphical Statistics. DOI: 10.1080/10618600.2021.1923517.
  • Zou, H., and Li, R. (2008), “One-Step Sparse Estimates in Nonconcave Penalized Likelihood Models,” The Annals of Statistics, 36, 1509–1533.

Reprints and Corporate Permissions

Please note: Selecting permissions does not provide access to the full text of the article, please see our help page How do I view content?

To request a reprint or corporate permissions for this article, please click on the relevant link below:

Academic Permissions

Please note: Selecting permissions does not provide access to the full text of the article, please see our help page How do I view content?

Obtain permissions instantly via Rightslink by clicking on the button below:

If you are unable to obtain permissions via Rightslink, please complete and submit this Permissions form. For more information, please visit our Permissions help page.