Search in:

Journal of Computational and Graphical Statistics Volume 30, 2021 - Issue 1

Submit an article Journal homepage

577

Views

CrossRef citations to date

Altmetric

Articles

Optimal Sampling for Generalized Linear Models Under Measurement Constraints

Tao Zhanga Department of Statistics and Data Science, Cornell University, Ithaca, NYCorrespondence[email protected]
View further author information

Yang Ninga Department of Statistics and Data Science, Cornell University, Ithaca, NYView further author information

David Rupperta Department of Statistics and Data Science, Cornell University, Ithaca, NY;b School of Operations Research and Information Engineering, Cornell University, Ithaca, NYView further author information

Pages 106-114 | Received 18 Jul 2019, Accepted 01 Jun 2020, Published online: 20 Jul 2020

Cite this article
https://doi.org/10.1080/10618600.2020.1778483
CrossMark

Full Article
Figures & data
References
Supplemental
Citations
Metrics
Reprints & Permissions

References

Ai, M., Yu, J., Zhang, H., and Wang, H. (2018), “Optimal Subsampling Algorithms for Big Data Regressions,” arXiv no. 1806.06761.
Google Scholar
Aitkin, M. A., Aitkin, M., Francis, B., and Hinde, J. (2005), Statistical Modelling in GLIM 4 (Vol. 32), Oxford: OUP.
Google Scholar
Banerji, M., Lahav, O., Lintott, C. J., Abdalla, F. B., Schawinski, K., Bamford, S. P., Andreescu, D., Murray, P., Raddick, M. J., Slosar, A., and Szalay, A. (2010), “Galaxy Zoo: Reproducing Galaxy Morphologies via Machine Learning,” Monthly Notices of the Royal Astronomical Society, 406, 342–353. DOI: 10.1111/j.1365-2966.2010.16713.x.
Web of Science ®Google Scholar
Cai, T. T., and Guo, Z. (2018), “Semi-Supervised Inference for Explained Variance in High-Dimensional Linear Regression and Its Applications,” arXiv no. 1806.06179.
Google Scholar
Chakrabortty, A., and Cai, T. (2018), “Efficient and Adaptive Linear Regression in Semi-Supervised Settings,” The Annals of Statistics, 46, 1541–1572. DOI: 10.1214/17-AOS1594.
Web of Science ®Google Scholar
Chapelle, O., Schlkopf, B., and Zien, A. (2010), Semi-Supervised Learning (1st ed.), Cambridge, MA: The MIT Press.
Google Scholar
Drineas, P., Magdon-Ismail, M., Mahoney, M. W., and Woodruff, D. P. (2012), “Fast Approximation of Matrix Coherence and Statistical Leverage,” Journal of Machine Learning Research, 13, 3475–3506.
Web of Science ®Google Scholar
Drineas, P., and Mahoney, M. W. (2016), “RandNLA: Randomized Numerical Linear Algebra,” Communications of the ACM, 59, 80–90. DOI: 10.1145/2842602.
Web of Science ®Google Scholar
Drineas, P., Mahoney, M. W., and Muthukrishnan, S. (2006), “Sampling Algorithms for l2 Regression and Applications,” in Proceedings of the Seventeenth Annual ACM-SIAM Symposium on Discrete Algorithm, SODA ’06, Society for Industrial and Applied Mathematics, Philadelphia, PA, USA, pp. 1127–1136.
Google Scholar
Drineas, P., Mahoney, M. W., Muthukrishnan, S., and Sarlós, T. (2011), “Faster Least Squares Approximation,” Numerische Mathematik, 117, 219–249. DOI: 10.1007/s00211-010-0331-6.
Web of Science ®Google Scholar
Hall, P., and Heyde, C. C. (2014), Martingale Limit Theory and Its Application, New York: Academic Press.
Google Scholar
Hamidieh, K. (2018), “A Data-Driven Statistical Model for Predicting the Critical Temperature of a Superconductor,” Computational Materials Science, 154, 346–354. DOI: 10.1016/j.commatsci.2018.07.052.
Web of Science ®Google Scholar
Huber, P. (2004), Robust Statistics, Wiley Series in Probability and Statistics—Applied Probability and Statistics Section Series, Chichester: Wiley.
Google Scholar
Khuri, A. I., Mukherjee, B., Sinha, B. K., and Ghosh, M. (2006), “Design Issues for Generalized Linear Models: A Review,” Statistical Science, 21, 376–399. DOI: 10.1214/088342306000000105.
Web of Science ®Google Scholar
Kiefer, J. (1959), “Optimum Experimental Designs,” Journal of the Royal Statistical Society, Series B, 21, 272–304. DOI: 10.1111/j.2517-6161.1959.tb00338.x.
Google Scholar
Ma, P., Mahoney, M. W., and Yu, B. (2015), “A Statistical Perspective on Algorithmic Leveraging,” The Journal of Machine Learning Research, 16, 861–911.
Web of Science ®Google Scholar
McCullagh, P., and Nelder, J. (1989), Generalized Linear Models, Chapman and Hall/CRC Monographs on Statistics and Applied Probability Series (2nd ed.), London: Chapman & Hall.
Google Scholar
Pukelsheim, F. (2006), Optimal Design of Experiments, Philadelphia, PA: SIAM.
Google Scholar
Raskutti, G., and Mahoney, M. W. (2016), “A Statistical Perspective on Randomized Sketching for Ordinary Least-Squares,” The Journal of Machine Learning Research, 17, 7508–7538.
Web of Science ®Google Scholar
Reiman, D. M., and Göhre, B. E. (2019), “Deblending Galaxy Superpositions With Branched Generative Adversarial Networks,” Monthly Notices of the Royal Astronomical Society, 485, 2617–2627. DOI: 10.1093/mnras/stz575.
Web of Science ®Google Scholar
Rousseeuw, P. J., and Hubert, M. (2011), “Robust Statistics for Outlier Detection,” Wiley Interdisciplinary Reviews: Data Mining and Knowledge Discovery, 1, 73–79. DOI: 10.1002/widm.2.
Web of Science ®Google Scholar
Ting, D., and Brochu, E. (2018), “Optimal Subsampling With Influence Functions,” in Advances in Neural Information Processing Systems, pp. 3650–3659.
Google Scholar
van der Vaart, A. W. (2000), Asymptotic Statistics (Vol. 3), Cambridge: Cambridge University Press.
Google Scholar
Wang, H., Yang, M., and Stufken, J. (2019), “Information-Based Optimal Subdata Selection for Big Data Linear Regression,” Journal of the American Statistical Association, 114, 393–405. DOI: 10.1080/01621459.2017.1408468.
Web of Science ®Google Scholar
Wang, H., Zhu, R., and Ma, P. (2018), “Optimal Subsampling for Large Sample Logistic Regression,” Journal of the American Statistical Association, 113, 829–844. DOI: 10.1080/01621459.2017.1292914.
PubMed Web of Science ®Google Scholar
Wang, Y., Yu, A. W., and Singh, A. (2017), “On Computationally Tractable Selection of Experiments in Measurement-Constrained Regression Models,” The Journal of Machine Learning Research, 18, 5238–5278.
Web of Science ®Google Scholar
Xu, P., Yang, J., Roosta, F., Ré, C., and Mahoney, M. W. (2016), “Sub-Sampled Newton Methods With Non-Uniform Sampling,” in Advances in Neural Information Processing Systems, pp. 3000–3008.
Google Scholar
Zhang, A., Brown, L. D., and Cai, T. T. (2016), “Semi-Supervised Inference: General Theory and Estimation of Means,” arXiv no. 1606.07268.
Google Scholar
Zhu, X. J. (2005), “Semi-Supervised Learning Literature Survey,” Technical Report, University of Wisconsin-Madison Department of Computer Sciences.
Google Scholar

Reprints and Corporate Permissions

Please note: Selecting permissions does not provide access to the full text of the article, please see our help page How do I view content?

To request a reprint or corporate permissions for this article, please click on the relevant link below:

Order Reprints Request Corporate Permissions

Academic Permissions

Please note: Selecting permissions does not provide access to the full text of the article, please see our help page How do I view content?

Obtain permissions instantly via Rightslink by clicking on the button below:

Request Academic Permissions

If you are unable to obtain permissions via Rightslink, please complete and submit this Permissions form. For more information, please visit our Permissions help page.

Related research

People also read lists articles that other readers of this article have read.

Recommended articles lists articles that we recommend and is powered by our AI driven recommendation engine.

Cited by lists all citing articles based on Crossref citations.
Articles with the Crossref icon will open in a new tab.

People also read
Recommended articles
Cited by

To cite this article:

Reference style: APA Chicago Harvard

Citation copied to clipboard

Reference styles above use APA (6th edition), Chicago (16th edition) & Harvard (10th edition)

Download citation

Download a citation file in RIS format that can be imported by citation management software including EndNote, ProCite, RefWorks and Reference Manager.

Choose format: RIS BibTex RefWorks Direct Export

Choose options: Citation Citation & abstract Citation & references

Optimal Sampling for Generalized Linear Models Under Measurement Constraints

References

Information for

Open access

Opportunities

Help and information

Your download is now in progress and you may close this window

Login or register to access this feature

Optimal Sampling for Generalized Linear Models Under Measurement Constraints

References

Reprints and Corporate Permissions

Academic Permissions

Related research

To cite this article:

Download citation

Information for

Open access

Opportunities

Help and information

Keep up to date