2,621
Views
107
CrossRef citations to date
0
Altmetric
Theory and Methods

Residual Weighted Learning for Estimating Individualized Treatment Rules

, , &
Pages 169-187 | Received 01 Mar 2015, Published online: 03 May 2017

References

  • Allen, G. I. (2013), “Automatic Feature Selection via Weighted Kernels and Regularization,” Journal of Computational and Graphical Statistics, 22, 284–299.
  • An, L. T. H., and Tao, P. D. (1997), “Solving a Class of Linearly Constrained Indefinite Quadratic Problems by D. C. Algorithms,” Journal of Global Optimization, 11, 253–285.
  • Argyriou, A., Hauser, R., Micchelli, C. A., and Pontil, M. (2006), “A DC-Programming Algorithm for Kernel Selection,” in ICML, Vol. 148, ACM International Conference Proceeding Series, eds. W. W. Cohen, and A. Moore, New York: ACM, pp. 41–48.
  • Baron, G., Perrodeau, E., Boutron, I., and Ravaud, P. (2013), “Reporting of Analyses From Randomized Controlled Trials With Multiple Arms: A Systematic Review,” BMC Medicine, 11, 84.
  • Bartlett, P. L., and Mendelson, S. (2002), “Rademacher and Gaussian Complexities: Risk Bounds and Structural Results,” Journal of Machine Learning Research, 3, 463–482.
  • Breiman, L. (2001), “Random Forests,” Machine Learning, 45, 5–32.
  • Byrd, R. H., Lu, P., Nocedal, J., and Zhu, C. (1995), “A Limited Memory Algorithm for Bound Constrained Optimization,” SIAM Journal on Scientific and Statistical Computing, 16, 1190–1208.
  • Cohen, J. (1988), Statistical Power Analysis for the Behavioral Sciences, Hillsdale, NJ: Lawrence Erlbaum Associates, Inc.
  • Collobert, R., Sinz, F., Weston, J., and Bottou, L. (2006), “Trading Convexity for Scalability,” in Proceedings of the 23rd International Conference on Machine Learning, ICML ’06, 201–208. New York: ACM.
  • Devroye, L., Györfi, L., and Lugosi, G. (1996), A Probabilistic Theory of Pattern Recognition, New York: Springer.
  • Dietterich, T. G., and Bakiri, G. (1995), “Solving Multiclass Learning Problems via Error-Correcting Output Codes,” Journal of Artificial Intelligence Research, 2, 263–286.
  • Fleming, T., and Harrington, D. (1991), Counting Processes and Survival Analysis, New York: Wiley.
  • Fung, G. M., and Mangasarian, O. L. (2004), “A Feature Selection Newton Method for Support Vector Machine Classification,” Computational Optimization and Applications, 28, 185–202.
  • Grandvalet, Y., and Canu, S. (2002), “Adaptive Scaling for Feature Selection in SVMs,” in Advances in Neural Information Processing System, eds. S. Becker, S. Thrun, and K. Obermayer, Cambridge, MA: MIT Press, pp. 553–560.
  • Gunter, L., Zhu, J., and Murphy, S. (2011), “Variable Selection for Qualitative Interactions,” Statistical Methodology, 8, 42–55.
  • Hastie, T., Tibshirani, R., and Friedman, J. H. (2001), The Elements of Statistical Learning, New York: Springer.
  • Havlir, D. V., Kendall, M. A., Ive, P., Kumwenda, J., Swindells, S., Qasba, S. S., Luetkemeyer, A. F., Hogg, E., Rooney, J. F., Wu, X., Hosseinipour, M. C., Lalloo, U., Veloso, V. G., Some, F. F., Kumarasamy, N., Padayatchi, N., Santos, B. R., Reid, S., Hakim, J., Mohapi, L., Mugyenyi, P., Sanchez, J., Lama, J. R., Pape, J. W., Sanchez, A., Asmelash, A., Moko, E., Sawe, F., Andersen, J., and Sanne, I. (2011), “Timing of Antiretroviral Therapy for HIV-1 Infection and Tuberculosis,” New England Journal of Medicine, 365, 1482–1491.
  • Hsu, C.-W., Chang, C.-C., and Lin, C.-J. (2003), “A Practical Guide to Support Vector Classification,” Technical Report, Department of Computer Science, National Taiwan University, Taipei 106, Taiwan.
  • Kimeldorf, G., and Wahba, G. (1971), “Some Results on Tchebycheffian Spline Functions,” Journal of Mathematical Analysis and Applications, 33, 82–95.
  • Lafferty, J., and Wasserman, L. (2008), “Rodeo: Sparse, Greedy Nonparametric Regression,” The Annals of Statistics, 36, 28–63.
  • Lee, Y., Lin, Y., and Wahba, G. (2004), “Multicategory Support Vector Machines, Theory, and Application to the Classification of Microarray Data and Satellite Radiance Data,” Journal of the American Statistical Association, 99, 67–81.
  • Lin, Y., and Zhang, H. H. (2006), “Component Selection and Smoothing in Multivariate Nonparametric Regression,” The Annals of Statistics, 34, 2272–2297.
  • Morales, J. L., and Nocedal, J. (2011), “Remark on ‘Algorithm 778: L-BFGS-B: Fortran Subroutines for Large-Scale Bound Constrained Optimization’,” ACM Transactions on Mathematical Software, 38, 7:1–7:4.
  • Murphy, S. A. (2003), “Optimal Dynamic Treatment Regimes,” Journal of the Royal Statistical Society, Series B, 65, 331–355.
  • ——— (2005), “An Experimental Design for the Development of Adaptive Treatment Strategies,” Statistics in Medicine, 24, 1455–1481.
  • Nocedal, J. (1980), “Updating Quasi-Newton Matrices With Limited Storage,” Mathematics of Computation, 35, 773–782.
  • Nocedal, J., and Wright, S. J. (2006), Numerical Optimization (2nd ed.), New York: Springer.
  • Qian, M., and Murphy, S. A. (2011), “Performance Guarantees for Individualized Treatment Rules,” The Annals of Statistics, 39, 1180–1210.
  • Rakotomamonjy, A. (2003), “Variable Selection Using svm Based Criteria,” Journal of Machine Learning Research, 3, 1357–1370.
  • Robins, J. M. (2004), “Optimal Structural Nested Models for Optimal Sequential Decisions,” in Proceedings of the Second Seattle Symposium in Biostatistics, Vol. 179 of Lecture Notes in Statistics, eds. D. Lin, and P. Heagerty, New York: Springer, pp. 189–326.
  • Rudin, W. (1987), Real and Complex Analysis (3rd ed.), New York: McGraw-Hill, Inc.
  • Schmidt, M. (2010), Graphical Model Structure Learning With l1-Regularization, Ph.D. thesis, The University of British Columbia.
  • Steinwart, I., and Christmann, A. (2008), Support Vector Machines, New York: Springer.
  • Steinwart, I., and Scovel, C. (2007), “Fast Rates for Support Vector Machines Using Gaussian Kernels,” The Annals of Statistics, 35, 575–607.
  • Therneau, T. M., Grambsch, P. M., and Fleming, T. R. (1990), “Martingale-Based Residuals for Survival Models,” Biometrika, 77, 147–160.
  • Tibshirani, R. (1994), “Regression Shrinkage and Selection Via the Lasso,” Journal of the Royal Statistical Society, Series B, 58, 267–288.
  • Treggiari, M. M., Retsch-Bogart, G., Mayer-Hamblett, N., Khan, U., Kulich, M., Kronmal, R., Williams, J., Hiatt, P., Gibson, R. L., Spencer, T., Orenstein, D., Chatfield, B. A., Froh, D. K., Burns, J. L., Rosenfeld, M., and Ramsey, B. W. (2011), “Comparative Efficacy and Safety of 4 Randomized Regimens to Treat Early Pseudomonas Aeruginosa Infection in Children With Cystic Fibrosis,” Archives of Pediatrics & Adolescent Medicine, 165, 847–856.
  • Treggiari, M. M., Rosenfeld, M., Mayer-Hamblett, N., Retsch-Bogart, G., Gibson, R. L., Williams, J., Emerson, J., Kronmal, R. A., and Ramsey, B. W. (2009), “Early Anti-Pseudomonal Acquisition in Young Patients With Cystic Fibrosis: Rationale and Design of the EPIC Clinical Trial and Observational Study,” Contemporary Clinical Trials, 30, 256–268.
  • Vapnik, V. N. (1998), Statistical Learning Theory, New York: Wiley.
  • Wang, L., Zhu, J., and Zou, H. (2008), “Hybrid Huberized Support Vector Machines for Microarray Classification and Gene Selection,” Bioinformatics, 24, 412–419.
  • Weston, J., Mukherjee, S., Chapelle, O., Pontil, M., Poggio, T., and Vapnik, V. (2000), “Feature Selection for SVMs,” in Advances in Neural Information Processing System, Cambridge, MA: MIT Press, pp. 668–674.
  • Wu, Y., and Liu, Y. (2007), “Robust Truncated-Hinge-Loss Support Vector Machines,” Journal of the American Statistical Association, 102, 974–983.
  • Yuille, A. L., and Rangarajan, A. (2003), “The Concave–Convex Procedure,” Neural Computation, 15, 915–936.
  • Zhang, B., Tsiatis, A. A., Laber, E. B., and Davidian, M. (2012), “A Robust Method for Estimating Optimal Treatment Regimes,” Biometrics, 68, 1010–1018.
  • Zhao, Y., Kosorok, M. R., and Zeng, D. (2009), “Reinforcement Learning Design for Cancer Clinical Trials,” Statistics in Medicine, 28, 3294–3315.
  • Zhao, Y., Zeng, D., Rush, A. J., and Kosorok, M. R. (2012), “Estimating Individualized Treatment Rules Using Outcome Weighted Learning,” Journal of the American Statistical Association, 107, 1106–1118.
  • Zhao, Y., Zeng, D., Socinski, M. A., and Kosorok, M. R. (2011), “Reinforcement Learning Strategies for Clinical Trials in Nonsmall Cell Lung Cancer,” Biometrics, 67, 1422–1433.
  • Zhao, Y.-Q., Zeng, D., Laber, E. B., and Kosorok, M. R. (2015), “New Statistical Learning Methods for Estimating Optimal Dynamic Treatment Regimes,” Journal of the American Statistical Association, 110, 583–598.
  • Zhu, J., and Hastie, T. (2004), “Classification of Gene Microarrays by Penalized Logistic Regression,” Biostatistics, 5, 427–443.
  • Zhu, J., Rosset, S., Hastie, T., and Tibshirani, R. (2003), “1-Norm Support Vector Machines,” in Advances in Neural Information Processing Systems, eds. S. Thrun, L. K. Saul, and B. Schölkopf (Vol. 16, pp. 49–56), Cambridge, MA: MIT.
  • Zou, H., and Hastie, T. (2005), “Regularization and Variable Selection via the Elastic Net,” Journal of the Royal Statistical Society, Series B, 67, 301–320.

Reprints and Corporate Permissions

Please note: Selecting permissions does not provide access to the full text of the article, please see our help page How do I view content?

To request a reprint or corporate permissions for this article, please click on the relevant link below:

Academic Permissions

Please note: Selecting permissions does not provide access to the full text of the article, please see our help page How do I view content?

Obtain permissions instantly via Rightslink by clicking on the button below:

If you are unable to obtain permissions via Rightslink, please complete and submit this Permissions form. For more information, please visit our Permissions help page.