1,067
Views
269
CrossRef citations to date
0
Altmetric
Theory and Method

On Measuring and Correcting the Effects of Data Mining and Model Selection

Pages 120-131 | Received 01 Dec 1996, Published online: 17 Feb 2012

Keep up to date with the latest research on this topic with citation updates for this article.

Read on this site (30)

Hengrui Luo, Younghyun Cho, James W. Demmel, Xiaoye S. Li & Yang Liu. (2024) Hybrid Parameter Search and Dynamic Model Selection for Mixed-Variable Bayesian Optimization. Journal of Computational and Graphical Statistics 0:0, pages 1-14.
Read now
Youhan Lu, Yushen Dong, Juan Hu & Yichao Wu. (2024) A Unified Approach to Variable Selection for Partially Linear Models. Journal of Computational and Graphical Statistics 33:1, pages 250-260.
Read now
Akifumi Okuno & Keisuke Yano. (2023) A Generalization Gap Estimation for Overparameterized Models via the Langevin Functional Variance. Journal of Computational and Graphical Statistics 32:4, pages 1287-1295.
Read now
Ying Hung, Li-Hsiang Lin & C. F. Jeff Wu. (2023) Optimal Simulator Selection. Journal of the American Statistical Association 118:542, pages 1264-1271.
Read now
Jakob A. Dambon, Fabio Sigrist & Reinhard Furrer. (2022) Joint variable selection of both fixed and random effects for Gaussian process-based spatially varying coefficient models. International Journal of Geographical Information Science 36:12, pages 2525-2548.
Read now
Hsin-Cheng Huang, Noel Cressie, Andrew Zammit-Mangion & Guowen Huang. (2021) False Discovery Rates to Detect Signals from Incomplete Spatially Aggregated Data. Journal of Computational and Graphical Statistics 30:4, pages 1081-1094.
Read now
Yichao Wu. (2021) Can’t Ridge Regression Perform Variable Selection?. Technometrics 63:2, pages 263-271.
Read now
Zhihuang Yang & Jiahua Chen. (2020) Small area mean estimation after effect clustering. Journal of Applied Statistics 47:4, pages 602-623.
Read now
Xiaotong Shen & Hsin-Cheng Huang. (2020) Discussion of “From Fixed-X to Random-X Regression: Bias-Variance Decompositions, Covariance Penalties, and Prediction Error Estimation”. Journal of the American Statistical Association 115:529, pages 152-156.
Read now
Ehsan Ahmadi, Andres Garcia-Arce, Dale T. Masel, Eric Reich, Jason Puckey & Rebecca Maff. (2019) A metaheuristic-based stacking model for predicting the risk of patient no-show and late cancellation for neurology appointments. IISE Transactions on Healthcare Systems Engineering 9:3, pages 272-291.
Read now
Ryan J. Tibshirani & Saharon Rosset. (2019) Excess Optimism: How Biased is the Apparent Error of an Estimator Tuned by SURE?. Journal of the American Statistical Association 114:526, pages 697-712.
Read now
Jiming Jiang & Thuan Nguyen. (2019) A discussion of prior-based Bayesian information criterion (PBIC). Statistical Theory and Related Fields 3:1, pages 17-18.
Read now
Yu-Mei Chang, Pao-Sheng Shen & Chun-Shu Chen. (2017) Adaptive-Cox model averaging for right-censored data. Communications in Statistics - Theory and Methods 46:19, pages 9364-9376.
Read now
Diego Ardila, Dorsa Sanadgol, Peter Cauwels & Didier Sornette. (2017) Identification and critical time forecasting of real estate bubbles in the USA. Quantitative Finance 17:4, pages 613-631.
Read now
Yongli Zhang & Xiaotong Shen. (2015) Adaptive Modeling Procedure Selection by Data Perturbation. Journal of Business & Economic Statistics 33:4, pages 541-551.
Read now
Sung Won Han, Hua Zhong & Mary Putt. (2015) An Efficient Operator for the Change Point Estimation in Partial Spline Model. Communications in Statistics - Simulation and Computation 44:5, pages 1171-1186.
Read now
Chun-Shu Chen, Yun-Huan Lee & Hung-Wei Hsu. (2014) Adaptive order selection for autoregressive models. Journal of Statistical Computation and Simulation 84:9, pages 1963-1974.
Read now
Sung Wan Han, Rickson C. Mesquita, Theresa M. Busch & Mary E. Putt. (2014) A method for choosing the smoothing parameter in a semi-parametric model for detecting change-points in blood flow. Journal of Applied Statistics 41:1, pages 26-45.
Read now
JeffreyC. Sklar, Junqing Wu, Wendy Meiring & Yuedong Wang. (2013) Nonparametric Regression With Basis Selection From Multiple Libraries. Technometrics 55:2, pages 189-201.
Read now
Xianchao Xie, S.C. Kou & LawrenceD. Brown. (2012) SURE Estimates for a Heteroscedastic Hierarchical Model. Journal of the American Statistical Association 107:500, pages 1465-1479.
Read now
. (2012) Book Reviews. Technometrics 54:3, pages 319-331.
Read now
Yun-Huan Lee & Chun-Shu Chen. (2012) Autoregressive model selection based on a prediction perspective. Journal of Applied Statistics 39:4, pages 913-922.
Read now
Marc Aerts, Niel Hens & Jeffrey S. Simonoff. (2010) Model selection in regression based on pre-smoothing. Journal of Applied Statistics 37:9, pages 1455-1472.
Read now
A.W. JAYAWARDENA, P.C. XU, F.L. TSANG & W.K. LI. (2006) Determining the structure of a radial basis function network for prediction of nonlinear hydrological time series. Hydrological Sciences Journal 51:1, pages 21-44.
Read now
John H. Kalivas. (2005) Multivariate Calibration, an Overview. Analytical Letters 38:14, pages 2259-2279.
Read now
Li Jianlong, QI Jiaguo, Zhao Dehua, Jiang Ping & Xu Sheng. (2005) Establishing grassland yield models using Projection Pursuit Regression Method. New Zealand Journal of Agricultural Research 48:1, pages 47-55.
Read now
EdwardI. George. (2000) The Variable Selection Problem. Journal of the American Statistical Association 95:452, pages 1304-1308.
Read now
Zheng Wang. (2000) An algorithm for generalized monotonic smoothing. Journal of Applied Statistics 27:4, pages 495-507.
Read now
Brent Johnson, Bradley P. Carlin & James S. Hodges. (1999) CROSS-STUDY HIERARCHICAL MODELING OF STRATIFIED CLINICAL TRIAL DATA. Journal of Biopharmaceutical Statistics 9:4, pages 617-640.
Read now

Articles from other publishers (239)

Maarten Jansen. (2024) Information criteria for structured parameter selection in high-dimensional tree and graph models. Digital Signal Processing 148, pages 104437.
Crossref
HUI CHEN, WINSTON WEI DOU & LEONID KOGAN. (2024) Measuring “Dark Matter” in Asset Pricing Models. The Journal of Finance 79:2, pages 843-902.
Crossref
Liya Fu, You-Gan Wang & Jinran Wu. 2024. Modeling and Analysis of Longitudinal Data. Modeling and Analysis of Longitudinal Data 173 221 .
Sander Greenland. 2019. Handbook of Epidemiology. Handbook of Epidemiology 1 76 .
Liyuan Cui, Yongmiao Hong, Yingxing Li & Junhui Wang. (2023) A Regularized High-Dimensional Positive Definite Covariance Estimator with High-Frequency Data. Management Science.
Crossref
Kazuaki Murayama & Shuichi Kawano. (2023) Sparse Bayesian Learning With Weakly Informative Hyperprior and Extended Predictive Information Criterion. IEEE Transactions on Neural Networks and Learning Systems 34:9, pages 5856-5868.
Crossref
Mathias Harrer, David Daniel Ebert, Paula Kuper, Sarah Paganini, Sandra Schlicker, Yannik Terhorst, Benedikt Reuter, Lasse B. Sander & Harald Baumeister. (2023) Predicting heterogeneous treatment effects of an Internet-based depression intervention for patients with chronic back pain: Secondary analysis of two randomized controlled trials. Internet Interventions 33, pages 100634.
Crossref
Wei Liu, Huazhen Lin, Li Liu, Yanyuan Ma, Ying Wei & Yi Li. (2023) Supervised structural learning of semiparametric regression on high‐dimensional correlated covariates with applications to eQTL studies. Statistics in Medicine 42:18, pages 3145-3163.
Crossref
Xinyang Liu, Anyu Liu, Jason Li Chen & Gang Li. (2023) Impact of decomposition on time series bagging forecasting performance. Tourism Management 97, pages 104725.
Crossref
Mu Qiao, Yanchun Liang, Adriano Tavares & Xiaohu Shi. (2023) Multilayer Perceptron Network Optimization for Chaotic Time Series Modeling. Entropy 25:7, pages 973.
Crossref
Chrysoula D. Kappatou, James Odgers, Salvador García-Muñoz & Ruth Misener. (2023) An Optimization Approach Coupling Preprocessing with Model Regression for Enhanced Chemometrics. Industrial & Engineering Chemistry Research.
Crossref
Qingying Zong & Jonathan R. Bradley. (2022) Criterion constrained Bayesian hierarchical models. TEST 32:1, pages 294-320.
Crossref
Ayaka Sakata. (2023) Prediction errors for penalized regressions based on generalized approximate message passing. Journal of Physics A: Mathematical and Theoretical 56:4, pages 043001.
Crossref
Dimitrina S. Dimitrova, Vladimir K. Kaishev, Andrea Lattuada & Richard J. Verrall. (2023) Geometrically designed variable knot splines in generalized (non-)linear models. Applied Mathematics and Computation 436, pages 127493.
Crossref
Yunquan Song, Yuqi Su & Zhijian Wang. (2022) Variable Selection of Spatial Logistic Autoregressive Model with Linear Constraints. Entropy 24:11, pages 1660.
Crossref
Bastien Marquis & Maarten Jansen. (2022) Information criteria bias correction for group selection. Statistical Papers 63:5, pages 1387-1414.
Crossref
Daniele Durante, Tristan Guillot, Luciano Iess, David J. Stevenson, Christopher R. Mankovich, Steve Markham, Eli Galanti, Yohai Kaspi, Marco Zannoni, Luis Gomez Casajus, Giacomo Lari, Marzia Parisi, Dustin R. Buccino, Ryan S. Park & Scott J. Bolton. (2022) Juno spacecraft gravity measurements provide evidence for normal modes of Jupiter. Nature Communications 13:1.
Crossref
Zachary M. Sparrow, Brian G. Ernst, Trine K. Quady & Robert A. DiStasioJr.Jr.. (2022) Uniting Nonempirical and Empirical Density Functional Approximation Strategies Using Constraint-Based Regularization. The Journal of Physical Chemistry Letters 13:30, pages 6896-6904.
Crossref
Jens Thomas & Mathias Lipka. (2022) A simple data-driven method to optimize the penalty strengths of penalized models and its application to non-parametric smoothing. Monthly Notices of the Royal Astronomical Society 514:4, pages 6203-6214.
Crossref
Eric Wolsztynski, Finbarr O’Sullivan & Janet F. Eary. (2022) Spatially coherent modeling of 3D FDG-PET data for assessment of intratumoral heterogeneity and uptake gradients. Journal of Medical Imaging 9:04.
Crossref
Jacob D. Pilawa, Emily R. Liepold, Silvana C. Delgado Andrade, Jonelle L. Walsh, Chung-Pei Ma, Matthew E. Quenneville, Jenny E. Greene & John P. Blakeslee. (2022) The MASSIVE Survey. XVII. A Triaxial Orbit-based Determination of the Black Hole Mass and Intrinsic Shape of Elliptical Galaxy NGC 2693. The Astrophysical Journal 928:2, pages 178.
Crossref
René-Marcel Kruse, Alexander Silbersdorff & Benjamin Säfken. (2022) Model averaging for linear mixed models via augmented Lagrangian. Computational Statistics & Data Analysis 167, pages 107351.
Crossref
Stephen J. Richards. (2022) Allowing for shocks in portfolio mortality models. British Actuarial Journal 27.
Crossref
Gregory C. Reinsel, Raja P. Velu & Kun ChenGregory C. Reinsel, Raja P. Velu & Kun Chen. 2022. Multivariate Reduced-Rank Regression. Multivariate Reduced-Rank Regression 311 328 .
Masao Ueki & Gen Tamiya. (2021) Smooth-threshold multivariate genetic prediction incorporating gene–environment interactions. G3 Genes|Genomes|Genetics 11:12.
Crossref
P Moulik, V Lekic, B Romanowicz, Z Ma, A Schaeffer, T Ho, E Beucler, E Debayle, A Deuss, S Durand, G Ekström, S Lebedev, G Masters, K Priestley, J Ritsema, K Sigloch, J Trampert & A M Dziewonski. (2022) Global reference seismological data sets: multimode surface wave dispersion. Geophysical Journal International 228:3, pages 1808-1849.
Crossref
Zixin Lin. (2021) Degrees of Freedom in Functional Principal Components Analysis. Mathematical Problems in Engineering 2021, pages 1-11.
Crossref
Yongxin Liu, Peng Zeng & Lu Lin. (2020) Degrees of freedom for regularized regression with Huber loss and linear constraints. Statistical Papers 62:5, pages 2383-2405.
Crossref
Matthieu Lesnoff, Jean‐Michel Roger & Douglas N. Rutledge. (2021) Monte Carlo methods for estimating Mallows's Cp and AIC criteria for PLSR models. Illustration on agronomic spectroscopic NIR data. Journal of Chemometrics 35:10.
Crossref
F.J. Martinez-de-Pison, J. Ferreiro, E. Fraile & A. Pernia-Espinoza. (2021) A comparative study of six model complexity metrics to search for parsimonious models with GAparsimony R Package. Neurocomputing 452, pages 317-332.
Crossref
Peter Hettegger, Klemens Vierlinger & Andreas Weinhaeusel. (2021) Random rotation for identifying differentially expressed genes with linear models following batch effect correction. Bioinformatics 37:15, pages 2142-2149.
Crossref
Alejandro Rodríguez-Collado & Cristina Rueda. (2021) A simple parametric representation of the Hodgkin-Huxley model. PLOS ONE 16:7, pages e0254152.
Crossref
Carl Schmertmann. (2021) D-splines: Estimating rate schedules using high-dimensional splines with empirical demographic penalties. Demographic Research 44, pages 1085-1114.
Crossref
Masao Ueki. (2021) Testing conditional mean through regression model sequence using Yanai’s generalized coefficient of determination. Computational Statistics & Data Analysis 158, pages 107168.
Crossref
Seung Jun Shin & Yichao Wu. 2014. Wiley StatsRef: Statistics Reference Online. Wiley StatsRef: Statistics Reference Online 1 7 .
Mathias Lipka & Jens Thomas. (2021) A novel approach to optimize the regularization and evaluation of dynamical models using a model selection framework. Monthly Notices of the Royal Astronomical Society 504:3, pages 4599-4625.
Crossref
María José Lombardía, Esther López-Vizcaíno & Cristina Rueda. (2020) Selection model for domains across time: application to labour force survey by economic activities. TEST 30:1, pages 228-254.
Crossref
Jiming Jiang & Thuan NguyenJiming Jiang & Thuan Nguyen. 2021. Linear and Generalized Linear Mixed Models and Their Applications. Linear and Generalized Linear Mixed Models and Their Applications 63 172 .
Paul H.C. Eilers. 2007. Encyclopedia of Analytical Chemistry. Encyclopedia of Analytical Chemistry 1 12 .
Stephen C. Newbold & Robert J. Johnston. (2020) Valuing non-market valuation studies using meta-analysis: A demonstration using estimates of willingness-to-pay for water quality improvements. Journal of Environmental Economics and Management 104, pages 102379.
Crossref
Mykola Dimura, Thomas-Otavio Peulen, Hugo Sanabria, Dmitro Rodnin, Katherina Hemmen, Christian A. Hanke, Claus A. M. Seidel & Holger Gohlke. (2020) Automated and optimally FRET-assisted structural modeling. Nature Communications 11:1.
Crossref
Benjamin Säfken & Thomas Kneib. (2019) Conditional covariance penalties for mixed models. Scandinavian Journal of Statistics 47:3, pages 990-1010.
Crossref
Samprit Chatterjee & Jeffrey S. Simonoff. 2020. Handbook of Regression Analysis With Applications in R. Handbook of Regression Analysis With Applications in R 337 342 .
Zhikun Gao, Yanlin Tang, Huixia Judy Wang, Guangying K. Wu & Jeff Lin. (2020) Automatic identification of curve shapes with applications to ultrasonic vocalization. Computational Statistics & Data Analysis 148, pages 106956.
Crossref
Xiaoying Tian. (2020) Prediction error after model search. The Annals of Statistics 48:2.
Crossref
Christopher M. Thomas, Bo Dong & Keith Haines. (2020) Inverse Modeling of Global and Regional Energy and Water Cycle Fluxes using Earth Observation Data. Journal of Climate 33:5, pages 1707-1723.
Crossref
Yongxin Liu, Peng Zeng & Lu Lin. (2020) Generalized -penalized quantile regression with linear constraints . Computational Statistics & Data Analysis 142, pages 106819.
Crossref
Rahul Mazumder & Haolei Weng. (2020) Computing the degrees of freedom of rank-regularized estimators and cousins. Electronic Journal of Statistics 14:1.
Crossref
Mineaki Ohishi, Hirokazu Yanagihara & Yasunori Fujikoshi. (2020) A fast algorithm for optimizing ridge parameters in a generalized ridge regression by minimizing a model selection criterion. Journal of Statistical Planning and Inference 204, pages 187-205.
Crossref
John H. Kalivas & Steven D. Brown. 2020. Comprehensive Chemometrics. Comprehensive Chemometrics 213 247 .
Paul H.C. Eilers. 2020. Comprehensive Chemometrics. Comprehensive Chemometrics 635 648 .
Mineaki Ohishi, Hirokazu Yanagihara & Hirofumi Wakaki. 2020. Intelligent Decision Technologies. Intelligent Decision Technologies 267 278 .
Bastien Marquis & Maarten Jansen. 2020. Nonparametric Statistics. Nonparametric Statistics 357 365 .
Ben Van Calster, Jan Y. Verbakel, Evangelia Christodoulou, Ewout W. Steyerberg & Gary S. Collins. (2019) Statistics versus machine learning: definitions are interesting (but understanding, methodology, and reporting are more important). Journal of Clinical Epidemiology 116, pages 137-138.
Crossref
Rodrigo F. de Mello, Chaitanya Manapragada & Albert Bifet. (2019) Measuring the Shattering coefficient of Decision Tree models. Expert Systems with Applications 137, pages 443-452.
Crossref
Alexander Giessing & Xuming He. (2019) On the predictive risk in misspecified quantile regression. Journal of Econometrics 213:1, pages 235-260.
Crossref
Khaiwal Ravindra, Preety Rattan, Suman Mor & Ashutosh Nath Aggarwal. (2019) Generalized additive models: Building evidence of air pollution, climate change and human health. Environment International 132, pages 104987.
Crossref
G P Fedorov & A V Ustinov. (2019) Automated analysis of single-tone spectroscopic data for cQED systems. Quantum Science and Technology 4:4, pages 045009.
Crossref
Chixiang Chen, Biyi Shen, Lijun Zhang, Yuan Xue & Ming Wang. (2019) Empirical-Likelihood-Based Criteria for Model Selection on Marginal Analysis of Longitudinal Data With Dropout Missingness. Biometrics 75:3, pages 950-965.
Crossref
Juan Liu, Emmanouil Z. Psarakis, Yang Feng & Ioannis Stamos. (2019) A Kronecker Product Model for Repeated Pattern Detection on 2D Urban Images. IEEE Transactions on Pattern Analysis and Machine Intelligence 41:9, pages 2266-2272.
Crossref
I-Ping Tu, Su-Yun Huang & Dai-Ni Hsieh. (2019) The generalized degrees of freedom of multilinear principal component analysis. Journal of Multivariate Analysis 173, pages 26-37.
Crossref
Jun Liao, Guohua Zou & Yan Gao. (2019) Spatial Mallows model averaging for geostatistical models. Canadian Journal of Statistics 47:3, pages 336-351.
Crossref
Fan Wang, Chunfeng LianZhengwang WuHan ZhangTengfei LiYu MengLi WangWeili LinDinggang Shen & Gang Li. (2019) Developmental topography of cortical thickness during infancy. Proceedings of the National Academy of Sciences 116:32, pages 15855-15860.
Crossref
Maarten Jansen. 2014. Wiley StatsRef: Statistics Reference Online. Wiley StatsRef: Statistics Reference Online 1 8 .
Gerda Claeskens & Maarten Jansen. (2019) Discussion on “Model Confidence Bounds for Variable Selection” by Yang Li, Yuetian Luo, Davide Ferrari, Xiaonan Hu, and Yichen Qin. Biometrics 75:2, pages 404-406.
Crossref
Fangyao Li, Christopher M. Triggs, Bogdan Dumitrescu & Ciprian Doru Giurcăneanu. (2019) The matching pursuit algorithm revisited: A variant for big data and new stopping rules. Signal Processing 155, pages 170-181.
Crossref
Steve Miller & Richard Startz. (2019) Feasible generalized least squares using support vector regression. Economics Letters 175, pages 28-31.
Crossref
Allison Meisner, Chirag R. Parikh & Kathleen F. Kerr. (2018) Using ordinal outcomes to construct and select biomarker combinations for single-level prediction. Diagnostic and Prognostic Research 2:1.
Crossref
Angel J. Duran & Angel P. del Pobil. (2018) Predicting the internal model of a robotic system from its morphology. Robotics and Autonomous Systems 110, pages 33-43.
Crossref
Carsten F. Dormann, Justin M. Calabrese, Gurutzeta Guillera‐Arroita, Eleni Matechou, Volker Bahn, Kamil Bartoń, Colin M. Beale, Simone Ciuti, Jane Elith, Katharina Gerstner, Jérôme Guelat, Petr Keil, José J. Lahoz‐Monfort, Laura J. Pollock, Björn Reineking, David R. Roberts, Boris Schröder, Wilfried Thuiller, David I. Warton, Brendan A. Wintle, Simon N. Wood, Rafael O. Wüest & Florian Hartig. (2018) Model averaging in ecology: a review of Bayesian, information‐theoretic, and tactical approaches for predictive inference. Ecological Monographs 88:4, pages 485-504.
Crossref
Chun‐Shu Chen & Chung‐Wei Shen. (2018) Model selection based on resampling approaches for cluster longitudinal data with missingness in outcomes. Statistics in Medicine 37:20, pages 2982-2997.
Crossref
A. Pernía-Espinoza, J. Fernandez-Ceniceros, J. Antonanzas, R. Urraca & F.J. Martinez-de-Pison. (2018) Stacking ensemble with parsimonious base models to improve generalization capability in the characterization of steel bolted components. Applied Soft Computing 70, pages 737-750.
Crossref
Xiaoli Gao. (2016) A flexible shrinkage operator for fussy grouped variable selection. Statistical Papers 59:3, pages 985-1008.
Crossref
Paul H. C. Eilers. (2018) The truth about the effective dimension. Statistica Neerlandica 72:3, pages 201-209.
Crossref
Dazhi Yang, Jan Kleissl, Christian A. Gueymard, Hugo T.C. Pedro & Carlos F.M. Coimbra. (2018) History and trends in solar irradiance and PV power forecasting: A preliminary assessment and review using text mining. Solar Energy 168, pages 60-101.
Crossref
Henry WJ Reeve & Gavin Brown. (2018) Diversity and degrees of freedom in regression ensembles. Neurocomputing 298, pages 55-68.
Crossref
Ayaka Sakata. (2018) Estimator of prediction error based on approximate message passing for penalized linear regression. Journal of Statistical Mechanics: Theory and Experiment 2018:6, pages 063404.
Crossref
Ewout W. Steyerberg, Hajime Uno, John P.A. Ioannidis, Ben van Calster, Chinedu Ukaegbu, Tara Dhingra, Sapna Syngal & Fay Kastrinos. (2018) Poor performance of clinical prediction models: the harm of commonly applied methods. Journal of Clinical Epidemiology 98, pages 133-143.
Crossref
J. G. Liao, Joseph E. Cavanaugh & Timothy L. McMurry. (2018) Extending AIC to best subset regression. Computational Statistics 33:2, pages 787-806.
Crossref
Frederik Riis Mikkelsen & Niels Richard Hansen. (2018) Degrees of freedom for piecewise Lipschitz estimators. Annales de l'Institut Henri Poincaré, Probabilités et Statistiques 54:2.
Crossref
Stephen C. Newbold, Patrick J. Walsh, D. Matthew Massey & Julie Hewitt. (2018) Using structural restrictions to achieve theoretical consistency in benefit transfers. Environmental and Resource Economics 69:3, pages 529-553.
Crossref
Marvin Höge, Thomas Wöhling & Wolfgang Nowak. (2018) A Primer for Model Selection: The Decisive Role of Model Complexity. Water Resources Research 54:3, pages 1688-1715.
Crossref
Gary Venter & Şule Şahın. (2017) PARSIMONIOUS PARAMETERIZATION OF AGE-PERIOD-COHORT MODELS BY BAYESIAN SHRINKAGE. ASTIN Bulletin 48:1, pages 89-110.
Crossref
John Elder. 2018. Handbook of Statistical Analysis and Data Mining Applications. Handbook of Statistical Analysis and Data Mining Applications 705 718 .
David FletcherDavid Fletcher. 2018. Model Averaging. Model Averaging 57 97 .
David FletcherDavid Fletcher. 2018. Model Averaging. Model Averaging 1 29 .
F. J. Martinez-de-Pison, R. Gonzalez-Sendino, J. Ferreiro, E. Fraile & A. Pernia-Espinoza. 2018. Hybrid Artificial Intelligent Systems. Hybrid Artificial Intelligent Systems 62 73 .
Dominique Fourdrinier, William E. Strawderman & Martin T. WellsDominique Fourdrinier, William E. Strawderman & Martin T. Wells. 2018. Shrinkage Estimation. Shrinkage Estimation 237 276 .
Peng Zeng, Qinqin Hu & Xiaoyu Li. (2017) Geometry and Degrees of Freedom of Linearly Constrained Generalized Lasso. Scandinavian Journal of Statistics 44:4, pages 989-1008.
Crossref
Philip T. Reiss, Lei Huang, Pei-Shien Wu, Huaihou Chen & Stan Colcombe. (2017) Pointwise Influence Matrices for Functional-Response Regression. Biometrics 73:4, pages 1092-1101.
Crossref
Ruben UrracaAndres Sanz-GarciaJulio Fernandez-CenicerosAlpha Pernia-EspinozaFrancisco Javier Martinez-De-Pison. (2017) Improving hotel room demand forecasting with a hybrid GA-SVR methodology based on skewed data transformation, feature selection and parsimony tuning. Logic Journal of the IGPL 25:6, pages 877-889.
Crossref
Zheng Ning, Youngjo Lee, Peter K. Joshi, James F. Wilson, Yudi Pawitan & Xia Shen. (2017) A Selection Operator for Summary Association Statistics Reveals Allelic Heterogeneity of Complex Traits. The American Journal of Human Genetics 101:6, pages 903-912.
Crossref
Qiong Zhang & Yongjia Song. (2017) Moment-Matching-Based Conjugacy Approximation for Bayesian Ranking and Selection. ACM Transactions on Modeling and Computer Simulation 27:4, pages 1-23.
Crossref
María José Lombardía, Esther López-Vizcaíno & Cristina Rueda. (2017) Mixed Generalized Akaike Information Criterion for Small Area Models. Journal of the Royal Statistical Society Series A: Statistics in Society 180:4, pages 1229-1252.
Crossref
Jun Liao, Guohua Zou & Yan Gao. (2017) Optimal model averaging by data perturbation for spatial noisy data. Environmental and Ecological Statistics 24:3, pages 415-431.
Crossref
Wenlin Dai, Tiejun Tong & Lixing Zhu. (2017) On the Choice of Difference Sequence in a Unified Framework for Variance Estimation in Nonparametric Regression. Statistical Science 32:3.
Crossref
Amir Hasan Kakaee, Behrooz Mashadi & Mostafa Ghajar. (2016) A novel volumetric efficiency model for spark ignition engines equipped with variable valve timing and variable valve lift Part 1: model development. Proceedings of the Institution of Mechanical Engineers, Part D: Journal of Automobile Engineering 231:2, pages 175-191.
Crossref
Pan Shang & Lingchen Kong. (2017) On the Degrees of Freedom of Mixed Matrix Regression. Mathematical Problems in Engineering 2017, pages 1-8.
Crossref
Charles-Alban Deledalle, Nicolas Papadakis, Joseph Salmon & Samuel Vaiter. (2017) CLEAR: Covariant LEAst-Square Refitting with Applications to Image Restoration. SIAM Journal on Imaging Sciences 10:1, pages 243-284.
Crossref
Gary Venter. 2017. Actuarial Sciences and Quantitative Finance. Actuarial Sciences and Quantitative Finance 3 23 .
Francisco Javier Martinez-de-Pison, Esteban Fraile-Garcia, Javier Ferreiro-Cabello, Rubén Gonzalez & Alpha Pernia. 2017. International Joint Conference SOCO’16-CISIS’16-ICEUTE’16. International Joint Conference SOCO’16-CISIS’16-ICEUTE’16 201 210 .
A Sakata. (2016) Evaluation of generalized degrees of freedom for sparse estimation by replica method. Journal of Statistical Mechanics: Theory and Experiment 2016:12, pages 123302.
Crossref
Cristina Rueda, Miguel A. Fernández, Sandra Barragán, Kanti V. Mardia & Shyamal D. Peddada. (2016) Circular Piecewise Regression with Applications to Cell-Cycle Data. Biometrics 72:4, pages 1266-1274.
Crossref
Ming Yuan. (2016) Degrees of freedom in low rank matrix estimation. Science China Mathematics 59:12, pages 2485-2502.
Crossref
Jin‐Hua Chen, Chun‐Shu Chen, Meng‐Fan Huang & Hung‐Chih Lin. (2016) Estimating the Probability of Rare Events Occurring Using a Local Model Averaging. Risk Analysis 36:10, pages 1855-1870.
Crossref
Alessio Sancetta. (2016) Greedy algorithms for prediction. Bernoulli 22:2.
Crossref
Shakir Ali, Adlul Islam, P.K. Mishra & Alok K. Sikka. (2016) Green-Ampt approximations: A comprehensive analysis. Journal of Hydrology 535, pages 340-355.
Crossref
Masao Ueki & Gen Tamiya. (2016) Smooth-Threshold Multivariate Genetic Prediction with Unbiased Model Selection. Genetic Epidemiology 40:3, pages 233-243.
Crossref
P. Moulik & G. Ekström. (2016) The relationships between large‐scale variations in shear velocity, density, and compressional velocity in the Earth's mantle. Journal of Geophysical Research: Solid Earth 121:4, pages 2737-2771.
Crossref
Xinyu Zhang, Hua Liang, Anna Liu, David Ruppert & Guohua Zou. (2015) Selection Strategy for Covariance Structure of Random Effects in Linear Mixed‐effects Models. Scandinavian Journal of Statistics 43:1, pages 275-291.
Crossref
Parviz Shahbazikhah, John H. Kalivas, Erik Andries & Trevor O'Loughlin. (2016) Using the L 1 norm to select basis set vectors for multivariate calibration and calibration updating . Journal of Chemometrics 30:3, pages 109-120.
Crossref
R. Urraca, J. Antonanzas, M. Alia-Martinez, F.J. Martinez-de-Pison & F. Antonanzas-Torres. (2016) Smart baseline models for solar irradiation forecasting. Energy Conversion and Management 108, pages 539-548.
Crossref
Chong You, Samuel Müller & John T. Ormerod. (2014) On generalized degrees of freedom with application in linear mixed models selection. Statistics and Computing 26:1-2, pages 199-210.
Crossref
Xiyang Zhi & Feng Xue. A novel approach to blind deconvolution based on generalized Akaike’s information criterion. A novel approach to blind deconvolution based on generalized Akaike’s information criterion.
Esin Karahan, Pedro A. Rojas-Lopez, Maria L. Bringas-Vega, Pedro A. Valdes-Hernandez & Pedro A. Valdes-Sosa. (2015) Tensor Analysis and Fusion of Multimodal Brain Images. Proceedings of the IEEE 103:9, pages 1531-1559.
Crossref
Kei Hirose & Michio Yamamoto. (2014) Sparse estimation via nonconcave penalized likelihood in factor analysis model. Statistics and Computing 25:5, pages 863-875.
Crossref
Thomas Read, Rouslan V. Olkhov, E. Diane Williamson & Andrew M. Shaw. (2015) Label-free Fab and Fc affinity/avidity profiling of the antibody complex half-life for polyclonal and monoclonal efficacy screening. Analytical and Bioanalytical Chemistry 407:24, pages 7349-7357.
Crossref
Qinqin Hu, Peng Zeng & Lu Lin. (2015) The dual and degrees of freedom of linearly constrained generalized lasso. Computational Statistics & Data Analysis 86, pages 13-26.
Crossref
Pierantonio Facco, Filippo Dal Pastro, Natascia Meneghetti, Fabrizio Bezzo & Massimiliano Barolo. (2015) Bracketing the Design Space within the Knowledge Space in Pharmaceutical Product Development. Industrial & Engineering Chemistry Research 54:18, pages 5128-5138.
Crossref
Nico Didcock, Stefan Jakubek & Hans-Michael Kögeler. (2015) Regularisation methods for neural network model averaging. Engineering Applications of Artificial Intelligence 41, pages 128-138.
Crossref
Thomas Bäck, Peter Krause & Christophe Foussette. (2015) Automatische Metamodellierung von CAE-Simulationsmodellen. ATZ - Automobiltechnische Zeitschrift 117:5, pages 64-69.
Crossref
Ying Zhang. (2015) Quantile Regression Based on Laplacian Manifold Regularization. Quantile Regression Based on Laplacian Manifold Regularization.
Maarten Jansen. (2015) Generalized Cross Validation in variable selection with and without shrinkage. Journal of Statistical Planning and Inference 159, pages 90-104.
Crossref
Keisuke Fukui, Mariko Yamamura & Hirokazu Yanagihara. (2015) Comparison with Residual-Sum-of-Squares-Based Model Selection Criteria for Selecting Growth Functions. FORMATH 14:0, pages 27-39.
Crossref
Aiyou Chen, Art B. Owen & Minghui Shi. (2015) Data enriched linear regression. Electronic Journal of Statistics 9:1.
Crossref
Gerda Claeskens & Maarten Jansen. 2015. International Encyclopedia of the Social & Behavioral Sciences. International Encyclopedia of the Social & Behavioral Sciences 647 652 .
Samuel Vaiter, Gabriel Peyré & Jalal Fadili. 2015. Sampling Theory, a Renaissance. Sampling Theory, a Renaissance 103 153 .
M. Alia-Martinez, J. Antonanzas, F. Antonanzas-Torres, A. Pernía-Espinoza & R. Urraca. 2015. Hybrid Artificial Intelligent Systems. Hybrid Artificial Intelligent Systems 656 667 .
R. Urraca, A. Sanz-Garcia, J. Fernandez-Ceniceros, E. Sodupe-Ortega & F. J. Martinez-de-Pison. 2015. Hybrid Artificial Intelligent Systems. Hybrid Artificial Intelligent Systems 632 643 .
Frank E. Harrell ,Frank E. HarrellJr.Jr.. 2015. Regression Modeling Strategies. Regression Modeling Strategies 1 11 .
Ming Wang. (2014) Generalized Estimating Equations in Longitudinal Data Analysis: A Review and Recent Developments. Advances in Statistics 2014, pages 1-11.
Crossref
Aurélie Boisbunon, Stéphane Canu, Dominique Fourdrinier, William Strawderman & Martin T. Wells. (2014) Akaike's Information Criterion, C p and Estimators of Loss for Elliptically Symmetric Distributions . International Statistical Review 82:3, pages 422-439.
Crossref
S. Kaufman & S. Rosset. (2014) When does more regularization imply fewer degrees of freedom? Sufficient conditions and counterexamples. Biometrika 101:4, pages 771-784.
Crossref
Rosanna Overholser & Ronghui Xu. (2014) Effective degrees of freedom and its application to conditional AIC for linear mixed-effects models with correlated error structures. Journal of Multivariate Analysis 132, pages 160-170.
Crossref
Kei Hirose & Michio Yamamoto. (2014) Estimation of an oblique structure via penalized likelihood factor analysis. Computational Statistics & Data Analysis 79, pages 120-132.
Crossref
Tom Burr & S. Tobin. 2014. Encyclopedia of Information Science and Technology, Third Edition. Encyclopedia of Information Science and Technology, Third Edition 1825 1833 .
Huihua Lu, Bojan Cukic & Mark Culp. (2014) A Semi-supervised Approach to Software Defect Prediction. A Semi-supervised Approach to Software Defect Prediction.
Inna Chervoneva, Boris Freydin, Brian Hipszer, Tatiyana V. Apanasovich & Jeffrey I. Joseph. (2014) Estimation of nonlinear differential equation model for glucose–insulin dynamics in type I diabetic patients using generalized smoothing. The Annals of Applied Statistics 8:2.
Crossref
Audris Mockus. (2014) Engineering big data solutions. Engineering big data solutions.
John H. Kalivas & Jon Palmer. (2013) Characterizing multivariate calibration tradeoffs (bias, variance, selectivity, and sensitivity) to select model tuning parameters. Journal of Chemometrics 28:5, pages 347-357.
Crossref
Maarten Jansen. (2014) Information criteria for variable selection under sparsity. Biometrika 101:1, pages 37-55.
Crossref
Charles-Alban Deledalle, Samuel Vaiter, Jalal Fadili & Gabriel Peyré. (2014) Stein Unbiased GrAdient estimator of the Risk (SUGAR) for Multiple Parameter Selection. SIAM Journal on Imaging Sciences 7:4, pages 2448-2487.
Crossref
Marta Avalos, Yves Grandvalet, Hélène Pouyes, Ludivine Orriols & Emmanuel Lagarde. 2014. Computational Intelligence Methods for Bioinformatics and Biostatistics. Computational Intelligence Methods for Bioinformatics and Biostatistics 109 124 .
Sander Greenland. 2014. Handbook of Epidemiology. Handbook of Epidemiology 1087 1159 .
Yuqi Chen, Pang Du & Yuedong Wang. (2013) Variable selection in linear models. WIREs Computational Statistics 6:1, pages 1-9.
Crossref
Tiejun Tong, Yanyuan Ma & Yuedong Wang. (2013) Optimal variance estimation without estimating the mean function. Bernoulli 19:5A.
Crossref
Samuel Vaiter, Charles-Alban Deledalle, Gabriel Peyré, Charles Dossal & Jalal Fadili. (2013) Local behavior of sparse analysis regularization: Applications to risk estimation. Applied and Computational Harmonic Analysis 35:3, pages 433-451.
Crossref
Gabriel E. Hoffman. (2013) Correcting for Population Structure and Kinship Using the Linear Mixed Model: Theory and Extensions. PLoS ONE 8:10, pages e75707.
Crossref
Sunghoon Kwon, Sangmi Han & Sangin Lee. (2013) A small review and further studies on the LASSO. Journal of the Korean Data and Information Science Society 24:5, pages 1077-1088.
Crossref
Dave Plaehn. (2013) What’s the real penalty in penalty analysis?. Food Quality and Preference 28:2, pages 456-469.
Crossref
C.M. Rubingh, H. Martens, H. van der Voet & A.K. Smilde. (2013) The costs of complex model optimization. Chemometrics and Intelligent Laboratory Systems 125, pages 139-146.
Crossref
Samuel Müller, J. L. Scealy & A. H. Welsh. (2013) Model Selection in Linear Mixed Models. Statistical Science 28:2.
Crossref
Isamu Nagai. (2013) Selection of model selection criteria for multivariate ridge regression. Hiroshima Mathematical Journal 43:1.
Crossref
Kei Hirose, Shohei Tateishi & Sadanori Konishi. (2013) Tuning parameter selection in sparse regression modeling. Computational Statistics & Data Analysis 59, pages 28-40.
Crossref
Jonathan Jaeger & Philippe Lambert. (2013) Bayesian P-spline estimation in hierarchical models specified by systems of affine differential equations. Statistical Modelling 13:1, pages 3-40.
Crossref
Pengcheng Xu, A.W. Jayawardena & W.K. Li. (2013) Model selection for RBF network via generalized degree of freedom. Neurocomputing 99, pages 163-171.
Crossref
Carla Cardinali. 2013. Data Assimilation for Atmospheric, Oceanic and Hydrologic Applications (Vol. II). Data Assimilation for Atmospheric, Oceanic and Hydrologic Applications (Vol. II) 89 110 .
Samprit Chatterjee & Jeffrey S. Simonoff. 2012. Handbook of Regression Analysis. Handbook of Regression Analysis 227 230 .
Chung‐Wei Shen & Yi‐Hau Chen. (2012) Model Selection for Generalized Estimating Equations Accommodating Dropout Missingness. Biometrics 68:4, pages 1046-1054.
Crossref
Qinglan Li & Pengcheng Xu. (2012) Estimation of Lyapunov spectrum and model selection for a chaotic time series. Applied Mathematical Modelling 36:12, pages 6090-6099.
Crossref
Philip T. Reiss, Lei Huang, Joseph E. Cavanaugh & Amy Krain Roy. (2012) Resampling-based information criteria for best-subset regression. Annals of the Institute of Statistical Mathematics 64:6, pages 1161-1186.
Crossref
J. Paul Ronaldson, Rafidah Zainon, Nicola Jean Agnes Scott, Steven Paul Gieseg, Anthony P. Butler, Philip H. Butler & Nigel G. Anderson. (2012) Toward quantifying the composition of soft tissues by spectral CT with Medipix3. Medical Physics 39:11, pages 6847-6857.
Crossref
C Deledalle, S Vaiter, G Peyré, J Fadili & C Dossal. (2012) Proximal Splitting Derivatives for Risk Estimation. Journal of Physics: Conference Series 386, pages 012003.
Crossref
Huihua Lu, Bojan Cukic & Mark Culp. (2012) Software defect prediction using semi-supervised learning with dimension reduction. Software defect prediction using semi-supervised learning with dimension reduction.
Angelika van der Linde. (2012) A Bayesian view of model complexity. Statistica Neerlandica 66:3, pages 253-271.
Crossref
G. Ambler, S. Seaman & R. Z. Omar. (2011) An evaluation of penalised survival methods for developing prognostic models with rare events. Statistics in Medicine 31:11-12, pages 1150-1161.
Crossref
Andrey Feuerverger, Yu He & Shashi Khatri. (2012) Statistical Significance of the Netflix Challenge. Statistical Science 27:2.
Crossref
Ronny Luss, Saharon Rosset & Moni Shahar. (2012) Efficient regularized isotonic regression with application to gene–gene interaction search. The Annals of Applied Statistics 6:1.
Crossref
Yong Sheng Shi, Jun Jie Yue & Yun Xue Song. (2012) Application of the Regularization Chaos Prediction Model in Aero-Engine Performance Parameters. Advanced Materials Research 424-425, pages 347-351.
Crossref
Jianqing Fan, Shaojun Guo & Ning Hao. (2012) Variance Estimation Using Refitted Cross-Validation in Ultrahigh Dimensional Regression. Journal of the Royal Statistical Society Series B: Statistical Methodology 74:1, pages 37-65.
Crossref

Displaying 200 of 269 citing articles. Use the download link below to view the full list of citing articles.

Download full citations list

Reprints and Corporate Permissions

Please note: Selecting permissions does not provide access to the full text of the article, please see our help page How do I view content?

To request a reprint or corporate permissions for this article, please click on the relevant link below:

Academic Permissions

Please note: Selecting permissions does not provide access to the full text of the article, please see our help page How do I view content?

Obtain permissions instantly via Rightslink by clicking on the button below:

If you are unable to obtain permissions via Rightslink, please complete and submit this Permissions form. For more information, please visit our Permissions help page.