References
- Anderson, T. W., and Rubin, H. (1956), “Statistical Inference in Factor Analysis,” in Proceedings of the Third Berkeley Symposium on Mathematical Statistics and Probability, Volume 5: Contributions to Econometrics, Industrial Research, and Psychometry, University of California Press, Berkeley, CA, pp. 111–150.
- Bai, J. (2009), “Panel Data Models With Interactive Fixed Effects,” Econometrica, 77, 1229–1279.
- Bai, J., and Li, K. (2012), “Statistical Analysis of Factor Models of High Dimension,” The Annals of Statistics, 40, 436–465. DOI: https://doi.org/10.1214/11-AOS966.
- Bai, J., and Ng, S. (2002), “Determining the Number of Factors in Approximate Factor Models,” Econometrica, 70, 191–221. DOI: https://doi.org/10.1111/1468-0262.00273.
- Baumgart, M., Priebe, S., Groth, M., Hartmann, N., Menzel, U., Pandolfini, L., Koch, P., Felder, M., Ristow, M., Englert, C., Guthke, R., Platzer, M. and Cellerino, A. (2016), “Longitudinal RNA-Seq Analysis of Vertebrate Aging Identifies Mitochondrial Complex I as a Small-Molecule-Sensitive Modifier of Lifespan,” Cell Systems, 2, 122–132. DOI: https://doi.org/10.1016/j.cels.2016.01.014.
- Breton, C. V., Marsit, C. J., Faustman, E., Nadeau, K., Goodrich, J. M., Dolinoy, D. C., Herbstman, J., Holland, N., LaSalle, J. M., Schmidt, R., Yousefi, P., Perera, F., Joubert, B. R., Wiemels, J., Taylor, M., Yang, I. V., Chen, R., Hew, K. M., Freeland, D. M. H., Miller, R., and Murphy, S. K. (2017), “Small-Magnitude Effect Sizes in Epigenetic End Points Are Important in Children’s Environmental Health Studies: The Children’s Environmental Health and Disease Prevention Research Center’s Epigenetics Working Group,” Environmental Health Perspectives, 125, 511–526. DOI: https://doi.org/10.1289/EHP595.
- Buja, A., and Eyuboglu, N. (1992), “Remarks on Parallel Analysis,” Multivariate Behavioral Research, 27, 509–540. DOI: https://doi.org/10.1207/s15327906mbr2704_2.
- Chen, L. S., Wang, J., Wang, X., and Wang, P. (2017), “A Mixed-Effects Model for Incomplete Data From Labeling-Based Quantitative Proteomics Experiments,” The Annals of Applied Statistics, 11, 114–138. DOI: https://doi.org/10.1214/16-AOAS994.
- Dobriban, E., and Owen, A. B. (2019), “Deterministic Parallel Analysis: An Improved Method for Selecting Factors and Principal Components,” Journal of the Royal Statistical Society, Series B, 81, 163–183. DOI: https://doi.org/10.1111/rssb.12301.
- Douc, R., Moulines, É., and Rydén, T. (2004), “Asymptotic Properties of the Maximum Likelihood Estimator in Autoregressive Models With Markov Regime,” The Annals of Statistics, 32, 2254–2304. DOI: https://doi.org/10.1214/009053604000000021.
- Fan, J., and Han, X. (2017), “Estimation of the False Discovery Proportion With Unknown Dependence,” Journal of the Royal Statistical Society, Series B, 79, 1143–1164. DOI: https://doi.org/10.1111/rssb.12204.
- Ferguson, T. (1996), A Course in Large Sample Theory, London: Chapman & Hall.
- Gagnon-Bartsch, J. A., Jacob, L., and Speed, T. P. (2013), “Removing Unwanted Variation From High Dimensional Data With Negative Controls,” Technical Report, UC Berkeley.
- Gagnon-Bartsch, J. A., and Speed, T. P. (2012), “Using Control Genes to Correct for Unwanted Variation in Microarray Data,” Biostatistics, 13, 539–552. DOI: https://doi.org/10.1093/biostatistics/kxr034.
- Galanter, J. M., Gignoux, C. R., Oh, S. S., Torgerson, D., Pino-Yanes, M., Thakur, N., Eng, C., Hu, D., Huntsman, S., Farber, H. J., Avila, P. C., Brigino-Buenaventura, E., LeNoir, M. A., Meade, K., Serebrisky, D., Rodríguez-Cintrón, W., Kumar, R., Rodríguez-Santana, J. R., Seibold, M. A., Borrell, L. N., Burchard, E. G., and Zaitlen, N. (2017), “Differential Methylation Between Ethnic Sub-Groups Reflects the Effect of Genetic Ancestry and Environmental Exposures,” eLife, 6, e20532. DOI: https://doi.org/10.7554/eLife.20532.
- Gerard, D., and Stephens, M. (2020), “Empirical Bayes Shrinkage and False Discovery Rate Estimation, Allowing for Unwanted Variation,” Biostatistics, 21, 15–32. DOI: https://doi.org/10.1093/biostatistics/kxy029.
- GTEx Consortium (2017), “Genetic Effects on Gene Expression Across Human Tissues,” Nature, 550, 204–213.
- Hastie, T., Tibshirani, R., and Friedman, J. (2009), The Elements of Statistical Learning: Data Mining, Inference, and Prediction (2nd ed.), New York: Springer.
- Houseman, E. A., Molitor, J., and Marsit, C. J. (2014), “Reference-Free Cell Mixture Adjustments in Analysis of DNA Methylation Data,” Bioinformatics, 30, 1431–1439. DOI: https://doi.org/10.1093/bioinformatics/btu029.
- Jaffe, A. E., and Irizarry, R. A. (2014), “Accounting for Cellular Heterogeneity Is Critical in Epigenome-Wide Association Studies,” Genome Biology, 15, R31. DOI: https://doi.org/10.1186/gb-2014-15-2-r31.
- Knowles, D. A., Burrows, C. K., Blischak, J. D., Patterson, K. M., Serie, D. J., Norton, N., Ober, C., Pritchard, J. K., Gilad, Y., and McVean, G. (2018), “Determining the Genetic Basis of Anthracycline-Cardiotoxicity by Response QTL Mapping in Induced Cardiomyocytes,” eLife, 7, e33480.
- Lee, S., Sun, W., Wright, F. A., and Zou, F. (2017), “An Improved and Explicit Surrogate Variable Analysis Procedure by Coefficient Adjustment,” Biometrika, 104, 303–316. DOI: https://doi.org/10.1093/biomet/asx018.
- Leek, J. T., Scharpf, R. B., Bravo, H. C., Simcha, D., Langmead, B., Johnson, W. E., Geman, D., Baggerly, K., and Irizarry, R. A. (2010), “Tackling the Widespread and Critical Impact of Batch Effects in High-Throughput Data,” Nature Reviews Genetics, 11, DOI: https://doi.org/10.1038/nrg2825.
- Leek, J. T., and Storey, J. D. (2007), “Capturing Heterogeneity in Gene Expression Studies by Surrogate Variable Analysis,” PLOS Genetics, 3, 1724–1735. DOI: https://doi.org/10.1371/journal.pgen.0030161.
- Leek, J. T., and Storey, J. D. (2008), “A General Framework for Multiple Testing Dependence,” Proceedings of the National Academy of Sciences of the United States of America, 105, 18718–18723.
- Li, D., Qian, J., and Su, L. (2016), “Panel Data Models With Interactive Fixed Effects and Multiple Structural Breaks,” Journal of the American Statistical Association, 111, 1804–1819. DOI: https://doi.org/10.1080/01621459.2015.1119696.
- Liu, C., Marioni, R. E., Hedman, Å., Pfeiffer, L., Tsai, P.-C., Reynolds, L. M., Just, A. C., Duan, Q., Boer, C. G., Tanaka, T., Elks, C. E., Aslibekyan, S., Brody, J. A., Kühnel, B., Herder, C., Almli, L. M., Zhi, D., Wang, Y., Huan, T., Yao, C., Mendelson, M. M., Joehanes, R., Liang, L., Love, S.-A., Guan, W., Shah, S., McRae, A. F., Kretschmer, A., Prokisch, H., Strauch, K., Peters, A., Visscher, P. M., Wray, N. R., Guo, X., Wiggins, K. L., Smith, A. K., Binder, E. B., Ressler, K. J., Irvin, M. R., Absher, D. M., Hernandez, D., Ferrucci, L., Bandinelli, S., Lohman, K., Ding, J., Trevisi, L., Gustafsson, S., Sandling, J. H., Stolk, L., Uitterlinden, A. G., Yet, I., Castillo-Fernandez, J. E., Spector, T. D., Schwartz, J. D., Vokonas, P., Lind, L., Li, Y., Fornage, M., Arnett, D. K., Wareham, N. J., Sotoodehnia, N., Ong, K. K., van Meurs, J. B. J., Conneely, K. N., Baccarelli, A. A., Deary, I. J., Bell, J. T., North, K. E., Liu, Y., Waldenberger, M., London, S. J., Ingelsson, E., and Levy, D. (2016), “A DNA Methylation Biomarker of Alcohol Consumption,” Molecular Psychiatry, 23, 422–433. DOI: https://doi.org/10.1038/mp.2016.192.
- Lu, X., and Su, L. (2016), “Shrinkage Estimation of Dynamic Panel Data Models With Interactive Fixed Effects,” Journal of Econometrics, 190, 148–175. DOI: https://doi.org/10.1016/j.jeconom.2015.09.005.
- Mahoney, M. W. (2011), Randomized Algorithms for Matrices and Data, Foundations and Trends[textregistered] in Machine Learning, 3, 123–224.
- Maksimovic, J., Gagnon-Bartsch, J. A., Speed, T. P., and Oshlack, A. (2015), “Removing Unwanted Variation in a Differential Methylation Analysis of Illumina HumanMethylation450 Array Data,” Nucleic Acids Research, 43, e106. DOI: https://doi.org/10.1093/nar/gkv526.
- Martin, E. M., and Fry, R. C. (2018), “Environmental Influences on the Epigenome: Exposure-Associated DNA Methylation in Human Populations,” Annual Review of Public Health, 39, 309–333. DOI: https://doi.org/10.1146/annurev-publhealth-040617-014629.
- Martino, D., Loke, Y. J., Gordon, L., Ollikainen, M., Cruickshank, M. N., Saffery, R., and Craig, J. M. (2013), “Longitudinal, Genome-Scale Analysis of DNA Methylation in Twins From Birth to 18 Months of Age Reveals Rapid Epigenetic Change in Early Life and Pair-Specific Effects of Discordance,” Genome Biology, 14, R42. DOI: https://doi.org/10.1186/gb-2013-14-5-r42.
- Maschietto, M., Bastos, L. C., Tahira, A. C., Bastos, E. P., Euclydes, V. L. V., Brentani, A., Fink, G., de Baumont, A., Felipe-Silva, A., Francisco, R. P. V., Gouveia, G., Grisi, S. J. F. E., Escobar, A. M. U., Moreira-Filho, C. A., Polanczyk, G. V., Miguel, E. C., and Brentani, H. (2017), “Sex Differences in DNA Methylation of the Cord Blood Are Related to Sex-Bias Psychiatric Diseases,” Scientific Reports, 7, 44547. DOI: https://doi.org/10.1038/srep44547.
- McKennan, C., Naughton, K., Stanhope, C., Kattan, M., O’Connor, G., Sandel, M., Visness, C., Wood, R., Bacharier, L., Beigelman, A., Lovisky-Desir, S., Togias, A., Gern, J., Nicolae, D., and Ober, C. (2018), “Longitudinal Studies at Birth and Age 7 Reveal Strong Effects of Genetic Variation on Ancestry-Associated DNA Methylation Patterns in Blood Cells From Ethnically Admixed Children,” bioRxiv.
- McKennan, C., and Nicolae, D. (2019), “Accounting for Unobserved Covariates With Varying Degrees of Estimability in High-Dimensional Biological Data,” Biometrika, 106, 823–840. DOI: https://doi.org/10.1093/biomet/asz037.
- Morales, E., Vilahur, N., Salas, L. A., Motta, V., Fernandez, M. F., Murcia, M., Llop, S., Tardon, A., Fernandez-Tardon, G., Santa-Marina, L., Gallastegui, M., Bollati, V., Estivill, X., Olea, N., Sunyer, J., and Bustamante, M. (2016), “Genome-Wide DNA Methylation Study in Human Placenta Identifies Novel Loci Associated With Maternal Smoking During Pregnancy,” International Journal of Epidemiology, 45, 1644–1655. DOI: https://doi.org/10.1093/ije/dyw196.
- Owen, A. B., and Perry, P. O. (2009), “Bi-Cross-Validation of the SVD and the Nonnegative Matrix Factorization,” Annals of Applied Statistics, 3, 564–594. DOI: https://doi.org/10.1214/08-AOAS227.
- Owen, A. B., and Wang, J. (2016), “Bi-Cross-Validation for Factor Analysis,” Statistical Science, 31, 119–139. DOI: https://doi.org/10.1214/15-STS539.
- Peixoto, L., Risso, D., Poplawski, S. G., Wimmer, M. E., Speed, T. P., Wood, M. A., and Abel, T. (2015), “How Data Analysis Affects Power, Reproducibility and Biological Insight of RNA-Seq Studies in Complex Datasets,” Nucleic Acids Research, 43, 7664–7674. DOI: https://doi.org/10.1093/nar/gkv736.
- Storey, J. D. (2001), “A Direct Approach to False Discovery Rates,” Journal of the Royal Statistics Society, Series B, 63, 479–498. DOI: https://doi.org/10.1111/1467-9868.00346.
- Su, L., and Ju, G. (2018), “Identifying Latent Grouped Patterns in Panel Data Models With Interactive Fixed Effects,” Journal of Econometrics, 206, 554–573. DOI: https://doi.org/10.1016/j.jeconom.2018.06.014.
- Sun, Y., Zhang, N. R., and Owen, A. B. (2012), “Multiple Hypothesis Testing Adjusted for Latent Variables, With an Application to the AGEMAP Gene Expression Data,” The Annals of Applied Statistics, 6, 1664–1668. DOI: https://doi.org/10.1214/12-AOAS561.
- Tung, J., Zhou, X., Alberts, S. C., Stephens, M., Gilad, Y., and Dermitzakis, E. T. (2015), “The Genetic Architecture of Gene Expression Levels in Wild Baboons,” eLife, 4, e04729.
- Wald, A. (1949), “Note on the Consistency of the Maximum Likelihood Estimate,” The Annals of Mathematical Statistics, 20, 595–601. DOI: https://doi.org/10.1214/aoms/1177729952.
- Wang, J., Zhao, Q., Hastie, T., and Owen, A. B. (2017), “Confounder Adjustment in Multiple Hypothesis Testing,” The Annals of Statistics, 45, 1863–1894. DOI: https://doi.org/10.1214/16-AOS1511.
- Yang, I. V., Pedersen, B. S., Liu, A. H., O’Connor, G. T., Pillai, D., Kattan, M., Misiak, R. T., Gruchalla, R., Szefler, S. J., Khurana Hershey, G. K., Kercsmar, C., Richards, A., Stevens, A. D., Kolakowski, C. A., Makhija, M., Sorkness, C. A., Krouse, R. Z., Visness, C., Davidson, E. J., Hennessy, C. E., Martin, R. J., Togias, A., Busse, W. W., and Schwartz, D. A. (2017), “The Nasal Methylome and Childhood Atopic Asthma,” Journal of Allergy and Clinical Immunology, 139, 1478–1488. DOI: https://doi.org/10.1016/j.jaci.2016.07.036.
- Yao, C., Li, H., Shen, X., He, Z., He, L., and Guo, Z. (2012), “Reproducibility and Concordance of Differential DNA Methylation and Gene Expression in Cancer,” PLoS One, 7, e29686. DOI: https://doi.org/10.1371/journal.pone.0029686.
- Yousefi, P., Huen, K., Davé, V., Barcellos, L., Eskenazi, B., and Holland, N. (2015), “Sex Differences in DNA Methylation Assessed by 450 K Beadchip in Newborns,” BMC Genomics, 16, 911. DOI: https://doi.org/10.1186/s12864-015-2034-y.
- Zhang, X., Biagini Myers, J. M., Burleson, J., Ulm, A., Bryan, K. S., Chen, X., Weirauch, M. T., Baker, T. A., Butsch Kovacic, M. S., and Ji, H. (2018), “Nasal DNA Methylation Is Associated With Childhood Asthma,” Epigenomics, 10, 629–641. DOI: https://doi.org/10.2217/epi-2017-0127.