889
Views
15
CrossRef citations to date
0
Altmetric
Statistical Computing and Graphics

A Set of Efficient Methods to Generate High-Dimensional Binary Data With Specified Correlation Structures

ORCID Icon, ORCID Icon, &
Pages 310-322 | Received 10 Dec 2019, Accepted 22 Aug 2020, Published online: 16 Oct 2020

References

  • Bahadur, R. R. (1959), “A Representation of the Joint Distribution of Responses to n Dichotomous Items,” Technical Report, Columbia University New York Teachers College.
  • Carey, V., Zeger, S. L., and Diggle, P. (1993), “Modelling Multivariate Binary Data With Alternating Logistic Regressions,” Biometrika, 80, 517–526.
  • Chaganty, N. R., and Joe, H. (2006), “Range of Correlation Matrices for Dependent Bernoulli Random Variables,” Biometrika, 93, 197–206.
  • Cox, D. R. (1972), “The Analysis of Multivariate Binary Data,” Applied Statistics, 21, 113–120.
  • Demirtas, H. (2006), “A Method for Multivariate Ordinal Data Generation Given Marginal Distributions and Correlations,” Journal of Statistical Computation and Simulation, 76, 1017–1025.
  • Diwakar, H., and Vaidya, A. (2009), “Data Quality for Decision Support—The Indian Banking Scenario,” in Data Quality and High-Dimensional Data Analysis, eds. C.-Y. Chan, S. Chawla, S. Sadiq, X. Zhou, and V. Pudi, New Delhi, India: World Scientific, pp. 60–77.
  • Emrich, L. J., and Piedmonte, M. R. (1991), “A Method for Generating High-Dimensional Multivariate Binary Variates,” The American Statistician, 45, 302–304.
  • Fieuws, S., Verbeke, G., Boen, F., and Delecluse, C. (2006), “High Dimensional Multivariate Mixed Models for Binary Questionnaire Data,” Journal of the Royal Statistical Society, Series C, 55, 449–460.
  • Gange, S. J. (1995), “Generating Multivariate Categorical Variates Using the Iterative Proportional Fitting Algorithm,” The American Statistician, 49, 134–138.
  • Guerra, M. W., and Shults, J. (2014), “A Note on the Simulation of Overdispersed Random Variables With Specified Marginal Means and Product Correlations,” The American Statistician, 68, 104–107.
  • Hardin, J. W., and Hilbe, J. M. (2002), Generalized Estimating Equations, Boca Raton, FL: Chapman and Hall/CRC.
  • Kennedy, G. C., Matsuzaki, H., Dong, S., Liu, W.-M., Huang, J., Liu, G., Su, X., Cao, M., Chen, W., Zhang, J., and Liu, W. (2003), “Large-Scale Genotyping of Complex DNA,” Nature Biotechnology, 21, 1233. DOI: https://doi.org/10.1038/nbt869.
  • Lee, A. (1993), “Generating Random Binary Deviates Having Fixed Marginal Distributions and Specified Degrees of Association,” The American Statistician, 47, 209–215.
  • Leisch, F., Weingessel, A., and Hornik, K. (1998), “On the Generation of Correlated Artificial Binary Data,” Working Papers SFB “Adaptive Information Systems and Modelling in Economics and Management Science,” 13.
  • Lunn, A. D., and Davies, S. J. (1998), “A Note on Generating Correlated Binary Variables,” Biometrika, 85, 487–490.
  • Metzker, M. L. (2010), “Sequencing Technologies—The Next Generation,” Nature Reviews Genetics, 11, 31. DOI: https://doi.org/10.1038/nrg2626.
  • Naik, P., Wedel, M., Bacon, L., Bodapati, A., Bradlow, E., Kamakura, W., Kreulen, J., Lenk, P., Madigan, D. M., and Montgomery, A. (2008), “Challenges and Opportunities in High-Dimensional Choice Data Analyses,” Marketing Letters, 19, 201.
  • Park, C. G., Park, T., and Shin, D. W. (1996), “A Simple Method for Generating Correlated Binary Variates,” The American Statistician, 50, 306–310.
  • Preisser, J. S., and Qaqish, B. F. (2014), “A Comparison of Methods for Simulating Correlated Binary Variables With Specified Marginal Means and Correlations,” Journal of Statistical Computation and Simulation, 84, 2441–2452.
  • Prentice, R. L. (1988), “Correlated Binary Regression With Covariates Specific to Each Binary Observation,” Biometrics, 44, 1033–1048.
  • Pritchard, J. K., and Przeworski, M. (2001), “Linkage Disequilibrium in Humans: Models and Data,” The American Journal of Human Genetics, 69, 1–14. DOI: https://doi.org/10.1086/321275.
  • Sachidanandam, R., Weissman, D., Schmidt, S. C., Kakol, J. M., Stein, L. D., Marth, G., Sherry, S., Mullikin, J. C., Mortimore, B. J., Willey, D. L., and Hunt, S. E. (2001), “A Map of Human Genome Sequence Variation Containing 1.42 Million Single Nucleotide Polymorphisms,” Nature, 409, 928–934.
  • Shults, J., and Hilbe, J. M. (2014), Quasi-Least Squares Regression, Boca Raton, FL: CRC Press.
  • Wilbur, J. D., Ghosh, J., Nakatsu, C., Brouder, S., and Doerge, R. (2002), “Variable Selection in High-Dimensional Multivariate Binary Data With Application to the Analysis of Microbial Community DNA Fingerprints,” Biometrics, 58, 378–386. DOI: https://doi.org/10.1111/j.0006-341x.2002.00378.x.

Reprints and Corporate Permissions

Please note: Selecting permissions does not provide access to the full text of the article, please see our help page How do I view content?

To request a reprint or corporate permissions for this article, please click on the relevant link below:

Academic Permissions

Please note: Selecting permissions does not provide access to the full text of the article, please see our help page How do I view content?

Obtain permissions instantly via Rightslink by clicking on the button below:

If you are unable to obtain permissions via Rightslink, please complete and submit this Permissions form. For more information, please visit our Permissions help page.