Publication Cover
Statistics
A Journal of Theoretical and Applied Statistics
Volume 53, 2019 - Issue 4
92
Views
1
CrossRef citations to date
0
Altmetric
Original Articles

Unified multivariate hypergeometric interpoint distances

&
Pages 921-942 | Received 14 Oct 2018, Accepted 14 Mar 2019, Published online: 27 May 2019

References

  • Janardan KG, Patil GP. A unified approach for a class of multivariate hypergeometric models. Indian J Stat A. 1972;34(4):363–376.
  • Janardan KG. Chance mechanisms for multivariate hypergeometric models. Indian J Stat A. 1973;35(4):465–478.
  • Lukens MW, Examination of statistical outlier structure in high dimension using interpoint distance densities and multivariate rankings [Unpublished PhD dissertation]. George Mason University; 2004.
  • Modarres R. On the interpoint distances of Bernoulli vectors. Stat Probabil Lett. 2014;84:215–222. doi: 10.1016/j.spl.2013.10.012
  • Modarres R. Multivariate poisson interpoint distances. Stat Probabil Lett. 2016;112:113–123. doi: 10.1016/j.spl.2016.01.025
  • Modarres R. Multinomial interpoint distances. Stat Pap 2018. 2018;59:341–360.
  • Shurygin AM. Using interpoint distances for pattern recognition. Pattern Recogn Image Anal. 200616(4):726–729. doi: 10.1134/S1054661806040213
  • Zhang P, Wang X, Song PX-K. Clustering categorical data based on distance vectors. J Am Stat Assoc. 2006;101(473):355–367. doi: 10.1198/016214505000000312
  • Hall P, Titterington DM, Xue JH. Median-based classifiers for high-dimensional data. J Am Stat Assoc. 2009;104(488):1597–1608. doi: 10.1198/jasa.2009.tm08107
  • Guo L, Modarres R. Interpoint distance classification of high dimensional discrete observations. Int Stat Rev. 2018:1–16. DOI:10.1111/instr12281.
  • Maa JF, Pearl DK, Bartoszyński R. Reducing multidimensional two-sample data to one-dimensional interpoint comparisons. Ann Stat. 1996;24(3):1069–1074. doi: 10.1214/aos/1032526956
  • Bai Z, Saranadasa H. Effect of high dimension: by an example of a two sample problem. Stat Sin. 1996;6(2):311-–329.
  • Chen SX, Qin YL, A two-sample test for high-dimensional data with applications to gene-set testing. Ann Stat. 2010;38(2):808–835. doi: 10.1214/09-AOS716
  • Srivastava MS, Katayama S, Kano Y. A two sample test in high dimensional data. J Multivar Anal. 2013;114:349–358. doi: 10.1016/j.jmva.2012.08.014
  • Wei S, Lee C, Wichers L, et al. Direction-projection-permutation for high dimensional hypothesis tests. arXiv:1304.0796; 2013.
  • Ghosh AK, Biswas M. Distribution free high-dimensional two-sample tests based on discriminating hyperplanes. Test. 2016;25:525–547. doi: 10.1007/s11749-015-0467-x
  • Friedman JH, Rafsky LC. Multivariate generalizations of the Wald-Wolfowitz and Smirnov two-sample tests. Ann Stat. 1979;7(4):697–717. doi: 10.1214/aos/1176344722
  • Henze N, Penrose MD. On the multivariate runs test. Ann Stat. 1999;27(1):290–298. doi: 10.1214/aos/1018031112
  • Hall P, Tajvidi N. Permutation tests for equality of distributions in high-dimensional settings. Biometrika. 2002; 89(2):359–374. doi: 10.1093/biomet/89.2.359
  • Baringhaus L, Franz C. On a new multivariate twosample test. J Multivar Anal. 2004;88:190–206. doi: 10.1016/S0047-259X(03)00079-4
  • Liu Z, Modarres R. A triangle test for equality of distribution functions in high dimensions. J Nonparametr Stat. 2011;23(3):605–615. doi: 10.1080/10485252.2010.485644
  • Biswas M, Ghosh AK. A nonparametric two-sample test applicable to high dimensional data. J Multivar Anal. 2014;123:160–171. doi: 10.1016/j.jmva.2013.09.004
  • Székely GJ, Rizzo ML. The energy of data. Annu Rev Stat Appl. 2017;4:447–479. doi: 10.1146/annurev-statistics-060116-054026
  • Rosenblum PR. An exact distribution free test comparing two multivariate distributions based on adjacency. J Royal Stat Soc B. 2005;67:515–530. doi: 10.1111/j.1467-9868.2005.00513.x
  • Jureckova J, Kalina J. Nonparametric multivariate rank tests and their unbiasedness. Bernoulli. 2012;18:229–251. doi: 10.3150/10-BEJ326
  • Marozzi M. Multivariate multidistance tests for high-dimensional low sample size case-control studies. Stat Med. 2015;34(9):1511–1526. doi: 10.1002/sim.6418
  • Marozzi M. Multivariate tests based on interpoint distances with application to magnetic resonance imaging. Stat Methods Med Res. 201625(6):2593–2610. doi: 10.1177/0962280214529104
  • Johnson NL, Kotz S, Balakrishnan N. Discrete multivariate distributions, 165. New York: Wiley; 1997.
  • Kaddi CD, Parry RM, Wang MD. Multivariate hypergeometric similarity measure. IEEE/ACM Trans Comput Biol Bioinform. 2013;10(6):1505–1516. doi: 10.1109/TCBB.2013.28
  • Waller LA, Zelterman D. Log-linear modeling with the negative multinomial distribution. Biometrics. 1997;53(3):971–982. doi: 10.2307/2533557
  • Joag-Dev K, Proschan F. Negative association of random variables with applications. Ann Stat. 1983;11(1):286–295. doi: 10.1214/aos/1176346079
  • Childs A, Balakrishnan N. Some approximations to the multivariate hypergeometric distribution with applications to hypothesis testing. Comput Stat and Data An. 2000;35(2):137–154. doi: 10.1016/S0167-9473(00)00007-4
  • Requena F, Ciudad NM. Characterization of maximum probability points in the multivariate hypergeometric distribution. Stat Probabil Lett. 200050(1):39–47. doi: 10.1016/S0167-7152(00)00079-1
  • Kvam PH, Day D. Multivariate polya distribution in combat modeling. Nav Res Log. 2001;48(1):1–17. doi: 10.1002/1520-6750(200102)48:1<1::AID-NAV1>3.0.CO;2-Y
  • D'Elia A. Modelling ranks using the inverse hypergeometric distribution. Stat Modelling. 2003;3(1):65–78. doi: 10.1191/1471082X03st047oa
  • Tabb DL, Fernando CG, Chambers MC. MyriMatch: highly accurate tandem mass spectral peptide identification by multivariate hypergeometric analysis. J Proteome Res. 2007;6(2):654–661. doi: 10.1021/pr0604054
  • Corrado CJ. The exact distribution of the maximum and minimum frequencies of multinomial/Dirichlet and multivariate hypergeometric distributions. Stat Comput. 2011;21(3):349–359. doi: 10.1007/s11222-010-9174-3
  • Lebrun R. Efficient time/space algorithm to compute rectangular probabilities of multinomial, multivariate hypergeometric and multivariate Polya distributions. Stat Comput. 2013;23(5):615–623. doi: 10.1007/s11222-012-9334-8
  • Barni M, Cappellini V, Mecocci A. Fast vector median filter based on Euclidean norm approximation. IEEE Signal Process Lett. 1994;1:92–94. doi: 10.1109/97.295343
  • Székely GJ, Rizzo ML. Energy statistics: statistics based on distances. J Stat Plan Inference. 2013;143:1249–1272. doi: 10.1016/j.jspi.2013.03.018
  • Matteson DS, James NA. A nonparametric approach for multiple change point analysis of multivariate data. J Am Stat Assoc. 2013;109(505):334–345. doi: 10.1080/01621459.2013.849605
  • Chen J, Li P. Hypothesis test for normal mixture models: the EM approach. Ann Stat. 2009;37(5A):2523–2542. doi: 10.1214/08-AOS651

Reprints and Corporate Permissions

Please note: Selecting permissions does not provide access to the full text of the article, please see our help page How do I view content?

To request a reprint or corporate permissions for this article, please click on the relevant link below:

Academic Permissions

Please note: Selecting permissions does not provide access to the full text of the article, please see our help page How do I view content?

Obtain permissions instantly via Rightslink by clicking on the button below:

If you are unable to obtain permissions via Rightslink, please complete and submit this Permissions form. For more information, please visit our Permissions help page.