418
Views
3
CrossRef citations to date
0
Altmetric
Advances in Sampling and Optimization

Projection Pursuit Based on Gaussian Mixtures and Evolutionary Algorithms

ORCID Icon &
Pages 847-860 | Received 25 Feb 2018, Accepted 19 Mar 2019, Published online: 12 Jun 2019

References

  • Arnold, B. C., and Beaver, R. J. (2000), “The Skew-Cauchy Distribution,” Statistics & Probability Letters, 49, 285–290. DOI: 10.1016/S0167-7152(00)00059-6.
  • Azzalini, A., and Dalla Valle, A. (1996), “The Multivariate Skew-Normal Distribution,” Biometrika, 83, 715–726. DOI: 10.1093/biomet/83.4.715.
  • Banfield, J. D., and Raftery, A. E. (1993), “Model-Based Gaussian and Non-Gaussian Clustering,” Biometrics, 49, 803–821. DOI: 10.2307/2532201.
  • Baragona, R., Battaglia, F., and Poli, I. (2011), Evolutionary Statistical Procedures: An Evolutionary Computation Approach to Statistical Procedures Designs and Applications, Berlin, Heidelberg: Springer Science & Business Media.
  • Biernacki, C., Celeux, G., and Govaert, G. (2000), “Assessing a Mixture Model for Clustering With the Integrated Completed Likelihood,” IEEE Transactions on Pattern Analysis and Machine Intelligence, 22, 719–725. DOI: 10.1109/34.865189.
  • Breiman, L., Friedman, J., Stone, C. J., and Olshen, R. A. (1984), Classification and Regression Trees, Boca Raton, FL: CRC Press.
  • Byrd, R. H., Lu, P., Nocedal, J., and Zhu, C. (1995), “A Limited Memory Algorithm for Bound Constrained Optimization,” SIAM Journal on Scientific Computing, 16, 1190–1208. DOI: 10.1137/0916069.
  • Campbell, N., and Mahon, R. (1974), “A Multivariate Study of Variation in Two Species of Rock Crab of the Genus Leptograpsus,” Australian Journal of Zoology, 22, 417–425. DOI: 10.1071/ZO9740417.
  • Celeux, G., and Govaert, G. (1995), “Gaussian Parsimonious Clustering Models,” Pattern Recognition, 28, 781–793. DOI: 10.1016/0031-3203(94)00125-6.
  • Chang, W. (1983), “On Using Principal Components Before Separating a Mixture of Two Multivariate Normal Distributions,” Applied Statistics, 32, 267–275. DOI: 10.2307/2347949.
  • Comon, P. (1994), “Independent Component Analysis, a New Concept?,” Signal Processing, 36, 287–314. DOI: 10.1016/0165-1684(94)90029-9.
  • Cook, D., Buja, A., and Cabrera, J. (1993), “Projection Pursuit Indexes Based on Orthonormal Function Expansions,” Journal of Computational and Graphical Statistics, 2, 225–250. DOI: 10.2307/1390644.
  • Cook, R. D., and Weisberg, S. (1994), An Introduction to Regression Graphics, New York: Wiley.
  • Cover, T. M., and Thomas, J. A. (2006), Elements of Information Theory (2nd ed.), New York: Wiley.
  • Dempster, A. P., Laird, N. M., and Rubin, D. B. (1977), “Maximum Likelihood From Incomplete Data via the EM Algorithm,” Journal of the Royal Statistical Society, Series B, 39, 1–22. DOI: 10.1111/j.2517-6161.1977.tb01600.x.
  • Diaconis, P., and Freedman, D. (1984), “Asymptotics of Graphical Projection Pursuit,” The Annals of Statistics, 12, 793–815. DOI: 10.1214/aos/1176346703.
  • Dudoit, S., Fridlyand, J., and Speed, T. (2002), “Comparison of Discrimination Methods for the Classification of Tumors Using Gene Expression Data,” Journal of the American Statistical Association, 97, 77–87. DOI: 10.1198/016214502753479248.
  • Dumitrescu, D., Lazzerini, B., Jain, L. C., and Dumitrescu, A. (2000), Evolutionary Computation, Boca Raton, FL: CRC Press.
  • Eiben, A., and Smith, J. (2003), Introduction to Evolutionary Computing, Berlin, Heidelberg: Springer-Verlag.
  • Escobar, M. D., and West, M. (1995), “Bayesian Density Estimation and Inference Using Mixtures,” Journal of the American Statistical Association, 90, 577–588. DOI: 10.1080/01621459.1995.10476550.
  • Friedman, J. H. (1987), “Exploratory Projection Pursuit,” Journal of the American Statistical Association, 82, 249–266. DOI: 10.1080/01621459.1987.10478427.
  • Friedman, J. H., and Tukey, J. W. (1974), “A Projection Pursuit Algorithm for Exploratory Data Analysis,” IEEE Transactions on Computers, 100, 881–890. DOI: 10.1109/T-C.1974.224051.
  • Goldberg, D. E., and Holland, J. H. (1988), “Genetic Algorithms and Machine Learning,” Machine Learning, 3, 95–99. DOI: 10.1023/A:1022602019183.
  • Goldberger, J., and Aronowitz, H. (2005), “A Distance Measure Between GMMs Based on the Unscented Transform and Its Application to Speaker Recognition,” in Ninth European Conference on Speech Communication and Technology, Interspeech, pp. 1985–1988.
  • Golub, T., Slonim, D., Tamayo, P., Huard, C., Gaasenbeek, M., Mesirov, J., Coller, H., Loh, M., Downing, J. R., Caligiuri, M., Bloomfield, C., and Lander, E. (1999), “Molecular Classification of Cancer: Class Discovery and Class Prediction by Gene Expression Monitoring,” Science, 286, 531–537. DOI: 10.1126/science.286.5439.531.
  • Gower, J. C. (1987), “Discussion of the Paper by Jones, M. C., and Sibson, R. C.,” Journal of the Royal Statistical Society, Series A, 150, 19–21. DOI: 10.2307/2981662.
  • Hall, P. (1989), “On Polynomial-Based Projection Indices for Exploratory Projection Pursuit,” The Annals of Statistics, 17, 589–605. DOI: 10.1214/aos/1176347127.
  • Hastie, T., and Tibshirani, R. (1987), “Discussion of the Paper by Jones, M. C., and Sibson, R. C.,” Journal of the Royal Statistical Society, Series A, 150, 27–28.
  • Hastie, T., and Tibshirani, R. (1996), “Discriminant Analysis by Gaussian Mixtures,” Journal of the Royal Statistical Society, Series B, 58, 155–176. DOI: 10.1111/j.2517-6161.1996.tb02073.x.
  • Hershey, J. R., and Olsen, P. A. (2007), “Approximating the Kullback Leibler Divergence Between Gaussian Mixture Models,” in 2007 IEEE International Conference on Acoustics, Speech and Signal Processing (Vol. 4), IEEE, pp. 317–320.
  • Holland, J. (1975), Adaptation in Natural and Artificial Systems, Ann Arbor, MI: The University of Michigan Press.
  • Huber, M. F., Bailey, T., Durrant-Whyte, H., and Hanebeck, U. D. (2008), “On Entropy Approximation for Gaussian Mixture Random Vectors,” in 2008 IEEE International Conference on Multisensor Fusion and Integration for Intelligent Systems, IEEE, pp. 181–188.
  • Huber, P. J. (1985), “Projection Pursuit,” The Annals of Statistics, 13, 435–475. DOI: 10.1214/aos/1176349519.
  • Hyvarinen, A. (1999), “Fast and Robust Fixed-Point Algorithms for Independent Component Analysis,” IEEE transactions on Neural Networks, 10, 626–634. DOI: 10.1109/72.761722.
  • Hyvarinen, A., Karhunen, J., and Oja, E. (2001), Independent Component Analysis, Toronto: Wiley.
  • Jolliffe, I. T. (2002), Principal Component Analysis, New York: Springer.
  • Jones, M. C., and Sibson, R. (1987), “What Is Projection Pursuit?” (with discussion), Journal of the Royal Statistical Society, Series A, 150, 1–37. DOI: 10.2307/2981662.
  • Julier, S. J., and Uhlmann, J. K. (1996), “A General Method for Approximating Nonlinear Transformations of Probability Distributions,” Technical Report, Robotics Research Group, Department of Engineering Science, University of Oxford.
  • Kruskal, J. B. (1969), “Toward a Practical Method Which Helps Uncover the Structure of a Set of Multivariate Observations by Finding the Linear Transformation Which Optimizes a New Index of Condensation,” in Statistical Computation, eds. R. C. Milton and J. A. Nelder, New York: Academic Press, pp. 427–440.
  • Lee, E.-K., and Cook, D. (2010), “A Projection Pursuit Index for Large p Small n Data,” Statistics and Computing, 20, 381–392. DOI: 10.1007/s11222-009-9131-1.
  • Lee, E.-K., Cook, D., Klinke, S., and Lumley, T. (2005), “Projection Pursuit for Exploratory Supervised Classification,” Journal of Computational and Graphical Statistics, 14, 831–846. DOI: 10.1198/106186005X77702.
  • Li, B., Zha, H., and Chiaromonte, F. (2005), “Contour Regression: A General Approach to Dimension Reduction,” Annals of Statistics, 33, 1580–1616. DOI: 10.1214/009053605000000192.
  • Li, W. (2012), “Volcano Plots in Analyzing Differential Expressions With mRNA Microarrays,” Journal of Bioinformatics and Computational Biology, 10, 1231003. DOI: 10.1142/S0219720012310038.
  • Marchini, J. L., Heaton, C., and Ripley, B. D. (2017), “fastICA: FastICA Algorithms to Perform ICA and Projection Pursuit,” R Package Version 1.2-1.
  • McLachlan, G., and Krishnan, T. (2008), The EM Algorithm and Extensions (2nd ed.), Hoboken, NJ: Wiley.
  • McNicholas, P. D., ElSherbiny, A., McDaid, A. F., and Murphy, T. B. (2015), “pgmm: Parsimonious Gaussian Mixture Models,” R Package Version 1.2.
  • Morris, K., and McNicholas, P. D. (2013), “Dimension Reduction for Model-Based Clustering via Mixtures of Shifted Asymmetric Laplace Distributions,” Statistics & Probability Letters, 83, 2088–2093. DOI: 10.1016/j.spl.2013.04.011.
  • Pollard, K. S., Dudoit, S., and van der Laan, M. J. (2005), “Multiple Testing Procedures: R multtest Package and Applications to Genomics,” in Bioinformatics and Computational Biology Solutions Using R and Bioconductor, New York: Springer.
  • Pollard, K. S., Gilbert, H. N., Ge, Y., Taylor, S., and Dudoit, S. (2018), “multtest: Resampling-Based Multiple Hypothesis Testing,” R Package Version 2.36.0.
  • Posse, C. (1995), “Projection Pursuit Exploratory Data Analysis,” Computational Statistics & Data Analysis, 20, 669–687. DOI: 10.1016/0167-9473(95)00002-8.
  • Ripley, B. (2016), “MASS: Support Functions and Datasets for Venables and Ripley’s MASS,” R Package Version 7.3-45.
  • Roeder, K., and Wasserman, L. (1997), “Practical Bayesian Density Estimation Using Mixtures of Normals,” Journal of the American Statistical Association, 92, 894–902. DOI: 10.1080/01621459.1997.10474044.
  • Schwarz, G. (1978), “Estimating the Dimension of a Model,” The Annals of Statistics, 6, 461–464. DOI: 10.1214/aos/1176344136.
  • Scrucca, L. (2013), “GA: A Package for Genetic Algorithms in R,” Journal of Statistical Software, 53, 1–37.
  • Scrucca, L. (2017), “On Some Extensions to GA Package: Hybrid Optimisation, Parallelisation and Islands Evolution,” The R Journal, 9, 187–206. DOI: 10.32614/RJ-2017-008.
  • Scrucca, L., Fop, M., Murphy, T. B., and Raftery, A. E. (2016), “mclust 5: Clustering, Classification and Density Estimation Using Gaussian Finite Mixture Models,” The R Journal, 8, 205–233.
  • Shannon, C. E. (1948), “A Mathematical Theory of Communication,” Bell System Technical Journal, 27, 379–423, 623–656. DOI: 10.1002/j.1538-7305.1948.tb00917.x.
  • Simon, D. (2013), Evolutionary Optimization Algorithms, New York: Wiley.
  • Streuli, H. (1973), “Der heutige stand der kaffeechemie,” in Association Scientifique International du Cafe, 6th International Colloquium on Coffee Chemistry, Bogota, Columbia (Vol. 61), pp. 61–72.
  • Weisberg, S. (2002), “Dimension Reduction Regression in R,” Journal of Statistical Software, 7, 1–22.

Reprints and Corporate Permissions

Please note: Selecting permissions does not provide access to the full text of the article, please see our help page How do I view content?

To request a reprint or corporate permissions for this article, please click on the relevant link below:

Academic Permissions

Please note: Selecting permissions does not provide access to the full text of the article, please see our help page How do I view content?

Obtain permissions instantly via Rightslink by clicking on the button below:

If you are unable to obtain permissions via Rightslink, please complete and submit this Permissions form. For more information, please visit our Permissions help page.