References
- Bishop, C. M. (2006). Pattern recognition and machine learning. New York, NY: Springer.
- Brusco, M. J., & Steinley, D. (2007). A comparison of heuristic procedures for minimum within-cluster sums of squares partitioning. Psychometrika, 73, 125–144.
- DeSarbo, W. S., Carroll, J. D., Clark, L. A., & Green, P. E. (1984). Synthesized clustering: A method for amalgamating alternative clustering bases with differential weighting of variables. Psychometrika, 49, 57–78.10.1007/BF02294206
- Dillon, W. R., & Goldstein, M. (1984). Multivariate analysis: Methods and applications. New York, NY: Wiley.
- Eldén, L. (2007). Matrix methods in data mining and pattern recognition. Philadelphia, PA: SIAM.10.1137/1.9780898718867
- Fisher, R. A. (1936). The use of multiple measurements in taxonomic problems. Annals of Eugenics, 7, 179–18810.1111/j.1469-1809.1936.tb02137.x
- Frey, B. J., & Dueck, D. (2007). Clustering by passing messages between data points. Science, 315, 972–976.10.1126/science.1136800
- Friedman, J. H., & Meulman, J. J. (2004). Clustering objects on subsets of attributes (with discussion). Journal of the Royal Statistical Society: Series B (Statistical Methodology), 66, 815–849.10.1111/rssb.2004.66.issue-4
- Gan, G., Ma, C., & Wu, J. (2007). Data clustering: Theory, algorithms, and applications. Philadelphia, PA: SIAM.10.1137/1.9780898718348
- Härdle, W. K., & Simar, L. (2012). Applied multivariate statistical analysis. New York, NY: Springer.10.1007/978-3-642-17229-8
- Hastie, T., Friedman, J., & Tibshirani, R. (2001). The elements of statistical learning: Data mining, inference, and prediction. New York, NY: Springer.10.1007/978-0-387-21606-5
- Heiser, W. J., & Groenen, P. J. F. (1997). Cluster differences scaling with a within-clusters loss component and a fuzzy successive approximation strategy to avoid local minima. Psychometrika, 62, 63–83.10.1007/BF02294781
- Izenman, A. J. (2008). Modern multivariate statistical techniques. New York, NY: Springer.10.1007/978-0-387-78189-1
- Joreskog, K. G. (1977). Factor analysis by least-squares and maximum-likelihood methods. In K. Enslein, A. Ralston, & H. S. Wilf (Eds.), Statistical methods for digital computers (pp. 125–153). New York, NY: John Wiley & Sons.
- Jöreskog, K. G. (1967). Some contributions to maximum likelihood factor analysis. Psychometrika, 32, 443–482.10.1007/BF02289658
- Jöreskog, K. G., & Goldberger, A. S. (1972). Factor analysis by generalized least squares. Psychometrika, 37, 243–260.10.1007/BF02306782
- Ladd, G. W. (1966). Linear probability functions and discriminant functions. Econometrica, 34, 873–885.10.2307/1910106
- Lawley, D. N., & Maxwell, A. E. (1971). Factor analysis as a statistical method. New York, NY: American Elsevier.
- Lipovetsky, S. (2009a). Linear regression with special coefficient features attained via parameterization in exponential, logistic, and multinomial–logit forms. Mathematical and Computer Modelling, 49, 1427–1435.10.1016/j.mcm.2008.11.013
- Lipovetsky, S. (2009b). PCA and SVD with nonnegative loadings. Pattern Recognition, 42, 68–76.10.1016/j.patcog.2008.06.025
- Lipovetsky, S. (2012). Total odds and other objectives for clustering via multinomial-logit model. Advances in Adaptive Data Analysis, 4, doi:10.1142/S1793536912500197
- Lipovetsky, S. (2013a). Additive and multiplicative mixed normal distributions and finding cluster centers. International Journal of Machine Learning and Cybernetics, 4(1), 1–11. doi:10.1007/s13042-012-0070-3
- Lipovetsky, S. (2013b). Finding cluster centers and sizes via multinomial parameterization. Applied Mathematics and Computation, 221, 571–580.10.1016/j.amc.2013.06.098
- Lipovetsky, S., & Conklin, M. (2005). Regression by data segments via discriminant analysis. Journal of Modern Applied Statistical Methods, 4, 63–74.
- Lipovetsky, S., Tishler, A., & Conklin, W. M. (2002). Multivariate least squares and its relation to other multivariate techniques. Applied Stochastic Models in Business and Industry, 18, 347–356.10.1002/(ISSN)1526-4025
- Liu, H., & Motoda, H. (Eds.). (2008). Computational methods of feature selection. Boca Raton, FL: Chapman & Hall/CRC.
- Maxwell, A. E. (1983). Factor analysis. In S. Kotz & N. L.Johnson (Eds.), Encyclopedia of Statistical Sciences (Vol. 3, pp. 2–8). New York, NY: John Wiley & Sons.
- Nowakowska, E., Koronacki, J., & Lipovetsky, S. (2014). Clusterability assessment for Gaussian mixture models. Applied Mathematics and Computation, 256, 591–601. doi:10.1016/j.amc.2014.12.038
- Ripley, B. D. (1996). Pattern recognition and neural networks. Cambridge: Cambridge University Press.10.1017/CBO9780511812651
- S-PLUS’2000. (1999). Seattle, WA: MathSoft.
- Szekely, G. J., & Rizzo, M. L. (2005). Hierarchical clustering via joint between–within distances: Extending ward’s minimum variance method. Journal of Classification, 22, 151–183.10.1007/s00357-005-0012-9
- Timm, N. H. (1975). Multivariate analysis with applications in education and psychology. Monterey, CA: Brooks/Cole.