Search in:

Advanced search

The Journal of Experimental Education Volume 87, 2019 - Issue 1

Submit an article Journal homepage

366

Views

CrossRef citations to date

Altmetric

Measurement, Statistics, and Research Design

A Comparison of Bias Reduction Methods: Clustering versus Propensity Score Subclassification and Weighting

Ida D'AttomaDepartment of Statistical Sciences, University of Bologna, via Belle Arti, Bologna (Italy)Correspondence[email protected]

http://orcid.org/0000-0002-2305-8454

(Senior Assistants Professor in Statistics for Economics)

Furio CamilloDepartment of Statistical Sciences, University of Bologna, via Belle Arti, Bologna (Italy)

(Associate Professor of Business Statistics and Data Mining) &

M. H. ClarkCollege of Education and Human Performance, University of Central Florida, Orlando, FL

(Associate Lecturer at Department of Educational and Human Sciences)

Pages 33-54 | Published online: 29 Nov 2017

Cite this article
https://doi.org/10.1080/00220973.2017.1391161
CrossMark

Full Article
Figures & data
References
Citations
Metrics
Reprints & Permissions

References

Aldenderfer, M. S., & Blashfield, R. K. (1984). Cluster analysis. Beverly Hills, CA: Sage.
Google Scholar
Anderberg, M. R. (1973). Cluster Analysis for Applications. , London, Englang: Academic Press.
Google Scholar
Austin, P. C. (2014). A comparison of 12 algorithms for matching on the propensity score. Statistics in Medicine, 1057–1069. doi: 10.1002/sim.2328 doi:10.1002/sim.6004.
PubMed Web of Science ®Google Scholar
Austin, P. C. (2011). An introduction to propensity score methods for reducing the effects of confounding in observational studies. Multivariate Behavioral Research, 46, 399–424. doi:10.1080/00273171.2011.568786.
PubMed Web of Science ®Google Scholar
Austin, P. C., & Mamdani, M. M. (2006). A comparison of propensity score methods: A case study estimating the effectiveness of post-AMI statin use. Statistics in Medicine, 25, 2084–2106. doi: 10.1002/sim.2328 doi:10.1002/sim.2328.
PubMed Web of Science ®Google Scholar
Austin, P. C., & Stuart, E. A. (2015). Moving towards best practice when using inverse probability weighting (IPTW) using the propensity score to estimate causal treatment effects in observational studies. Statistics in Medicine, 34, 3661–3679. doi: 10.1002/sim.6607 doi:10.1002/sim.6607.
PubMed Web of Science ®Google Scholar
Bai, H. (2013). A bootstrap procedure of propensity score estimation. Journal of Experimental Education, 81, 157–177. doi: 10.1080/00220973.2012.700497 doi:10.1080/00220973.2012.700497.
Web of Science ®Google Scholar
Benzecri, J. P. (1973). L'analyse des données. Paris, France: Dunod.
Google Scholar
Camillo, F., & D'Attoma, I. (2010). A new data mining approach to estimate causal effects of policy interventions. Expert Systems with Applications, 37, 171–181.
Web of Science ®Google Scholar
Camillo, F., & D'Attoma, I. (2012). %GI : A SAS macro for measuring and testing global imbalance of covariates within subgroups. Journal of Statistical Software, Volume 51, Code Snippet 1, pp. 1–19. doi:10.18637/jss.v051.c01.
PubMedGoogle Scholar
Chiu, T., Fang, D., Chen, J., Wang, Y., & Jeris, C. (2001). A robust and scalable, clustering algorithm for mixed type attribute in large database environment. In: Proceedings of the 7th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (pp. 263–268). San Francisco, CA: ACM.
Google Scholar
Christakis, N. A., & Iwashyha, T. I. (2003). The health impact of health care on families: A matched cohort study of hospice use by decedents and mortality outcomes in surviving, widowed spouses. Social Science and Medicine, 57, 465–475. doi:10.1016/S0277-9536(02)00370-2.
PubMed Web of Science ®Google Scholar
Clark, M. H. (2011, November). A comparison of bias reduction methods on educational outcomes. Paper presented at the American Evaluation Association Convention, Anaheim, CA.
Google Scholar
Cochran, W. G. (1968). The effectiveness of adjustment by subclassification in removing bias in observational studies. Biometrics, 24, 205–213. doi:10.2307/2528036.
Web of Science ®Google Scholar
Cohen, A., Gnanadesikan, R., Kettenring, J. R., & Landwehr, J. M. (1977). Methodological developments in some applications of clustering. In P. R. Krishnaiah (ed.), Applications of statistics (pp. 141–162). Amsterdam, The Netherlands: North Holland Publishing.
Google Scholar
D'Attoma, I., & Camillo, F. (2011). A multivariate strategy to measure and test Global Imbalance in observational studies. Expert Systems with Applications, 38, 3451–3460. doi:10.1016/j.eswa.2010.08.132.
Web of Science ®Google Scholar
D'Attoma, I., & Liberati, C. (2011). An optimal cluster-based approach for subgroup analysis using information complexity criterion. International Journal of Business Intelligence and Data Mining, 6, 402–425. doi:10.1504/IJBIDM.2011.044978.
Google Scholar
Dehejia, R. H., & Wahba, S. (1999). Causal effects in nonexperimental studies: Re-evaluating the evaluation of training programs. Journal of the American Statistical Association, 94, 1053–1062. doi:10.1080/01621459.1999.10473858.
Web of Science ®Google Scholar
Diamond, A., & Sekhon, J. S. (2013). Genetic matching for estimating causal effects: A general multivariate matching method for achieving balance in observation studies. Review of Economics and Statistics, 95, 932–945. doi:10.1162/REST_a_00318.
Web of Science ®Google Scholar
Duda, R. O., Hart, P. E., & Stork, D. G. (2001). Pattern classification. Hoboken, NJ: John Wiley & Sons.
Google Scholar
Estadella, J. D., Aluja-Banet, T., & Thio-Henestrosa, S. (2005). Distribution of the inter and intra inertia in conditional MCA. Computational Statistics, 20: 449–463. doi:10.1007/BF02741308.
Web of Science ®Google Scholar
Everitt, B. S. (1993). Cluster analysis. London, UK: Arnold.
Google Scholar
Everitt, B. S., Landan, S., Leese, M., & Stahl, D. (2011). Cluster analysis. Wiley Online Library, 5th edition.
Google Scholar
Fraley, C., & Raftery, A. (1998). How many clusters? Which clustering method? Answer via model-based cluster analysis. Computer Journal, 41, 578–588. doi:10.1093/comjnl/41.8.578.
Web of Science ®Google Scholar
Freedman, D. A., & Berk, R. A. (2008). Weighting regressions by propensity scores. Evaluation Review, 32, 392–409. doi: 10.1177/0193841X08317586 doi:10.1177/0193841X08317586.
PubMed Web of Science ®Google Scholar
Gerfin, M., & Lechner, M. (2002). A microeconometric evaluation of the active labour market policy in Switzerland. Economic Journal, 112, 854–893. doi:10.1111/1468-0297.00072.
Web of Science ®Google Scholar
Garrido, M. M., Kelley, A. S., Paris, J., Rosa, K., Meier, D. E., Morrison, R. S., & Aldridge, M. D. (2014). Methods for constructing and assessing propensity scores. Health Services Research, 49, 1701–1720. doi:10.1111/1475-6773.12182.
PubMed Web of Science ®Google Scholar
Gibson, C. M. (2003). Privileging the participant: The importance of subgroup analysis in social welfare evaluations. American Journal of Evaluation, 24, 443–469. doi:10.1177/109821400302400403.
Web of Science ®Google Scholar
Goder, A., & Filkov, V. (2008). Consensus clustering algorithms: Comparison and refinement. Proceedings of the Ninth Workshop on Algorithm Engineering and Experiments. San Francisco, CA: Society for Industrial and Applied Mathematics.
Google Scholar
Green, P. E., & Krieger, A. M. (1995). Alternative approaches to cluster-based market segmentation. Journal of Market Research Society, 37, 221–229.
Web of Science ®Google Scholar
Greenacre, M. J. (1984). Theory and applications of correspondence analysis. London, UK: Academic Press.
Google Scholar
Hansen, B. B., & Bowers, J. (2008). Covariate balance in simple, stratified and clustered comparative studies. Statistical Science, 23, 219–236. doi:10.1214/08-STS254.
Web of Science ®Google Scholar
Harder, V. S., Stuart, E. A., & Anthony, J. C. (2010). Propensity score techniques and the assessment of measured covariate balance to test causal associations in psychological research. Psychological Methods, 15, 234–249. doi: 10.1037/a0019623 doi:10.1037/a0019623.
PubMed Web of Science ®Google Scholar
Harknett, K. (2006). Does receiving an earnings supplement affect union formation? Estimating effects for program participants using propensity score matching. Evaluation Review, 30, 741–778. doi:10.1177/0193841X06293411.
PubMed Web of Science ®Google Scholar
Hasselblad, V., & Hedges, L. V. (1995). Meta-analysis of screening and diagnostic tests. Psychological Bulletin, 117, 167–178.
PubMed Web of Science ®Google Scholar
Hastie, T., Tibishirani, R., & Friedman, J. (2001). The elements of statistical learning: Data mining, inference and prediction. New York, NY: Springer.
Google Scholar
Ho, D. E., Imai, K., King, G., & Stuart, E. A. (2007). Matching as nonparametric preprocessing for reducing model dependence in parametric causal inference. Political Analysis, 15, 199–236. doi:10.1093/pan/mpl013.
Web of Science ®Google Scholar
Hong, G., & Raudenbush, S. W. (2005). Effects of kindergarten retention policy on children's cognitive growth in reading and mathematics. Educational and Policy Analysis, 27, 205–224. doi:10.3102/01623737027003205.
Web of Science ®Google Scholar
Hong, G. (2007). Marginal mean weighting adjustment for selection bias. Toronto, Canada: Ontario Institute for Studies in Education of the University of Toronto. Unpublished manuscript.
Google Scholar
Horvitz, D. G., & Thompson, D. J. (1952). A generalization of sampling without replacement from a finite universe. Journal of the American Statistical Association, 47, 663–685. doi:10.1080/01621459.1952.10483446.
Web of Science ®Google Scholar
Iacus, S. M., King, G., & Porro, G. (2011). Multivariate matching methods that are monotonic imbalance bounding. Journal of the American Statistical Association, 106, 345–361. doi:10.1198/jasa.2011.tm09599.
Web of Science ®Google Scholar
Imbens, G. W. (2004). Nonparametric estimation of average treatment effects under exogeneity: A review. Review of Economics and Statistics, 86, 4–29. doi:10.1162/003465304323023651.
Web of Science ®Google Scholar
Jobson, J. D. (1992). Applied multivariate data analysis. Volume II: Categorical and multivariate methods. New York, NY: Springer.
Google Scholar
Jolliffe, I. T., Jones, B., & Morgan, B. J. T. (1982). Utilising clusters: A case study involving the elderly. Journal of the Royal Statistical Society, A, 145, 224–236. doi:10.2307/2981536.
Web of Science ®Google Scholar
Kang, J. D. Y., & Schafer, J. L. (2007). Demystifying double robustness: A comparison of alternative strategies for estimating a population mean from incomplete data. Statistical Science, 22, 523–539. doi:10.1214/07-STS227.
Web of Science ®Google Scholar
Lanehart, R. E., De Gil, P. R., Kim, E. S., Bellara, A. P., Kromrey, J. D., & Lee, R. S. (2012). Propensity score analysis and assessment of propensity score approaches using SAS procedures. SAS Global Forum Paper 314-2012.
Google Scholar
Lebart, L., Morineau, A., & Tabard, N. (1977). Technique de la Description Statistique. Paris, France: Dunod.
Google Scholar
Lebart, L., Morineau, A., & Warwick, K. M. (1984). Multivariate descriptive statistical analysis: Correspondence analysis and related techniques for large matrices. New York, NY: John Wiley & Sons.
Google Scholar
Lebart, L., Morineau, A., & Piron, M. (1997). Statistique exploratoire multidimensionelle. Paris, France: Dunod.
Google Scholar
Li, Q., Maasoumi, E., & Racine, J. S. (2009). A nonparametric test for equality of distributions with mixed categorical and continuous data. Journal of Econometrics, 148, 186–200. doi:10.1016/j.jeconom.2008.10.007.
Web of Science ®Google Scholar
Lunceford, J. K., & Davidian, M. (2004). Stratification and weighting via the propensity score in estimations of causal treatment effects: A comparative study. Statistics in Medicine, 23, 2937–2960. doi:10.1002/sim.1903.
PubMed Web of Science ®Google Scholar
Milligan, G. W. (1980). An examination of the effect of six types of error perturbation of fifteen clustering algorithms. Psychometrika, 45(3), 325–342. doi:10.1007/BF02293907.
Web of Science ®Google Scholar
Milligan, G. W. (1981). A Monte Carlo study of thirty internal criterion measures for cluster analysis. Psychometrika, 46, 187–199. doi:10.1007/BF02293899.
Web of Science ®Google Scholar
Morgan, S. L., & Harding, D. J. (2006). Matching estimators of causal effects: Prospects and pitfalls in theory and practice. Sociological Methods and Research, 35, 3–60. doi:10.1177/0049124106289164.
Web of Science ®Google Scholar
Peck, L. R. (2005). Using cluster analysis in program evaluation. Evaluation Review, 29, 178–196. doi:10.1177/0193841X04266335.
PubMed Web of Science ®Google Scholar
Peck, L. R. (2007). What are the effects of welfare sanction policies? Or, using propensity scores as a subgroup indicator to learn more from social experiments. American Journal of Evaluation, 28, 256–274. doi:10.1177/1098214007304129.
Web of Science ®Google Scholar
Peck, L. R., Camillo, F., & D'Attoma, I. (2010). A promising new approach to eliminating selection bias. Canadian Journal of Program Evaluation, 24, 31–56.
Google Scholar
Peck, L. R., D'Attoma, I., Camillo, F., & Guo, C. (2012). A new strategy for reducing selection bias in non-experimental evaluations, and the case of how public assistance receipt affects charitable giving. Policy Studies Journal, 40, 601–625. doi:10.1111/j.1541-0072.2012.00466.x.
Web of Science ®Google Scholar
Robins, J. M., Hernan, M. A., & Brumback, B. (2000). Marginal structural models and causal inference in epidemiology. Epidemiology, 11, 550–560. doi:10.1097/00001648-200009000-00011.
PubMed Web of Science ®Google Scholar
Rosenbaum, P. R., & Rubin, D. B. (1983). The central role of the propensity score in observational studies for causal effects. Biometrika, 70, 41–55. doi:10.1093/biomet/70.1.41.
Web of Science ®Google Scholar
Rosenbaum, P. R., & Rubin, D. B. (1984). Reducing bias in observational studies using subclassification on the propensity score. Journal of the American Statistical Association, 79, 516–524. doi:10.1080/01621459.1984.10478078.
Web of Science ®Google Scholar
Rosenbaum, P. R., & Rubin, D. B. (1985). The Bias Due to Incomplete Matching. Biometrics, 41, 103–116.
PubMed Web of Science ®Google Scholar
Rubin, D. B. (1973). Matching to remove bias in observational studies. Biometrics, 29, 159–183. doi:10.2307/2529684.
Web of Science ®Google Scholar
Rubin, D. B. (2001). Using propensity scores to help design observational studies: Application to the tobacco litigation. Health Services and Outcomes Research Methodology, 2, 169–188. doi:10.1023/A:1020363010465.
Google Scholar
Salvador, S., & Chan, P. (2004). Determining the number of clusters/segments in hierarchical clustering/segmentation algorithms. In Proceedings of the 16th IEEE International Conference on Tools with AI (ICTAI). Los Angeles, CA: IEEE Computer Society.
Google Scholar
Shadish, W. R., Clark, M. H., & Steiner, P. M. (2008). Can nonrandomized experiments yield accurate answers? A randomized experiment comparing random and nonrandom assignment. Journal of the American Statistical Association, 103, 1334–1356. doi:10.1198/016214508000000733.
Web of Science ®Google Scholar
Shadish, W. R., & Steiner, P. M. (2010). A primer on propensity score analysis. Newborn and Infant Nursing Reviews, 10, 19–26. doi:10.1053/j.nainr.2009.12.010.
Google Scholar
Schafer, J. L., & Kang, J. (2008). Average causal effects from nonrandomized studies: A practical guide and simulated example. Psychological Methods, 13, 279–313. doi:10.1037/a0014268.
PubMed Web of Science ®Google Scholar
Smith, J., & Todd, P. (2005). Does matching overcome LaLonde's critique of nonexperimental estimators? Journal of Econometrics, 125, 305–353. doi:10.1016/j.jeconom.2004.04.011.
Web of Science ®Google Scholar
Stone, C. A., & Tang, Y. (2013). Comparing propensity score methods in balancing covariates and recovering impact in small sample educational program evaluations. Practical Assessment, Research & Evaluation, 18(13), 1–12.
Google Scholar
Strehl, A., & Ghosh, J. (2002). Cluster ensembles: A knowledge reuse framework for combining multiple partitions. Journal of Machine Learning Research, 3, 583–617.
Google Scholar
Tebes, J. K., Feinn, R., Vanderploeg, J. J., Chinman, M. J., Shepard, J., Brabham, T., … , & Connel, C. (2007). Impact of a positive youth development program in urban after-school settings on the prevention of adolescent substance use. Journal of Adolescent health, 41, 239–247. doi:10.1016/j.jadohealth.2007.02.016.
PubMed Web of Science ®Google Scholar
Thoemmes, F. J., & Kim, E. S. (2011). A systematic review of propensity score methods in the social sciences. Multivariate Behavioral Research, 46, 90–118. doi:10.1080/00273171.2011.540475.
PubMed Web of Science ®Google Scholar
Tibshirani, R., Walther, G., & Hastie, T. (2001). Estimating the number of clusters in a data set via the Gap Statistic. Journal of the Royal Statistical Society (Series B), 63, 411–423. doi:10.1111/1467-9868.00293.
Google Scholar
Topchy, A., Jain, A. K., & Punch, W. (2004). Clustering ensembles: Models of consensus and weak partitions. In IEEE International Conference on Data Mining, ICDM 03 & SIAM International Conference on Data Mining. Los Angeles, CA: IEEE Computer Society.
Google Scholar
Yoshikawa, H., Rosman, E. A., & Hsueh, J. (2001). Variation in teenage mothers', experiences of child care and other components of welfare reform: Selection processes and developmental consequences. Child Development, 72, 299–317. doi:10.1111/1467-8624.00280.
PubMed Web of Science ®Google Scholar
Zanutto, E. L. (2006). A comparison of propensity score and linear regression analysis of complex survey data. Journal of Data Science, 4, 67–91.
Google Scholar

Reprints and Corporate Permissions

Please note: Selecting permissions does not provide access to the full text of the article, please see our help page How do I view content?

To request a reprint or corporate permissions for this article, please click on the relevant link below:

Order Reprints Request Corporate Permissions

Academic Permissions

Please note: Selecting permissions does not provide access to the full text of the article, please see our help page How do I view content?

Obtain permissions instantly via Rightslink by clicking on the button below:

Request Academic Permissions

If you are unable to obtain permissions via Rightslink, please complete and submit this Permissions form. For more information, please visit our Permissions help page.

Share icon
Back to Top

Related research

People also read lists articles that other readers of this article have read.

Recommended articles lists articles that we recommend and is powered by our AI driven recommendation engine.

Cited by lists all citing articles based on Crossref citations.
Articles with the Crossref icon will open in a new tab.

People also read
Recommended articles
Cited by

To cite this article:

Reference style: APA Chicago Harvard

Citation copied to clipboard

Reference styles above use APA (6th edition), Chicago (16th edition) & Harvard (10th edition)

Download citation

Download a citation file in RIS format that can be imported by citation management software including EndNote, ProCite, RefWorks and Reference Manager.

Choose format: RIS BibTex RefWorks Direct Export

Choose options: Citation Citation & abstract Citation & references

A Comparison of Bias Reduction Methods: Clustering versus Propensity Score Subclassification and Weighting

References

Information for

Open access

Opportunities

Help and information

Your download is now in progress and you may close this window

Login or register to access this feature

A Comparison of Bias Reduction Methods: Clustering versus Propensity Score Subclassification and Weighting

References

Reprints and Corporate Permissions

Academic Permissions

Related research

To cite this article:

Download citation

Information for

Open access

Opportunities

Help and information

Keep up to date