10
Views
2
CrossRef citations to date
0
Altmetric
Original Articles

A Grouping Algorithm for Qualitative Data with A Dichotomous Outcome Variable

&
Pages 168-174 | Received 01 Jul 1983, Published online: 09 Jul 2007
 

Abstract

This paper presents an algorithm for grouping the values of qualitative predictor variables while minimizing the loss of information about a dichotomous dependent variable. The algorithm is based on Shannon's measure of uncertainty. Subpopulations corresponding to the predictor values are ranked by their conditional Bernoulli parameter. At each iteration the increase in uncertainty resulting from grouping each pair of adjacent subpopulations is computed, and the pair with the least increase is grouped. Stopping rules based on the number of values remaining, the cumulative loss of information and the Maximum Likelihood Chi-Square Statistic are proposed, A numerical example is included.

Reprints and Corporate Permissions

Please note: Selecting permissions does not provide access to the full text of the article, please see our help page How do I view content?

To request a reprint or corporate permissions for this article, please click on the relevant link below:

Academic Permissions

Please note: Selecting permissions does not provide access to the full text of the article, please see our help page How do I view content?

Obtain permissions instantly via Rightslink by clicking on the button below:

If you are unable to obtain permissions via Rightslink, please complete and submit this Permissions form. For more information, please visit our Permissions help page.