Abstract
Scale-contaminated normal distributions have been widely used in numerical studies of robustness requiring distributions that are elongated; that is, stretched relative to Gaussian behavior. The contaminated normal family has much appeal, in part because the mechanism used to generate such random variables seems a realistic model for creating outliers. But this family also has serious deficiencies as a test bed for studying the effects of elongation. Specifically, there are no truly heavy-tailed contaminated normal distributions, and the parameters of the family control elongation in a way that is readily misunderstood. Thus, when sampling from the contaminated normal family, it is difficult to systematically manipulate elongation and easy to misconstrue results. A Symmetrie distribution is elongated to the extent that its quantiles depart from the center of symmetry more rapidly than do those of a Gaussian distribution. It is essential to distinguish between elongation or Stretching and scale or dispersion. We argue that failure to make this distinction leads to misconceptions about the manner in which contaminated normals are non-Gaussian and about the way in which their shape is controlled by the two parameters: contamination rate and contaminant scale. The methods used here are largely graphical in nature and rely heavily on a scale-free diagnostic plot that shows both the magnitude and location of elongation in a Symmetrie distribution. This plot is supplemented by a simple approximation, in terms of Standard normal quantiles, for the quantiles of the contaminated normal family. We also examine some published research based on the contaminated normal family and show that the diagnostic plot can aid understanding trends in such results. Finally, we briefly consider some alternatives to the contaminated normal family, distributions that may be useful in designing numerical studies because they permit a more clear-cut manipulation of elongation with easy generation of random variates.