1,092
Views
2
CrossRef citations to date
0
Altmetric
Articles

My musings on a pioneering work of Erich Lehmann and its rediscoveries on some families of distributions

ORCID Icon
Pages 8066-8073 | Received 06 Sep 2020, Accepted 22 Feb 2021, Published online: 17 Mar 2021

Abstract

The paper of Erich Lehmann (Lehmann Citation1953) is renowned for its ground-breaking contribution to rank tests within the area of nonparametric statistics. This work, in all likelihood, is known to every statistician. But, what is likely not known to many are some of the novel concepts and models that this paper succinctly introduced to the area of distribution theory. This, unfortunately, has led to some of these being rediscovered in the literature and then being referred to under different names. The purpose of this note is, therefore, two-fold: first to explain the key models that are contained in the mentioned work of Erich Lehmann, and second to point out how some of the known models discussed in the distribution theory and stochastic modeling literature are indeed present either explicitly or implicitly in the paper of Lehmann.

1. Introduction

Following the publication of two exhaustive volumes on Continuous Univariate Distributions by Johnson et al. (Johnson, Kotz, and Balakrishnan Citation1994, Citation1995), the area of Statistical Distribution Theory has exploded during the last 25 years, introducing many extensions, generalizations, and “new constructions” of some general families of distributions. These are clearly intended to provide greater flexibility while modeling statistical data. Unfortunately, in the ardor that ensued in this regard, some well-known work either got overlooked, or rediscovered! One such example is the classic paper of Lehmann (Citation1953) in which several novel concepts and models pertaining to distribution theory were introduced, albeit for a study of nonparametric tests!

In this short note, I will first explain the key models that are present in Lehmann (Citation1953) and then point out how some of the known models (that are currently studied and discussed in the distribution theory literature) under different names are contained either explicitly or implicitly in the above mentioned work of Lehmann.

2. Exponentiated distributions

By starting with the one-sided two-sample hypothesis testing problem, Lehmann points out that one could formulate the hypothesis test as H0:F(x)=G(x) vs. the alternative H1:G(x)F(x) for all x. He then states that one could take G(x)=F(xa) for some a > 0 (be reminded that it is a one-sided testing problem), analogous to parametric tests (like the Student-t test), and that the distribution of ranks would then depend not only on the location-shift a but also on the distributional form F(x) itself (see p. 23). Hence, he realizes the need for introducing a “natural nonparametric alternative” against which the distribution of ranks would be constant (see p. 24). With this intent, and keeping in mind that the alternative of interest is G(x)F(x), he proposes the alternative H1:G(x)=(F(x))k for some k1 (see p. 27). He then gives two intuitive reasonings for this particular choice: first is that when k is an integer, G is precisely the distribution function of the maximum of k iid random variables with distribution function F, and second is that with this choice Pr(X<Y)=kk+1, where X is distributed as F and Y is distributed as G.

There are two key aspects to this choice: (i) as k increases, the distribution G would shift more and more to the right of F (recall here the notion of usual stochastic order FstG), and (ii) so ranks of Y-values will tend to become larger in the combined sample when k increases. His Figure 1A–C on pages 26–28 visually demonstrates these two aspects for different choices of k.

The ingenuity in the proposal of the family G(x)=(F(x))k is that it is a natural nonparametric family of distributions which is stochastically ordered with respect to F(x) for k > 1 as mentioned above (and naturally in the reverse ordering when 0<k<1). Moreover, the “distance” between the two distributions is captured by the parameter k no matter what the “baseline” (in today’s jargon) distribution function F is!

It is evident that the use of any specific distributional form for F(x) will result in producing a corresponding family of parametric distributions. Let us take, for example, the two-parameter Weibull distribution with distribution function F(x)=1e(x/σ)α, for x,σ,α>0, where σ is the scale parameter and α is the shape parameter. Then, it simply results in the distribution function G(x)={1e(x/σ)α}k, k > 0, which is what has been proposed as exponentiated-Weibull family by Mudholkar and Srivastava (Citation1993); see also its subsequent use in the analysis of the well-known bus-motor-failure data by Mudholkar, Srivastava, and Freimer (Citation1995).

Another choice for the baseline distribution is the two-parameter exponential distribution with distribution function F(x)=1e(xμ)/σ,x>μ,σ>0, where μ is the location (threshold) parameter and σ is the scale parameter. This leads to the distribution function G(x)={1e(xμ)/σ}k, k > 0, that has been introduced as generalized exponential distribution by Gupta and Kundu (Citation1999).

In neither of these works, the original paper of Lehmann (Citation1953) has been cited which, as mentioned earlier, is not surprising as many are not aware of the key distributional aspects in the paper of Lehmann! However, in the paper by Cordeiro, Ortega, and de Cunha (Citation2013), the connection of Lehmann distributions of both types to exponentiated distributions has been noted. In fact, the author of this paper had a discussion with the first author (Gauss Cordeiro) of the mentioned manuscript about this specific issue!

During the last two decades, many more such “exponentiated distributions” have proliferated the distribution theory literature simply with different choices of F(x). It needs to be stated, of course, that these distributions, though are all part of the Lehmann family, could have their own intricacies in inferential aspects or when being used for modeling purposes.

3. PHR and PRHR distributions

In reliability theory, two useful reliability functions that have been studied extensively are hazard rate and reversed hazard rate functions. They are defined for lifetime distributions as (1) hF(x)=f(x)1F(x) and rF(x)=f(x)F(x),x>0;(1) see, for example, Shaked and Shanthikumar (Citation1994). While the hazard rate function seems to have been well studied in the literature, the reversed hazard rate function concept is more recent and possibly first introduced by Keilson and Sumita (Citation1982). A detailed discussion on reversed hazard rate function and its properties was later provided by Block, Savits, and Singh (Citation1998).

The Lehmann family of distributions G(x) is intrinsically connected with the reversed hazard rate function rF(x) though none of the mentioned papers made this connection. Interestingly, this was duly observed in a simultaneous independent work of Gupta, Gupta, and Gupta (Citation1998), even explicitly mentioning Lehmann alternatives in the title of their paper. With F(x) as the baseline distribution function and rF(x) denoting the corresponding reversed hazard rate function, it is readily evident that (2) F(x)=exp{xrF(t)dt} and f(x)=rF(x)exp{xrF(t)dt},(2) as stated in Finkelstein (Citation2002). In order for the above integral to exist, with the condition that F(0)=0 over the support [0,), we need to have 0rF(t)dt=, as specified by Block, Savits, and Singh (Citation1998) and Finkelstein (Citation2002). The connection between the function rF(x) and the Lehmann family becomes abundantly clear from the fact that (3) rG(x)=g(x)G(x)=k(F(x))k1f(x)(F(x))k=krF(x),(3) resulting in the so-called Proportional Reversed Hazard Rates (PRHR) family, as stated explicitly by Gupta, Gupta, and Gupta (Citation1998). In Equation(3), k will be interpreted as the proportionality parameter and will be linked to the covariates in the data, usually through a log-linear link function; one may see Kalbfleisch and Lawless (Citation1991) wherein this is explicitly presented in their Eq. (20) (on p. 26), but referring to the function rG(x) as “reverse time hazards” and the distribution G(x) as a “power model”. The choices of Pareto distribution with F(x)=1(1+x)α,0<x,α< (see Gupta, Gupta, and Gupta Citation1998), and the Fréchet (or inverse Weibull) distribution with F(x)=exγ,0<x,γ< (see Di Crescenzo Citation2000) serve as good examples for the PRHR family. Unfortunately, some of the subsequently published works on the PRHR family of distributions still did not refer to the work of Lehmann in this regard; see, for example, Gupta, Gupta, and Sankaran (Citation2004), Kundu and Gupta (Citation2004), Badía and Berrade (Citation2008), Sankaran and Sukumaran (Citation2014), and Seo and Kim (Citation2020).

The fundamental idea behind the construction of this family of distributions by Lehmann (Citation1953) is that, in the two-sample case, we have two distributions (F, G), which may be represented, by a single function ϕ, as (F,ϕ(F)), where ϕ is a continuous non-decreasing function with ϕ(0)=0 and ϕ=1; see this explicit comment on pages 25–26 in this regard. This is what ultimately leads to the distribution of ranks being constant, through the use of a probability integral transformation U=F(X). Of course, he used ϕ(F)=Fk as an example, stating its advantage of admitting a simple interpretation of it as an alternative and it being the distribution of the maxima of k iid random variables from F when k is a positive integer (see p. 27); also, on page 28, he goes on to say that a similar interpretation could be provided when k is any rational number. A heedful look at the distribution of ranks that Lehmann (Citation1953) has derived shows its “symmetry” (meaning mirror-image) if the distribution G=Fk is changed to G=1(1F)k; see Lehmann (Citation1953, p. 29). This is quite intuitive in that the distribution of maxima of k iid random variables from F is being switched to that of the minima. One can only wonder whether Lehmann realized this property and so did not use it as another example, or he did not observe this property.

In any case, this has been duly noted by Shorack (Citation1966, Citation1967) who, in fact, refers to the model G(x)=1(1F(x))k also as Lehmann alternative for the reasons explained above. Davies (Citation1971) directly and explicitly refers to it as Lehmann alternative in addition to explaining its importance from a reliability testing perspective. Subsequently, in the nonparametric literature, both distributions, i.e., G=Fk and G=1(1F)k, are commonly referred to as Lehmann families or Lehmann alternatives; for example, see Lin (Citation1990), Lin and Sukhatme (Citation1992), van der Laan and Chakraborti (Citation1999), and Balakrishnan and Ng (Citation2006). However, it is evident that this Lehmann family corresponds to the case wherein (4) hG(x)=g(x)1G(x)=k(1F(x))k1f(x)(1F(x))k=khF(x),(4) which is the well-known Proportional Hazards Model. In Equation(4), k is the proportionality parameter and is commonly linked to the covariates in the data through a log-linear link function. One of the earliest works in this regard, in fact, a prelude to the pioneering work of Cox (Citation1972) on proportional hazards model under a semi-parametric setting, is the paper by Feigl and Zelen (Citation1965) (see also, Cox Citation1964). Specifically, Feigl and Zelen used the model in Equation(4), with baseline exponential distribution, to model the survival times of leukemia patients with white blood cell count as a covariate to estimate the parameter k. Strangely enough, all subsequent works on the model in Equation(4) in the areas of distribution theory and lifetime analysis, whether in a fully parametric or a semi-parametric form, refer to the model as PH distribution, while those in the area of nonparametric statistics refer to it as Lehmann family! Interestingly, in the book Analysis of Survival Data by Cox and Oakes (Citation1984), the authors explicitly refer to this family as Lehmann family; see their Eq. (2.26) on page 23. In the paper by Hall and Wellner (Citation2013), an explicit comment to the fact that the two Lehmann families correspond to proportional hazards and reversed proportional hazards distributions has been made.

One final comment that is worth making is that the two Lehmann families of distributions can be unified in a simple way to a single family form as G*(x)=[1{1F(x)}]k,k,>0, with =1 and k = 1 deducing to the two Lehmann families discussed so far.

4. Transmuted distributions

Shaw and Buckley (Citation2007, Citation2009) used a transformation map to propose a quadratic rank transmuted distribution of the form (5) G(x)=(1+λ)F(x)λ(F(x))2,(5) where |λ|1 and F(x) is the baseline distribution function. Since then, numerous authors have studied this transmuted family of distributions, by making different choices for the baseline distribution function F(x) (too many, in fact, to list here); see, for example, Aryal and Tsokos (Citation2011) and Kozubowski and Podgorski (Citation2016).

Turning our attention again to the paper of Lehmann (Citation1953), in Section 6 of his paper, he introduces yet another family of distributions of the form (6) G(x)=qF(x)+p(F(x))2,0p,q1,(6) where p+q=1; see Eq. (6.1) in Lehmann (Citation1953, p. 34). It is evident that this family is another convenient nonparametric family of distributions from Lehmann’s viewpoint. Curiously, other than introducing this family of distributions and then using it in a subsequent power analysis of rank tests, Lehmann does not discuss any properties or intuitive interpretation for this family even though, in my view, it is as natural and appealing a distribution as his first family. Upon observing that the distribution in (6) is indeed a two-component mixture distribution and that F(x)(F(x))2, we can reparametrize Equation(6) in the form of Equation(5) with an expanded range for the parameter λ.

Another mixture formulation, using Lehmann family, can be provided as follows. Let us suppose Z, Y1, and Y2 are iid random variables from the baseline distribution F(x). Upon observing that the distribution in Equation(6) in indeed a two-component mixture distribution, we first define the random variable (7) X=dZwith probability q=dmax(Y1,Y2)with probability p,(7) where q=1p, with its distribution function as in Equation(6). Next, we use the well-known “triangle rule” for order statistics (see, for example, Arnold, Balakrishnan, and Nagaraja Citation1992) which states that (8) Z=dmin(Y1,Y2) with probability 12=dmax(Y1,Y2) with probability 12.(8) By combining the two mixture forms in Equation(7) and Equation(8), we arrive at (9) X=dmin(Y1,Y2) with probability q2=dmax(Y1,Y2) with probability p+q2.(9)

The distribution of X obtained readily from Equation(9), after a reparametrization, turns out to be identical to Shaw and Buckley’s transmuted distribution in Equation(5). The above mixture representation of the transmuted distribution in Equation(5) has been offered by Granzotto, Louzada, and Balakrishnan (Citation2017). They then proposed a cubic rank transmuted distribution by considering the random variable X with distribution function G as a mixture of the three order statistics obtained from iid random variables Y1,Y2,Y3 from the baseline distribution F(x). Though this differs from the Lehmann family in Equation(6), it just turns out to be an extension with one more mixture component with distribution function (F(x))3!

5. Concluding remarks

In this paper, we have highlighted the novel concepts and statistical models that have been introduced in the pioneering paper of Lehmann (Citation1953). Even though this paper is well known to researchers in the area of nonparametric statistics, it seems that many in the area of distribution theory are not aware of this important aspect of the paper. This, unfortunately, has led to some of these distributions being rediscovered in the literature under different names. For this reason, we have explained the key distributions that are present in Lehmann (Citation1953) and as to how some of the known models in the distribution theory literature are present either explicitly or implicitly in the ground-breaking work of Lehmann!

Acknowledgments

The author thanks the Natural Sciences and Engineering Research Council of Canada for funding this research through an Individual Discovery Grant 2020-06733, and the Associate Editor and the Reviewer for some useful comments and suggestions on an earlier version of this manuscript.

References

  • Arnold, B. C., N. Balakrishnan, and H. N. Nagaraja. 1992. A first course in order statistics. New York: John Wiley & Sons.
  • Aryal, A. R., and C. P. Tsokos. 2011. Transmuted Weibull distribution: A generalization of the Weibull probability distribution. European Journal of Pure and Applied Mathematics 4:89–102.
  • Badía, F. G., and M. D. Berrade. 2008. On the reversed hazard rate and mean inactivity time of mixtures. In Advances in Mathematical Modeling for Reliability, eds. T. Bedford, J. Quigley, and L. Walls, 103–10. Amsterdam, The Netherlands: IOS Press.
  • Balakrishnan, N., and H. K. T. Ng. 2006. Precedence-type tests and applications. Hoboken, NJ: John Wiley & Sons.
  • Block, H. W., T. H. Savits, and H. Singh. 1998. The reversed hazard rate function. Probability in the Engineering and Informational Sciences 12 (1):69–90. doi:10.1017/S0269964800005064.
  • Cordeiro, G. M., E. M. Ortega, and D. C. de Cunha. 2013. The exponentiated generalized class of distributions. Journal of Data Science 11:1–17.
  • Cox, D. R. 1964. Some applications of exponential ordered scores. Journal of the Royal Statistical Society: Series B (Methodological) 26 (1):103–10. doi:10.1111/j.2517-6161.1964.tb00544.x.
  • Cox, D. R. 1972. Regression models and life-tables (with discussion). Journal of the Royal Statistical Society–Series B 34:187–220.
  • Cox, D. R., and D. Oakes. 1984. Analysis of survival data. Boca Raton, FL: Chapman and Hall/CRC Press.
  • Davies, R. B. 1971. Rank tests for Lehmann’s alternative. Journal of the American Statistical Association 66:879–83.
  • Di Crescenzo, A. 2000. Some results on the proportional reversed hazards model. Statistics & Probability Letters 50 (4):313–21. doi:10.1016/S0167-7152(00)00127-9.
  • Feigl, P., and M. Zelen. 1965. Estimation of exponential survival probabilities with concomitant information. Biometrics 21 (4):826–38. doi:10.2307/2528247.
  • Finkelstein, M. 2002. On the reversed hazard rate. Reliability Engineering & System Safety 78 (1):71–5. doi:10.1016/S0951-8320(02)00113-8.
  • Granzotto, D. C. T., F. Louzada, and N. Balakrishnan. 2017. Cubic rank transmuted distributions: Inferential issue and applications. Journal of Statistical Computation and Simulation 87 (14):2760–78. doi:10.1080/00949655.2017.1344239.
  • Gupta, R. C., P. L. Gupta, and R. D. Gupta. 1998. Modeling failure time data by Lehmann alternatives. Communications in Statistics - Theory and Methods 27 (4):887–904. doi:10.1080/03610929808832134.
  • Gupta, R. D., R. C. Gupta, and P. G. Sankaran. 2004. Some characterization results based on the factorization of the (reversed) hazard rate function. Communications in Statistics - Theory and Methods 33 (12):3009–31. doi:10.1081/STA-200038875.
  • Gupta, R. D., and D. Kundu. 1999. Generalized exponential distributions. Australian & New Zealand Journal of Statistics 41 (2):173–88. doi:10.1111/1467-842X.00072.
  • Hall, W. J., and J. A. Wellner. 2013. Efficient testing and estimation in two Lehmann alternatives to symmetry-at-zero models. In From probability to statistics and back: High-dimensional models and processes—A Festschrift in honor of Jon A. Wellner, 197–212. Beachwood, OH: Institute of Mathematical Statistics.
  • Johnson, N. L., S. Kotz, and N. Balakrishnan. 1994. Continuous univariate distributions, vol. 1, 2nd ed., New York: John Wiley & Sons.
  • Johnson, N. L., S. Kotz, and N. Balakrishnan. 1995. Continuous univariate distributions, vol. 2, 2nd ed., New York: John Wiley & Sons.
  • Kalbfleisch, J. D., and J. F. Lawless. 1991. Regression models for right truncated data with applications to AIDS incubation times and reporting lags. Statistica Sinica 1:19–32.
  • Keilson, J., and U. Sumita. 1982. Uniform stochastic ordering and related inequalities. Canadian Journal of Statistics 10 (3):181–98. doi:10.2307/3556181.
  • Kozubowski, T. J., and K. Podgorski. 2016. Transmuted distributions and random extrema. Statistics & Probability Letters 116:6–8. doi:10.1016/j.spl.2016.04.001.
  • Kundu, D., and R. D. Gupta. 2004. Characterizations of the proportional (reversed) hazard model. Communications in Statistics - Theory and Methods 33 (12):3095–102. doi:10.1081/STA-200039053.
  • Lehmann, E. L. 1953. The power of rank tests. The Annals of Mathematical Statistics 24 (1):23–43. doi:10.1214/aoms/1177729080.
  • Lin, C.-H., and S. Sukhatme. 1992. On the choice of precedence tests. Communications in Statistics - Theory and Methods 21 (10):2949–68. doi:10.1080/03610929208830955.
  • Lin, C.-H. 1990. Powers of two-sample rank tests under Lehmann alternatives. PhD Thesis., Iowa State University, Ames, IA.
  • Mudholkar, G. S., and D. K. Srivastava. 1993. Exponentiated Weibull family for analyzing bathtub failure data. IEEE Transactions on Reliability 42 (2):299–302. doi:10.1109/24.229504.
  • Mudholkar, G. S., D. K. Srivastava, and M. Freimer. 1995. The exponentiated Weibull family: A reanalysis of the bus-motor-failure data. Technometrics 37 (4):436–45. doi:10.1080/00401706.1995.10484376.
  • Sankaran, P. G., and A. Sukumaran. 2014. Modeling lifetime data with multiple causes using cause specific reversed hazard rates. Statistica 74:315–34.
  • Seo, J. I., and Y. Kim. 2020. Note on the family of proportional reversed hazard distributions. Communications in Statistics—Simulation and Computation. Advance online publication. doi:10.1080/03610918.2020.1783558.
  • Shaked, M., and J. G. Shanthikumar. 1994. Stochastic orders and their applications. New York: Academic Press.
  • Shaw, W. T., and I. R. C. Buckley. 2007. The alchemy of probability distributions: Beyond Gram-Charlier and Cornish-Fisher expansions, and skew-normal or kurtic-normal distributions. UCL Discovery Repository:1–16.
  • Shaw, W. T., and I. R. C. Buckley. 2009. “The alchemy of probability distributions: Beyond Gram-Charlier expansions, and skew-kurtotic-normal distribution from a rank transmutation map,” https://EconPapers.repec.org/RePEc:arx:papers:0901.0434
  • Shorack, R. A. 1966. Recursive generation of the distribution of the Mann-Whitney-Wilcoxon U-statistic under generalized Lehmann alternatives. The Annals of Mathematical Statistics 37 (1):284–6. doi:10.1214/aoms/1177699621.
  • Shorack, R. A. 1967. Tables of the distribution of the Mann-Whitney-Wilcoxon U-statistic under Lehmann alternatives. Technometrics 9 (4):666–77. doi:10.1080/00401706.1967.10490515.
  • van der Laan, P., and S. Chakraborti. 1999. Best precedence tests against Lehmann alternatives. Bulletin of the International Statistical Institute 58:395–6.

Reprints and Corporate Permissions

Please note: Selecting permissions does not provide access to the full text of the article, please see our help page How do I view content?

To request a reprint or corporate permissions for this article, please click on the relevant link below:

Academic Permissions

Please note: Selecting permissions does not provide access to the full text of the article, please see our help page How do I view content?

Obtain permissions instantly via Rightslink by clicking on the button below:

If you are unable to obtain permissions via Rightslink, please complete and submit this Permissions form. For more information, please visit our Permissions help page.