Full article: The Gini coefficient and discontinuity

Formulae display: $MathJax Logo$ ?Mathematical formulae have been encoded as MathML and are displayed in this HTML version using MathJax in order to improve their display. Uncheck the box to turn MathJax off. This feature requires Javascript. Click on a formula to zoom.

Abstract

This article reveals a discontinuity in the mapping from a Lorenz curve to the associated cumulative distribution function. The problem is of a mathematical nature—based on an analysis of the transformation between the distribution function of a bound random variable and its Lorenz curve. It will be proven that the transformation from a normalized income distribution to its Lorenz curve is a continuous bijection with respect to the $L^{q}$ ([0,1])-metric—for every q ≥ 1. The inverse transformation, however, is not continuous for any q ≥ 1. This implies a more careful attitude when interpreting the value of a Gini coefficient. A further problem is that if you have estimated a Lorenz curve from empirical data,then you cannot trust that the associated distribution is a good estimate of the true income distribution.

Keywords:

PUBLIC INTEREST STATEMENT

This article deals with the relation between the distribution of income and the measurement of economic inequality in a society. The latter is often expressed as the Gini coefficient, G: the expected difference between two randomly drawn household or individual incomes divided by two times the average income. This division was made to be able to compare the magnitude for different societies. Looking at the income distribution, a reasonable degree of equality in the actual society must imply that the difference between the maximal income and the average income is not too large. If we divide this number by the maximal income, we get a quantity, H, comparable between societies. In this article, it will be shown not only that if H is close to zero, then G is close to zero but also that the opposite is not necessarily true. The most direct consequence is that a small G is not enough to ensure relative economic equality in a society.

1. Introduction

Since the 1960s, economists have widely accepted the Lorenz curve as the tool for deriving measures of income inequality in society, among them the Gini coefficient. The traditional method was to group data in a number of intervals and assume all incomes in an interval to be equal to the average income in the actual interval, Morgan (Citation1962). This gives a lower limit of the “true” Gini coefficient.

The ability in our time to collect and centralize precise data about individual income implies that direct methods are now used to compute the Gini coefficient (see, OECD-IDD, Citation2017, p. 8). This actual OECD method does not take its offset in the Lorenz curve of observed data. It is based on the relative mean differences of observed income data.Footnote¹

In education in the economic sciences, however, the Lorenz curve keeps its position in illustrating the Gini coefficient. Still, in this century, scholars find new ways to derive the approximations made in the 1950s to 70s (see Golden (Citation2008) and Farris (Citation2010)).

At least since 1970, there has been a critical attitude towards the Gini coefficient as a precise measure of inequality (see Atkinson (Citation1970)). Moreover, different proposals using the Lorenz curve have been advanced to give a more multi-faceted idea of inequality. Most influential in that respect were Kakwani (Citation1980), Donaldson & Weymark (Citation1983), and Yitzhaki (Citation1983) with their generalized or higher-order Gini coefficients. Their formulas turned out to be equivalent. In the last twenty years, a variety of new inequality measures have been developed, among them the generalized entropy family of indices. All the way through, we alternatively could use the ratio of the top to bottom shares (see, e.g., Liu and Gastwirth (Citation2020)).

The voice of critics is thus rather comprehensive. The aim of the present article is to point out 2 potential problems working with the Gini coefficient, problems that remain even when using the class of generalized Gini coefficients proposed by Kakwani and others. This is not just another demonstration of the fact that 2 different distributions could have the same Gini coefficient. Rather, we will discover that a small Gini coefficient does not necessarily imply a noticeable degree of equality. Furthermore, if you try to obtain the populations' income distribution from an estimated Lorenz curve—that is solving an inverse problem—then your result might be far from the true distribution. The cause to the problems is not solely due to observed data. It is a discontinuity in the relationship between 'the distribution function and the Lorenz curve for a bounded random variable, which brings trouble.

Consequently, we will in section 2 establish the connection between a cumulative distribution function for a bound, non-negative random variable and its Lorenz curve. It will be proved that any non-decreasing, convex function mapping [0,1] on [0,1] with a non-vertical left-hand tangent in (1,1) will be the Lorenz curve for some bound distribution. the correspondence is 1–1 up to scale. In section 3, the set of normalized income distributions and its subset, the Lorenz curves will be conceived of as subsets in the linear $L^{q} ([0, 1])$ -spaces. Thus, for any $q \geq 1$ , a metric is present, and it will be established that the Lorenz curve results from its income cdf through a continuous transformation. Traditional measures of inequality, especially the Gini coefficient, appear as distances in $L^{q} ([0, 1])$ , and in section 4, we shall see that the inverse transformation, mapping the Lorenz curve to its normalized distribution function, is not continuous. In section 5, we will draw some implications from this fact. The results will be derived in a general manner, which means that there will be no restrictions with respect to the type of bound distribution. This implies that the formal language departs somewhat from prevalent presentation in the economic literature.

2. The transformation mapping a cumulative distribution function to its Lorenz function

A Lorenz curveFootnote² is formally a curve in the plane with the property, which for every point belonging to it, $(p, y), p$ , will denote a fraction of a population, while $y$ will denote the relative share of some limited resource or goods, which this fraction possesses. $p$ is explicitly the fraction that has the lowest share of the resource. If we assume that $y$ can never be negative, the curve will contain the points (0,0) and (1,1), and it will be non-decreasing. The associated cumulative distribution function, which has this curve as its graph, will in accordance with the current style also be termed the Lorenz curve. In fact, we have implicitly chosen a statistical model that operates with a large or an indefinite number of members of the population, which is treated as a continuous medium. Furthermore, we will only work with bound and non-negative distributions of the good.

As preliminary results, we have that for any real, non-negative and bound random variable $X$ , with cumulative distribution function $F,$ the expectation exists and could be calculated as

(1)

m = E (X) = \int_{0}^{X_{e . s .}} (1 - F (x)) d x = \int_{0}^{1} F^{- 1} (u) d u .

(1)

The integral used is the Lebesgue integral, and $X_{e . s .}$ is the essential supremum of $X .$ Note that (1) is valid for any mixture of continuous and atomic distribution functions.

$F^{- 1} (u)$ might not exist as a function as it is not required that $F$ is strictly increasing. So, in this text, $F^{- 1} (u)$ simply means the u-fractile of $F$ , formally,

$F^{- 1} (u) = i n f \{x : F (x) \geq u\}$

If you are not used to work with $F^{- 1}$ in this way, the correctness of the last equality sign in (1) can be justified by Figure . The red curve is the graph of $F,$ and the area of the shaded set is both, $E (X)$ and $\int_{0}^{1} F^{- 1} (u) d u,$

Figure 1. The graph of a non-continuous cdf. The area of the shaded set is $\int_{0}^{1} F^{- 1} (u) d u$ .

Figure 1. The graph of a non-continuous cdf. The area of the shaded set is ∫01F−1udu.

(2)

L (p) = \frac{1}{m} \int_{0}^{p} F^{- 1} (u) d u, p \in [0, 1] .

(2)

Expression (2) was used by Gastwirth to define the Lorenz curve (see Gastwirth (Citation1971)). Dorfman (Citation1979) in fact generally proves an equivalent result to (2).Footnote³

Note that there is no problem with this definition. As $F^{- 1} (u)$ is uniquely determined as a measurable function with up to countably many discontinuities, $L (p)$ is given in [0.1]. That the Lorenz curve might not be differentiable is merely a consequence of the model (half of me earns half of my income, and my income ranked neighbor to the right-hand side might earn the same as me or (considerably) more). If $F$ is an empirical cdf, then one could object to its application if the sample is small (see Yitzhaki and Schechtman (Citation2013) p 28–29). Let us assume that this is not the case.

Formula (2) defines a mapping, $L$ , of the class of non-negative, finite distribution functions into itself,

$L : F (x) ↷ L (p)$ .

Furthermore, (2) ensures that the Lorenz curve, $L,$ will always be convex.

As the third assumption, familiar to the reader, we have that the distribution function of $a \cdot X, a > 0$ is given by $F_{a X} (x) = F_{X} (\frac{x}{a}) .$ We therefore conclude that

(3)

L (F_{a X}) = \frac{1}{m} \int_{0}^{p} {F_{X}}^{- 1} (u) d u .

(3)

We will now determine the preimage of an arbitrary Lorenz curve,

L .

Formula (3) means that for every

a > 0,

F_{a X}

will belong to the same preimage as

F_{X} .

EquationEquation (2)(2) $L (p) = \frac{1}{m} \int_{0}^{p} F^{- 1} (u) d u, p \in [0, 1] .$ (2) is equivalent to

m \cdot L (p) = \int_{0}^{p} F^{- 1} (u) d u

(4)

\Leftrightarrow m \cdot l (p) = F^{- 1} (p) .

(4)

Here, $l (p)$ is the density function corresponding to $L (p)$ , uniquely determined almost everywhere.

Note that $l (p)$ is non-decreasing, and because of that, its inverse function will exist in the sense explained above.

In order that $X$ is bound, $F^{- 1} (1)$ must be the finite number $X_{e . s .}$ Consequently, we have

Theorem 2.1

A non-negative random variable is bound if, and only if, the Lorenz curve associated with it has a non-vertical left-hand tangent in the point (1,1). The slope of this tangent is $\frac{X_{e . s .}}{m}$

EquationEquation (4)(4) $\Leftrightarrow m \cdot l (p) = F^{- 1} (p) .$ (4) is equivalent to

(m \cdot l)^{- 1} (x) = F (x)

\Leftrightarrow l^{- 1} (\frac{x}{m}) = F (x) .

So, for any fixed $m$ and any given $L$ —without a vertical left-hand tangent in (1,1)— $F$ as a cumulative distribution function will be uniquely determined for $x \in [0, X_{e . s .}] .$ We can arrive at the below conclusion:

If the cumulative distribution function $F_{X}$ for some non-negative, finite random variable $X$ is in the preimage, with respect to the mapping $L,$ of a Lorenz function $L$ , the preimage will be exactly ${F_{a X} : a > 0}$ . Thus, we have that

Theorem 2.2

Any finite non-negative distribution function—up to scale—is determined by its Lorenz curve. If the expectation or essential supremum is known, the distribution function is uniquely given.

Formula (4) and Theorem 2.2 were proved by Lambert (Citation1990), p 40–41, in the case of $F$ being differentiable and strictly increasing. In the present context, no results depend on the existence of a density function for the actual distribution function.

We realize that, from now on, we only have to look at the normalized random variable,

Y = \frac{1}{X_{e . s .}} \cdot X

when we are working with Lorenz functions for finite random variables. We achieve that $Y \in [0, 1]$ and that the transformation

L : F_{Y} \to \frac{1}{m} \int_{0}^{p} {F_{Y}}^{- 1} (u) d u

is an injection of the class of distribution functions for normalized non-negative random into itself. Remark: Note that for the graph of $L,$ by Theorem 2.1, the left-hand tangent in (1,1) has a slope of $\frac{1}{m}$ , where $m$ is the expectation of $Y .$

Let us illustrate what we found with an example: Suppose that in a given situation,

L (p) = \{\begin{matrix} p^{2} & f o r p \in [0, 0.5] \\ 1.5 (p - 1) + & 1 f o r p \in] 0.5, 1] \end{matrix}

and that $m = 2.$

l (p) = \frac{1}{2} \cdot F^{- 1} (p)

we get

F^{- 1} (p) = \{\begin{matrix} 4 p f o r p \in [0, 0.5] \\ 3 f o r p \in]0.5, 1] \end{matrix} .

Hence,

F (x) = \{\begin{matrix} 0.25 x f o r x \in [0, 2[ \\ 0.5 f o r x \in [2, 3[ \\ 1 f o r x = 3 \end{matrix}

It is the unique solution for $F$ in this situation.

As a random variable, $X,$ with this distribution function $F,$ has the maximal value of 3, and the normalized random variable is $Y = \frac{X}{3} . Y$ has the distribution function,

F_{Y} (x) = \{\begin{matrix} 0 .75 x f o r x \in [0, \frac{2}{3} [ \\ 0 .5 f o r x \in [\frac{2}{3}, 1 [ \\ 1 f o r x = 1 \end{matrix}

We will denote this class of distribution functions for normalized random variables $N C D F,$ Normalized Cumulative Distribution Functions.

From the way we constructed $Y,$ it is essential that any member of $N C D F$ fulfills that

x < 1 \Leftrightarrow F_{Y} (x) < 1.

We know that $L = L (F)$ is a convex function that maps [0,1] on [0,1]. From Theorem 2.1, we further know that $L$ cannot have an infinite left-hand derivative at $x = 1.$ But will any convex function with the domain and range [0,1] be the $L$ -image of some member of $N C D F$ if it only has finite left-hand derivatives?

In the strict sense of convex function, the answer must be no, because the $L$ -image will have to be a distribution function. Consider therefore a convex nondecreasing function $f,$ mapping [0,1] on [0,1], satisfying that $lim_{t \to 1_{-}} \frac{f (1) - f (t)}{1 - t}$ is a finite number. We denote this class of functions $C C D F,$ Convex Cumulative Distribution Functions. Every member of $C C D F$ must be continuous.

If $c \in]0, 1[$ then,

α (t) = \frac{f (c) - f (t)}{c - t}, t \in [0, 1], t \neq c

will be a non-decreasing and non-negative function,Footnote⁴ and hence, $α (c -) = lim_{t \to c -} α (t)$ and $α (c +) = lim_{t \to c +} α (t)$ exist and

α (c -) \leq α (c +) .

So, $f$ is differentiable from both the left and the right in any point in [0,1] and $f {^{'}}_{-} (c) \leq f {^{'}}_{+} (c) .$ $f$ must be differentiable almost everywhere in [0,1] for the following reason: The set of points fulfilling

f {^{'}}_{-} (c) < f {^{'}}_{+} (c)

is at most of numerable cardinality, because if we define

g (x) = \{\begin{matrix} f^{'} (x) i f f i s d i f f e r e n t i a b l e i n x \\ f^{'}_(x) i f f i s n o t d i f f e r e n t i a b l e i n x \end{matrix}

$g$ will be non-decreasing in [0,1] and therefore continuous almost everywhere. So, in this way, we found that $f (t) = \int_{0}^{t} g (x) d x$ on [0,1].

We can identify $g$ with $k \cdot F^{- 1}$ , the inverse function to a member of $N C D F,$ multiplied by a constant of value $g (1)$ , which, in fact, equals $\frac{1}{m}$ , with $m$ being the expectation associated with $F .$ Thus, any member of $C C D F$ is the $L$ -image of a member of $N C D F$ .

So far, our investigation has shown the following:

Theorem 2.3

The mapping

L : F \to \frac{1}{m} \cdot \int_{0}^{p} F^{- 1} (u) d u

is a bijection of the class $N C D F$ on the class $C C D F .$

Thus, any member of $C C D F$ will be the Lorenz curve for some finite cdf.

3. Convergence of sequences in NCDF and CCDF

$N C D F$ is a subset of the Banach spaces $L^{q} ([0, 1], λ)$ Footnote⁵ for every $q \in [1, \infty],$ with $λ$ being the Lebesgue measure. $N C D F$ and $C C D F$ can now be conceived of as metric spaces—the metric of course induced by $L^{q} ([0, 1], λ)$ . Neither of them is complete, which can be seen in the following example. Let

F_{n} (x) = {\begin{matrix} 1 - \frac{1}{n} f o r x \in [0, 1[ \\ 1 f o r x = 1 \end{matrix} .

Then, $\{F_{n}\}$ is a Cauchy sequence in the space $N C D F$ for any of the metrics in $L^{q} ([0, 1], λ), q \in [1, \infty] .$

Since

{∥1 - F_{n}∥}_{q} = {(\int_{0}^{1} {|1 - F_{n} (x)|}^{q} d x)}^{\frac{1}{q}} = \frac{1}{n} \to 0 f o r n \to \infty,

$\{F_{n}\}$ must converge to 1 in the $L^{q}$ -metric, $q < \infty$ . In the $L^{\infty}$ -metric—the supremum norm—the convergence is obvious. Function 1 on [0,1] is certainly not in $N C D F .$

The $L$ -image of $\{F_{n}\}$ is the sequence $\{n (p - (1 - \frac{1}{n})) \cdot 1_{[1 - \frac{1}{n}, 1]} (p)\} .$ It is a Cauchy sequence in the $L^{q}$ -metric for any real $q \geq 1,$ because the distance between numbers $n$ and $m$ is less than ${|\frac{m - n}{2 m n}|}^{\frac{1}{q}}$ , which shrinks to zero with increasing $n$ and $m$ $.$ The limit of the sequence will be

F (p) = \{\begin{matrix} 0 f o r p \in [0, 1[ \\ 1 f o r p = 1 \end{matrix}

Although $F$ is a member of $N C D F,$ and although it is convex in $[0, 1]$ , it cannot be in $C C D F$ , because this set contains exclusively continuous functions. In the $L^{\infty}$ -metric, the $L$ -image of $\{F_{n}\}$ is not even a Cauchy sequence.

We will now examine to which extent convergence of a sequence in $N C D F$ implies convergence in $C C D F$ of its $L$ -image.

Lemma 3.1

Given that $F, G \in N C D F$ , if we name the expected values connected with $F$ and $G$ , respectively, $m_{F}$ and $m_{G}$ , then

| m_{G} - m_{F} | \leq {∥F - G∥}_{1}

Proof:

|m_{G} - m_{F}| = |\int_{0}^{1} (1 - G (x)) d x - \int_{0}^{1} (1 - F (x)) d x|

= |\int_{0}^{1} (F (x) - G (x)) d x| \leq \int_{0}^{1} |F (x) - G (x)| d x .

Now, let $\{F_{n}\}$ be a sequence in $N C D F .$

At first, we demand that $\{F_{n}\}$ converges to $F$ belonging to $N C D F$ in the $L^{\infty} ([0, 1], λ)$ -metric. Let $m_{n}$ and $m$ be the expected values connected with $F_{n}$ and $F,$ respectively.

Now,

{||L (F_{n}) - L (F)||}_{\infty} = sup_{p \in [0, 1]} |\frac{1}{m_{n}} \int_{0}^{p} F_{n}^{- 1} (u) d u - \frac{1}{m} \int_{0}^{p} F^{- 1} (u) d u .|

We see that

{||L (F_{n}) - L (F)||}_{\infty} \leq \frac{|m - m_{n}|}{m \cdot m_{n}} \int_{0}^{1} F_{n}^{- 1} (u) d u + \frac{1}{m} \int_{0}^{1} |F_{n}^{- 1} (u) - F^{- 1} (u)| d u .

As a consequence of Lemma 3.1, $m_{n} \to m f o r n \to \infty,$ which means that the first term shrinks to 0 as $n$ increases.

For 2 members of $N C D F,$ $G$ and $H$ , we consider

{∥G - H∥}_{1} = \int_{0}^{1} |G (x) - H (x)| d x .

But this is exactly identical to $\int_{0}^{1} |G^{- 1} (u) - H^{- 1} (u)| d u,$ as visualized in . As $\int_{0}^{1} |F_{n} (x) - F (x)| d x$ $< \underset{x \in [0, 1]}{s u p |F_{n} (x) - F (x)|,}$ we conclude $\frac{1}{m} \int_{0}^{1} |{F_{n}}^{- 1} (u) - F^{- 1} (u)| d u \to 0$ for $n \to \infty .$

Figure 2. The graph of 2 members of $N C D F,$ named $G$ and $H .$ The area of the shaded set is ${||G - H||}_{1} .$

So, ${||L (F_{n}) - L (F)||}_{\infty} \to 0$

whenever ${||F_{n} - F||}_{\infty} \to 0.$

Next, we let $\{F_{n}\}$ converge to $F$ belonging to $N C D F$ in the $L^{1} ([0, 1], λ)$ -metric and look at

∥ L (F_{n}) - L (F) ∥_{1} = \int_{0}^{1} |\frac{1}{m_{n}} \int_{0}^{p} F_{n}^{- 1} (u) d u - \frac{1}{m} \int_{0}^{p} F^{- 1} (u) d u| d p

With an argument similar to the above one, we get that

∥ L (F_{n}) - L (F) ∥_{1} \leq \int_{0}^{1} \frac{|m - m_{n}|}{m \cdot m_{n}} \int_{0}^{1} F_{n}^{- 1} (u) d u d p + \int_{0}^{1} |\frac{1}{m} \int_{0}^{p} (F_{n}^{- 1} (u) - F^{- 1} (u)) d u| d p,

Again, the first term will shrink to zero as $n$ increases. The second term will be equal to or lesser than

\frac{1}{m} \int_{0}^{1} \int_{0}^{p} |F_{n}^{- 1} (u) - F^{- 1} (u)| d u d p = \frac{1}{m} \int_{0}^{1} \int_{u}^{1} |F_{n}^{- 1} (u) - F^{- 1} (u)| d p d u,

where we switched the order of integration. The last expression will be less than

\int_{0}^{1} |F_{n}^{- 1} (u) - F^{- 1} (u)| d u \to 0 f o r n \to \infty .

So,

{||F_{n} - F||}_{1} \to 0 \Rightarrow {||L (F_{n}) - L (F)||}_{1} \to 0.

We now face the case where $\{F_{n}\}$ converges to $F$ belonging to $N C D F$ in the $L^{q} ([0, 1], λ)$ -metric for a $q > 1.$

If ${||F_{n} - F||}_{q} \to 0$ , then ${||F_{n} - F||}_{1} \to 0$ according to Jensen’s inequality. We just saw that this implies that $| |L (F_{n}) - L (F)| |_{1} \to 0 f o r n \to \infty .$

As $x^{\frac{1}{q}} \to 0 f o r x \to 0_{+}$ for any $q > 1,$ we have that $∥ L (F_{n}) - L (F) ∥_{1}^{\frac{1}{q}} \to 0 f o r n \to \infty .$

Furthermore, $|L (F_{n}) (p) - L (F) (p)| < 1$ for every $p \in [0, 1],$ which means that for every $p \in [0, 1],$

| L (F_{n}) (p) - L (F) (p) |^{q} < | L (F_{n}) (p) - L (F) (p) | .

We can conclude that

∥ L (F_{n}) - L (F) ∥_{q} \to 0 f o r n \to \infty .

This finishes the proof of the following:

Theorem 3.2

For any sequence $\{F_{n}\}$ belonging to $N C D F$ and any $q \in [1, \infty],$

lim_{n \to \infty} ∥ F_{n} - F_{q} ∥= 0 \Rightarrow lim_{n \to \infty} ∥ L (F_{n}) - L (F) ∥_{q} = 0.

The result could also be stated this way: The transformation $L$ that maps any cdf for a normalized random variable 1–1 to its Lorenz curve is continuous with respect to the $L^{q} ([0, 1], λ)$ -metric for every $q \in [1, \infty] .$

4. The $L^{1} ([0, 1])$ -metric and generalized Gini coefficients

With the $L^{1}$ -metric in $N C D F,$ we have introduced a way of measuring distances between bound distribution functions. If we name the completely equal distribution of the resource under observation $I$ , we have

I (x) = \{\begin{matrix} 0 f o r x \in [0, 1[ \\ 1 f o r x = 1 \end{matrix} .

Given that an $F \in N C D F, {||F - I||}_{1}$ will be a measure of the distance between $F$ and a complete equality with respect to the actual resource. We see that

(7)

{∥F - I∥}_{1} = 1 - m .

(7)

$w i t h m$ being the expectation associated with $F .$

Note that this distance should not be confused with Ebert’s distance between income distributions (Ebert, Citation1984). Every member of Ebert’s class,

$d^{r} (X, Y) = {(\int_{0}^{1} |F_{X}^{- 1} (v) - F_{Y}^{- 1} (v)|^{r} d v)}^{\frac{1}{r}}, r \geq 1$

is an absolute measure, because the income distributions are meant for absolute income. In contrast, (7) is strictly relative: If you add the same amount to every individual share, the distance will decrease—this also happens for the distance between 2 arbitrary members of $N C D F .$

Replacing $(7)$ with ${||L (F) - L (I)||}_{1}$ gives

{||L (F) - L (I)||}_{1} = \int_{0}^{1} (p - L (p)) d p =∥ p - L ∥_{1},

where we have named $L (F) L$ —as usual—and calculated $L (I)$ to be $p,$ the identical mapping.

The value of this integral will be in [0, 0.5], since $L$ , as we know, is convex. If we normalize it, i.e., multiply it with 2, we of course get the Gini coefficient for the distribution function $F$ ,

G = 2 ∥ p - L ∥_{1} .

This is the most popular way to explain the Gini coefficient, because it is illustrated as the size of an area. If $∥ F - I ∥_{1}$ is a quantity near zero, then the Gini coefficient will also be near zero—this is a consequence of theorem 3.2. But the opposite conclusion can generally not be drawn. In other words, we could have a small Gini coefficient in a rather polarized population. E.g., if 96.7 % of the population each earns 37.9% of the maximal income and while 3.3% each earns the maximal income, then $G = 0.05$ , while $∥ F - I ∥_{1} = 0.6.$ This is a symptom of the following:

Theorem 4.1

The inverse mapping to $L, L^{- 1},$ which maps $C C D F,$ the set of Lorenz curves, 1–1 on $N C D F,$ the set of distribution functions for normalized random variables, is not continuous with respect to the $L^{q} ([0, 1])$ -metric for any $q \in [1, \infty] .$

Proof:

If we can construct a sequence in $C C D F$ with the property that it converges to the identical mapping—and that at the same time its $L^{- 1}$ -image will not converge to $I,$ which is the $L^{- 1}$ -image of the identical mapping, then we are through with the proof.

In fact, we are able to choose the sequence in $C C D F$ in the following two-parameter-class of linear combination of power functionsFootnote⁶,

(8)

L (p) = a p + (1 - a) p^{b}, a \in [0, 1], b > 1.

(8)

|p - L (p)| = |p - a p - (1 - a) p^{b}| = (1 - a) (p - p^{b}),

we have that

a \to 1_{-} \Rightarrow sup_{p \in [0, 1]} |p - L (p)| \to 0.

So, if $L$ is given by (8), for every $q \in [1, \infty],$

∥ p - L ∥_{q} \to 0 f o r a \to 1_{-} .

According to theorem 2.1, $L^{'} (1)$ equals $\frac{1}{m}, w i t h m$ being the expectation of a normalized random variable with Lorenz curve $L .$ $m$ can be chosen as any value in $[0, 1] .$

Following formula (8), $L^{'} (1) = a + (1 - a) b .$ So, in $C C D F$ , we choose a sequence $\{L_{n}\}$ of type (8) fulfilling that for every $n \in N,$

(9)

a = 1 - \frac{1}{n} a n d b = \frac{\frac{1}{m} - a}{1 - a} .

(9)

We regard now,

{∥L^{- 1} (L_{n}) - L^{- 1} (p)∥}_{q} = {∥L^{- 1} (L_{n}) - I∥}_{q} .

For $q = 1$ , we have ${||L^{- 1} (L_{n}) - I||}_{1} = \int_{0}^{1} (L^{- 1} (L_{n}) - I) d x = \int_{0}^{1} L^{- 1} (L_{n}) (x) d x = 1 - m .$

As $∥ L^{- 1} (L_{n}) - I ∥_{q} \geq∥ L^{- 1} (L_{n}) - I ∥_{1} f o r q \in [1, \infty],$ we conclude that $\{L^{- 1} (L_{n})\}$ does not converge to $I$ for any $q \in [1, \infty] .$ This finishes the proof.

Note that we also showed that you could have a situation where the Gini coefficient shrinks to zero for a sequence of Lorenz curves, while at the same time, every one of the associated distribution functions has an arbitrarily great difference between the mean and maximal income!

This pattern in fact repeats for every higher-order Gini coefficient for the sequence ${\{L_{n}\}}_{n \in N} .$

Corollary 4.2

For the sequence of Lorenz curves given by (8) and (9), any generalized Gini coefficient will shrink to zero with increasing $n .$

Proof:

Using the formula of Kakwani (Citation1980), we have

G_{k} = k (k - 1) \int_{0}^{1} (p - a p - (1 - a) p^{b}) {(1 - p)}^{k - 2} d p, k \in \{2, 3 \dots\}

\Leftrightarrow G_{k} = 1 - k (k - 1) \int_{0}^{1} (a p + (1 - a) p^{b}) {(1 - p)}^{k - 2} d p, k \in {2, 3 \dots} .

For $k = 2, G_{k}$ is the ordinary Gini coefficient.

We achieve an estimate of $G_{k}$ using partial integration. Set

L^{(i)} (p) = \frac{a}{(i + 1)!} p^{i + 1} + \frac{1 - a}{(b + 1) \cdot \dots \cdot (b + i)} p^{b + i}, i \in {0, 1 \dots k - 1}

which is the $i$ th integral of the Lorenz function (8), then

G_{k} = 1 - (k (k - 1) ({[L^{(1)} (p) {(1 - p)}^{k - 2}]}_{0}^{1} + \int_{0}^{1} L^{(1)} (p) (k - 2) {(1 - p)}^{k - 3} d p))

= 1 - k (k - 1) (k - 2) \int_{0}^{1} L^{(1)} (p) {(1 - p)}^{k - 3} d p .

Iterating this process, we get

G_{k} = 1 - k! \int_{0}^{1} L^{(k - 2)} (p) d p = 1 - k! {[L^{(k - 1)} (p)]}_{0}^{1}

\Leftrightarrow G_{k} = 1 - k! (\frac{a}{k!} + \frac{1 - a}{(b + 1) \dots (b + k - 1)})

\Leftrightarrow G_{k} = (1 - a) (1 - \frac{1}{\prod_{j = 2}^{k} \frac{b + j - 1}{j}}) .

Inserting the values of $a$ and $b$ given by (9), it is easy to see that for every $k,$ $G_{k}$ will shrink to zero as $n \to \infty .$

In principle, the transformation $L$ creates a unique connection between any bound, non-negative probability distribution and its Lorenz curve. The mean value is intrinsic when calculating one of the objects from the other. Although the transformation proves to be continuous, the inverse transformation does not possess this feature. The very example that points out the discontinuity shows that the Gini coefficient of a population income can be very small, while in the same population, the income obtained by the majority can be far below the maximal income. This repeats for higher-order Gini coefficients although they were meant to weight poverty higher.

The specific property in our model, which creates this weakness, is the fact that the expected value of the individual share of the good in question determines the slope of the left-hand tangent of the Lorenz curve in the point (1,1).

5. Some conclusions related to the discontinuity of the inverse mapping

The results from section 4 rise at least 2 problems which our examples can illustrate.

First, we already saw that there is an obvious inequality in the non-continuous distribution example mentioned just before theorem 4.1. One can construct a continuous case almost parallel to it with a Lorenz curve of the type (8) choosing $a = 0.95$ and $b = 31$ . This example has $m = 0.4$ and a Gini coefficient value of 0.04688. In both examples, there is a majority with homogeneous and low income. The minority though is big enough to create a feeling of inequality. Following the advice in Liu and Gastwirth (Citation2020) about supplying the Gini coefficient with other measures, one finds that the series of generalized Gini coefficients gives only slightly different values. The so-called generalized entropy family of indices gives only smaller values. Even Gastwirth’s more promising modified Gini coefficient multiplying the Gini coefficient with the ratio of the mean value to the median gives only a value near 0.05. These measures of inequality are presented in Liu and Gastwirth (Citation2020). In this situation, one should turn to the relative deviation of the income distribution. This means the square root of the variance divided by the double mean value.Footnote⁷ Yitzhaki and Schechtman (Citation2013, p 22–25) gives thorough analysis and discussion on the relationship between the Gini coefficient and variance. So, if you accept that 5% of a population is not an extremely small part and if the Gini coefficient is suspiciously low, or lower than 0.1, then supply it with a computation of the relative deviation. In our examples, it is about 0.096. You could state it like this: A low Gini coefficient is necessary for relative equality in a society, but it is not sufficient.

Second, the fact that the continuous mapping of a cdf for a normalized random variable to its Lorenz curve has an inverse mapping, which is discontinuous, is in fact just another example of inverse problems in econometry. Horowitz (Citation2014) gives a survey of the problem—all his examples are with respect to the supremum norm—in economics and also some rather different fields. It seems that the phenomenon has a certain prevalence in the empirical sciences. Trying to estimate a distribution following the discontinuous mapping, one is faced with an ill-posed inverse problem. Horowitz shows in his examples how to deal with the problem in some specific cases through regularization.

In our case, one could ask: Is it possible to estimate the income distribution in the society if we have information related to the Lorenz curve? Kleiber and Kotz (Citation2002) point out that a finite, non-negative cdf always could be found exactly as all the moments of it are known. Alternatively knowing the mean of the minimum of $n$ independent random variables sharing the cdf for every $n \in N$ gives the same possibility. From there, they conclude that if the sequence ${\{G_{k}\}}_{k = 2}^{\infty}$ of generalized Gini coefficients is known, then the cdf can be determined. They refined the result somewhat proving that you could do with a subsequence ${\{G_{k_{j}}\}}_{j = 1}^{\infty}$ fulfilling that $\sum_{j = 1}^{\infty} \frac{1}{k_{j}} = \infty .$

Farris (Citation2010) states an idea to make it less labor-intensive: Suppose that you take a sample of incomes. You compute $G_{2}, G_{3}$ , and $G_{4}$ from the empirical distribution function. Then, calculate a Lorenz curve of the type (8) directly from the values of $G_{2}$ and $G_{3},$ which means that you have estimates of $a$ and $b$ in (8). Finally, you compute the 4th order Gini coefficient from the Lorenz curve you found, $\tilde{G_{4}}$ . . If it fits well to $G_{4}$ , then you have good model. But if you from this stage conclude that you have a well-estimated income distribution function based on $a$ and $b,$ then you are facing an ill-posed inverse problem, and you cannot be sure that your estimated cdf is useful.

6. Epilogue

The widespread idea of illustrating the Gini coefficient as the area between the segment from (0,0) to (1,1) and the Lorenz curve of empirical data or some approximation to them is sound because this area can be conceived of as a distance—in $L^{1} ([0, 1]) .$ Still, a small Gini coefficient is not enough to ensure a high degree of income equality in a society.

This conclusion is not the same as a removal of the Gini coefficient or its generalizations. Corrado Gini’s own introduction, and especially the moderate rewriting of it made by Dorfman (Citation1979), gives this interpretation: In the population, pick 2 individual shares of the good in question, $X_{1}$ and $X_{2}$ . Let $Y = min (X_{1}, X_{2})$ . Then,

G = 1 - \frac{E (Y)}{E (X_{1})} .

Therefore, if you make a repeated experiment choosing a sample of 2 values, note the first and the least, then in the long run, the ratio between the average of the latter and of the former subtracted from 1 will approximate the Gini coefficient. So, if you take a stroll somewhere in your town and ask a random and honest pedestrian about her income, then on average, the answer would be close to your own income—if the Gini coefficient is low.

Disclosure statement

It is a pleasure to thank the editor and the anonymous reviewers for helping to improve this paper

No potential conflict of interest was reported by the author(s).

Additional information

Funding

The author received no direct funding for this research.

Notes on contributors

Jens Peter Kristensen

Being a math teacher, I often took an interest in courses of teaching in applied mathematics. In the 2 latest decades, this was frequently interdisciplinary courses with social science and economics. After some courses about economic inequality, I wrote an article (in Danish) to the magazine of the Danish Association of Mathematics Teachers, LMFK-bladet 4/2015 p 10 - 15. In a comment, a colleague referred to Farris’ article in AMM 12/2010, leading me to some of the comprehensive economics literature on measuring inequality, Lorenz curve, and Gini coefficient. My prime interest was to analyze the transformation from which you derive the Lorenz curve from a given income distribution. If you demand the distributions to be normalized, this mapping is 1-1 of a set of distributions into itself. The set is contained in a normed space. So, mathematically, you can ask if it is continuous and if the inverse is. Having answered these questions, it remains drawing consequences in economics methodology – which could be further refined. Have among other high schools worked at Hasseris Gymnasium, Denmark.

Notes

1. Furthermore, the current OECD formula weights income higher, the more numerous the household in which the individual lives is.

2. Named after M. O. Lorenz, the American economist who developed the concept in his pioneering research on income inequality. Se Lorenz, M O. (1905) Methods of measuring the concentration of wealth. Journal of American statistical Association. p 209–219.

3. Dorfman does not use the concept of

F^{- 1} (u)

. Furthermore, he uses Stieltjes integrals. Other writers I found only prove the case with a differentiable distribution function. We will at present be content with this, although it is not difficult to prove (2) for any kind of distribution using the Lebesgue measure on [0,1].

4. More details in the proof of these claims about convex functions could be found in Rudin (Citation1974) p 62–63.

5. The term is chosen because here – in accordance with the current literature – is used as argument for Lorenz curves.

6. I found this class of functions in Farris (2010) p 863. He calls them Pareto functions which they obviously not are. They can only be Lorenz curves for finite random variables. One could utter that for b > 2 the associated cdf has a certain resemblance to Pareto distributions.

7. In Liu & Gastwirth’s (Citation2020) terminology this is “one half of the coefficient of variation”.

References

Atkinson, A. B. (1970). On the measurement of inequality. Journal of Economic Theory, 2(3), 244–16. https://doi.org/10.1016/0022-0531(70)90039-6
Web of Science ®Google Scholar
Donaldson, D., & Weymark, J. A. (1983). Ethically flexible gini indices for income distributions in the continuum. Journal of Economic Theory, 29(2), 353–358. https://doi.org/10.1016/0022-0531(83)90053-4
Web of Science ®Google Scholar
Dorfman, R. (1979). A formula for the gini coefficient. Review of Economics and Statistics, 61(1), 146–149. https://doi.org/10.2307/1924845
Web of Science ®Google Scholar
Ebert, U. (1984). Measures of distance between income distributions. Journal of Economic Theory, 32(2), 266–274. https://doi.org/10.1016/0022-0531(84)90054-1
Web of Science ®Google Scholar
Farris, F. A. (2010). The gini index and measures of inequality. American Mathematical Monthly, 12(10), 851–864. https://doi.org/10.4169/000298910x523344
Web of Science ®Google Scholar
Gastwirth, J. (1971). A general definition of the Lorenz Curve. Econometrica, 39(6), 1037–1039. https://doi.org/10.2307/1909675
Web of Science ®Google Scholar
Golden, J. (2008). A simple geometric approach to approximating the Gini coefficient. Journal of Economic Education, 39(1), 68–77. https://doi.org/10.3200/JECE.39.1.68-77
Web of Science ®Google Scholar
Horowitz, J. L. (2014). Ill-posed inverse problems in economics. Annual Review of Economics, 6(1), 21–51. https://doi.org/10.1146/annurev-economics-080213-041213
Web of Science ®Google Scholar
Kakwani, N. (1980). On a class of poverty measures. Econometrica, 48(2), 437–446. https://doi.org/10.2307/1911106
Web of Science ®Google Scholar
Kleiber, C., & Kotz, S. (2002). A characterization of income distributions in terms of generalized gini coefficients. Social Choice and Welfare, 19(4), 789–794. https://doi.org/10.1007/s003550200154
Web of Science ®Google Scholar
Lambert, P. (1990). The distribution and redistribution of income. Basil Blackwell.
Google Scholar
Liu, Y., & Gastwirth, J. (2020). On the capacity of the Gini index to represent income distribution. Metron, 78(1), 61–69. https://doi.org/10.1007/s40300-020-00164-8
Web of Science ®Google Scholar
Morgan, J. (1962). The anatomy of income distribution. Review of Economics and Statistics, 44(3), 270–282. https://doi.org/10.2307/1926398
Web of Science ®Google Scholar
OECD: Income Distribution Database. (2017). http://www.oecd.org/els/soc/IDD-ToR.pdf
Google Scholar
Rudin, W. (1974). Real and complex analysis. McGraw-Hill.
Google Scholar
Yitzhaki, S. (1983). On an extension of the Gini inequality index. International Economics Review, 24(3), 617–628. https://doi.org/10.2307/2648789
Web of Science ®Google Scholar
Yitzhaki, S., & Schechtman, E. (2013). Chapter 2. In The Gini methodology.Springer Series in Statistics 272 (New York: Springer).
Google Scholar

The Gini coefficient and discontinuity

Abstract

PUBLIC INTEREST STATEMENT

1. Introduction

2. The transformation mapping a cumulative distribution function to its Lorenz function

3. Convergence of sequences in NCDF and CCDF

4. The $L^{1} ([0, 1])$ -metric and generalized Gini coefficients

5. Some conclusions related to the discontinuity of the inverse mapping

6. Epilogue

Disclosure statement

Notes on contributors

Jens Peter Kristensen

References

Information for

Open access

Opportunities

Help and information

The Gini coefficient and discontinuity

Abstract

PUBLIC INTEREST STATEMENT

1. Introduction

2. The transformation mapping a cumulative distribution function to its Lorenz function

3. Convergence of sequences in NCDF and CCDF

4. The L10,1-metric and generalized Gini coefficients

5. Some conclusions related to the discontinuity of the inverse mapping

6. Epilogue

Disclosure statement

Additional information

Funding

Notes on contributors

Jens Peter Kristensen

Notes

References

Related research

To cite this article:

Download citation

Your download is now in progress and you may close this window

Login or register to access this feature

Information for

Open access

Opportunities

Help and information

Keep up to date

4. The $L^{1} ([0, 1])$ -metric and generalized Gini coefficients