Formulae display: $MathJax Logo$ ?Mathematical formulae have been encoded as MathML and are displayed in this HTML version using MathJax in order to improve their display. Uncheck the box to turn MathJax off. This feature requires Javascript. Click on a formula to zoom.

Abstract.

This study considers excess distribution estimation in iid settings. There are two ways for the estimation; the fitting to the generalized Pareto distribution and the fully non parametric estimation. The fitting estimator is justified by the approximation proven in the extreme value theory; however, the accuracy depends on how extremely large the target is. The non parametric estimator does not need an approximation and has the advantage of wide applicability. This study conducts both theoretical and numerical comparative study on excess distribution estimation. Asymptotic convergence rates of two estimators are obtained, and the mean integrated squared errors are numerically surveyed by simulation study. An illustrative example of Abisko rainfall amount is presented.

Keywords:

Mathematics Subject Classification (2010):

1. Introduction

Let $X_{1}, X_{2}, \dots X_{n}$ be independent and identically distributed random variables with a continuous distribution function F. Suppose that n is sufficiently large. Here, we consider estimating the excess distribution (ED) given by $\begin{array}{l} F_{u} (x) := P (X_{1} - u \leq x | X_{1} > u) = \frac{F (x + u) - F (u)}{1 - F (u)} . \end{array}$

Shimokihara and Maesono (Citation2018) studied asymptotic properties of a non parametric estimator. The non parametric estimator (NE) is the plug-in type of the kernel distribution estimator $\begin{array}{l} {\hat{F}}_{u} (x) := \frac{\hat{F} (x + u) - \hat{F} (u)}{1 - \hat{F} (u)}, \end{array}$ where $\hat{F} (x)$ is the kernel distribution estimator given by $\begin{array}{l} \hat{F} (x) = \frac{1}{n} \sum_{i = 1}^{n} W (\frac{x - X_{i}}{h}), \end{array}$ where W is the cumulative distribution function of a symmetric density w. The bandwidth h is supposed to satisfy h→0.

Under some regularity conditions, the asymptotic mean squared error (MSE) of NE ${\hat{F}}_{u} (x)$ asymptotically equals $\begin{array}{l} \frac{h^{4}}{4} {(\frac{1}{1 - F (u)})}^{2} {(f^{'} (x + u) - f^{'} (u) + F_{u} (x) f^{'} (u))}^{2} {(\int z^{2} w (z) d z)}^{2} \\ + n^{- 1} F_{u} (x) \frac{1 - F (x + u)}{(1 - F (u))^{2}} \end{array}$

(see Theorem 1.1 in Shimokihara and Maesono Citation2018), where the integral range being $(- \infty, \infty)$ is omitted in this paper. Both x and u are implicitly assumed to be fixed in Shimokihara and Maesono (Citation2018).

If we want to know the ED F_u(x) on a tail, the Pickands-Balkema-De Haan theorem in the extreme value theory is applicable. The theorem states that F_u(x) converges to the generalized Pareto distribution (GPD) H_{γ, c}(x) as $u ↑ x^{*} := sup (supp (f))$ , where $\begin{array}{l} H_{γ, c} (x) := & H_{γ} (\frac{x}{c}) for 1 + γ \frac{x}{c} > 0, \\ H_{γ} (x) := & 1 - {\begin{cases} (1 + γ x)^{- 1 / γ} & for 1 + γ x > 0 and γ \in R ∖ {0} \\ \exp (- x) & for x \in R and γ = 0. \end{cases} \end{array}$

Thus, parametrically fitting GPD to ED is justified, and $\begin{array}{l} H_{{\hat{γ}}_{u}} (x) := H_{\hat{γ}} (\frac{x}{{\hat{c}}_{u}}) for 1 + \hat{γ} \frac{x}{{\hat{c}}_{u}} > 0 \end{array}$ provides good estimates for a sufficiently large u. We will call the parametric estimator PE.

In short, there are mainly two ways to estimate ED. NE ${\hat{F}}_{u}$ is supposed to be used for fixed, that is, not large u. On the other hand, the extreme value approach requires large u. Then, the following question arise: How large u should be for the extreme value approach ?. Preceding researches also states “How far can we extrapolate into the tails?” (Smith Citation1987, p.1194), “That is, an approximation to probabilities of extreme deviation is supposed, which is assumed to become increasingly accurate as one moves further from the range of the data, but whose concise accuracy is unknown” (in the abstract of Hall and Weissman Citation1997). Smith (Citation1987) gave a response in terms of the convergence rate, which is a function of x (Remark of Theorem 8.1).

This study aims at clarifying how large the fitting estimator requires on u by comparing the two ways of ED estimation. Moriyama (Citation2021) conducted a comparative study on the estimation of sample maximum distribution F^m between the extreme-value-based approach and non parametric approach and investigated both theoretical and numerical accuracy, depending on m. Estimators of extreme quantiles are numerically compared in Banfi, Cazzaniga, and De Michele (Citation2022).

To the best of our knowledge, this is the first comparative study between the extreme-value-based approach and the non parametric approach in the distribution tail. This study assumes the tail of the underlying distribution to obtain the explicit form of asymptotic errors. Throughout this study, suppose that F belongs to either one of (i) the so-called Hall class of distributions (see Hall and Welsh Citation1984), (ii) the following Weibull class of distributions, and (iii) the bounded class of distributions (see, e.g., Stupfler Citation2016), which satisfy (i) $\exists (α, β, A, B)$ s.t. $α > 0, β \geq 2^{- 1}, A > 0, B \neq 0$ and $\begin{array}{l} x^{α + β} {1 - F (x) - A x^{- α} (1 + B x^{- β})} \to 0 as x \to \infty, \end{array}$

(ii) $\exists (κ, C)$ s.t. $κ > 0, C > 0$ and $\begin{array}{l} \exp (C x^{κ}) {1 - F (x) - \exp (- C x^{κ})} \to 0 as x \to \infty, \end{array}$

(iii) $\exists (x^{*}, μ, σ, D, E)$ s.t. $x^{*} \in R$ , $μ < - 2$ , $σ \leq - 2^{- 1}$ , D > 0, E≠0 and $\begin{array}{l} (x^{*} - x)^{μ + σ} {1 - F (x) - (x^{*} - x)^{- μ} (D + E (x^{*} - x)^{- σ})} \to 0 as x ↑ x^{*}, \end{array}$ respectively. Then, the limiting GPD is a Fréchet, Gumbel, and Weibull type under the supposition (see Beirlant et al. Citation2004), where $\begin{array}{l} γ := {\begin{cases} α^{- 1} & for the Hall class \\ 0 & for the Weibull class \\ μ^{- 1} & for the bounded class, \end{cases} c_{u} := {\begin{cases} γ u & for the Hall class \\ C^{- 1} κ^{- 1} u^{1 - κ} & for the Weibull class \\ - γ (x^{*} - u) & for the bounded class . \end{cases} \end{array}$ under κ≤1.

Section 2 and 3 give the asymptotic properties of NE, the Kernel-type estimator, and PE, the fitting estimator to GPD, respectively, under the supposition $x := x_{n} \to x_{\infty}$ and $u := u_{n} \to x^{*}$ as n→∞, where x_∞: = ∞ for the Hall class or the Weibull class and x_∞: = 0 for the bounded class. Results of the numerically comparative study are shown in Section 4, and the asymptotic convergence rates of the two estimators are provided in some cases. The proofs of theoretical results are in Appendix.

2. Kernel-type estimation

The following theorem on the MSE of NE is a consequence of Theorem 1.1 in Shimokihara and Maesono (Citation2018), where all asymptotic notations in this article refer to n→∞.

Theorem 1.

Suppose F is continuously twice differentiable at x. If $\int z^{2} w (z) d z < \infty$ and $\begin{array}{l} {\begin{cases} h x^{κ - 1} \to 0 & for the Weibull class \\ h (x^{*} - x)^{- 1} \to 0 and h (x^{*} - x)^{- 1} \to 0 & for the bounded class, \end{cases} \end{array}$ $\begin{array}{l} F_{u}^{- 2} (x) E [(F_{u} (x) - {\hat{F}}_{u} (x))^{2}] \sim (U_{n} h^{2} \frac{ξ_{n}}{2} \int z^{2} w (z) d z)^{2} + \frac{U_{n} (1 - U_{n}) v_{n}}{n}, \end{array}$

where $\begin{array}{l} U_{n} := & {\begin{cases} {(1 + \frac{x}{u})}^{- α} & for the Hall class \\ \exp (- C {(x + u)^{κ} - u^{κ}}) & for the Weibull class \\ {(1 - \frac{x}{x^{*} - u})}^{- μ} & for the bounded class, \end{cases} \\ ξ_{n} := & {\begin{cases} α (α + 1) u^{- 2} {{(\frac{x}{u} + 1)}^{- 2} + 1 - 2 {(\frac{x}{u} + 1)}^{α}} & for the Hall class \\ κ^{2} C^{2} {(x + u)^{2 κ - 2} - u^{2 κ - 2}} & for the Weibull class \\ μ (μ + 1) (x^{*} - u)^{- 2} \\ \times {{(1 - \frac{x}{x^{*} - u})}^{- 2} + 1 - 2 {(1 - \frac{x}{x^{*} - u})}^{μ}} & for the bounded class, \end{cases} \\ v_{n} := & {\begin{cases} A^{- 1} u^{α} & for the Hall class \\ \exp (C u^{κ}) & for the Weibull class \\ D^{- 1} (x^{*} - u)^{μ} & for the bounded class . \end{cases} \end{array}$

The following Corollary 1 on the bandwidth minimizing the MSE follows from Theorem 1.

Corollary 1.

Suppose $| \int z W (z) w (z) d z | < \infty$ and $U_{n}^{- 2} ξ_{n}^{- 2} ω_{n} n^{- 1} \to 0$ , where $\begin{array}{l} ω_{n} := {\begin{cases} A^{- 1} α u^{α - 1} ((1 - U_{n})^{2} + U_{n}^{1 + γ}) & for the Hall class \\ κ C \exp (C u^{κ}) (u^{κ - 1} (1 - U_{n})^{2} + (x + u)^{κ - 1} U_{n}) & for the Weibull class \\ D^{- 1} μ (x^{*} - u)^{μ - 1} ((1 - U_{n})^{2} + U_{n}^{1 + γ}) & for the bounded class . \end{cases} \end{array}$

Under the assumptions of Theorem 1, the optimal bandwidth in the sense of the MSE is $\begin{array}{l} h = {(2 U_{n}^{- 2} ξ_{n}^{- 2} ω_{n} n^{- 1} \frac{\int z W (z) w (z) d z}{{(\int z^{2} w (z) d z)}^{2}})}^{1 / 3} . \end{array}$

$F_{u} (x) - {\hat{F}}_{u} (x)$ with the optimal bandwidth is asymptotically non degenerate normal with the asymptotic mean $\begin{array}{l} ν_{0} n^{- 2 / 3} U_{n}^{- 1 / 3} ξ_{n}^{- 1 / 3} ω_{n}^{2 / 3}, \end{array}$

where $ν_{0} := {(2 \int z^{2} w (z) d z)}^{- 1 / 3} {(\int z W (z) w (z) d z)}^{2 / 3}$ .

The following Corollary 2 states the special case U_n = O(1) of Corollary 1.

Corollary 2.

Suppose $^{\exists} δ > 0$ s.t. U_n→δ. Under the assumptions of Corollary 1, the asymptotically optimal bandwidth in the sense of the MSE is $\begin{array}{l} h = & {(2 δ^{- 2} {(1 - δ)^{2} + δ^{1 + γ}} n^{- 1} \frac{\int z W (z) w (z) d z}{{(\int z^{2} w (z) d z)}^{2}})}^{1 / 3} \\ \times & {\begin{cases} A^{- 1 / 3} α^{1 / 3} (α + 1)^{- 2 / 3} u^{(α + 3) / 3} (δ^{2 γ} + 1 - 2 δ)^{- 2 / 3} & for the Hall class \\ κ^{- 1} C^{- 1 / 3} \exp (C u^{κ} / 3) (- \ln δ)^{- 2 / 3} u^{1 - (κ / 3)} & for the Weibull class \\ D^{- 1 / 3} μ^{1 / 3} (μ + 1)^{- 2 / 3} (x^{*} - u)^{(μ + 3) / 3} (δ^{2 γ} + 1 - 2 δ)^{- 2 / 3} & for the bounded class . \end{cases} \end{array}$

${\hat{F}}_{u} (x)$ with the optimal bandwidth has the asymptotic bias $\begin{array}{l} ν_{0} δ^{- 1 / 3} {((1 - δ)^{2} + δ^{1 + γ})}^{2 / 3} n^{- 2 / 3} \\ \times & {\begin{cases} A^{- 2 / 3} α^{1 / 3} (α + 1)^{- 1 / 3} u^{2 α / 3} (δ^{2 γ} + 1 - 2 δ)^{- 1 / 3} & for the Hall class \\ C^{1 / 3} (- \ln δ)^{- 1 / 3} \exp (2 C u^{κ} / 3) u^{κ / 3} & for the Weibull class \\ D^{- 2 / 3} μ^{1 / 3} (μ + 1)^{- 1 / 3} (x^{*} - u)^{2 μ / 3} (δ^{2 γ} + 1 - 2 δ)^{- 1 / 3} & for the bounded class . \end{cases} \end{array}$

The twice differentiability required in Theorem 1 is the usual regularity condition in smooth distribution estimation or density estimation (see, e.g., Wand and Jones Citation1995).

Remark 1.

x satisfying $lim_{n \to \infty} h (x^{*} - x)^{- 1} > 0$ is called a boundary point in the naive kernel distribution estimation and the convergence rate of ${\hat{F}}_{u} (x)$ changes (see the proof of Theorem 1). Theorem 1 requires that μ is an integer or $μ < - 2$ .

3. Fitting estimator to GPD

We employ the maximum likelihood estimation (MLE) based on the peak-over-threthold (POT) for fitting to the GPD, which was developed by Pickands (Citation1975). Let t: = t_n be the threshold of the POT and N be the number of $X_{1}, X_{2}, \dots X_{n}$ exceeding t. Let Y_j be the jth number of $X_{1}, X_{2}, \dots X_{n}$ exceeding t $(j = 1, \dots, N)$ . It holds that $(N / N^{*}) \overset{p}{\to} 1$ , where $N^{*} := n (1 - F (t))$ . $\overset{p}{\to}$ means the probability convergence. Set $γ_{t} := (γ_{t}, c_{t})^{T}$ and $\begin{array}{l} {\hat{γ}}_{t} := \underset{γ_{t}}{\arg \max} \sum_{j = 1}^{N} \ln h_{γ_{t}} (Y_{j}), \end{array}$ where h_𝜸 is the density function of H_𝜸. Then, t needs to satisfy the following assumption.

Assumption 1.

Either (i) $(U_{n} \lor T_{n}) \to 0$ or (ii) $^{\exists} δ > 0$ s.t. both U_n→δ and T_n→δ holds, where $\begin{array}{l} T_{n} := {\begin{cases} {(1 + \frac{x}{t})}^{- α} & for the Hall class \\ \exp (- C κ t^{κ - 1} x) & for the Weibull class \\ {(1 - \frac{x}{x^{*} - t})}^{- μ} & for the bounded class . \end{cases} \end{array}$

PE, the fitting estimator, fundamentally depends on the approximation based on the Pickands-Balkema-De Haan theorem shown in the following Proposition 1, whose convergence is ensured by Assumption 1.

Proposition 1.

Under Assumption 1 $\begin{array}{l} τ_{n} := F_{u} (x) - H_{γ_{t}} (x) \to 0. \end{array}$

Remark 2.

The condition (i) in Assumption 1 $\begin{array}{l} U_{n} \to 0 ⟺ {\begin{cases} u = o (x) & for the Hall class \\ u = o (x^{(1 - κ)^{- 1}}) or κ \geq 1 & for the Weibull class \\ (x^{*} - u)^{- 1} = o (x^{- 1}) & for the bounded class \end{cases} \end{array}$ restricts the threshold t to being asymptotically same as u in the following sense $\begin{array}{l} {\begin{cases} t = o (x) & for the Hall class \\ t = o (x^{(1 - κ)^{- 1}}) or κ \geq 1 & for the Weibull class \\ (x^{*} - t)^{- 1} = o (x^{- 1}) & for the bounded class . \end{cases} \end{array}$ $\begin{array}{l} U_{n} \to δ ⟺ {\begin{cases} u = (δ^{- γ} - 1)^{- 1} x + o (1) & for the Hall class \\ u = [- (C x)^{- 1} \ln δ]^{(κ - 1)^{- 1}} + o (x^{(1 - κ)^{- 1}}) & for the Weibull class \\ u = x^{*} + (δ^{- γ} - 1)^{- 1} x + o (x) & for the bounded class, \end{cases} \end{array}$ where the Weibull class additionally needs both x = o(u) and κ < 1. (ii) being true requires t to be asymptotically same as u in a similar sense, as (i).

MLE is asymptotically efficient; however, various approaches are proposed and compared (Zhang Citation2007; Del Castillo and Serra Citation2015; Kang and Song Citation2017). Smith (Citation1987) gave the conditions that show the following scaled version of the MLE $\begin{array}{l} {\hat{γ}}_{t}^{*} = (\begin{array}{c} 1 \\ c_{t}^{- 1} \end{array}) {\hat{γ}}_{t} \end{array}$ is asymptotically normal with a non trivial bias and $\sqrt{N^{*}}$ -consistent under the following Assumption 2.

Assumption 2.

$^{\exists} λ \in R$ s.t. λ_n→λ, where $\begin{array}{l} λ_{n} := \sqrt{n} \times {\begin{cases} A^{1 / 2} t^{- α / 2} & for the Hall class \\ - t^{2} \exp (- C t^{κ}) & for the Weibull class \\ D^{1 / 2} (x^{*} - t)^{- μ / 2} & for the bounded class \end{cases} \end{array}$

We have the following proposition on the accuracy of PE, which is a consequence of Smith (Citation1987).

Proposition 2.

Under Assumptions 1–2 $\begin{array}{l} E [(F_{u} (x) - H_{{\hat{γ}}_{t}} (x))^{2}] \sim (τ_{n} + {N^{*}}^{- 1 / 2} λ_{n} η_{n}^{T} Σ_{0}^{- 1} μ)^{2} + {N^{*}}^{- 1} (η_{n}^{T} Σ_{0}^{- 1} η_{n}), \end{array}$

where 𝛍 and the Fisher information matrix Σ₀ are given in Smith (Citation1987), and $\begin{array}{l} η_{n} := {(T_{n} (- T_{n}^{γ} + 1 + γ \ln T_{n}), \frac{1 - T_{n}^{- γ}}{γ} T_{n}^{1 + γ})}^{⊤} . \end{array}$

The following corollary on the convergence rate of PE immediately follows from Proposition 2.

Corollary 3.

Under the assumptions of Proposition 2, $(F_{u} (x) - H_{{\hat{γ}}_{t}} (x))$ converges with the rate larger of τ_n and ${N^{*}}^{- 1 / 2} η_{n}^{⊤} 1$ .

4. Comparative study

Suppose $t = u^{*}$ and $^{\exists} δ > 0$ s.t. U_n≡δ throughout in this section, where $\begin{array}{l} u^{*} := {\begin{cases} u & for the Hall class or the Weibull class \\ (x^{*} - u)^{- 1} & for the bounded class . \end{cases} \end{array}$

Set the threshold $u^{*} = n^{1 / 8}$ , $u^{*} = n^{1 / 4}$ , $u^{*} = n^{1 / 2}$ , or $u^{*} = n^{3 / 4}$ . Since $τ_{n} \sim T_{n} - U_{n} + O ((u^{*})^{- β})$ for the Hall class, the MSE of PE converges with the rate $n^{- 1} (u^{*})^{α} + (u^{*})^{- 2 β}$ . The MSE for the bounded class is of order $n^{- 1} (u^{*})^{- μ} + (u^{*})^{2 σ}$ . The minimum of the MSE of NE is of order $n^{- 1} (u^{*})^{α}$ or $n^{- 1} (u^{*})^{- μ}$ for the Hall class or the bounded class if the minimizing bandwidth converges, that is, $n^{- 1} (u^{*})^{α + 3} \to 0$ or $n^{- 1} (u^{*})^{- μ + 3} \to 0$ , respectively. For the Weibull class, the MSE of PE does not converge to zero when u is of the polynomial order of n, and the MSE of NE tends to infinity. If $u = (\ln n)^{1 / κ}$ , the MSE of PE converges order slower than any polynomial, the asymptotic variance n^C−1 converges. The order of the MSE of NE $n^{(4 / 3) (C - 1)} (\ln n)^{2 / 3}$ in the setting.

To sum up, when F belongs to the Hall class or the bounded class, NE converges with the same or faster rate than PE if the optimal bandwidth converges. Specifically, whether PE should be used depends on $n^{- 1} (u^{*})^{(1 / | γ |) + 3}$ converges or not. For the Weibull class, the two estimators are not consistent if the threshold u is a polynomial order of n.

Next, the underlying distributions F were supposed to be Burr distributions defined as $1 - F (x) = (1 + x^{c})^{ℓ}$ , where α = cℓ and β = c, Weibull distributions, and inverse Burr distributions defined as $1 - F (x) = (1 + (- x)^{ℓ})^{c} x < 0$ , where μ = cℓ and $σ = 1 / c$ . The parameters of the underlying distributions and the convergence rates of MSE without terms slower than any polynomial are summarized in , where the tail index γ is α⁻¹, zero and μ⁻¹, respectively. The hyphen means the distribution breaks the assumption of this study of the estimator. The Weibull class with γ = 0 breaks both the assumptions of PE and NE. For $u^{*} = n^{1 / 8}$ (small relative to n) and γ far from zero, the convergence rate of MSE of NE is fast and especially close to n⁻¹ for α or μ being close to zero while that of PE is quite slow. The relation to the convergence rate of PE is complicated, unlike that of NE. As u^* gets relatively large, the convergence rate of PE becomes faster, but the requirement becomes restrictive in general. Particularly, the assumption is broken for γ being close to zero. NE loses its consistency completely if $n^{1 / 2} (u^{*})^{- 1}$ converges to some constant, including zero.

Table 1. The polynomial convergence rates of the MSE of the estimators and the lengths.

Display Table

By simulating the following, the mean integrated squared error (MISE) of PE $\begin{array}{l} L_{u}^{- 1} \int_{Q_{u} (0.1)}^{Q_{u} (0.9)} {(H_{{\hat{γ}}_{t}} (x) - F_{u} (x))}^{2} d x . \end{array}$ and that of NE ${\hat{F}}_{u}$ , we studied the numerical accuracy in finite-sample cases. $L_{u} := Q_{u} (0.9) - Q_{u} (0.1)$ , and Q_u(q) denotes the qth quantile of the ED. We suppose $Q_{u} (q) (0.1 \leq q \leq 0.9)$ , which is intended that U_n = O(1), that is, $F_{u} (x) \sim \exp (- U_{n}) = O (1)$ . This numerical study employs $\hat{h} := n^{- 1 / 3} u^{1 + ({\hat{γ}}^{- 1} / 3)}$ as the bandwidth estimator following the result in Corollary 2, where $\hat{γ}$ is the MLE. The kernel functions were the Epanechnikov for the inverse Burr distributions and the Gaussian for the other distributions. We simulated the MISE values 10,000 times, where show the mean values and their standard deviation (sd), where the hyphens mean $1 - F (u^{*})$ numerically equals zero, and so we cannot derive the MISE value. The sample sizes were (n =) 2⁸ or 2¹². u^* is n^1∕8, n^1∕4 or n^1∕2.

shows the simulated results on the MISE values for the Burr cases. On the whole, NE surpasses PE for relatively small u^* for example, $u^{*} = n^{1 / 8}$ and conversely, PE is better for $u^{*} = n^{1 / 2}$ . The MISE values of NE are especially large for both α and u^* being large. For u^* being around n^1∕4 they are comparable. For $u^{*} ≳ n^{1 / 2}$ (e.g. $u^{*} = n$ ) it is thought that PE far outperforms NE and NE is of no use.

shows the MISE values for the inverse Burr cases. NE gets inaccurate as u^* becomes large relatively to n; however, the performances of PE and NE heavily depend on not only the size of u^* but also the tail index γ. This numerical property is slightly different from that of the Burr cases. For $γ = - 1 / 6$ (i.e. $c = 3, ℓ = 2$ ) which is closest to zero in this study, NE is always more accurate than PE. Conversely, even though u^* is small, PE outperforms NE for some cases γ being around −1 to −3.

shows the MISE values for the Weibull cases. Due to the light-tailness, 1−F(u) numerically equals zero in many cases. The remaining cases shows PE and NE are comparable, while PE is more numerically stable. In order to continue the comparative study on the Weibull cases, we chose relatively smaller (ln⁡n)^1∕κ as the threshold u^*. shows the simulated results on the MISE values. In this setting, $1 - F (u^{*}) \sim \exp (C) n^{- 1}$ . We chose 1, 1∕2, and 1∕5 as the parameter C, where the tail gets lighter as C becomes small. For κ = 3 and C = 1, NE is quite accurate, but PE is much better than NE for $κ = 1 / 2$ and C = 1 and κ = 10 and C = 1. For the other cases, they are comparable, and so we cannot conclude which one is better for the Weibull cases. For the light-tailed distribution, ED is considered to be quite sensitive to the distribution parameters. This study concludes PE and NE are comparable for the Weibull cases; however, more detailed numerical study is an important future work.

5. Real data study

This section considers a real-data study. The data is on Abisko rainfall provided by Abisko Scientific Research Station. It is available in mev package in the R software environment. The data includes the rainfall amount (in mm) and the dates from 1/1/1913 to 1/1/2015, which is given in . The time series trend was analyzed by the annual maximums and found to be not statistically significant (Rudvik Citation2012). Kiriliouk et al. (Citation2019) applied the GPD fitting with the threshold u = 12 and showed $\hat{γ} ≒ 0$ .

Figure 1. Abisko rainfall amount (in mm) from 1/1/1913 to 1/1/2015.

Extreme rainfall causes a landslide, and so a probability estimation is required. shows the estimated ED functions of Abisko rainfall amount (in mm) by the non parametric approach (solid line) and by fitting to the GPD (dashed line). The difference between the two approaches is found to be small. is the magnified at the area [7, 14] and shows the little difference; however, the difference is less than around 4%. In this area, the non parametric approach tends to return larger values, which means a pessimistic prospect.

Figure 2. The estimated ED functions of Abisko rainfall amount (in mm) from 1/1/1913 to 1/1/2015 data by the non parametric approach (solid line) and by the fitting to the GPD (dashed line).

Figure 3. The estimated ED functions in [7, 14] of the Abisko rainfall amount (in mm) by the nonparametric approach (solid line) and by the fitting to the GPD (dashed line).

6. Conclusion and discussion

This study investigates the two estimators, PE and NE, of the ED above the threshold u and compares their accuracy. Asymptotic MSE of the estimators are derived and numerical study is conducted. Theoretical investigation reveals the followings. The threshold as the hyperparameter of PE denoted by t needs to be asymptotically same as u (see Assumption 1). The MSE of NE and the minimizing hyperparameter (bandwidth h) are presented. For the Weibull class, the two estimators of the ED of a polynomial order are not consistent. For the Hall class or bounded class, the accuracy of the two estimators depends on both u and the parameter γ. As u becomes larger relative to n, the two estimators tend to lose consistency. When u is small relative to n, NE is theoretically superior to PE in general. When u is large relative to n, PE excels NE. If γ > 0, the heavier the tail is, the better NE works. If γ < 0, NE outperforms PE, especially for γ being close to zero. Simulation study mostly demonstrates the asymptotic supremacy of each of the estimators. In the real data study, the difference between the two estimators are surveyed. It is found that the difference is slight, but the non parametric approach returns an estimated probability slightly larger.

The obtained result of the comparative study is different from that of distribution estimation of sample maximum. By comparing the fitting estimator to the generalized extreme value distribution and the non parametric kernel type estimator, Moriyama (Citation2021) demonstrated that the non parametric estimator is good in the case γ≒0, where the fitting estimator loses consistency. That means the performance of non parametric estimation in extreme value analysis depends on at least the target being related to the generalized Pareto distribution or the generalized extreme value distribution. This fact suggests the properties of other non parametric estimators in extreme value analysis. In order to improve the accuracy of extreme value inference, we need to continue to clarify the properties of the non parametric estimators.

Data availability

The dataset analyzed during the current study is available in the mev package in the R software environment.

Acknowledgments

The author appreciates the editor’s and referees’ valuable comments that helped us improve this manuscript.

Disclosure statement

The author declares that there are no conflicts of interest.

Additional information

Funding

This work was supported by JSPS KAKENHI Grant Number JP23K16850.

References

Banfi, F., G. Cazzaniga, and C. De Michele. 2022. Nonparametric extrapolation of extreme quantiles: a comparison study. Stochastic Environmental Research and Risk Assessment 36 (6):1579–96. doi:10.1007/s00477-021-02102-0.
Web of Science ®Google Scholar
Beirlant, J., Y. Goegebeur, J. Teugels, and J. Segers. 2004. Statistics of extremes: theory and applications. Chichester: John Wiley & Sons, Ltd.
Google Scholar
Del Castillo, J., and I. Serra. 2015. Likelihood inference for generalized Pareto distribution. Computational Statistics & Data Analysis 83:116–28. doi:10.1016/j.csda.2014.10.014.
Web of Science ®Google Scholar
Hall, P., and I. Weissman. 1997. On the estimation of extreme tail probabilities. The Annals of Statistics25:1311–26.
Google Scholar
Hall, P., and A. H. Welsh. 1984. Best attainable rates of convergence for estimates of parameters of regular variation. The Annals of Statistics 12 (3):1079–84.
Web of Science ®Google Scholar
Kang, S., and J. Song. 2017. Parameter and quantile estimation for the generalized Pareto distribution in peaks over threshold framework. Journal of the Korean Statistical Society 46 (4):487–501. doi:10.1016/j.jkss.2017.02.003.
Web of Science ®Google Scholar
Kiriliouk, A., H. Rootzén, J. Segers, and J. L. Wadsworth. 2019. peaks over thresholds modeling with multivariate generalized pareto distributions. Technometrics 61 (1):123–35. doi:10.1080/00401706.2018.1462738.
Web of Science ®Google Scholar
Moriyama, T. 2021. Parametric and nonparametric probability distribution estimators of sample maximum, arXiv preprint, arXiv:2111.03765.
Google Scholar
Pickands, J. 1975. Statistical inference using extreme order statistics. The Annals of Statistics 3 (1):119–31.
Web of Science ®Google Scholar
Rudvik, A. 2012. Dependence structures in stable mixture models with an application to extreme precipitation. Licentiate thesis, Chalmers University of Technology, Gothenburg, Sweden.
Google Scholar
Shimokihara, A., and Y. Maesono. 2018. Asymptotic mean squared error of kernel estimator of excess distribution function. Bulletin of Informatics and Cybernetics 50:51–64. doi:10.5109/2233859.
Google Scholar
Smith, R. L. 1987. Estimating tails of probability distributions. The Annals of Statistics 15 (3):1174–1207.
Web of Science ®Google Scholar
Stupfler, G. 2016. Estimating the conditional extreme-value index under random right-censoring. Journal of Multivariate Analysis 144:1–24. doi:10.1016/j.jmva.2015.10.015.
Web of Science ®Google Scholar
Wand, M. P., and M. C. Jones. 1995. Kernel smoothing. London: Chapman & Hall.
Google Scholar
Zhang, Jin. 2007. Likelihood moment estimation for the generalized pareto distribution. Australian & New Zealand Journal of Statistics 49 (1):69–77. doi:10.1111/j.1467-842X.2006.00464.x.
Web of Science ®Google Scholar

Appendix

Proof of Proposition 1.

For the Hall class, it follows from

γ = α^{- 1}

and c_t = γt that

\begin{array}{l} H_{γ_{t}} (x) = 1 - {(1 + \frac{x}{t})}^{- α} = 1 - T_{n} . \end{array}

Since $\begin{array}{l} F_{u} (x) = 1 - {(1 + \frac{x}{u})}^{- α} + O (u^{- β} {(1 + \frac{x}{u})}^{- α}), \end{array}$ we see $τ_{n} := F_{u} (x) - H_{γ_{t}} (x) \to 0$ if either (i) $(U_{n} \lor T_{n}) \to 0$ or (ii) $^{\exists} δ > 0$ s.t. U_n→δ and T_n→δ holds, which means $t = u = O (x)$ . Then, $\begin{array}{l} τ_{n} \sim T_{n} - U_{n} + O (u^{- β}) . \end{array}$

For the Weibull class, it follows from γ = 0 and $c_{t} = κ^{- 1} t^{1 - κ}$ that $\begin{array}{l} H_{γ_{t}} (x) = 1 - \exp (- C κ t^{κ - 1} x) = 1 - T_{n} . \end{array}$

Since $\begin{array}{l} F_{u} (x) = 1 - \exp (- C {(x + u)^{κ} - u^{κ}}) = 1 - U_{n}, \end{array}$

U_n→0 when x is same as or larger order than u. We also see $\begin{array}{l} U_{n} = \exp (- C {κ u^{κ - 1} x + 2^{- 1} κ (κ - 1) u^{κ - 2} x^{2} + \dots}) \sim \exp (- C κ u^{κ - 1} x) \end{array}$ if x = o(u). Then, considering whether $u^{κ - 1} x \to \infty$ or not we see τ_n→0 under Assumption 1.

For the bounded class, it holds that $\begin{array}{l} H_{γ_{t}} (x) = 1 - {(1 - \frac{x}{x^{*} - t})}^{- μ} = 1 - T_{n} . \end{array}$

Since $\begin{array}{l} F_{u} (x) = 1 - {(1 - \frac{x}{x^{*} - u})}^{- μ} + O ((x^{*} - u)^{- σ} {(1 - \frac{x}{x^{*} - u})}^{- μ}), \end{array}$ we see $τ_{n} \sim T_{n} - U_{n} + O ((x^{*} - u)^{μ - σ} (x^{*} - x)^{- μ})$ if t = u.

Combining the results, Proposition 1 has been proved. ▪

Proof of Proposition 2 for the Hall class or the bounded class.

First, we decompose the difference as follows: $\begin{array}{l} F_{u} (x) - H_{{\hat{γ}}_{t}} (x) = & [F_{u} (x) - H_{γ_{t}} (x)] - [H_{γ_{t}} (x) - H_{{\hat{γ}}_{t}} (x)] =: τ_{n} + ζ_{n} (say) . \end{array}$

It holds that $\begin{array}{l} ζ_{n} = H_{γ_{t}} (x) - H_{{\hat{γ}}_{t}} (x) = - \frac{\partial}{\partial γ} H_{γ} (x) |_{γ = {\tilde{γ}}_{t}} ({\hat{γ}}_{t} - γ_{t}) \end{array}$ where $\begin{array}{l} H_{γ} (x) := 1 - (1 + (α c_{t})^{- 1} x)^{- α} \end{array}$ and ${\tilde{γ}}_{t} = ({\tilde{γ}}_{t}, {\tilde{c}}_{t})^{T}$ is between ${\hat{γ}}_{t}$ and 𝜸_t with probability 1.

By calculating the derivative, we have $\begin{array}{l} \frac{\partial}{\partial c_{t}} H_{γ} (x) |_{γ = {\tilde{γ}}_{t}} = & - \frac{x}{{\tilde{c}}_{t}^{2}} {(1 + \frac{x}{{\tilde{α}}_{t} {\tilde{c}}_{t}})}^{- {\tilde{α}}_{t} - 1} \end{array}$ where ${\tilde{α}}_{t} := {\tilde{γ}}_{t}^{- 1}$ . It follows from $\begin{array}{l} \frac{x}{{\hat{α}}_{t} {\hat{c}}_{t}} - \frac{x}{α_{t} c_{t}} = \frac{x}{{\hat{α}}_{t} c_{t}} (\frac{c_{t}}{{\hat{c}}_{t}} - 1) + \frac{x}{α_{t} c_{t}} (\frac{α_{t}}{{\hat{α}}_{t}} - 1) \overset{p}{\to} 0 \end{array}$ that $\begin{array}{l} \frac{\partial}{\partial c_{t}} H_{{\tilde{γ}}_{k}} (x) - t_{n}^{*} \overset{p}{\to} 0, \end{array}$ where $\begin{array}{l} t_{n}^{*} := c_{t}^{- 1} \frac{1 - T_{n}^{- γ}}{γ} T_{n}^{1 + γ} . \end{array}$

Similarly, it holds that $\begin{array}{l} \frac{\partial}{\partial α} H_{γ} (x) |_{γ = {\tilde{γ}}_{t}} - s_{n} \overset{p}{\to} 0, \end{array}$ where $s_{n} := T_{n} (- T_{n}^{γ} + 1 + γ \ln T_{n})$ .

Thus, we see ζ_n is asymptotically equivalent in distribution to $- {N^{*}}^{- 1 / 2} η_{n}^{T} N^{*}$ , where $η_{n} = (s_{n}, t_{n})^{T}$ and $t_{n} := c_{t} t_{n}^{*}$ . Combining the results, Proposition 2 has been proved. Proposition 2 for the bounded class is proved in the same manner.

▪

Proof of Proposition 2 for the Weibull class.

$\begin{array}{l} ζ_{n} := H_{γ_{t}} (x) - H_{{\hat{γ}}_{t}} (x) = - \frac{\partial}{\partial γ} H_{γ} (x) |_{γ = {\tilde{γ}}_{t}} ({\hat{γ}}_{t} - γ_{t}) \end{array}$ holds, where ${\tilde{γ}}_{t}$ is between ${\hat{γ}}_{t}$ and 𝜸_t with probability 1. We have $\begin{array}{l} \frac{\partial}{\partial α} H_{γ} (x) |_{γ = γ_{t}} \overset{p}{\to} 0. \end{array}$

It holds that $\begin{array}{l} \frac{\partial}{\partial c_{t}} H_{γ} (x) |_{γ = γ_{t}} = - \frac{x}{c_{t}^{2}} \exp (- \frac{x}{c_{t}}) \end{array}$ $\begin{array}{l} \frac{\partial}{\partial c_{t}} H_{γ} (x) |_{γ = {\tilde{γ}}_{t}} - c_{t}^{- 1} T_{n} \ln T_{n} \overset{p}{\to} 0. \end{array}$

In the same manner as the Proof of Proposition 2, we have ζ_n is asymptotically equivalent in distribution to $- {N^{*}}^{- 1 / 2} η_{n}^{T} N^{*}$ , where $η_{n} = (s_{n}, t_{n})^{T}$ , s_n≡0 and $t_{n} := c_{t} t_{n}^{*}$ . Proposition 2 for the Weibull class has now been proved. ▪

Proof of Theorem 1.

𝔹 denotes the asymptotic bias of NE ${\hat{F}}_{u} (x)$ , and 𝕍 denotes the asymptotic variance later. Shimokihara and Maesono (Citation2018) proved $\begin{array}{l} B & := {(\frac{1}{1 - F (u)})}^{2} {(f^{'} (x + u) - f^{'} (u) + F_{u} (x) f^{'} (u))}^{2} {(\int z^{2} w (z) d z)}^{2} \\ V & := F_{u} (x) \frac{1 - F (x + u)}{(1 - F (u))^{2}} . \end{array}$

This is seen from ${\hat{F}}_{u} (x) - F_{u} (x)$ is asymptotically $\begin{array}{l} \frac{\hat{F} (x + u) - \hat{F} (u)}{1 - F (u)} + \frac{F (x + u) - F (u)}{(1 - F (u))^{2}} {\hat{F} (u) - F (u)} - F_{u} (x) \\ = & \frac{1}{1 - F (u)} {\hat{F} (x + u) - F (x + u)} - \frac{1 - F (x + u)}{(1 - F (u))^{2}} {\hat{F} (u) - F (u)}, \end{array}$ which holds when ${1 - F (u)}^{- 1} {\hat{F} (u) - F (u)} = o_{P} (1)$ . It is true if either F is the Hall class or $\begin{array}{l} {\begin{cases} h x^{κ - 1} \to 0 & for the Weibull class \\ h (x^{*} - x)^{- 1} \to 0 and h (x^{*} - x)^{- 1} \to 0 & for the bounded class, \end{cases} \end{array}$ holds. The expansion gives $\begin{array}{l} V & \sim F_{u} (x) \frac{1 - F (x + u)}{(1 - F (u))^{2}} + \frac{2 h}{(1 - F (u))^{2}} (F_{u}^{2} (x) f (u) + f (x + u)) \int z^{2} W (z) w (z) d z . \end{array}$

For the Hall class, $\begin{array}{l} B & := α^{2} (α + 1)^{2} u^{- 4} {(1 - {(\frac{x}{u} + 1)}^{- α - 2} + 1 - {(\frac{x}{u} + 1)}^{- α})}^{2} {(\int z^{2} w (z) d z)}^{2} \\ V & := A^{- 1} {1 - {(\frac{x}{u} + 1)}^{- α}} u^{α} {(\frac{x}{u} + 1)}^{- α} \\ + 2 h A^{- 1} α u^{α - 1} ({1 - {(\frac{x}{u} + 1)}^{- α}}^{2} + {(\frac{x}{u} + 1)}^{- α - 1}) \int z^{2} W (z) w (z) d z . \end{array}$

For the Weibull class, $\begin{array}{l} B & := (κ C)^{2} ((x + u)^{κ - 2} (- κ C (x + u)^{κ} + κ - 1) \exp (- C {(x + u)^{κ} - u^{κ}}) \\ - u^{κ - 2} (- κ C u^{κ} + κ - 1) \\ + [1 - \exp (- C {(x + u)^{κ} - u^{κ}})] u^{κ - 2} (- κ C u^{κ} + κ - 1))^{2} {(\int z^{2} w (z) d z)}^{2} \\ \sim (κ C)^{4} U_{n}^{2} {- (x + u)^{2 κ - 2} + u^{2 κ - 2}}^{2} {(\int z^{2} w (z) d z)}^{2} \\ V & := [1 - \exp (- C {(x + u)^{κ} - u^{κ}})] \exp (C {2 u^{κ} - (x + u)^{κ}}) \\ + 2 h κ C \exp (C u^{κ}) ([1 - \exp (- C {(x + u)^{κ} - u^{κ}})]^{2} u^{κ - 1} \\ + (x + u)^{κ - 1} \exp (- C {(x + u)^{κ} - u^{κ}})) \int z^{2} W (z) w (z) d z . \end{array}$

For the bounded class, $\begin{array}{l} B & := μ^{2} (μ + 1)^{2} (x^{*} - u)^{- 4} {(1 - (1 - \frac{x}{x^{*} - u})^{- μ - 2} + 1 - (1 - \frac{x}{x^{*} - u})^{- μ})}^{2} {(\int z^{2} w (z) d z)}^{2} \\ V & := D^{- 1} {1 - (1 - \frac{x}{x^{*} - u})^{- μ}} (x^{*} - u)^{μ} (1 - \frac{x}{x^{*} - u})^{- μ} \\ + 2 h D^{- 1} μ (x^{*} - u)^{μ - 1} ({1 - (1 - \frac{x}{x^{*} - u})^{- μ}}^{2} + {(1 - \frac{x}{x^{*} - u})}^{- μ - 1}) \int z^{2} W (z) w (z) d z . \end{array}$

By combining the results, Theorem 1 is proved. ▪

Proof of Corollary 2.

It follows from Theorem 1 that $\begin{array}{l} E [(F_{u} (x) - {\hat{F}}_{u} (x))^{2}] \sim U_{n}^{2} h^{4} \frac{ξ_{n}^{2}}{4} {(\int z^{2} w (z) d z)}^{2} + \frac{1}{n} (U_{n} (1 - U_{n}) v_{n} - 2 h ω_{n} \int z W (z) w (z) d z), \end{array}$ where $\begin{array}{l} ξ_{n} := & {\begin{cases} α (α + 1) u^{- 2} {{(\frac{x}{u} + 1)}^{- 2} + 1 - 2 {(\frac{x}{u} + 1)}^{α}} \\ κ^{2} C^{2} {(x + u)^{2 κ - 2} - u^{2 κ - 2}} \\ μ (μ + 1) (x^{*} - u)^{- 2} {{(1 - \frac{x}{x^{*} - u})}^{- 2} + 1 - 2 {(1 - \frac{x}{x^{*} - u})}^{μ}} \end{cases} \\ v_{n} := & {\begin{cases} A^{- 1} u^{α} \\ \exp (C u^{κ}) \\ D^{- 1} (x^{*} - u)^{μ} \end{cases} \\ ω_{n} := & {\begin{cases} A^{- 1} α u^{α - 1} ({1 - {(\frac{x}{u} + 1)}^{- α}}^{2} + {(\frac{x}{u} + 1)}^{- α - 1}) \\ κ C \exp (C u^{κ}) ([1 - \exp (- C {(x + u)^{κ} - u^{κ}})]^{2} u^{κ - 1} + (x + u)^{κ - 1} \exp (- C {(x + u)^{κ} - u^{κ}})) \\ D^{- 1} μ (x^{*} - u)^{μ - 1} ({1 - (1 - \frac{x}{x^{*} - u})^{- μ}}^{2} + {(1 - \frac{x}{x^{*} - u})}^{- μ - 1}) . \end{cases} \end{array}$

Each of the first cases is the Hall class, the second case is the Weibull class, and the last case is the bounded class of distribution. By differencing MSE with respect to h, we see that the bandwidth minimizing the MSE is given by $\begin{array}{l} h = {(2 U_{n}^{- 2} ξ_{n}^{- 2} ω_{n} n^{- 1} \frac{\int z W (z) w (z) d z}{{(\int z^{2} w (z) d z)}^{2}})}^{1 / 3} . \end{array}$

Suppose $^{\exists} δ > 0$ s.t. U_n→δ. Then, minimizing bandwidth is $\begin{array}{l} h = & {(2 δ^{- 2} n^{- 1} \frac{\int z W (z) w (z) d z}{{(\int z^{2} w (z) d z)}^{2}})}^{1 / 3} \\ \times {\begin{cases} A^{- 1 / 3} α^{1 / 3} (α + 1)^{- 2 / 3} u^{(α - 5) / 3} {δ^{2 γ} + 1 - 2 δ}^{- 2 / 3} {(1 - δ)^{2} + δ^{1 + γ}}^{1 / 3} \\ (κ C)^{- 1} \exp (C u^{κ} / 3) {(u^{κ} - C^{- 1} \ln δ)^{2 - (2 / κ)} - u^{2 κ - 2}}^{- 2 / 3} \\ (u^{κ - 1} (1 - δ)^{2} + (u^{κ} - C^{- 1} \ln δ)^{1 - (1 / κ)} δ)^{1 / 3} \\ D^{- 1 / 3} μ^{1 / 3} (μ + 1)^{- 2 / 3} (x^{*} - u)^{(μ - 5) / 3} {δ^{2 γ} + 1 - 2 δ}^{- 2 / 3} {(1 - δ)^{2} + δ^{1 + γ}}^{1 / 3} . \end{cases} \end{array}$

${\hat{F}}_{u} (x)$ with the optimal bandwidth has the asymptotic bias $\begin{array}{l} ν_{0} δ^{- 1 / 3} n^{- 2 / 3} \times {\begin{cases} A^{- 2 / 3} α^{1 / 3} (α + 1)^{- 1 / 3} u^{2 α / 3} {δ^{2 γ} + 1 - 2 δ}^{- 1 / 3} {((1 - δ)^{2} + δ^{1 + γ})}^{2 / 3} \\ \exp (2 C u^{κ} / 3) {(u^{κ} - C^{- 1} \ln δ)^{2 - (2 / κ)} - u^{2 κ - 2}}^{- 1 / 3} \\ (u^{κ - 1} (1 - δ)^{2} + (u^{κ} - C^{- 1} \ln δ)^{1 - (1 / κ)} δ)^{2 / 3} \\ D^{- 2 / 3} μ^{1 / 3} (μ + 1)^{- 1 / 3} (x^{*} - u)^{2 μ / 3} {δ^{2 γ} + 1 - 2 δ}^{- 1 / 3} {((1 - δ)^{2} + δ^{1 + γ})}^{2 / 3} . \end{cases} \end{array}$

Corollary 2 has been proved. ▪

Comparative study on excess distribution estimation in iid settings

Abstract.

1. Introduction

2. Kernel-type estimation

3. Fitting estimator to GPD

4. Comparative study

Table 1. The polynomial convergence rates of the MSE of the estimators and the lengths.

Table 2. Scaled MISE values (×100) and sd values (×100) for the estimators.

Table 3. Scaled MISE values (×100) and sd values (×100) for the estimators.

Table 4. Scaled MISE values (×100) and sd values (×100) for the estimators.

Table 5. Scaled MISE values (×100) and sd values (×100) for the estimators.

5. Real data study

6. Conclusion and discussion

Data availability

Acknowledgments

Disclosure statement

References

Appendix

Proof of Proposition 1.

Proof of Proposition 2 for the Hall class or the bounded class.

Proof of Proposition 2 for the Weibull class.

Proof of Theorem 1.

Proof of Corollary 2.

Information for

Open access

Opportunities

Help and information

Comparative study on excess distribution estimation in iid settings

Abstract.

1. Introduction

2. Kernel-type estimation

3. Fitting estimator to GPD

4. Comparative study

Table 1. The polynomial convergence rates of the MSE of the estimators and the lengths.

Table 2. Scaled MISE values (×100) and sd values (×100) for the estimators.

Table 3. Scaled MISE values (×100) and sd values (×100) for the estimators.

Table 4. Scaled MISE values (×100) and sd values (×100) for the estimators.

Table 5. Scaled MISE values (×100) and sd values (×100) for the estimators.

5. Real data study

6. Conclusion and discussion

Data availability

Acknowledgments

Disclosure statement

Additional information

Funding

References

Appendix

Proof of Proposition 1.

Proof of Proposition 2 for the Hall class or the bounded class.

Proof of Proposition 2 for the Weibull class.

Proof of Theorem 1.

Proof of Corollary 2.

Related research

To cite this article:

Download citation

Your download is now in progress and you may close this window

Login or register to access this feature

Information for

Open access

Opportunities

Help and information

Keep up to date