1,451

Views

CrossRef citations to date

Altmetric

Articles

New extreme value theory for maxima of maxima

Wenzhi CaoDepartment of Statistics, University of Wisconsin-Madison, Madison, WI, USAView further author information

Zhengjun ZhangDepartment of Statistics, University of Wisconsin-Madison, Madison, WI, USACorrespondence[email protected]
View further author information

Formulae display: $MathJax Logo$ ?Mathematical formulae have been encoded as MathML and are displayed in this HTML version using MathJax in order to improve their display. Uncheck the box to turn MathJax off. This feature requires Javascript. Click on a formula to zoom.

Abstract

Although advanced statistical models have been proposed to fit complex data better, the advances of science and technology have generated more complex data, e.g., Big Data, in which existing probability theory and statistical models find their limitations. This work establishes probability foundations for studying extreme values of data generated from a mixture process with the mixture pattern depending on the sample length and data generating sources. In particular, we show that the limit distribution, termed as the accelerated max-stable distribution, of the maxima of maxima of sequences of random variables with the above mixture pattern is a product of three types of extreme value distributions. As a result, our theoretical results are more general than the classical extreme value theory and can be applicable to research problems related to Big Data. Examples are provided to give intuitions of the new distribution family. We also establish mixing conditions for a sequence of random variables to have the limit distributions. The results for the associated independent sequence and the maxima over arbitrary intervals are also developed. We use simulations to demonstrate the advantages of our newly established maxima of maxima extreme value theory.

Keywords:

1. Introduction

Rigorous risk analysis helps to make better decisions and prevent great failures. Extreme value theory has been a powerful tool in risk analysis and is widely applied to risk analysis in finance, insurance, health, climate, and environmental studies. In classical extreme value theory, the sequence of data is assumed to have the same marginal distribution, and the limit distribution of the maxima is in one of the extreme value types if it exists. Galambos (Citation1978), de Haan (Citation1993), Beirlant et al. (Citation2004), de Haan and Ferreira (Citation2006), Leadbetter et al. (Citation2012) and Resnick (Citation2013) amongst many monographs are good literatures introducing the theoretical results in the classical extreme value theory. Mikosch et al. (Citation1997), Embrechts et al. (Citation1999), McNeil and Frey (Citation2000), Coles (Citation2001), Finkenstädt and Rootzén (Citation2004), Castillo et al. (Citation2005), Salvadori et al. (Citation2007) and Dey and Yan (Citation2016) introduce many applications of extreme value method to the areas of science, engineering, nature, finance, insurance and climate. For example, in financial applications, extreme value theory is one of the tools to calculate the Value-at-Risk (VaR) and Expected Shortfall (ES) (e.g., Rocco, Citation2014; Tsay, Citation2005). Chavez-Demoulin et al. (Citation2016) offer an extreme value theory (EVT)-based statistical approach for modelling operational risk and losses, by taking into account dependence of the parameters on covariates and time. Zhang and Smith (Citation2010) propose the multivariate maxima of moving maxima (M4) processes and apply the method to model jumps in returns in multivariate financial time series and predict the extreme co-movements in price returns. Daouia et al. (Citation2018) use the extreme expectiles to measure VaR and marginal expected shortfall. In the statistical inference of maximum likelihood estimation (MLE), a discussion on the properties of maximum likelihood estimators of the parameters in generalised extreme value (GEV) distribution was given by Smith (Citation1985). In the paper, it is shown that the classical properties of the MLE hold when the shape parameter $ξ > - 1 / 2$ , but not when $ξ \leq - 1 / 2$ . Bücher and Segers (Citation2017) give a general result on the asymptotic normality of the maximum likelihood estimator for parametric models whose support may depend on the parameters.

In the age of Big Data, the advances of science and technology have been changing data generating processes in a more complex way. As a result, the data structures and dependence structures accompanied by the collected data can be very different from the existed assumptions in many commonly used models. In the literature, advanced statistical models and machine learning approaches have been proposed to fit such complex data or learn the underlying structures better. For example, the support vector machine, the deep learning method, and the random forest method have now been very well recognised and wildly used in data analysis. In extreme value analysis for more complex data, the same marginal distribution assumption and its derived extreme value distributions can be very restrictive and lack of data fitting power. Although statistical models, e.g., Heffernan et al. (Citation2007), Naveau et al. (Citation2011), Tang et al. (Citation2013), Malinowski et al. (Citation2015), Zhang and Zhu (Citation2016) and Idowu and Zhang (Citation2017), have been proposed to model extreme values observed from different data sources with different populations and max-domains of attraction, their probability foundations have not been established.

The definition of the classical maximum domain of attraction cannot be applied directly to the extreme values of data drawn from different populations mixed together. Note that we are not dealing with mixtures of distributions that may belong to a maximum domain of attraction of classical extreme value distribution. In this study, we are dealing with maxima of maxima in which the maxima resulted from each population has its limit extreme value distribution and norming and centering constants and convergence rate. For example, in many real-world applications, the risks one is exposed to usually come from different resources, and the risk at a given time is decided by the dominant one, i.e., not the added risk of all risks. Let us consider a specific example: Suppose a patient suffers two severe diseases. The risk of that the patient will die over a certain time may be best described by the maximum, not the sum, of two risk variables.

This work extends the definition of the maximum domain of attraction to maxima of maxima of sequences of random variables in which the mixing patterns change along with the sample size. The accelerated max-stable distribution (accelerated extreme value distribution) is expressed as a product of the classical extreme value distributions for the maxima of maxima resulted from different distributions. Some basic properties and theoretical results are provided. It can be seen that the classical extreme value distributions are special cases of our newly established family of accelerated max-stable distributions. The results obtained can be applied to more complex data, e.g., Big Data. The new results also establish the probability foundation of previously proposed statistical models in extreme time series modeling. Those models include Heffernan et al. (Citation2007) that introduces one scheme where the maxima are taken over random variables with different distributions, and Zhang and Zhu (Citation2016) that models intra-daily maxima of high-frequency financial data.

The structure of this paper is as follows. In Section 2, (1) we give a brief review of the classical extreme value theory; (2) we define our maxima of maxima of sequences of random variables; (3) we use examples to demonstrate the characteristics of the maxima of maxima; (4) we establish the convergence of maxima of maxima to the accelerated max-stable distributions; (5) we illustrate density functions of the new family of accelerated max-stable distributions and evaluate moments and tail equivalence. Simulations are used to demonstrate the advantages of the accelerated max-stable distribution family in terms of the estimation accuracy of high quantiles at different levels. We also apply this new accelerated max-stable distribution to the high quantiles of the daily maxima of 330 stock returns of S&P 500 companies. In Section 3, the convergence of joint probability for general thresholds and approximation errors are developed. In Section 4, theoretical results for weakly dependent sequences are derived. Section 6 concludes. Additional figures and technical proofs are included in the Appendix.

2. Accelerated max-stable distribution for independent sequences

2.1. A brief review of classical univariate extreme value theory

In classical extreme value theory, the central result is the Fisher-Tippett theorem which specifies the form of the limit distribution for centered and normalised maxima. Let $X_{1}, X_{2}, \dots, X_{n}$ be a sequence of independent and identically distributed (i.i.d.) non-degenerate random variables (rvs) with common distribution function F and $M_{n} = max (X_{1}, \dots, X_{n})$ be the sample maxima. The Fisher-Tippett theorem states that: If for some norming constants $a_{n} > 0$ and centering constants $b_{n}$ we have (1) $P (a_{n} (M_{n} - b_{n}) \leq x) \overset{w}{\to} H (x)$ (1) for some nondegenerate H, where $\overset{w}{\to}$ stands for convergence in distribution, then H belongs to one type of the following three cumulative distribution functions (cdf's): (2) $\begin{aligned} F r' e c h e t : Φ_{α} (x) = \{\begin{cases} 0, & x \leq 0 \\ \exp {- x^{- α}}, & x > 0, \end{cases} α > 0. \\ W e i b u l l : Ψ_{α} (x) = \{\begin{cases} \exp {- (- x)^{α}}, & x \leq 0 \\ 1, & x > 0, \end{cases} α > 0. \\ G u m b e l : Λ (x) = \exp {- e^{- x}}, x \in R . \end{aligned}$ (2) Conversely, every extreme value distribution in (Equation2(2) $\begin{aligned} F r' e c h e t : Φ_{α} (x) = \{\begin{cases} 0, & x \leq 0 \\ \exp {- x^{- α}}, & x > 0, \end{cases} α > 0. \\ W e i b u l l : Ψ_{α} (x) = \{\begin{cases} \exp {- (- x)^{α}}, & x \leq 0 \\ 1, & x > 0, \end{cases} α > 0. \\ G u m b e l : Λ (x) = \exp {- e^{- x}}, x \in R . \end{aligned}$ (2) ) can be a limit in (Equation1(1) $P (a_{n} (M_{n} - b_{n}) \leq x) \overset{w}{\to} H (x)$ (1) ), and in particular, when H itself is the cdf of each $X_{i}$ , the limit is itself. We say that F belongs to the maximum domain of attraction of the extreme value distribution of H, and denote as $F \in M D A (H)$ when (Equation1(1) $P (a_{n} (M_{n} - b_{n}) \leq x) \overset{w}{\to} H (x)$ (1) ) holds. H is also called the max-stable distribution since for any $n = 2, 3, \dots$ , there are constants $a_{n} > 0$ and $b_{n}$ such that $H^{n} (a_{n} x + b_{n}) = H (x)$ . Due to this property, the equivalence of extreme value distribution or max-stable distribution in practice is mutually implied.

2.2. Maxima of maxima

Suppose that the independent mixed sequence of random variables ${X_{i}}_{i = 1}^{n}$ is composed of k subsequences ${X_{j, i}}_{i = 1}^{n_{j}}, j = 1, 2, \dots, k$ ; ${X_{j, i}}_{i = 1}^{n_{j}} \overset{i . i . d .}{\sim} F_{j} (x)$ , $n_{j} \to \infty$ as $n \to \infty$ and $n = n_{1} + \dots + n_{k}$ . Denote $M_{j, n_{j}} = max (X_{j, i}, i = 1, \dots, n_{j})$ as the maximum of the jth subsequence, $j = 1, 2, \dots, k$ . Suppose $F_{j} \in M D A (H_{j})$ , where $H_{j}$ is one of the three types of extreme value distributions, i.e., $M_{j, n_{j}}$ has the following limit distribution with some norming constants $a_{j, n_{j}} > 0$ and centering constants $b_{j, n_{j}},$ (3) $lim_{n \to \infty} P (a_{j, n_{j}} (M_{j, n_{j}} - b_{j, n_{j}}) \leq x) = H_{j} (x) .$ (3) Define $M_{n} = max (M_{1, n_{1}}, M_{2, n_{2}}, \dots, M_{k, n_{k}})$ , i.e., $M_{n}$ is the maxima of k maxima of $M_{j, n_{j}}$ s. Throughout the paper, $M_{n}$ is termed as the maxima of maxima. Questions can be asked: (1) whether or not Equation (Equation1(1) $P (a_{n} (M_{n} - b_{n}) \leq x) \overset{w}{\to} H (x)$ (1) ) holds with appropriately chosen norming constants $a_{n} > 0, b_{n}$ ; (2) if Equation (1) holds, whether or not $a_{n} > 0, b_{n}$ are equivalent to any of $a_{j, n_{j}} > 0, b_{j, n_{j}}$ ; (3) whether or not $H (x)$ is a function of $H_{j} (x)$ ; (4) if all Equations (1)–(3) hold, which one is the best method to be used in practice. This paper intends to answer these four questions.

Practical examples related to the above defined process can be numerous. For example, (1) the maximum temperature of the US in a day can be described by the maximum of maxima of regional maximum temperatures. In each region, the maximum temperature is the maximum temperature recordings among all weather stations in the region. Considering the regions' spatial and geographical patterns, the regional maxima certainly follow different extreme value distributions from one region to another region. The US temperature maxima are the maxima of regional maxima, and should be modelled by a distribution function that is a function of the regional extreme value distribution functions. (2) Considering the daily risk of high-frequency trading in a stock market, one can partition the data into hourly data (from 9:00 am to 4:00 pm). Suppose each hourly maxima $M_{j, n_{j}}$ of negative returns can be approximately modelled by an extreme value distribution of $H_{j} (x)$ . It is clear that $M_{n}$ is better modelled by a function of $H_{j} (x), j = 1, \dots, 7,$ i.e., not a single $H_{j} (x)$ . We use the following simple example with k = 2 to illustrate the idea.

Example 2.1

The sequence ${X_{i}}_{i = 1}^{n}$ is generated by $X_{i} = max (Y_{i}, Z_{i})$ , where ${Y_{i}}_{i = 1}^{n} \overset{i . i . d .}{\sim} F_{1} (x)$ , ${Z_{i}}_{i = 1}^{n} \overset{i . i . d .}{\sim} F_{2} (x)$ , and $F_{1} (x)$ and $F_{2} (x)$ are two distribution functions. Assume $Y_{i}$ and $Z_{i}$ are independent. Then ${X_{i}}_{i = 1}^{n} \overset{i . i . d .}{\sim} F (x) = F_{1} (x) F_{2} (x)$ .

Remark

The form $X_{i} = max (Y_{i}, Z_{i})$ is the simplest case in the general mixture models introduced in Zhao and Zhang (Citation2018). It is also the simplest case in the copula structured M4 models studied by Zhang and Zhu (Citation2016).

For illustrative purpose of Example 2.1, let's consider two scenarios. Suppose ${Y_{i}^{[k]}}_{i = 1}^{n} \overset{i . i . d .}{\sim} N (0, 1)$ and ${Z_{i}^{[k]}}_{i = 1}^{n} \overset{i . i . d .}{\sim} U [a, b]$ for $k = 1, \dots, m$ . Here $U [a, b]$ represents the uniform distribution on the interval $[a, b]$ . The superscript $[k]$ stands for the kth sample sequence. In Scenario 1, Figure illustrates two different simulated sequences of ${X_{i}^{[k]}}_{i = 1}^{n}$ , where $X_{i}^{[k]} = max (Y_{i}^{[k]}, Z_{i}^{[k]})$ , and the maxima of $M_{n}^{[k]} = max (X_{1}^{[k]}, \dots, X_{n}^{[k]})$ for n = 100 and a particular k, e.g., k = 1. Next, we repeatedly generate $m = 10,000$ such sequences ${X_{i}^{[k]}}_{i = 1}^{n}, k = 1, \dots, m$ . By taking the maxima $M_{n}^{[k]} = max (X_{1}^{[k]}, \dots, X_{n}^{[k]}), k = 1, \dots, m$ , the histogram of ${M_{n}^{[k]}}_{k = 1}^{m}$ is displayed in Figure (a) with a = −2.2 and b = 2.2. In Scenario 2, by replacing the marginal distribution of $Z_{i}^{[k]}$ with $U [- 2.8, 2.8]$ , the histogram of $M_{n}^{[k]}$ is shown in Figure (b). It is clear that although ${X_{i}^{[k]}}_{i = 1}^{n}$ is independent and identically distributed (i.i.d.), one can see that the distribution of $M_{n}$ looks quite different from the three types of extreme value distributions.

Figure 1. Simulated mixed sequences from normal and uniform distributions and their maxima (marked with black dots). In (a), the maximum is from the uniform distribution; in (b), the maximum is from $N (0, 1)$ .

Figure 2. (a) Histogram of $M_{n}$ from $N (0, 1)$ and $U [- 2.2, 2.2]$ . (b) Histogram of $M_{n}$ from $N (0, 1)$ and $U [- 2.8, 2.8]$ .

In Example 2.1, the larger values of two paired underlying subsequences are observed while the smaller values are covered up by larger ones and are never observed. The sample sizes from the two subsequences are the same. However, in general mixed sequences the ratios of sample sizes from two subsequences $n_{1} / n_{2}$ can be any value between 0 and infinity and can vary as the total sample size grows. As a result, we can see many kinds of different patterns different from Figure .

In practice, data generating processes are naturally formed spatially and temporarily from underlying physical processes of studies. Here we provide two data generating processes in simulation.

For a given sample size n, we set the numbers $n_{1}$ and $n_{2}$ satisfying $n_{1} + n_{2} = n$ and assume that $lim_{n \to \infty} n_{1} / (n_{1} + n_{2}) \to r$ , $r \in (0, 1)$ . Then we generate the specified $n_{1}$ and $n_{2}$ observations from two populations respectively, stack them in a sequence, and perform a random permutation of the combined sequence. In a physical process, the procedure can be designed as: we mix $n_{1}$ yellow balls and $n_{2}$ white balls in a bag. Then we draw balls sequentially. If a yellow ball is drawn, generate a number from the first population, otherwise from the second population.
Alternatively, suppose $a_{1, n_{1}}$ , $a_{2, n_{2}}$ are norming constants defined in Equation (Equation3(3) $lim_{n \to \infty} P (a_{j, n_{j}} (M_{j, n_{j}} - b_{j, n_{j}}) \leq x) = H_{j} (x) .$ (3) ) with known population distributions in simulation, we can set $n_{1} + n_{2} = n$ and let $lim_{n \to \infty} a_{1, n_{1}} / a_{2, n_{2}} = r$ , and then solve $n_{1}$ and $n_{2}$ to generate the observations as the last step.

Example 2.2

Using the sampling scheme designed above. Suppose there are two sequences ${X_{1, i}}_{i = 1}^{n_{1}} \overset{i . i . d .}{\sim} N (0, 0.9)$ and ${X_{2, i}}_{i = 1}^{n_{2}} \overset{i . i . d .}{\sim} U [- 2, 2]$ with $n_{1} = 100$ and $n_{2} = 200$ . ${X_{k}}_{k = 1}^{300}$ is mixed with these two sequences. Let $M_{n}^{[j]}$ be the maxima of the jth realisation of the sequence, $j = 1, \dots, m$ , n = 300. With $m = 10,000$ , ${M_{n}^{[j]}}_{j = 1}^{m}$ are calculated and the histogram is shown in Figure (a). The case of $n_{1} = 200$ and $n_{2} = 100$ is shown in Figure (b).

Figure 3. Histograms of combinations of $M_{n}$ from $N (0, 0.9)$ and $U [- 2, 2]$ . (a) $n_{1} = 100$ , $n_{2} = 200$ . (b) $n_{1} = 200$ , $n_{2} = 100$ .

The histograms in Figure look different from any of the three types of extreme value distributions discussed in (Equation2(2) $\begin{aligned} F r' e c h e t : Φ_{α} (x) = \{\begin{cases} 0, & x \leq 0 \\ \exp {- x^{- α}}, & x > 0, \end{cases} α > 0. \\ W e i b u l l : Ψ_{α} (x) = \{\begin{cases} \exp {- (- x)^{α}}, & x \leq 0 \\ 1, & x > 0, \end{cases} α > 0. \\ G u m b e l : Λ (x) = \exp {- e^{- x}}, x \in R . \end{aligned}$ (2) ). One feature is that they can be bimodal. On the other hand, the classical GEV distributions are all unimodal. Figure shows two specific examples of choices of $n_{1}$ and $n_{2}$ . In more general situations, the ratios of $n_{1}$ and $n_{2}$ can be any values in $(0, \infty)$ . The ratio $n_{1} / n_{2}$ may also change as n increases. In Figures and , the left parts of the distributions are dominated by the Weibull type induced by the uniform distribution, and the right parts resemble the Gumbel type induced by the normal distribution. The reason is that when we look at the maxima of ${X_{i}}_{i = 1}^{n}$ , there are two populations competing with each other. Taking (b) in Figure as an example, the winners from $U [- 2, 2]$ form the steep peak on the left; and the winners from $N (0, 0.9)$ form the smoother peak on the right.

Figure (a) shows the distribution of $M_{n}$ for the sequence which is mixed with $N (0, 1)$ and a Fréchet distribution. In (b), (c) and (d), they show the combinations of one Fréchet distribution and one Weibull distribution. Notice that in panel (b), the distribution looks left-skewed and is very similar to a Weibull distribution. However, with the effect of the Fréchet distribution, it actually has an infinite right endpoint.

Figure 4. Histograms of $M_{n}$ . (a) $N (0, 1)$ and Fréchet combination. (b)–(d) Some combinations of Fréchet and Weibull.

In Figure , histograms of $M_{n}$ are created such that the independent sequences of random variables ${X_{i}}_{i = 1}^{n}$ are generated by comparing the pairs of observations from normal and Weibull distribution. They can be unimodal or bimodel, left-skewed or right-skewed. If we use the GEV family to characterise the distributions of $M_{n}$ in these examples, it may not capture the shape of the distribution properly. For example, if we look at the left part of the distribution in Figure (d), it resembles a Weibull distribution that has a finite right endpoint. However, because of the effect of the normal distribution on the right tail, the shape changes suddenly to be similar to a Gumbel distribution with infinite right endpoint. If we fit a GEV distribution to $M_{n}$ , the left part with more sample data may have a large effect on the fitted distribution and we may underestimate the long tail on the right.

Figure 5. Histograms of $M_{n}$ , with combinations of normal distribution and Weibull distribution.

2.3. Convergence to the accelerated max-stable distribution

Throughout the paper, $x_{F} = sup {x; F (x) < 1}$ is the right endpoint of a cdf F and let $\bar{F} (x) = 1 - F (x)$ ; $M_{n} = max (M_{1, n_{1}}, \dots, M_{k, n_{k}})$ is restricted to k = 2. For k>2, relative results can be derived with additional notations. The following theorem shows that under certain conditions on the norming constants $a_{j, n_{j}}$ and $b_{j, n_{j}}$ , we can choose one set of the norming constants for the global maximum $M_{n} = max (M_{1, n_{1}}, M_{2, n_{2}})$ to derive its limit distribution. Theorem 2.1 can be directly derived from Khintchine's theorem.

Theorem 2.1

If $M_{1, n_{1}}$ and $M_{2, n_{2}}$ satisfy (Equation3(3) $lim_{n \to \infty} P (a_{j, n_{j}} (M_{j, n_{j}} - b_{j, n_{j}}) \leq x) = H_{j} (x) .$ (3) ) for j = 1, 2, the limit distribution of $M_{n}$ as $n \to \infty$ can be determined in the following cases:

Case 1.	If $\frac{a_{1, n_{1}}}{a_{2, n_{2}}} \to a > 0$ , $a_{1, n_{1}} (b_{2, n_{2}} - b_{1, n_{1}}) \to b < + \infty$ , for some constants a and b, then (4) $P (a_{2, n_{2}} (M_{n} - b_{2, n_{2}}) \leq x) \to H_{1} (a x + b) H_{2} (x) .$ (4)
Case 2.	If $\frac{a_{1, n_{1}}}{a_{2, n_{2}}} \to 0$ , $a_{1, n_{1}} (b_{2, n_{2}} - b_{1, n_{1}}) \to + \infty$ , then (5) $P (a_{2, n_{2}} (M_{n} - b_{2, n_{2}}) \leq x) \to H_{2} (x) .$ (5)

Notice that the limit in Case 1 is the product of two extreme value distributions, $H_{1} (a x + b) H_{2} (x)$ . Although it is in the product form, sometimes it can still be reduced to the three classical extreme value distributions. For example, $\exp {- x^{- α}} \exp {- (\frac{x}{2})^{- α}}$ is still a Fréchet type. However, in some situations, when the conditions in Case 1 are satisfied, the limit product form cannot be reduced to any one of the three extreme value distributions. We next present several examples to illustrate these possibilities.

Example 2.3

Fréchet and Gumbel

Suppose $F_{1} (x) = Φ_{α} (x)$ is a Fréchet distribution function, and $F_{2} (x) = Λ (x)$ is the standard Gumbel distribution function. By choosing $a_{1, n_{1}} = n_{1}^{- 1 / α}, b_{1, n_{1}} = 0$ , $a_{2, n_{2}} = 1, b_{2, n_{2}} = \log n_{2}$ we have (6) $P (M_{2, n_{2}} - \log n_{2} \leq x) \to Λ (x) .$ (6) Then when $n_{1}^{1 / α} / \log n_{2} \to \infty$ , we have $\begin{aligned} P (n_{1}^{- 1 / α} M_{n} \leq x) \\ = P (n_{1}^{- 1 / α} M_{1, n_{1}} \leq x, M_{2, n_{2}} - \log n_{2} \\ \leq n_{1}^{1 / α} x - \log n_{2}) \\ \to Φ_{α} (x) . \end{aligned}$

Example 2.4

Fréchet and Fréchet

Suppose $F_{1} (x) = Φ_{α_{1}} (x)$ and $F_{2} (x) = Φ_{α_{2}} (x)$ are two Fréchet distribution functions such that $α_{1} > α_{2}$ , which means that the tail of $F_{2} (x)$ is heavier than the tail of $F_{1} (x)$ . By choosing norming constants $a_{1, n_{1}} = n_{1}^{- 1 / α_{1}}$ , $b_{1, n_{1}} = 0$ and $a_{2, n_{2}} = n_{2}^{- 1 / α_{2}}$ , $b_{2, n_{2}} = 0$ we have (7) $P (n_{j}^{- 1 / α_{j}} M_{j, n_{j}} \leq x) = Φ_{α_{j}} (x), j = 1, 2,$ (7) and (8) $\begin{aligned} P (n_{2}^{- 1 / α_{2}} M_{n} \leq x) \\ = P (n_{1}^{- 1 / α_{1}} M_{1, n_{1}} \leq \frac{n_{2}^{1 / α_{2}}}{n_{1}^{1 / α_{1}}} x, n_{2}^{- 1 / α_{2}} M_{2, n_{2}} \leq x) . \end{aligned}$ (8) If $n_{2}^{1 / α_{2}} / n_{1}^{1 / α_{1}} \to a > 0$ , then (9) $P (n_{2}^{- 1 / α_{2}} M_{n} \leq x) \to Φ_{α_{1}} (a x) Φ_{α_{2}} (x) .$ (9) If $n_{2}^{1 / α_{2}} / n_{1}^{1 / α_{1}} \to + \infty$ , then (10) $P (n_{2}^{- 1 / α_{2}} M_{n} \leq x) \to Φ_{α_{2}} (x) .$ (10)

In Example 2.4, the sequence is mixed with two Fréchet distributions with different shape parameters. The limit distribution of $M_{n}$ for this mixed sequence is the product of two Fréchet distributions, which is different from any of the three types of extreme value distributions.

Example 2.5

Uniform and normal

Suppose $F_{1} (x)$ is the function of the uniform distribution $U [0, 1]$ , $F_{2} (x)$ is the distribution function of $N (0, 1)$ . By choosing (11) $a_{1, n_{1}} = n_{1}, b_{1, n_{1}} = 1,$ (11) and $\begin{aligned} a_{2, n_{2}} & = (2 \log n_{2})^{1 / 2}, \\ b_{2, n_{2}} & = (2 \log n_{2})^{1 / 2} \\ - \frac{1}{2} (2 \log n_{2})^{- 1 / 2} (\log \log n_{2} + \log 4 π), \end{aligned}$ we have (12) $P (n_{1} (M_{1, n_{1}} - 1) \leq x) \to e^{x}$ (12) for x<0, and (13) $P (a_{2, n_{2}} (M_{2, n_{2}} - b_{2, n_{2}}) \leq x) \to Λ (x) .$ (13) Then $\begin{aligned} P (a_{2, n_{2}} (M_{n} - b_{2, n_{2}}) \leq x) \\ = P (n_{1} (M_{1, n_{1}} - 1) \leq n_{1} (\frac{x}{a_{2, n_{2}}} + b_{2, n_{2}} - 1), \\ a_{2, n_{2}} (M_{2, n_{2}} - b_{2, n_{2}}) \leq x) . \end{aligned}$ Since $n_{1} (\frac{x}{a_{2, n_{2}}} + b_{2, n_{2}} - 1) \to + \infty$ for any x, we have (14) $P (a_{2, n_{2}} (M_{n} - b_{2, n_{2}}) \leq x) \to Λ (x) .$ (14)

Example 2.6

Weibull and Weibull

Suppose $x_{F} < \infty$ and $K_{1} > 0, K_{2} > 0$ , (15) $\begin{aligned} F_{1} (x) = 1 - K_{1} (x_{F} - x)^{α_{1}}, x_{F} - K_{1}^{- 1 / α_{1}} \leq x \leq x_{F}, \end{aligned}$ (15) (16) $\begin{aligned} F_{2} (x) = 1 - K_{2} (x_{F} - x)^{α_{2}}, x_{F} - K_{2}^{- 1 / α_{2}} \leq x \leq x_{F}, \end{aligned}$ (16) are two polynomial functions with common finite endpoint $x_{F}$ , $α_{1} > α_{2}$ . We can choose $a_{1, n_{1}} = (n_{1} K_{1})^{1 / α_{1}}$ , $b_{1, n_{1}} = x_{F}$ , $a_{2, n_{2}} = (n_{2} K_{2})^{1 / α_{2}}$ , $b_{2, n_{2}} = x_{F}$ , and (17) $\begin{aligned} P ((n_{1} K_{1})^{1 / α_{1}} (M_{1, n_{1}} - x_{F}) \leq x) & \to Ψ_{α_{1}} (x), \end{aligned}$ (17) (18) $\begin{aligned} P ((n_{2} K_{2})^{1 / α_{2}} (M_{2, n_{2}} - x_{F}) \leq x) & \to Ψ_{α_{2}} (x) . \end{aligned}$ (18) If $\frac{(n_{2} K_{2})^{1 / α_{2}}}{(n_{1} K_{1})^{1 / α_{1}}} \to a > 0$ , then $\begin{aligned} P ((n_{1} K_{1})^{1 / α_{1}} (M_{n} - x_{F}) \leq x) \\ = P ((n_{1} K_{1})^{1 / α_{1}} (M_{1, n_{1}} - x_{F}), (n_{2} K_{2})^{1 / α_{2}} \\ \times (M_{2, n_{2}} - x_{F}) \leq \frac{(n_{2} K_{2})^{1 / α_{2}}}{(n_{1} K_{1})^{1 / α_{1}}} x) \\ \to Ψ_{α_{1}} (x) Ψ_{α_{2}} (a x) . \end{aligned}$

Example 2.7

Normal and Pareto

Suppose $F_{1} (x)$ is the standard normal distribution function of $N (0, 1)$ , $F_{2} (x) = 1 - K x^{- α}$ , $α > 0$ , K>0 is a Pareto distribution function. Let $\begin{aligned} a_{1, n_{1}} & = (2 \log n_{1})^{1 / 2}, \\ b_{1, n_{1}} & = (2 \log n_{1})^{1 / 2} \\ - \frac{1}{2} (2 \log n_{1})^{- 1 / 2} (\log \log n_{1} + \log 4 π), \\ a_{2, n_{2}} & = (K n_{2})^{- 1 / α}, b_{2, n_{2}} = 0. \end{aligned}$ Then $\begin{aligned} P (a_{1, n_{1}} (M_{1, n_{1}} - b_{1, n_{1}}) \leq x, a_{2, n_{2}} (M_{2, n_{2}} - b_{2, n_{2}}) \leq x) \\ \to \{\begin{cases} 0, & x < 0, \\ \exp (- e^{- x} - x^{- α}), & x \geq 0. \end{cases} \end{aligned}$ Furthermore, if $a_{2, n_{2}}, b_{1, n_{1}} \to \infty$ , then $P (a_{1, n_{1}} (M_{n} - b_{1, n_{1}}) \leq x) \to \exp (- e^{- x}) .$

Example 2.8

Cauchy and uniform distribution

$F_{1} (x) = \frac{1}{2} + \frac{1}{π} \tan^{- 1} x$ is the standard Cauchy distribution function, and $F_{2} (x) = x, 0 \leq x \leq 1$ , let $\begin{aligned} \begin{array}{ll} a_{1, n_{1}} = \tan (\frac{π}{n_{1}}) \sim \frac{π}{n_{1}}, & b_{1, n_{1}} = 0, \\ a_{2, n_{2}} = n_{2}, & b_{2, n_{2}} = 1. \end{array} \end{aligned}$ Then $\begin{aligned} P (\frac{π}{n_{1}} M_{1, n_{1}} \leq x, n_{2} (M_{2, n_{2}} - 1) \leq x) \\ \to \{\begin{cases} 0, & x < 0, \\ \exp (- x^{- 1}), & x \geq 0, \end{cases} \end{aligned}$ and $P (a_{1, n_{1}} (M_{n} - b_{1, n_{1}}) \leq x) \to \{\begin{cases} 0, & x < 0, \\ \exp (- x^{- 1}), & x \geq 0. \end{cases}$

In Example 2.8, the limit distribution for the normalised $M_{1, n_{1}}$ is 0 when x<0, and the limit distribution for the normalised $M_{2, n_{2}}$ is 1 when x>0. Thus, the product is the same as the former one.

In Examples 2.3 and 2.5, we showed that when n is sufficiently large (goes to infinity), the distribution of $M_{n}$ will be dominated by the subsequence whose marginal distribution has a heavier tail. In Examples 2.4 and 2.6, if the ratio $n_{2}^{1 / α_{2}} / n_{1}^{1 / α_{1}}$ converges to a constant, then one subsequence is never dominated by another, and the limit is of the product form that cannot be reduced to a classical extreme value distribution if $α_{1} \neq α_{2}$ .

We now introduce the accelerated max-stable distribution (AMSD) or the accelerated extreme value distribution (AEVD). We consider the convergence of the probability related to the normalised maxima $M_{1, n_{1}}$ and $M_{2, n_{2}}$ of two subsequences separately. By the relationship $M_{n} = max (M_{1, n_{1}}, M_{2, n_{2}})$ , we can use the accelerated max-stable distribution to approximate the distribution of $M_{n}$ . The classical extreme value distributions will be special cases in the accelerated max-stable distribution family.

Definition 2.1

Let $H_{1} (x)$ and $H_{2} (x)$ be two max-stable distribution functions, we call $H (x) = H_{1} (x) H_{2} (x)$ the accelerated max-stable distribution (AMSD/AEVD) function, which is the product of two max-stable distribution functions. More generally, we also say that $H (x)$ belongs to the accelerated max-stable distribution family if it is the product of k max-stable distribution functions, $k \geq 2$ .

Remark

If Z follows an accelerated max-stable distribution $H (x)$ , then Z can be expressed as $Z = max (Z_{1}, \dots, Z_{k})$ , where each $Z_{i}$ follows a max-stable distribution. By taking maxima of $(Z_{1}, \dots, Z_{k})$ , $Z_{i}$ values are accelerated by other components $Z_{j}$ s to get observed Z values. On the other hand, we have $\begin{aligned} H_{1} (x) H_{2} (x) \dots H_{k} (x) \\ \leq H_{1} (x) H_{2} (x) \dots H_{k - 1} (x) \\ \leq H_{1} (x) H_{2} (x) \dots H_{k - 2} (x) \\ \leq \dots \leq H_{1} (x) \end{aligned}$ and $\begin{aligned} \bar{H_{1} (x) H_{2} (x) \dots H_{k} (x)} & \geq \bar{H_{1} (x) H_{2} (x) \dots H_{k - 1} (x)} \\ \geq \bar{H_{1} (x) H_{2} (x) \dots H_{k - 2} (x)} \\ \geq \dots \geq \bar{H_{1} (x)}, \end{aligned}$ where $\bar{H} (x)$ stands for the survival function, i.e., $\bar{H_{1} (x) H_{2} (x) \dots H_{k} (x)} = 1 - H_{1} (x) H_{2} (x) \dots H_{k} (x) .$ The above inequalities may be regarded as accelerated survival rates. This observation motivates us to call the new distribution as the accelerated max-stable (extreme value) distribution. In the view of risk analysis, the systemic risk of Z is accelerated from individual risks of $Z_{j}$ s given a fixed confidence level.

For the independent sequence of random variables ${X_{i}}_{i = 1}^{n}$ with two subsequences ${X_{1, i}}_{i = 1}^{n_{1}}$ and ${X_{2, i}}_{i = 2}^{n_{2}}$ defined as above, suppose (Equation3(3) $lim_{n \to \infty} P (a_{j, n_{j}} (M_{j, n_{j}} - b_{j, n_{j}}) \leq x) = H_{j} (x) .$ (3) ) is satisfied with j = 1, 2 and norming constants $a_{j, n_{j}} > 0, b_{j, n_{j}}$ , i.e., (19) $lim_{n \to \infty} P (a_{j, n_{j}} (M_{j, n_{j}} - b_{j, n}) \leq x) = H_{j} (x), j = 1, 2,$ (19) then (20) $\begin{aligned} P (max (a_{1, n_{1}} (M_{1, n_{1}} - b_{1, n_{1}}), a_{2, n_{2}} (M_{2, n_{2}} - b_{2, n_{2}})) \leq x) \\ \to H (x) = H_{1} (x) H_{2} (x) . \end{aligned}$ (20)

Definition 2.2

Suppose an independent sequence of random variables ${X_{i}}_{i = 1}^{n}$ satisfies (Equation19(19) $lim_{n \to \infty} P (a_{j, n_{j}} (M_{j, n_{j}} - b_{j, n}) \leq x) = H_{j} (x), j = 1, 2,$ (19) ) and (Equation20(20) $\begin{aligned} P (max (a_{1, n_{1}} (M_{1, n_{1}} - b_{1, n_{1}}), a_{2, n_{2}} (M_{2, n_{2}} - b_{2, n_{2}})) \leq x) \\ \to H (x) = H_{1} (x) H_{2} (x) . \end{aligned}$ (20) ). We call the underlying distribution, $F_{n i}$ , of $X_{i}$ belongs to the competing-maximum domain of attractions of $H_{1}$ and $H_{2}$ , and denote as $F_{n i} \in C M D A (H_{1}, H_{2})$ .

We note that a max-stable distribution may also be decomposed into a product of two max-stable distributions. As a result, the max-stable distribution family can be thought as a family that is embedded in the accelerated max-stable distribution family. This observation can be seen in Theorem 2.1 that the limits of $P (a_{2, n_{2}} (M_{n} - b_{2, n_{2}}))$ under two different conditions belong to the accelerated max-stable distribution family. In other words, the accelerated max-stable distributions form an expanded family of distributions that can describe the limiting distribution of the normalised maxima for more general sequences.

For k = 2 and $F_{n i} \in C M D A (H_{1}, H_{2})$ , AMSDs/AEVDs can have the following six possible combinations:

Case 1.	$F_{j} \in M D A (Λ),$ j = 1, 2, $\begin{aligned} P (a_{1, n_{1}} (M_{1, n_{1}} - b_{1, n_{1}}) \\ \leq x, a_{2, n_{2}} (M_{2, n_{2}} - b_{2, n_{2}}) \leq x) \\ \overset{w}{\to} Λ (\frac{x - b_{1}}{a_{1}}) Λ (\frac{x - b_{2}}{a_{2}}) . \end{aligned}$
Case 2.	$F_{1} \in M D A (Φ_{α_{1}})$ and $F_{2} \in M D A (Φ_{α_{2}})$ , $\begin{aligned} P (a_{1, n_{1}} (M_{1, n_{1}} - b_{1, n_{1}}) \\ \leq x, a_{2, n_{2}} (M_{2, n_{2}} - b_{2, n_{2}}) \leq x) \\ \overset{w}{\to} Φ_{α_{1}} (\frac{x - b_{1}}{a_{1}}) Φ_{α_{2}} (\frac{x - b_{2}}{a_{2}}) . \end{aligned}$
Case 3.	$F_{1} \in M D A (Ψ_{α_{1}})$ and $F_{2} \in M D A (Ψ_{α_{2}})$ , $\begin{aligned} P (a_{1, n_{1}} (M_{1, n_{1}} - b_{1, n_{1}}) \\ \leq x, a_{2, n_{2}} (M_{2, n_{2}} - b_{2, n_{2}}) \leq x) \\ \overset{w}{\to} Ψ_{α_{1}} (\frac{x - b_{1}}{a_{1}}) Ψ_{α_{2}} (\frac{x - b_{2}}{a_{2}}) . \end{aligned}$
Case 4.	$F_{1} \in M D A (Λ)$ and $F_{2} \in M D A (Φ_{α})$ , $\begin{aligned} P (a_{1, n_{1}} (M_{1, n_{1}} - b_{1, n_{1}}) \\ \leq x, a_{2, n_{2}} (M_{2, n_{2}} - b_{2, n_{2}}) \leq x) \\ \overset{w}{\to} Λ (\frac{x - b_{1}}{a_{1}}) Φ_{α} (\frac{x - b_{2}}{a_{2}}) . \end{aligned}$
Case 5.	$F_{1} \in M D A (Λ)$ and $F_{2} \in M D A (Ψ_{α})$ , $\begin{aligned} P (a_{1, n_{1}} (M_{1, n_{1}} - b_{1, n_{1}}) \\ \leq x, a_{2, n_{2}} (M_{2, n_{2}} - b_{2, n_{2}}) \leq x) \\ \overset{w}{\to} Λ (\frac{x - b_{1}}{a_{1}}) Ψ_{α} (\frac{x - b_{2}}{a_{2}}) . \end{aligned}$
Case 6.	$F_{1} \in M D A (Φ_{α_{1}})$ and $F_{2} \in M D A (Ψ_{α_{2}})$ , $\begin{aligned} P (a_{1, n_{1}} (M_{1, n_{1}} - b_{1, n_{1}}) \\ \leq x, a_{2, n_{2}} (M_{2, n_{2}} - b_{2, n_{2}}) \leq x) \\ \overset{w}{\to} Φ_{α_{1}} (\frac{x - b_{1}}{a_{1}}) Ψ_{α_{2}} (\frac{x - b_{2}}{a_{2}}) . \end{aligned}$

It is easy to see that the classical extreme value distributions are special cases of the AMSD family. For any a>0, b>0 satisfying $\frac{1}{a} + \frac{1}{b} = 1,$ we have $\begin{aligned} Λ (x) & = \exp {- e^{- x}} = \exp {- e^{- (\frac{x}{a} + \frac{x}{b})}} \\ = \exp {(- e^{- x - \log a} - e^{- x - \log b})} \\ = Λ (x + \log a) Λ (x + \log b) . \\ Φ_{α} (x) & = \exp {- x^{- α}} \\ = \exp \{- {(\frac{x}{a^{- \frac{1}{α}}})}^{- α} - {(\frac{x}{b^{- \frac{1}{α}}})}^{- α}\} \\ = Φ (\frac{x}{a^{- \frac{1}{α}}}) Φ (\frac{x}{b^{- \frac{1}{α}}}) . \\ Ψ_{α} (x) & = \exp {- (- x)^{α}} \\ = \exp \{- {(\frac{- x}{a^{\frac{1}{α}}})}^{α} - {(\frac{- x}{b^{\frac{1}{α}}})}^{α}\} \\ = Ψ (\frac{x}{a^{\frac{1}{α}}}) Ψ (\frac{x}{b^{\frac{1}{α}}}) . \end{aligned}$ Since $H_{1} (x)$ and $H_{2} (x)$ are max-stable distributions, for any $n_{1} = 2, 3, \dots$ and $n_{2} = 2, 3, \dots$ , there are constants $a_{1, n_{1}} > 0$ , $b_{1, n_{1}}$ , $a_{2, n_{2}} > 0$ , $b_{2, n_{2}}$ such that $H_{1} (x) H_{2} (x) = H_{1}^{n_{1}} (a_{1, n_{1}} x + b_{1, n_{1}}) H_{2}^{n_{2}} (a_{2, n_{2}} x + b_{2, n_{2}})$ .

In Equation (Equation20(20) $\begin{aligned} P (max (a_{1, n_{1}} (M_{1, n_{1}} - b_{1, n_{1}}), a_{2, n_{2}} (M_{2, n_{2}} - b_{2, n_{2}})) \leq x) \\ \to H (x) = H_{1} (x) H_{2} (x) . \end{aligned}$ (20) ), we considered the convergence of $P (max (a_{1, n_{1}} (M_{1, n_{1}} - b_{1, n_{1}}), a_{2, n_{2}} (M_{2, n_{2}} - b_{2, n_{2}})) \leq x),$ instead of the traditional $P (a_{n} (M_{n} - b_{n}) \leq x)$ . If $n_{1}$ and $n_{2}$ are sufficiently large, by (Equation19(19) $lim_{n \to \infty} P (a_{j, n_{j}} (M_{j, n_{j}} - b_{j, n}) \leq x) = H_{j} (x), j = 1, 2,$ (19) ) we have $P (a_{1, n_{1}} (M_{1, n_{1}} - b_{1, n_{1}}) \leq x) \approx G_{1} (x)$ and $P (a_{2, n_{2}} (M_{2, n_{2}} - b_{2, n_{2}}) \leq x) \approx G_{2} (x)$ , then (21) $\begin{aligned} P (M_{n} \leq x) & = P (max (M_{1, n_{1}}, M_{2, n_{2}}) \leq x) \\ = P (M_{1, n_{1}} \leq x) P (M_{2, n_{2}} \leq x) \\ \approx G_{1} (a_{1, n_{1}} (x - b_{1, n_{1}})) G_{2} (a_{2, n_{2}} (x - b_{2, n_{2}})) \\ = G_{1}^{*} (x) G_{2}^{*} (x), \end{aligned}$ (21) where $G_{j}^{*}$ is of the same type as $G_{j}$ , j = 1, 2.

To close this section, we remark that (Equation21(21) $\begin{aligned} P (M_{n} \leq x) & = P (max (M_{1, n_{1}}, M_{2, n_{2}}) \leq x) \\ = P (M_{1, n_{1}} \leq x) P (M_{2, n_{2}} \leq x) \\ \approx G_{1} (a_{1, n_{1}} (x - b_{1, n_{1}})) G_{2} (a_{2, n_{2}} (x - b_{2, n_{2}})) \\ = G_{1}^{*} (x) G_{2}^{*} (x), \end{aligned}$ (21) ) is the basis of applying the newly introduced AMSD/AEVD family to real data. Based on (Equation21(21) $\begin{aligned} P (M_{n} \leq x) & = P (max (M_{1, n_{1}}, M_{2, n_{2}}) \leq x) \\ = P (M_{1, n_{1}} \leq x) P (M_{2, n_{2}} \leq x) \\ \approx G_{1} (a_{1, n_{1}} (x - b_{1, n_{1}})) G_{2} (a_{2, n_{2}} (x - b_{2, n_{2}})) \\ = G_{1}^{*} (x) G_{2}^{*} (x), \end{aligned}$ (21) ), in practice, we don't need to worry about the values of $n_{1}$ , $n_{2}$ , $a_{1, n_{1}}$ , $b_{1, n_{1}}$ , $a_{2, n_{2}}$ , $b_{2, n_{2}}$ , as they are absorbed in $G_{1}^{*} (x)$ and $G_{2}^{*} (x)$ , see also Coles (Citation2001). In our examples, we have used some fixed numbers for $n_{1}$ and $n_{2}$ . They are just for simulation convenience. When n tends to infinity, the values of $n_{1}$ and $n_{2}$ will depend on n.

The next section presents density functions and shapes from which one can see the flexibility of applying the new distribution to real data modelling.

2.4. Density functions and density plots

The density function of the accelerated max-stable distribution requires some discussion of the support region of the cumulative distribution function. We can express the two terms in the product using the form of the generalised extreme value distribution, (22) $\begin{aligned} F (x) & = H_{ξ_{1}; μ_{1}, σ_{1}} (x) H_{ξ_{2}; μ_{2}, σ_{2}} (x) \\ = \exp \{- {[1 + ξ_{1} \frac{x - μ_{1}}{σ_{1}}]}^{- 1 / ξ_{1}} \\ - {[1 + ξ_{2} \frac{x - μ_{2}}{σ_{2}}]}^{- 1 / ξ_{2}}\}, \end{aligned}$ (22) where $1 + ξ_{1} \frac{x - μ_{1}}{σ_{1}} > 0$ and $1 + ξ_{2} \frac{x - μ_{2}}{σ_{2}} > 0$ . We include the special case $H_{0; μ_{i}, σ_{2}}$ as the limit of $H_{ξ_{i}; μ_{i}, σ_{i}}$ for $ξ_{i} \to 0, i = 1, 2$ . Denote the density function as $f (x)$ and let $\begin{aligned} h (x) & = \exp \{- {[1 + ξ_{1} \frac{x - μ_{1}}{σ_{1}}]}^{- 1 / ξ_{1}} \\ - {[1 + ξ_{2} \frac{x - μ_{2}}{σ_{2}}]}^{- 1 / ξ_{2}}\} \\ \times [\frac{1}{σ_{1}} {(1 + ξ_{1} \frac{x - μ_{1}}{σ_{1}})}^{- 1 / ξ_{1} - 1} \\ + \frac{1}{σ_{2}} {(1 + ξ_{2} \frac{x - μ_{2}}{σ_{2}})}^{- 1 / ξ_{2} - 1}] . \end{aligned}$ Since $ξ_{1}$ and $ξ_{2}$ are symmetric, we only present one of them. We have the following six cases for the density functions.

Case 1.	$ξ_{1} = 0, ξ_{2} = 0$ . $\begin{aligned} f (x) & = \exp {- e^{- \frac{x - μ_{1}}{σ_{1}}} - e^{- \frac{x - μ_{2}}{σ_{2}}}} \\ \times [\frac{1}{σ_{1}} e^{- \frac{x - μ_{1}}{σ_{1}}} + \frac{1}{σ_{2}} e^{- \frac{x - μ_{2}}{σ_{2}}}], x \in R . \end{aligned}$
Case 2.	$ξ_{1} > 0$ , $ξ_{2} > 0,$ assuming $μ_{1} - \frac{σ_{1}}{ξ_{1}} \geq μ_{2} - \frac{σ_{2}}{ξ_{2}}$ , then $f (x) = \{\begin{cases} h (x), & i f x > μ_{1} - \frac{σ_{1}}{ξ_{1}}, \\ 0, & i f x \leq μ_{1} - \frac{σ_{1}}{ξ_{1}} . \end{cases}$
Case 3.	$ξ_{1} < 0$ , $ξ_{2} < 0$ , assuming $μ_{1} - \frac{σ_{1}}{ξ_{1}} \geq μ_{2} - \frac{σ_{2}}{ξ_{2}}$ , then $\begin{aligned} f (x) = \{\begin{cases} h (x), & i f x < μ_{2} - \frac{σ_{2}}{ξ_{2}}, \\ \exp \{- {[1 + ξ_{1} \frac{x - μ_{1}}{σ_{1}}]}^{- 1 / ξ_{1}}\} \\ \times [\frac{1}{σ_{1}} {(1 + ξ_{1} \frac{x - μ_{1}}{σ_{1}})}^{- 1 / ξ_{1} - 1}], & i f μ_{2} - \frac{σ_{2}}{ξ_{2}} \leq x \leq μ_{1} - \frac{σ_{1}}{ξ_{1}}, \\ 0, & i f x > μ_{1} - \frac{σ_{1}}{ξ_{1}} . \end{cases} \end{aligned}$
Case 4.	$ξ_{1} = 0$ , $ξ_{2} > 0$ . $\begin{aligned} f (x) = \{\begin{cases} \exp \{- e^{- \frac{x - μ_{1}}{σ_{1}}} - {[1 + ξ_{2} \frac{x - μ_{2}}{σ_{2}}]}^{- 1 / ξ_{2}}\}, \\ \times [\frac{1}{σ_{1}} e^{- \frac{x - μ_{1}}{σ_{1}}} + \frac{1}{σ_{2}} (1 + ξ_{2} \frac{x - μ_{2}}{σ_{2}})^{- 1 / ξ_{2} - 1}], & i f x > μ_{2} - \frac{σ_{2}}{ξ_{2}}, \\ 0, & i f x \leq μ_{2} - \frac{σ_{2}}{ξ_{2}} . \end{cases} \end{aligned}$
Case 5.	$ξ_{1} = 0$ , $ξ_{2} < 0$ . $\begin{aligned} f (x) = \{\begin{cases} \exp \{- e^{- \frac{x - μ_{1}}{σ_{1}}} - {[1 + ξ_{2} \frac{x - μ_{2}}{σ_{2}}]}^{- 1 / ξ_{2}}\} \\ \times [\frac{1}{σ_{1}} e^{- \frac{x - μ_{1}}{σ_{1}}} + \frac{1}{σ_{2}} (1 + ξ_{2} \frac{x - μ_{2}}{σ_{2}})^{- 1 / ξ_{2} - 1}], & i f x < μ_{2} - \frac{σ_{2}}{ξ_{2}}, \\ \exp {- e^{- \frac{x - μ_{1}}{σ_{1}}}} \times \frac{1}{σ_{1}} e^{- \frac{x - μ_{1}}{σ_{1}}}, & i f x \geq μ_{2} - \frac{σ_{2}}{ξ_{2}} . \end{cases} \end{aligned}$
Case 6.	$ξ_{1} > 0$ , $ξ_{2} < 0$ . If $μ_{1} - \frac{σ_{1}}{ξ_{1}} \geq μ_{2} - \frac{σ_{2}}{ξ_{2}}$ , $\begin{aligned} f (x) = \{\begin{cases} \exp \{- {[1 + ξ_{1} \frac{x - μ_{1}}{σ_{1}}]}^{- 1 / ξ_{1}}\} \\ \times [\frac{1}{σ_{1}} {[1 + ξ_{1} \frac{x - μ_{1}}{σ_{1}}]}^{- 1 / ξ_{1} - 1}], & i f x > μ_{1} - \frac{σ_{1}}{ξ_{1}}, \\ 0, & i f x \leq μ_{1} - \frac{σ_{1}}{ξ_{1}} . \end{cases} \end{aligned}$ If $μ_{1} - \frac{σ_{1}}{ξ_{1}} < μ_{2} - \frac{σ_{2}}{ξ_{2}}$ , $\begin{aligned} f (x) = \{\begin{cases} \exp \{- {[1 + ξ_{1} \frac{x - μ_{1}}{σ_{1}}]}^{- 1 / ξ_{1}}\} \\ \times [\frac{1}{σ_{1}} {[1 + ξ_{1} \frac{x - μ_{1}}{σ_{1}}]}^{- 1 / ξ_{1} - 1}], & i f x > μ_{2} - \frac{σ_{2}}{ξ_{2}}, \\ h (x), & i f μ_{1} - \frac{σ_{1}}{ξ_{1}} \leq x \leq μ_{2} - \frac{σ_{2}}{ξ_{2}}, \\ 0, & i f x < μ_{1} - \frac{σ_{1}}{ξ_{1}} . \end{cases} \end{aligned}$

In Figures and , four density plots of Weibull-Gumbel type are shown. In Figure , panel (a) is the density plot of Fréchet-Fréchet type; and panel (b) is the density plot of Fréchet-Gumbel type. We can observe that they capture the shapes of the histograms shown in Figures and .

Figure 6. Density plots of the accelerated max-stable distributions with Weibull-Gumbel combinations. (a) $ξ_{1} = 0$ , $μ_{1} = 0.5$ , $σ_{1} = 1$ , $ξ_{2} = - 1$ , $μ_{2} = - 1$ , $σ_{2} = 1$ . (b) $ξ_{1} = 0$ , $μ_{1} = 0.5$ , $σ_{1} = 1$ , $ξ_{2} = - 1$ , $μ_{2} = 0.5$ , $σ_{2} = 1$ .

Figure 7. Density plots of the accelerated max-stable distributions with Weibull-Gumbel combinations. (a) $ξ_{1} = - 1$ , $μ_{1} = - 1$ , $σ_{1} = 1$ , $ξ_{2} = 0$ , $μ_{2} = 0.5$ , $σ_{2} = 0.7$ . (b) $ξ_{1} = - 0.5$ , $μ_{1} = - 2$ , $σ_{1} = 1$ , $ξ_{2} = 0$ , $μ_{2} = - 1$ , $σ_{2} = 0.7$ .

Figure 8. (a) Density plot of the accelerated max-stable distribution with Fréchet-Fréchet combinition. $ξ_{1} = 0.5$ , $μ_{1} = 0$ , $σ_{1} = 1$ , $ξ_{2} = 0.9$ , $μ_{2} = 0$ , $σ_{2} = 1$ . (b) Density plot of the accelerated max-stable distributions with Fréchet-Gumbel combinition. $ξ_{1} = 0$ , $μ_{1} = 0$ , $σ_{1} = 3$ , $ξ_{2} = 1$ , $μ_{2} = - 3$ , $σ_{2} = 0.2$ .

In Figure (b), it is for $ξ_{1} = ξ_{2} = 0$ , i.e., the combination of two Gumbel distributions. In this case, the density plot is bimodal, which is different from that of a Gumbel distribution. Suppose that $X_{1, i} \sim N (μ_{1}, σ_{1})$ and $X_{2, j} \sim N (μ_{2}, σ_{2})$ , $1 \leq i \leq n_{1}$ and $1 \leq j \leq n_{2}$ , then we have some norming constants $a_{1, n_{1}} > 0, b_{1, n_{1}}$ and $a_{2, n_{2}} > 0, b_{2, n_{2}}$ such that (23) $\begin{aligned} P (a_{1, n_{1}} (M_{1, n_{1}} - b_{1, n_{1}}) \leq x, a_{2, n_{2}} (M_{2, n_{2}} - b_{2, n_{2}}) \leq x) \\ \to Λ (\frac{x - μ_{1}}{σ_{1}}) Λ (\frac{x - μ_{2}}{σ_{2}}) \\ = \exp {- e^{- \frac{x - μ_{1}}{σ_{1}}} - e^{- \frac{x - μ_{2}}{σ_{2}}}} . \end{aligned}$ (23) Here the limit product form requires that the two scale parameters $σ_{1} \neq σ_{2}$ . Otherwise, the product $\exp {- e^{- \frac{x - μ_{1}}{σ}} - e^{- \frac{x - μ_{2}}{σ}}}$ reduces to the Gumbel type.

Figure 9. (a) Density plot of the accelerated max-stable distribution with Weibull-Fréchet combination. $ξ_{1} = - 0.5$ , $μ_{1} = 0$ , $σ_{1} = 1$ , $ξ_{2} = 0.3$ , $μ_{2} = - 1$ , $σ_{2} = 0.1$ . (b) Density plot of the accelerated max-stable distribution with Gumbel-Gumbel combinition. $ξ_{1} = 0$ , $μ_{1} = 0$ , $σ_{1} = 3$ , $ξ_{2} = 0$ , $μ_{2} = - 3$ , $σ_{2} = 0.3$ .

2.5. Tail equivalence and the existence of moments

In this section, we discuss some results of tail-equivalence, and which moments are finite for certain AMSDs/AEVDs.

Definition 2.3

Two cdf's F and H are called tail-equivalent if they have the same right endpoint, i.e., if $x_{F} = x_{H}$ , and (24) $lim_{x \to x_{F}} \bar{F} (x) / \bar{H} (x) = c$ (24) for some constant $0 < c < \infty$ .

We have the following facts.

Fact 2.1

It is clear that the product distribution of a Weibull distribution and another type of extreme value distribution $H (x)$ is tail equivalent to $H (x)$ .

Fact 2.2

Suppose $X \sim Φ_{α_{1}} Φ_{α_{2}}$ , let $μ_{k} = E (X^{k})$ be the kth moment of X, then $μ_{k}$ is finite only if $k < min (α_{1}, α_{2})$ .

Suppose $α_{1} < α_{2}$ , then $Φ_{α_{1}}$ has a heavier tail than $Φ_{α_{2}}$ . Let $μ_{k}^{(1)}$ be the kth moment of $X \sim Φ_{α_{1}}$ . We know that $μ_{k}^{(1)} < \infty$ only if $k < α_{1}$ . This implies that $Φ_{α_{1}} Φ_{α_{2}}$ has the same right-tail heaviness as $Φ_{α_{1}}$ .

Fact 2.3

If $0 < α_{1} < α_{2}$ , then $Φ_{α_{1}} Φ_{α_{2}}$ and $Φ_{α_{1}}$ are tail-equivalent.

Fact 2.4

Suppose $X \sim Λ (x) Φ_{α} (x)$ . Let $μ_{k} = E (X^{k})$ be the kth moment of X. Then $μ_{k}$ is finite only if $k < α$ .

Fact 2.5

$Λ (x) Φ_{α} (x)$ and $Φ_{α} (x)$ are tail-equivalent.

Fact 2.6

If $H_{1} (x)$ has a heavier tail than $H_{2} (x)$ , then the accelerated max-stable distribution $H_{1} (x) H_{2} (x)$ is tail-equivalent to $H_{1} (x)$ .

3. Joint convergence and approximation errors

3.1. Convergence of joint probability for general thresholds

It may also be interesting to consider the limits of $P (M_{1, n_{1}} \leq u_{1, n_{1}}, M_{2, n_{2}} \leq u_{2, n_{2}})$ for some sequences $u_{1, n_{1}}$ and $u_{2, n_{2}}$ not necessarily of the form $x / a_{i, n_{i}} + b_{i, n_{i}}$ or even not dependent on x. Here $n_{1}$ and $n_{2}$ are the lengths of the two subsequences, we may write them specifically as $n_{1} (n)$ and $n_{2} (n)$ since they vary with the total length n. When choosing $u_{j, n_{j}} = x / a_{j, n_{j}} + b_{j, n_{j}}$ for j = 1, 2, it becomes the problem we discussed before. The question is:

Which conditions on $F_{1}$ and $F_{2}$ ensure that the limit of $P (M_{1, n_{1}} \leq u_{1, n_{1}}, M_{2, n_{2}} \leq u_{2, n_{2}})$ for $n \to \infty$ exists for appropriate constants $u_{1, n_{1}}$ and $u_{2, n_{2}}$ ?

Some conditions on tails ${\bar{F}}_{1}$ and ${\bar{F}}_{2}$ are required to ensure that $P (M_{1, n_{1}} \leq u_{1, n_{1}}, M_{2, n_{2}} \leq u_{2, n_{2}})$ converges to a non-trivial limit, i.e., a number in $(0, 1)$ .

Theorem 3.1

Suppose ${X_{i}}_{i = 1}^{n}$ is an independent sequence of random variables which is mixed with two subsequences ${X_{1, i}}_{i = 1}^{n_{1}}$ and ${X_{2, i}}_{i = 1}^{n_{2}}$ with underlying distributions $F_{1} (x)$ and $F_{2} (x)$ , $n_{1} \to \infty$ and $n_{2} \to \infty$ as $n \to \infty$ . Let $0 \leq τ < \infty$ and ${u_{1, i}}_{i = 1}^{n_{1}}$ and ${u_{2, i}}_{i = 1}^{n_{2}}$ are two sequences of real numbers such that (25) $\begin{aligned} n_{1} (1 - F_{1} (u_{1, n_{1}})) + n_{2} (1 - F_{2} (u_{2, n_{2}})) \\ \to τ a s n \to \infty . \end{aligned}$ (25) Then (26) $P (M_{1, n_{1}} \leq u_{1, n_{1}}, M_{2, n_{2}} \leq u_{2, n_{2}}) \to e^{- τ} a s n \to \infty .$ (26) Conversely, if (Equation26(26) $P (M_{1, n_{1}} \leq u_{1, n_{1}}, M_{2, n_{2}} \leq u_{2, n_{2}}) \to e^{- τ} a s n \to \infty .$ (26) ) holds for some $0 \leq τ < \infty$ , then so does (Equation25(25) $\begin{aligned} n_{1} (1 - F_{1} (u_{1, n_{1}})) + n_{2} (1 - F_{2} (u_{2, n_{2}})) \\ \to τ a s n \to \infty . \end{aligned}$ (25) ).

Remark

Since $1 - F (u_{j, n_{j}})$ is the probability that $X_{j, i}$ exceeds level $u_{j, n_{j}}$ , Equation (Equation25(25) $\begin{aligned} n_{1} (1 - F_{1} (u_{1, n_{1}})) + n_{2} (1 - F_{2} (u_{2, n_{2}})) \\ \to τ a s n \to \infty . \end{aligned}$ (25) ) means that the expected number of exceedences of $u_{1, n_{1}}$ by ${X_{1, i}}_{i = 1}^{n_{1}}$ and $u_{2, n_{2}}$ by ${X_{2, i}}_{i = 1}^{n_{2}}$ in total converges to τ. When the sequence is generated from one distribution $F (x)$ , Theorem 3.1 can be reduced to the classical result by choosing $u_{1, n_{1}} = u_{2, n_{2}} = u_{n}$ . That is (27) $n (1 - F (u_{n})) \to τ,$ (27) if and only if (28) $P (M_{n} \leq u_{n}) \to e^{- τ}$ (28) as $n \to \infty$ .

The following corollary gives the conditions such that we can choose one of ${u_{1, i}}_{i = 1}^{n_{1}}$ and ${u_{2, i}}_{i = 1}^{n_{2}}$ to be applied to $M_{n}$ , and derive a similar limit of $P (M_{n} \leq u_{n})$ . The condition involves both the ratio of two tail probabilities $\frac{1 - F_{1} (u_{1, n_{1}})}{1 - F_{2} (u_{2, n_{2}})}$ and $\frac{n_{1}}{n_{2}}$ .

Corollary 3.1

Let $0 \leq τ_{1} < \infty$ , $0 \leq τ_{2} < \infty$ . Suppose that there exist two sequences $u_{1, n_{1}}$ and $u_{2, n_{2}}$ such that (29) $\begin{aligned} n_{1} (1 - F_{1} (u_{1, n_{1}})) & \to τ_{1}, \\ n_{2} (1 - F_{2} (u_{2, n_{2}})) & \to τ_{2} . \end{aligned}$ (29) Then (30) $P (M_{1, n_{1}} \leq u_{1, n}, M_{2, n_{2}} \leq u_{2, n}) \to e^{- τ_{1} - τ_{2}} .$ (30) Moreover, if $\frac{n_{2} (1 - F_{2} (u_{1, n_{1}}))}{n_{1} (1 - F_{1} (u_{1, n_{1}}))} \to t$ , where $0 \leq t < \infty$ , then (31) $P (M_{n} \leq u_{1, n_{1}}) \to e^{- τ_{1} (1 + t)} .$ (31)

Specifically, if we choose $u_{1, n_{1}} = \frac{x}{a_{1, n_{1}}} + b_{1, n_{1}}$ , $u_{2, n_{2}} = \frac{x}{a_{2, n_{2}}} + b_{2, n_{2}}$ , and suppose that (32) $\begin{aligned} P (a_{1, n_{1}} (M_{1, n_{1}} - b_{1, n_{1}}) \leq x) & \to G_{1} (x), \end{aligned}$ (32) (33) $\begin{aligned} P (a_{2, n_{2}} (M_{2, n_{2}} - b_{2, n_{2}}) \leq x) & \to G_{2} (x), \end{aligned}$ (33) then $G_{1}$ and $G_{2}$ belong to the GEV distribution family and the limit in (Equation31(31) $P (M_{n} \leq u_{1, n_{1}}) \to e^{- τ_{1} (1 + t)} .$ (31) ) becomes $G_{1} (x) G_{2} (x)$ .

The following is an example of mixed sequence and the limit properties of the maxima of subsequences and the global maxima.

Example 3.1

Suppose ${X_{i}}_{i = 1}^{n}$ is a sequence of random variables combining two subsequences ${X_{1, i}}_{i = 1}^{n_{1}}$ and ${X_{2, i}}_{i = 1}^{n_{2}}$ . Suppose $\frac{n_{1}}{n} \to p$ , where $0 \leq p \leq 1$ , ${X_{1, i}}_{i = 1}^{n_{1}}$ and ${X_{2, i}}_{i = 1}^{n_{2}}$ are i.i.d. from a Pareto distribution with $F_{1} (x) = 1 - K x^{- α_{1}}, α_{1} > 0, K > 0, x > 0$ and a Fréchet distribution with $F_{2} (x) = \exp (- x^{- α_{2}}), α_{2} > 0, x > 0,$ respectively.

Since $(1 - F_{1} (t x)) / (1 - F_{1} (t)) = x^{- α_{1}}$ for each $x > 0,$ so that Type II (Fréchet) limit applies. For $u_{1, n_{1}} = (K n_{1, n_{1}} / τ)^{1 / α_{1}}$ we have $1 - F_{1} (u_{1, n_{1}}) = τ / n_{1}$ , so that (34) $P (M_{1, n_{1}} \leq {(\frac{K n_{1}}{τ})}^{1 / α_{1}}) \to e^{- τ} .$ (34) Putting $τ = x^{- α_{1}}$ for $x \geq 0$ , (35) $P ((K n_{1})^{- 1 / α_{1}} M_{1, n_{1}} \leq x) \to \exp (- x^{- α_{1}}) .$ (35) On the other hand, $F_{2}^{n_{2}} (n_{2}^{1 / α_{2}} x) = F_{2} (x)$ , i.e., $P (n_{2}^{- 1 / α_{2}} M_{2, n_{2}} \leq x) = F_{2} (x)$ .

Then we have for $x \geq 0$ , (36) $\begin{aligned} P ((K n_{1})^{- 1 / α_{1}} M_{1, n_{1}} \\ \leq x, n_{2}^{- 1 / α_{2}} M_{2, n_{2}} \leq x) \to \exp (- x^{- α_{1}} - x^{- α_{2}}) . \end{aligned}$ (36) Since $\begin{aligned} lim_{x \to \infty} \frac{{\bar{F}}_{1} (x)}{{\bar{F}}_{2} (x)} & = lim_{x \to \infty} \frac{K x^{- α_{1}}}{1 - \exp (- x^{- α_{2}})} \\ = lim_{x \to \infty} \frac{K x^{- α_{1}}}{x^{- α_{2}} + O (x^{- 2 α_{2}})} \\ \to \{\begin{cases} 0, & α_{1} > α_{2}, \\ K, & α_{1} = α_{2}, \\ \infty, & α_{1} < α_{2} . \end{cases} \end{aligned}$ When $α_{1} = α_{2}$ , the condition $\frac{n_{2} (1 - F_{2} (u_{1, n_{1}}))}{n_{1} (1 - F_{1} (u_{1, n_{1}}))} \to \frac{1 - p}{p K}$ in Corollary 3.1 is satisfied, hence (37) $P (M_{n} \leq n_{1}^{- 1 / α_{1}} x) \to \exp (- x^{- α_{1}} (1 + \frac{1 - p}{p k})) .$ (37) Since $n_{1} \sim n p,$ we also have (38) $P (M_{n} \leq (n p)^{- 1 / α_{1}} x) \to \exp (- x^{- α_{1}} (1 + \frac{1 - p}{p k})) .$ (38)

3.2. Approximation error

The convergence results are usually accompanied by the question of the approximation error. Suppose $n_{1} (1 - F_{1} (u_{1, n_{1}})) \to τ_{1}$ and $n_{2} (1 - F_{2} (u_{2, n_{2}})) \to τ_{2}$ , writing $τ_{1, n_{1}} = n_{1} (1 - F_{1} (u_{1, n_{1}}))$ and $τ_{2, n_{2}} = n_{2} (1 - F_{2} (u_{2, n_{2}}))$ , then by Theorem 3.1 we have (39) $P (M_{1, n_{1}} \leq u_{1, n_{1}}, M_{2, n_{2}} \leq u_{2, n_{2}}) \to e^{- τ_{1} - τ_{2}} .$ (39) The approximation can be decomposed into several parts. We have ${(1 - \frac{τ_{1, n_{1}}}{n_{1}})}^{n_{1}} \approx e^{- τ_{1, n_{1}}}, {(1 - \frac{τ_{2, n_{2}}}{n_{2}})}^{n_{2}} \approx e^{- τ_{2, n_{2}}},$ and $e^{- τ_{1, n_{1}}} \approx e^{- τ_{1}}, e^{- τ_{2, n_{2}}} \approx e^{- τ_{2}} .$ We denote $\begin{aligned} Δ_{1, n_{1}} & = {(1 - \frac{τ_{1, n_{1}}}{n_{1}})}^{n_{1}} - e^{- τ_{1, n_{1}}}, \\ Δ_{1, n_{1}}^{'} & = e^{- τ_{1, n_{1}}} - e^{- τ_{1}}, \\ Δ_{2, n_{2}} & = {(1 - \frac{τ_{2, n_{2}}}{n_{2}})}^{n_{2}} - e^{- τ_{2, n_{2}}}, \\ Δ_{2, n_{2}}^{'} & = e^{- τ_{2, n_{2}}} - e^{- τ_{2}} . \end{aligned}$ Then $\begin{aligned} P (M_{1, n_{1}} \leq u_{1, n_{1}}) - e^{- τ_{1}} & = Δ_{1, n_{1}} + Δ_{1, n_{1}}^{'}, \\ P (M_{2, n_{2}} \leq u_{2, n_{2}}) - e^{- τ_{2}} & = Δ_{2, n_{2}} + Δ_{2, n_{2}}^{'} . \end{aligned}$ The following result gives the bound for the approximation error.

Theorem 3.2

Let ${X_{i}}_{i = 1}^{n}$ be an independent sequence of random variables mixed with two subsequences ${X_{1, i}}_{i = 1}^{n_{1}}$ and ${X_{2, i}}_{i = 1}^{n_{2}}$ , which satisfies $n_{1} (1 - F_{1} (u_{1, n_{1}})) \to τ_{1}$ and $n_{2} (1 - F_{2} (u_{2, n_{2}})) \to τ_{2}$ , $Δ_{1, n_{1}}$ , $Δ_{1, n_{1}}^{'},$ $Δ_{2, n_{2}},$ $Δ_{2, n_{2}}^{'}$ are defined as above, then $\begin{aligned} P (M_{1, n_{1}} & \leq u_{1, n_{1}}, M_{2, n_{2}} \leq u_{2, n_{2}}) - e^{- τ_{1} - τ_{2}} \\ \leq Δ_{1, n_{1}} + Δ_{1, n_{1}}^{'} + Δ_{2, n_{2}} + Δ_{2, n_{2}}^{'} \end{aligned}$ with $\begin{aligned} 0 \leq - Δ_{j, n_{j}} & \leq \frac{τ_{j, n_{j}} e^{- τ_{j, n_{j}}}}{2} \cdot \frac{1}{n_{j} - 1} \\ \leq 0.3 \cdot \frac{1}{n_{j} - 1}, f o r j = 1, 2, \end{aligned}$ where the first bound is asymptotically sharp, in the sense that if $τ_{j, n_{j}} \to τ_{j}$ then $Δ_{j, n_{j}} \sim - (\frac{τ_{j} e^{- τ_{j}}}{2}) / n_{j}$ . Furthermore, for $τ_{j} - τ_{j, n_{j}} \leq \log 2$ , $Δ_{j, n_{j}}^{'} = e^{τ_{j}} {(τ_{j} - τ_{j, n_{j}}) + θ_{j} (τ_{j} - τ_{j, n_{j}})^{2}},$ with $0 < θ_{j} < 1$ .

If $τ_{j, n_{j}} \to τ_{j}$ for $u_{j, n_{j}} = x / a_{j, n_{j}} + b_{j, n_{j}}$ , then (Equation39(39) $P (M_{1, n_{1}} \leq u_{1, n_{1}}, M_{2, n_{2}} \leq u_{2, n_{2}}) \to e^{- τ_{1} - τ_{2}} .$ (39) ) holds. By Lemma A.1, (Equation39(39) $P (M_{1, n_{1}} \leq u_{1, n_{1}}, M_{2, n_{2}} \leq u_{2, n_{2}}) \to e^{- τ_{1} - τ_{2}} .$ (39) ) holds also if $a_{j, n_{j}}$ and $b_{j, n_{j}}$ are replaced by different constants $α_{j, n_{j}}$ and $β_{j, n_{j}}$ , satisfying $α_{j, n_{j}} / a_{j, n_{j}} \to 1$ and $(β_{j, n_{j}} - b_{j, n_{j}}) / a_{j, n_{j}} \to 0$ . However, the speed of convergence to zero of $Δ_{j, n_{j}}^{'}$ (thus the speed of $P (M_{j, n_{j}} \leq u_{j, n_{j}})$ to $e^{- τ_{j}}$ ) can be very different for different choices of norming constants.

4. Weakly dependent sequences

In this section, we extend the independent sequences to weakly dependent sequences. For a sequence of random variables ${X_{i}}_{i = 1}^{n}$ with identical distribution, it is stationary if ${X_{j_{1}}, \dots, X_{j_{n}}}$ and ${X_{j_{1} + m}, \dots, X_{j_{n} + m}}$ have the same joint distribution for any choice of n, $j_{1}, \dots, j_{n}$ , and m. For the mixed sequence, we will provide some alternatives so that the desired results still hold. We assume that the dependence between $X_{i, k}$ and $X_{i, j}$ falls off in some specific way as $| k - j |$ increases.

4.1. Review of some weakly dependent conditions

Some weakly dependent conditions in the literature can be generalised to the scenarios of mixed sequences. For m-dependent sequence ${X_{i}}_{i = 1}^{n}$ , $X_{i}$ and $X_{j}$ are independent if $| i - j | > m$ . Another commonly used condition is the strong mixing condition first introduced by Rosenblatt (Citation1956). A sequence of random variables ${X_{i}}_{i = 1}^{n}$ is said to satisfy the strong mixing condition if for some $A \in F (X_{1}, \dots, X_{p})$ and $B \in F (X_{p + k + 1}, X_{p + k + 2}, \dots)$ $| P (A \cap B) - P (A) P (B) | < g (k)$ for any p and k, where $g (k) \to 0$ as $k \to \infty$ ; $F (\cdot)$ is the σ-field generated by the indicated random variables. The function $g (k)$ does not depend on the sets A and B, so the strong mixing condition is uniform.

For normal sequences, the correlation between $X_{k}$ and $X_{j}$ may be a better measure of dependence. We can also use the dependence restriction $| C o r r (X_{k}, X_{j}) | \leq g (| k - j |),$ where $g (k) \to 0$ as $k \to \infty$ .

Since the event ${M_{n} \leq u}$ is the same as ${X_{1} \leq u, X_{2} \leq u, \dots, X_{n} \leq u}$ . We may restric the events on this type of event. Following Leadbetter et al. (Citation2012), we use $F_{i_{1}, \dots, i_{n}} (u)$ to denote $P (X_{i_{1}} \leq u, X_{i_{2}} \leq u, \dots, X_{i_{n}} \leq u)$ . The following condition D is a weakened condition of strong mixing.

The condition $D$ will be said to hold if for any integers $i_{1} < \dots < i_{p}$ and $j_{1} < \dots < j_{p^{'}}$ for which $j_{1} - i_{p} \geq l$ , and any real u, (40) $| F_{i_{1}, \dots, i_{p}, j_{1}, \dots, j_{p^{'}}} (u) - F_{i_{1}, \dots, i_{p}} (u) F_{j_{1}, \dots, j_{p^{'}}} (u) | \leq g (l)$ (40) where $g (l) \to 0$ as $l \to \infty$ .

Under the condition D, the Extremal Types Theorem also holds. Since we usually deal with the event ${M_{n} \leq u_{n}}$ for some levels ${u_{n}}$ , the condition can still be weakened. The condition $D (u_{n})$ is defined as follows.

The condition $D (u_{n})$ will be said to hold if for any integers (41) $1 \leq i_{1} < \dots < i_{p} < j_{1} < \dots < j_{p^{'}} \leq n$ (41) for which $j_{1} - i_{p} \geq l$ , we have (42) $| F_{i_{1}, \dots, i_{p}, j_{1}, \dots, j_{p^{'}}} (u_{n}) - F_{i_{1}, \dots, i_{p}} (u_{n}) F_{j_{1}, \dots, j_{p^{'}}} (u_{n}) | \leq α_{n, l}$ (42) where $α_{n, l_{n}} \to 0$ as $n \to \infty$ for some sequence $l_{n} = o (n)$ .

The condition $D (u_{n})$ guarantees that $lim inf P (M_{n} \leq u_{n}) \geq e^{- τ}$ . We still need a further assumption to have the opposite inequality for the upper limit. Here we present the $D^{'} (u_{n})$ condition used in Watson (Citation1954) and Loynes (Citation1965). This condition bounds the probability of more than one exceedance among $X_{1}, \dots, X_{[n / k]}$ , therefore no multiple points in the point process of exceedances.

The condition $D^{'} (u_{n})$ will be said to hold for the sequence of random variables ${X_{i}}_{i = 1}^{n}$ , if (43) $\underset{n \to \infty}{lim sup} n \sum_{j = 2}^{[n / k]} P {X_{1} > u_{n}, X_{j} > u_{n}} \to 0$ (43) as $k \to \infty$ , (where [ ] denotes the interger part).

If both conditions $D (u_{n})$ and $D^{'} (u_{n})$ are satisfied, we have $P (M_{n} \leq u_{n}) \to e^{- τ}$ is equivalent to $n (1 - F (u_{n})) \to τ a s n \to \infty$ for $0 \leq τ < \infty$ .

4.2. Weakly dependent mixed sequences

To generalise the results from non-mixed sequences to mixed sequences, we need to modify the conditions of $D (u_{n})$ and $D^{'} (u_{n})$ . We use $u_{n}$ to denote the vector of levels $(u_{1, n_{1}}, u_{2, n_{2}})$ when the sequence ${X_{i}}_{i = 1}^{n}$ is composed of two subsequences ${X_{1, i}}_{i = 1}^{n_{1}}$ and ${X_{2, i}}_{i = 2}^{n_{2}}$ , $n_{1} + n_{2} = n$ . We further assume that $\frac{n_{1}}{n} \to p$ as $n \to \infty$ , $0 \leq p \leq 1$ , so that $\frac{n_{2}}{n} \to 1 - p$ .

Before introducing the more general $D (u_{n})$ condition, we introduce some new notations. Let $u_{n}^{(i)} = u_{1, n_{1}} I (X_{i} \in {X_{1, i}}_{i = 1}^{n_{1}}) + u_{2, n_{2}} I (X_{i} \in {X_{2, i}}_{i = 1}^{n_{2}})$ . Here $I (A) = 1$ indicates that the event A is true, otherwise $I (A) = 0$ . The notation $u_{n}^{(i)}$ represents the threshold for $X_{i}$ , which depends on the subsequence that $X_{i}$ belongs to. For example, if $X_{1} = X_{1, 1}$ and $X_{2} = X_{2, 1}$ , then $P (X_{1} \leq u_{n}^{(1)}, X_{2} \leq u_{n}^{(2)})$ represents $P (X_{1, 1} \leq u_{1, n_{1}}, X_{2, 1} \leq u_{2, n_{2}})$ . After introducing this notation, we can state the condition $D (u_{n})$ as follows.

The condition $D (u_{n})$ will be said to hold for the mixed sequence of random variables ${X_{i}}_{i = 1}^{n}$ with two subsequences ${X_{1, i}}_{i = 1}^{n_{1}}$ and ${X_{2, i}}_{i = 1}^{n_{2}}$ if for any integers (44) $1 \leq i_{1} < \dots < i_{p} < j_{1} < \dots < j_{p^{'}} \leq n$ (44) for which $j_{1} - i_{p} \geq l$ , we have $\begin{aligned} | P (X_{i_{1}} \leq u_{n}^{(i_{1})}, \dots, X_{i_{p}} \\ \leq u_{n}^{(i_{p})}, X_{j_{1}} \leq u_{n}^{(j_{1})}, \dots, X_{j_{p^{'}}} \leq u_{n}^{(j_{p^{'}})}) \\ - P (X_{i_{1}} \leq u_{n}^{(i_{1})}, \dots, X_{i_{p}} \leq u_{n}^{(i_{p})}) \\ P (X_{j_{1}} \leq u_{n}^{(j_{1})}, \dots, X_{j_{p^{'}}} \leq u_{n}^{(j_{p^{'}})}) | < α_{n, l}, \end{aligned}$ where $α_{n, l_{n}} \to 0$ as $n \to \infty$ for some sequence $l_{n} = o (n)$ .

Similarly, we can also extend the condition $D^{'} (u_{n})$ for mixed sequences, which is denoted as $D^{'} (u_{n})$ .

The condition $D^{'} (u_{n})$ will be said to hold for the mixed sequence of random variables ${X_{i}}_{i = 1}^{n}$ and levels $u_{n} = (u_{1, n_{1}}, u_{2, n_{2}})$ if (45) $\begin{aligned} \underset{n \to \infty}{lim sup} k \sum_{1 \leq i < j \leq [n / k]} P (X_{i} > u_{n}^{(i)}, X_{j} > u_{n}^{(j)}) \to 0 \\ a s k \to \infty, \end{aligned}$ (45) where $u_{n}^{(i)} = u_{1, n_{1}} I (X_{i} \in {X_{1, i}}_{i = 1}^{n_{1}}) + u_{2, n_{2}} I (X_{i} \in {X_{2, i}}_{i = 2}^{n_{2}}),$ and $[]$ denotes the integer part.

Equation (Equation45(45) $\begin{aligned} \underset{n \to \infty}{lim sup} k \sum_{1 \leq i < j \leq [n / k]} P (X_{i} > u_{n}^{(i)}, X_{j} > u_{n}^{(j)}) \to 0 \\ a s k \to \infty, \end{aligned}$ (45) ) means that $\underset{n \to \infty}{lim sup} \sum_{1 \leq i < j \leq [n / k]} P (X_{i} > u_{n}^{(i)}, X_{j} > u_{n}^{(j)}) = o (1 / k)$ . It can be observed that if $D (u_{n})$ holds for the mixed sequence ${X_{i}}_{i = 1}^{n}$ , then $D (u_{j, n_{j}})$ also holds for the subsequence ${X_{j, i}}_{i = 1}^{n_{j}}$ , for j = 1, 2. The same conclusion is also true for the condition $D^{'} (u_{n})$ .

After introducing the conditions $D (u_{n})$ and $D^{'} (u_{n})$ , we have the extended results for mixed sequences. We assume that the two subsequences ${X_{1, i}}_{i = 1}^{n_{1}}$ and ${X_{2, i}}_{i = 2}^{n_{2}}$ are independent with each other. Also, for any interval $I_{n}$ with $l_{n}$ members, there are $a_{n}$ members from ${X_{1, i}}_{i = 1}^{n_{1}}$ and $b_{n}$ members from ${X_{2, i}}_{i = 2}^{n_{2}}$ . We assume that the proportion of each subsequence $\frac{a_{n}}{l_{n}} \to p$ and $\frac{b_{n}}{l_{n}} \to 1 - p$ , where $0 \leq p \leq 1$ .

Theorem 4.1

Let ${X_{i}}_{i = 1}^{n}$ be a weakly dependent mixed sequence of random variables with two subsequences ${X_{1, i}}_{i = 1}^{n_{1}}$ and ${X_{2, i}}_{i = 1}^{n_{2}}$ , with sample size proportions $\frac{n_{1}}{n} \to p$ and $\frac{n_{2}}{n} \to 1 - p$ as $n \to \infty$ , $0 \leq p \leq 1$ . Suppose that $D (u_{n})$ and $D^{'} (u_{n})$ hold for ${X_{i}}_{i = 1}^{n}$ , then for $0 \leq τ < \infty,$ (46) $P (M_{1, n_{1}} \leq u_{1, n_{1}}, M_{2, n_{2}} \leq u_{2, n_{2}}) \to e^{- τ}$ (46) if and only if (47) $n_{1} (1 - F_{1} (u_{1, n_{1}})) + n_{2} (1 - F_{2} (u_{2, n_{2}})) \to τ .$ (47)

Based on Theorem 4.1, we have the following corollary.

Corollary 4.1

The same conclusions hold with $τ = \infty$ (i.e., $P (M_{1, n_{1}} \leq u_{1, n_{1}}, M_{2, n_{2}} \leq u_{2, n_{2}}) \to 0$ if and only if $n_{1} (1 - F_{1} (u_{1, n_{1}})) + n_{2} (1 - F_{2} (u_{2, n_{2}})) \to \infty)$ if the requirements that $D (u_{n})$ , $D^{'} (u_{n})$ hold are replaced by the condition that, for arbitrarily large $τ (< \infty)$ , there exists a vector of levels $v_{n} = (v_{1, n_{1}}, v_{2, n_{2}})$ such that $v_{1, n_{1}} > u_{1, n_{1}}, v_{2, n_{2}} > u_{2, n_{2}}$ , which satisfy $n_{1} (1 - F_{1} (v_{1, n_{1}})) + n_{2} (1 - F_{2} (v_{2, n_{2}})) \to τ$ with $D (v_{n})$ and $D^{'} (v_{n})$ hold.

Theorem 4.1 tells us the property of the joint probability $P (M_{1, n_{1}} \leq u_{1, n_{1}}, M_{2, n_{2}} \leq u_{2, n_{2}})$ given the tail properties of $F_{1}$ and $F_{2}$ . $n_{1} (1 - F_{1} (u_{1, n_{1}})) + n_{2} (1 - F_{2} (u_{2, n_{2}}))$ is the mean exceedances of the two thresholds by the corresponding subsequences in total. Theorem 4.1 is the generalisation of Theorem 3.1 under the condition that the mixed sequence is weakly dependent within each subsequence.

4.3. Associated independent sequences

The ‘independent sequence associated with ${X_{i}}_{i = 1}^{n}$ ’ can be used to study the maxima of dependent sequence. It was first introduced by Loynes (Citation1965). For a weakly dependent sequence of random variables ${X_{i}}_{i = 1}^{n}$ , the notation ${{\hat{X}}_{i}}_{i = 1}^{n}$ is used to be the independent sequence with the same marginal distribution as ${X_{i}}_{i = 1}^{n}$ , and write ${\hat{M}}_{n} = max ({\hat{X}}_{1}, \dots, {\hat{X}}_{n})$ . When ${X_{i}}_{i = 1}^{n}$ is mixed with two subsequences ${X_{1, i}}_{i = 1}^{n_{1}}$ and ${X_{2, i}}_{i = 1}^{n_{2}}$ with different marginal distributions, we still have the associated independent subsequences ${{\hat{X}}_{1, i}}_{i = 1}^{n_{1}}$ and ${{\hat{X}}_{2, i}}_{i = 1}^{n_{1}}$ , and we write ${\hat{M}}_{i, n_{i}} = max ({\hat{X}}_{i, 1}, \dots, {\hat{X}}_{i, n_{i}})$ , for i = 1, 2.

The following Theorem 4.2 tells us that, under the weakly dependent conditions, $P (M_{1, n_{1}} \leq u_{1, n_{1}}, M_{2, n_{2}} \leq u_{2, n_{2}})$ and $P ({\hat{M}}_{1, n_{1}} \leq u_{1, n_{1}}, {\hat{M}}_{2, n_{2}} \leq u_{2, n_{2}})$ have the same limit if it exists. By Theorem 4.3, we can choose the same norming constant as the independent sequence to derive the same limit of $P (a_{1, n_{1}} (M_{1, n_{1}} - b_{1, n_{1}}) \leq x, a_{2, n_{2}} (M_{2, n_{2}} - b_{2, n_{2}}) \leq x)$ and $P (a_{1, n_{1}} ({\hat{M}}_{1, n_{1}} - b_{1, n_{1}}) \leq x, a_{2, n_{2}} ({\hat{M}}_{2, n_{2}} - b_{2, n_{2}}) \leq x)$ .

Theorem 4.2

Let ${X_{i}}_{i = 1}^{n}$ be a mixed sequence of random variables with two subsequences ${X_{1, i}}_{i = 1}^{n_{1}}$ and ${X_{2, i}}_{i = 1}^{n_{2}}$ , independent with each other. Suppose $D (u_{n})$ and $D^{'} (u_{n})$ hold for a vector of levels $u_{n} = (u_{1, n_{1}}, u_{2, n_{2}})$ . Then $P (M_{1, n_{1}} \leq u_{1, n_{1}}, M_{2, n_{2}} \leq u_{2, n_{2}}) \to θ > 0$ if and only if $P ({\hat{M}}_{1, n_{1}} \leq u_{1, n_{1}}, {\hat{M}}_{2, n_{2}} \leq u_{2, n_{2}}) \to θ$ . The same holds with $θ = 0$ if the condition $D (u_{n})$ and $D^{'} (u_{n})$ are replaced by the requirement that for arbitrarily large $τ < \infty$ there exists $v_{n} = (v_{1, n_{1}}, v_{2, n_{2}})$ such that $v_{1, n_{1}} > u_{1, n_{1}}, v_{2, n_{2}} > u_{2, n_{2}}$ , which satisfy $n_{1} (1 - F_{1} (v_{1, n_{1}})) + n_{2} (1 - F_{2} (v_{2, n_{2}})) \to τ$ with $D (v_{n})$ and $D^{'} (v_{n})$ hold.

Theorem 4.3

Suppose that $D (u_{n})$ and $D^{'} (u_{n})$ hold for the mixed sequence of random variables ${X_{i}}_{i = 1}^{n}$ , with $u_{1, n_{1}} = x / a_{1, n_{1}} + b_{1, n_{1}}, u_{2, n_{2}} = x / a_{2, n_{2}} + b_{2, n_{2}}$ for each real x. Then (48) $\begin{aligned} P (a_{1, n_{1}} (M_{1, n_{1}} - b_{1, n_{1}}) \\ \leq x, a_{2, n_{2}} (M_{2, n_{2}} - b_{2, n_{2}}) \leq x) \to G (x) \end{aligned}$ (48) if and only if (49) $\begin{aligned} P (a_{1, n_{1}} ({\hat{M}}_{1, n_{1}} - b_{1, n_{1}}) \\ \leq x, a_{2, n_{2}} ({\hat{M}}_{2, n_{2}} - b_{2, n_{2}}) \leq x) \to G (x) \end{aligned}$ (49) for some non-degenerate continuous distribution function $G (x)$ .

With the results in this section, for weakly dependent sequences with conditions $D (u_{n})$ and $D^{'} (u_{n})$ being satisfied, we can treat them as independent sequences when studying the limit distribution of the maxima. In the next section, some numerical experiments and estimation results are presented.

5. Numerical experiments

5.1. Simulation

We study the accuracy of the accelerated max-stable distributions in estimating the high quantiles of the simulated data. They are compared to the results using the classical GEV distribution alone. To simulate the data, we first generate two sequences from two different GEV distributions with parameters $ξ_{1}, μ_{1}, σ_{1}$ and $ξ_{2}, μ_{2}, σ_{2}$ , denoting them as ${X_{i}}_{i = 1}^{n}$ and ${Y_{i}}_{i = 1}^{n}$ , here n = 2000. We pair them and find their maxima, $Z_{i} = max (X_{i}, Y_{i})$ , then fit the accelerated max-stable distribution and GEV distribution separately to the sequence ${Z_{i}}_{i = 1}^{n}$ using maximum likelihood method. Using each fitted distribution, we generate a new sequence ${Z_{i}^{*}}_{i = 1}^{n}$ and calculate the proportion of ${Z_{i}^{*}}_{i = 1}^{n}$ that exceeds the 90th, 95th and 99th percentiles of the original sequence ${Z_{i}}_{i = 1}^{n}$ . The simulation scenarios cover all the possible combinations of three types of extreme value distributions. For each combination scenario, the process is repeated 100 times and the standard deviations of the estimated proportions are shown in the parentheses. The results are in Table .

Table 1. The proportions of the simulated data based on the fitted accelerated max-stable distributions and GEV distributions that exceeds the 90th, 95th and 99th percentiles of the original data $Z_{i}$ .

Display Table

From Table , for the 90th percentile, we can observe that accelerated max-stable distributions perform better than the GEV alone, and the exceeding proportion is closer to the theoretical value 0.1. The same is true for the 95th percentiles. For both of these two percentiles, the proportions are larger than the theoretical value 0.1 and 0.05 in general, with the GEV distribution deviating more. This observation implies that both estimations overestimate the true values. For the 99th percentiles, we observe that the differences are not large overall. With a few cases (2nd and 3rd), the accelerated max-stable distribution outperforms the GEV distribution. Also, the proportions for accelerated max-stable distributions are all larger than 0.01 and those for GEV distributions are mostly smaller than 0.01. This phenomenon implies that the accelerated max-stable distribution may overestimate the 99th percentiles. On the other hand, the GEV distribution may underestimate the 99th percentiles.

5.2. Real data

In this section, we apply both AMSD/AEVD and GEV fitting to stock data. The data contains the daily closing prices of 330 S&P 500 companies. Based on the closing prices, we calculate the daily negative log returns using the formula $r_{i} = - \log (\frac{p_{i}}{p_{i - 1}})$ . Here $p_{i}$ represents the stock's closing price of one company on day i. For each day i, we obtain the 330 negative log returns and calculate the maximal value of them, denoting it as $m_{i}$ . The time range is from 3 January 2000 to 30 December 2016, which contain 4277 trading days in the data. The histogram showing the distribution of ${m_{i}}_{i = 1}^{4277}$ is in Figure .

Figure 10. The histogram of the daily maxima of negative log returns of 330 stocks in the S&P 500 companies list.

We find the 90th, 95th and 99th sample percentiles of ${m_{i}}_{i = 1}^{4277}$ , which are 0.1545, 0.2 and 0.3229, respectively. Here the daily maximal negative log returns have some time dependency. However, for the purpose of demonstration, we treat them as independent and fit the AMSD/AEVD and the GEV distribution to ${m_{i}}_{i = 1}^{4277}$ . Based on the fitted distributions, we generate random samples with the same size and find the proportions of the samples that exceed the three percentiles. The proportions are shown in Table .

Table 2. The proportions of the simulated samples generated from the fitted distributions that exceed the 90th, 95th, and 99th sample percentiles of the maximal daily negative log returns.

Download CSV Display Table

Table clearly reveals that the AMSD/AEVD performs better than the GEV alone. The modelling performance may be further improved if time series dependence is implemented in the model fitting, e.g., the AcF model proposed by Zhao et al. (Citation2018) and Mao and Zhang (Citation2018). We will leave this task as a future project.

6. Conclusions

This paper extends the classical extreme value theory to maxima of maxima of time series with mixture patterns depending on the sample size. It has been shown that the classical extreme value distributions are special cases of the accelerated max-stable (extreme value) distributions (AMSDs/AEVDs). Some basic probabilistic properties are presented in the paper. These properties can be used as the probability foundation of recently proposed statistical models for extreme observations. The AMSDs may shed the light of extreme value studies and inferences. Many of existing theories in classical extreme value literature can be renovated in a much more general setting. Many real applications, e.g., risk analysis and portfolio management, systemic risk, etc. can be reanalysed and better results can be expected. Under the newly introduced framework, many new statistical models can be introduced and explored.

Acknowledgments

The authors thank Editor Jun Shao and two referees for their valuable comments. The work by Cao was partially supported by NSF-DMS-1505367 and Wisconsin Alumni Research Foundation #MSN215758. The work by Zhang was partially supported by NSF-DMS-1505367 and NSF-DMS-2012298.

Disclosure statement

No potential conflict of interest was reported by the author(s).

Additional information

Funding

The work by Cao was partially supported by NSF-DMS-1505367 and Wisconsin Alumni Research Foundation #MSN215758. The work by Zhang was partially supported by National Science Foundation NSF-DMS-1505367 and NSF-DMS-2012298.

Notes on contributors

Wenzhi Cao

Wenzhi Cao is a PhD student from the statistics department at the University of Wisconsin-Madison. Cao received his bachelor degree in Mathematics from Nankai University. Cao's research areas include extreme value theory and machine learning.

Zhengjun Zhang

Zhengjun Zhang is Professor of Statistics at the University of Wisconsin. Zhang's main research areas of expertise are in financial time series and rare event modelling, virtual standard cryptocurrency, risk management, nonlinear dependence, asymmetric dependence, asymmetric and directed causal inference, gene-gene relationship in rare diseases.

References

Beirlant, J., Goegebeur, Y., Segers, J., & Teugels, J. (2004). Statistics of extremes: Theory and applications. Wiley Series in Probability and Statistics. Wiley.
Google Scholar
Bücher, A., & Segers, J. (2017). On the maximum likelihood estimator for the generalized extreme-value distribution. Extremes, 20(4), 839–872. https://doi.org/https://doi.org/10.1007/s10687-017-0292-6
Web of Science ®Google Scholar
Castillo, E., Hadi, A. S., Balakrishnan, N., & Sarabia, J. M. (2005). Extreme value and related models with applications in engineering and science. Wiley Series in Probability and Statistics. Wiley.
Google Scholar
Chavez-Demoulin, V., Embrechts, P., & Hofert, M. (2016). An extreme value approach for modeling operational risk losses depending on covariates. Journal of Risk and Insurance, 83(3), 735–776. https://doi.org/https://doi.org/10.1111/jori.v83.3
Web of Science ®Google Scholar
Coles, S. (2001). An introduction to statistical modeling of extreme values. Springer.
Google Scholar
Daouia, A., Girard, S., & Stupfler, G. (2018). Estimation of tail risk based on extreme expectiles. Journal of the Royal Statistical Society: Series B (Statistical Methodology), 80(2), 263–292. https://doi.org/https://doi.org/10.1111/rssb.12254
Web of Science ®Google Scholar
de Haan, L. (1993). Extreme value statistics. In J. Galambos, J. Lechner, & E. Simiu (Eds.), Extreme value theory and applications (pp. 93–122). Kluwer Academic Publisher.
Google Scholar
de Haan, L., & Ferreira, A. (2006). Extreme value theory: An introduction. Springer Verlag.
Google Scholar
Dey, D. K., & Yan, J. (2016). Extreme value modeling and risk analysis: Methods and applications. Chapman & Hall/CRC.
Google Scholar
Embrechts, P., Resnick, S. I., & Samorodnitsky, G. (1999). Extreme value theory as a risk management tool. North American Actuarial Journal, 3(2), 30–41. https://doi.org/https://doi.org/10.1080/10920277.1999.10595797
Google Scholar
Finkenstädt, B., & Rootzén, H. E. (2004). Extreme values in finance, telecommunications, and the environment. Chapman & Hall/CRC.
Google Scholar
Galambos, J. (1978). The asymptotic theory of extreme order statistics. Technical Report.
Google Scholar
Heffernan, J. E., Tawn, J. A., & Zhang, Z. (2007). Asymptotically (in)dependent multivariate maxima of moving maxima processes. Extremes, 10(1–2), 57–82. https://doi.org/https://doi.org/10.1007/s10687-007-0035-1
Google Scholar
Idowu, T., & Zhang, Z. (2017). An extended sparse max-linear moving model with application to high-frequency financial data. Statistical Theory and Related Fields, 1(1), 92–111. https://doi.org/https://doi.org/10.1080/24754269.2017.1346852
Google Scholar
Leadbetter, M. R., Lindgren, G., & Rootzén, H. (2012). Extremes and related properties of random sequences and processes. Springer Science & Business Media.
Google Scholar
Loynes, R. M. (1965). Extreme values in uniformly mixing stationary stochastic processes. The Annals of Mathematical Statistics, 36(3), 993–999. https://doi.org/https://doi.org/10.1214/aoms/1177700071
Google Scholar
Malinowski, A., Schlather, M., & Zhang, Z. (2015). Marked point process adjusted tail dependence analysis for high-frequency financial data. Statistics and Its Interface, 8(1), 109–122. https://doi.org/https://doi.org/10.4310/SII.2015.v8.n1.a10
Web of Science ®Google Scholar
Mao, G., & Zhang, Z. (2018). Stochastic tail index model for high frequency financial data with Bayesian analysis. Journal of Econometrics, 205(2), 470–487. https://doi.org/https://doi.org/10.1016/j.jeconom.2018.03.019
Web of Science ®Google Scholar
McNeil, A. J., & Frey, R. (2000). Estimation of tail-related risk measures for heteroscedastic financial time series: An extreme value approach. Journal of Empirical Finance, 7(3–4), 271–300. https://doi.org/https://doi.org/10.1016/S0927-5398(00)00012-8
Google Scholar
Mikosch, T., Embrechts, P., & Klüppelberg, C. (1997). Modelling extremal events for insurance and finance. Springer Verlag.
Google Scholar
Naveau, P., Zhang, Z., & Zhu, B. (2011). An extension of max autoregressive models. Statistics and Its Interface, 4(2), 253–266. https://doi.org/https://doi.org/10.4310/SII.2011.v4.n2.a19
Web of Science ®Google Scholar
Resnick, S. I. (2013). Extreme values, regular variation and point processes. Springer.
Google Scholar
Rocco, M. (2014). Extreme value theory in finance: A survey. Journal of Economic Surveys, 28(1), 82–108. https://doi.org/https://doi.org/10.1111/joes.2014.28.issue-1
Web of Science ®Google Scholar
Rosenblatt, M. (1956). A central limit theorem and a strong mixing condition. Proceedings of the National Academy of Sciences USA, 42(1), 43–47. https://doi.org/https://doi.org/10.1073/pnas.42.1.43
PubMed Web of Science ®Google Scholar
Salvadori, G., De Michele, C., Kottegoda, N. T., & Rosso, R. (2007). Extremes in nature: An approach using copulas. Springer(Complexity).
Google Scholar
Smith, R. L. (1985). Maximum likelihood estimation in a class of nonregular cases. Biometrika, 72(1), 67–90. https://doi.org/https://doi.org/10.1093/biomet/72.1.67
Web of Science ®Google Scholar
Tang, R., Shao, J., & Zhang, Z. (2013). Sparse moving maxima models for tail dependence in multivariate financial time series. Journal of Statistical Planning and Inference, 143(5), 882–895. https://doi.org/https://doi.org/10.1016/j.jspi.2012.11.008
Web of Science ®Google Scholar
Tsay, R. S. (2005). Analysis of financial time series (Vol. 543). John Wiley & Sons.
Google Scholar
Watson, G. S. (1954). Extreme values in samples from m-dependent stationary stochastic processes. The Annals of Mathematical Statistics, 25(4), 798–800. https://doi.org/https://doi.org/10.1214/aoms/1177728670
Google Scholar
Zhang, Z., & Smith, R. L. (2010). On the estimation and application of max-stable processes. Journal of Statistical Planning and Inference, 140(5), 1135–1153. https://doi.org/https://doi.org/10.1016/j.jspi.2009.10.014
Web of Science ®Google Scholar
Zhang, Z., & Zhu, B. (2016). Copula structured m4 processes with application to high-frequency financial data. Journal of Econometrics, 194(2), 231–241. https://doi.org/https://doi.org/10.1016/j.jeconom.2016.05.004
Web of Science ®Google Scholar
Zhao, Z., & Zhang, Z. (2018). Semi-parametric dynamic max-copula model for multivariate time series. Journal of the Royal Statistical Society: Series B (Statistical Methodology), 80(2), 409–432. https://doi.org/https://doi.org/10.1111/rssb.12256
Web of Science ®Google Scholar
Zhao, Z., Zhang, Z., & Chen, R. (2018). Modeling maxima with autoregressive conditional Fréchet model. Journal of Econometrics, 207(2), 325–351. https://doi.org/https://doi.org/10.1016/j.jeconom.2018.07.004
Web of Science ®Google Scholar

Appendix

A.1. Proofs of Theorems and Propositions

A.1.1. Proof of Theorem 2.1

For Equation (Equation4),

\begin{aligned} P (a_{2, n_{2}} (M_{n} - b_{2, n_{2}}) \leq x) \\ = P (max (a_{2, n_{2}} (M_{1, n_{1}} - b_{2, n_{2}}), \\ a_{2, n_{2}} (M_{2, n_{2}} - b_{2, n_{2}})) \leq x) \\ = P (a_{2, n_{2}} (M_{1, n_{1}} - b_{2, n_{2}}) \\ \leq x, a_{2, n_{2}} (M_{2, n_{2}} - b_{2, n_{2}}) \leq x) \\ = P (a_{1, n_{1}} (M_{1, n_{1}} - b_{1, n_{1}}) \\ \leq a_{1, n_{1}} (\frac{x}{a_{2, n_{2}}} + b_{2, n_{2}} - b_{1, n_{1}}), \\ a_{2, n_{2}} (M_{2, n_{2}} - b_{2, n_{2}}) \leq x) \\ \to H_{1} (a x + b) H_{2} (x) . \end{aligned}

For Equation (Equation5),

\begin{aligned} P (a_{2, n_{2}} (M_{n} - b_{2, n_{2}}) \leq x) \\ = P (a_{1, n_{1}} (M_{1, n_{1}} - b_{1, n_{1}}) \\ \leq a_{1, n_{1}} (\frac{x}{a_{2, n_{2}}} + b_{2, n_{2}} - b_{1, n_{1}}), \\ a_{2, n_{2}} (M_{2, n_{2}} - b_{2, n_{2}}) \leq x) \\ \to H_{2} (x) . \end{aligned}

A.1.2. Proof of Fact 2.2

The density of $Φ_{α_{1}} Φ_{α_{2}}$ is (A1) $f (x) = \{\begin{cases} e^{- x^{- α_{1}} - x^{- α_{2}}} (α_{1} x^{- α_{1} - 1} + α_{2} x^{- α_{2} - 1}), & x \geq 0, \\ 0, & x < 0. \end{cases}$ (A1) Thus (A2) $\begin{aligned} μ_{k} & = \int_{0}^{\infty} x^{k} f (x) d (x) \\ = \int_{0}^{\infty} x^{k} e^{- x^{- α_{1}} - x^{- α_{2}}} (α_{1} x^{- α_{1} - 1} + α_{2} x^{- α_{2} - 1}) d x . \end{aligned}$ (A2) Dividing the integral into two parts, we get (A3) $μ_{k} = \int_{0}^{1} x^{k} f (x) d x + \int_{1}^{\infty} x^{k} f (x) d x .$ (A3) First, let us consider $\int_{0}^{1} x^{k} f (x) d x$ . Since $lim_{x \to 0} x^{k} f (x) = 0$ and $x^{k} f (x)$ is continuous on $[0, 1]$ , it is bounded on $[0, 1]$ . This implies that (A4) $\int_{0}^{1} x^{k} f (x) d x < \infty .$ (A4) Next, let us consider $\int_{1}^{\infty} x^{k} f (x) d x$ . We have $\begin{aligned} \int_{1}^{\infty} x^{k} f (x) d x \\ = \int_{1}^{\infty} e^{- x^{- α_{1}} - x^{- α_{2}}} (α_{1} x^{k - α_{1} - 1} + α_{2} x^{k - α_{2} - 1}) d x . \end{aligned}$ Notice that (A5) $lim_{x \to \infty} e^{- x^{- α_{1}} - x^{- α_{2}}} = 1.$ (A5) Therefore $\int_{1}^{\infty} e^{- x^{- α_{1}} - x^{- α_{2}}} (α_{1} x^{k - α_{1} - 1} + α_{2} x^{k - α_{2} - 1}) d x < \infty$ only if $k < α_{1}$ and $k < α_{2}$ , i.e., $k < min (α_{1}, α_{2})$ .

A.1.3. Proof of Fact 2.3

We need to consider $lim_{x \to \infty} \frac{1 - e^{- x^{- α_{1}}} e^{- x^{- α_{2}}}}{1 - e^{- x^{- α_{1}}}}$ .

Since $x^{- α_{1}} \to 0$ and $x^{- α_{2}} \to 0$ as $x \to \infty$ , we have the Taylor expansions $\begin{aligned} e^{- x^{- α_{1}}} & = 1 - x^{- α_{1}} + o (x^{- α_{1}}), \\ e^{- x^{- α_{1}} - x^{- α_{2}}} & = 1 - (x^{- α_{1}} + x^{- α_{2}}) + o (x^{- α_{1}} + x^{- α_{2}}) . \end{aligned}$ Therefore (A6) $\begin{aligned} lim_{x \to \infty} \frac{1 - e^{- x^{- α_{1}}} e^{- x^{- α_{2}}}}{1 - e^{- x^{- α_{1}}}} \\ = lim_{x \to \infty} \frac{(x^{- α_{1}} + x^{- α_{2}}) + o (x^{- α_{1}} + x^{- α_{2}})}{x^{- α_{1}} + o (x^{- α_{1}})} \\ = lim_{x \to \infty} (1 + x^{α_{1} - α_{2}}) = 1. \end{aligned}$ (A6) This proves that $Φ_{α_{1}} Φ_{α_{2}}$ and $Φ_{α_{1}}$ are tail-equivalent.

A.1.4. Proof of Fact 2.4

The density of $Λ (x) Φ_{α} (x)$ is $f (x) = \{\begin{cases} e^{- e^{- x} - x^{- α}} (e^{- x} + α x^{- α - 1}), & x \geq 0, \\ 0, & x < 0. \end{cases}$ Thus (A7) $\begin{aligned} μ_{k} & = \int_{0}^{\infty} x^{k} f (x) d x \\ = \int_{0}^{\infty} x^{k} e^{- e^{- x} - x^{- α}} (e^{- x} + α x^{- α - 1}) d x . \end{aligned}$ (A7) Dividing the above equation into two parts, we get (A8) $μ_{k} = \int_{0}^{1} x^{k} f (x) d x + \int_{1}^{\infty} x^{k} f (x) d x .$ (A8) For the first part, since $lim_{x \to 0} x^{k} f (x) = 0$ and $x^{k} f (x)$ is continuous on $[0, 1]$ , it is bounded on $[0, 1]$ . Thus $\int_{0}^{1} x^{k} f (x) < \infty$ .

For the second part, (A9) $\int_{1}^{\infty} x^{k} f (x) = \int_{1}^{\infty} e^{- e^{- x} - x^{- α}} (e^{- x} x^{k} + α x^{k - α - 1}) d x .$ (A9) Since (A10) $lim_{x \to \infty} e^{- e^{- x} - x^{- α}} = 1, a n d \int_{1}^{\infty} e^{- x} x^{k} d x < \infty f o r \forall k,$ (A10) we have $\int_{1}^{\infty} e^{- e^{- x} - x^{- α}} (e^{- x} x^{k} + α x^{k - α - 1}) d x < \infty$ if and only if $k < α$ .

A.1.5. Proof of Fact 2.5

We need to consider $lim_{x \to \infty} \frac{1 - e^{- e^{- x}} e^{- x^{- α}}}{1 - e^{- x^{- α}}}$ .

Since $lim_{x \to \infty} e^{- x} \to 0$ and $lim_{x \to \infty} x^{- α} \to 0$ , we have the Taylor expansions $\begin{aligned} e^{- e^{- x} - x^{- α}} & = 1 - e^{- x} - x^{- α} + o (e^{- x} + x^{- α}), \\ e^{- x^{- α}} & = 1 - x^{- α} + o (x^{- α}) . \end{aligned}$ Thus (A11) $\begin{aligned} lim_{x \to \infty} \frac{1 - e^{- e^{- x}} e^{- x^{- α}}}{1 - e^{- x^{- α}}} & = lim_{x \to \infty} \frac{e^{- x} + x^{- α} + o (e^{- x} + x^{- α})}{x^{- α} + o (x^{- α})} \\ = 1. \end{aligned}$ (A11) This implies that $Λ (x) Φ_{α} (x)$ and $Φ_{α} (x)$ are tail-equivalent.

A.1.6. Proof of Theorem 3.1

If (Equation25(25) $\begin{aligned} n_{1} (1 - F_{1} (u_{1, n_{1}})) + n_{2} (1 - F_{2} (u_{2, n_{2}})) \\ \to τ a s n \to \infty . \end{aligned}$ (25) ) holds, we must have $\begin{aligned} 1 - F_{1} (u_{1, n_{1}}) \to 0, \\ 1 - F_{2} (u_{2, n_{2}}) \to 0. \end{aligned}$ Then (A12) $\begin{aligned} n_{1} \log (1 - (1 - F_{1} (u_{1, n_{1}}))) + n_{2} \log (1 - (1 - F_{2} (u_{2, n_{2}}))) \\ = - n_{1} (1 - F_{1} (u_{1, n_{1}})) (1 + o (1)) \\ - n_{2} (1 - F_{2} (u_{2, n_{2}})) (1 + o (1)) \\ \to - τ, \end{aligned}$ (A12) which is equivalent to $\begin{aligned} P (M_{1, n_{1}} \leq u_{1, n_{1}}, M_{2, n_{2}} \leq u_{2, n_{2}}) \\ = (1 - (1 - F_{1} (u_{1, n_{1}})))^{n_{1}} (1 - (1 - F_{2} (u_{2, n_{2}})))^{n_{2}} \\ = \exp {n_{1} \log (1 - (1 - F_{1} (u_{1, n_{1}}))) \\ + n_{2} \log (1 - (1 - F_{2} (u_{2, n_{2}})))} \\ \to e^{- τ} . \end{aligned}$ Conversely, if (Equation26(26) $P (M_{1, n_{1}} \leq u_{1, n_{1}}, M_{2, n_{2}} \leq u_{2, n_{2}}) \to e^{- τ} a s n \to \infty .$ (26) ) holds, which is equivalent to (A13) $\begin{aligned} n_{1} \log (1 - (1 - F_{1} (u_{1, n_{1}}))) \\ + n_{2} \log (1 - (1 - F_{2} (u_{2, n_{2}}))) \to - τ, \end{aligned}$ (A13) we must have $1 - F_{1} (u_{1, n_{1}}) \to 0$ and $1 - F_{2} (u_{2, n_{2}}) \to 0$ . Otherwise, suppose $1 - F_{1} (u_{1, n_{1}}) ↛ 0$ , then there is a sequence of indexes $m_{1}, m_{2}, \dots$ and $ϵ > 0$ such that $1 - F_{1} (u_{1, m_{k}}) > ϵ$ for $\forall k$ . This means that $\begin{aligned} n_{1} \log (1 - (1 - F_{1} (u_{1, m_{i}}))) + n_{2} \log (1 - (1 - F_{2} (u_{2, n - m_{i}}))) \\ < n_{1} \log (1 - (1 - F_{1} (u_{1, m_{i}}))) \\ < n_{1} \log (1 - ϵ) \to - \infty, \end{aligned}$ which is contradictory to (EquationA13(A13) $\begin{aligned} n_{1} \log (1 - (1 - F_{1} (u_{1, n_{1}}))) \\ + n_{2} \log (1 - (1 - F_{2} (u_{2, n_{2}}))) \to - τ, \end{aligned}$ (A13) ). We have (A14) $\begin{aligned} n_{1} [(1 - F_{1} (u_{1, n_{1}})) + o (1 - F_{1} (u_{1, n_{1}}))] \\ + n_{2} [(1 - F_{2} (u_{2, n_{2}})) + o (1 - F_{2} (u_{2, n_{2}}))] \to τ \end{aligned}$ (A14) and Equation (Equation25(25) $\begin{aligned} n_{1} (1 - F_{1} (u_{1, n_{1}})) + n_{2} (1 - F_{2} (u_{2, n_{2}})) \\ \to τ a s n \to \infty . \end{aligned}$ (25) ) holds.

A.1.7. Proof of Corollary 3.1

Since (A15) $n_{1} (1 - F_{1} (u_{1, n_{1}})) + n_{2} (1 - F_{2} (u_{2, n_{2}})) \to τ_{1} + τ_{2},$ (A15) (Equation30(30) $P (M_{1, n_{1}} \leq u_{1, n}, M_{2, n_{2}} \leq u_{2, n}) \to e^{- τ_{1} - τ_{2}} .$ (30) ) is a direct result of Theorem 3.1.

If $\frac{n_{2} (1 - F_{2} (u_{1, n_{1}}))}{n_{1} (1 - F_{1} (u_{1, n_{1}}))} \to t$ , then $n_{2} (1 - F_{2} (u_{1, n_{1}})) \to t τ_{1}$ , where $0 \leq t τ_{1} < \infty$ . Therefore, $\begin{aligned} P (M_{n} \leq u_{1, n_{1}}) & = P (M_{1, n_{1}} \leq u_{1, n_{1}}) P (M_{2, n_{2}} \leq u_{1, n_{1}}) \\ \to e^{- τ_{1} (1 + t)} . \end{aligned}$

A.1.8. Proof of Theorem 3.2

Since $P (M_{i, n_{i}} \leq u_{i, n_{i}}) = (1 - τ_{i, n_{i}} / n_{i})^{n_{i}}$ and $0 \leq τ_{i, n_{i}} = n_{1} (1 - F_{i} (u_{i, n_{i}})) \leq n$ , the result follows from Lemma A.2.

A.1.9. Proof of Theorem 4.1

For fixed k, write $n^{'} = [n / k]$ , suppose that there are $n_{1}^{'}$ members from $F_{1}$ and $n_{2}^{'}$ members from $F_{2}$ among ${X_{1}, \dots, X_{n^{'}}},$ $n^{'} = n_{1}^{'} + n_{2}^{'}$ . If (Equation47(47) $n_{1} (1 - F_{1} (u_{1, n_{1}})) + n_{2} (1 - F_{2} (u_{2, n_{2}})) \to τ .$ (47) ) holds, by assumption we have $n_{1}^{'} \sim p n^{'} \sim \frac{n_{1}}{k}$ and $n_{2}^{'} \sim (1 - p) n^{'} \sim \frac{n_{2}}{k},$ thus (A16) $n_{1}^{'} (1 - F_{1} (u_{1, n_{1}})) + n_{2}^{'} (1 - F_{2} (u_{2, n_{2}})) \to \frac{τ}{k} .$ (A16) Since $\begin{aligned} P ({M_{1, n_{1}^{'}} \leq u_{1, n_{1}}, M_{2, n_{2}^{'}} \leq u_{2, n_{2}}}) \\ = 1 - P ({M_{1, n_{1}^{'}} > u_{1, n_{1}}} \cup {M_{2, n_{2}^{'}} > u_{2, n_{2}}}) \\ = 1 - P ((⋃_{i = 1}^{n_{1}^{'}} {X_{1, i} > u_{1, n_{1}}}) \cup (⋃_{j = 1}^{n_{2}^{'}} {X_{2, j} > u_{2, n_{2}}})), \end{aligned}$ we have (A17) $\begin{aligned} 1 - n_{1}^{'} (1 - F_{1} (u_{1, n_{1}})) - n_{2}^{'} (1 - F_{2} (u_{2, n_{2}})) \\ \leq P (M_{1, n_{1}^{'}} \leq u_{1, n_{1}}, M_{2, n_{2}^{'}} \leq u_{2, n_{2}}) \\ \leq 1 - n_{1}^{'} (1 - F_{1} (u_{1, n_{1}})) - n_{2}^{'} (1 - F_{2} (u_{2, n_{2}})) + S_{n}, \end{aligned}$ (A17) where $S_{n} = \sum_{1 \leq i < j \leq n^{'}} P (X_{i} > u_{n}^{(i)}, X_{j} > u_{n}^{(j)})$ .

Condition $D^{'} (u_{n})$ implies that $\underset{n \to \infty}{lim sup} S_{n} = o (\frac{1}{k})$ as $k \to \infty$ . By (EquationA16(A16) $n_{1}^{'} (1 - F_{1} (u_{1, n_{1}})) + n_{2}^{'} (1 - F_{2} (u_{2, n_{2}})) \to \frac{τ}{k} .$ (A16) ) and (EquationA17(A17) $\begin{aligned} 1 - n_{1}^{'} (1 - F_{1} (u_{1, n_{1}})) - n_{2}^{'} (1 - F_{2} (u_{2, n_{2}})) \\ \leq P (M_{1, n_{1}^{'}} \leq u_{1, n_{1}}, M_{2, n_{2}^{'}} \leq u_{2, n_{2}}) \\ \leq 1 - n_{1}^{'} (1 - F_{1} (u_{1, n_{1}})) - n_{2}^{'} (1 - F_{2} (u_{2, n_{2}})) + S_{n}, \end{aligned}$ (A17) ), we have $\begin{aligned} 1 - \frac{τ}{k} & \leq \underset{n \to \infty}{lim inf} P (M_{1, n_{1}^{'}} \leq u_{1, n_{1}}, M_{2, n_{2}^{'}} \leq u_{2, n_{2}}) \\ \leq \underset{n \to \infty}{lim sup} P (M_{1, n_{1}^{'}} \leq u_{1, n_{1}}, M_{2, n_{2}^{'}} \leq u_{2, n_{2}}) \\ \leq 1 - \frac{τ}{k} + o (\frac{1}{k}) . \end{aligned}$ Since $D (u_{n})$ implies $D (u_{1, n_{1}})$ and $D (u_{2, n_{2}})$ , Lemma A.3 holds for each subsequence. We have $\begin{aligned} {(1 - \frac{τ}{k})}^{k} & \leq \underset{n \to \infty}{lim inf} P (M_{1, n_{1}} \leq u_{1, n_{1}}, M_{2, n_{2}} \leq u_{2, n_{2}}) \\ \leq \underset{n \to \infty}{lim sup} P (M_{1, n_{1}} \leq u_{1, n_{1}}, M_{2, n_{2}} \leq u_{2, n_{2}}) \\ \leq {(1 - \frac{τ}{k} + o (\frac{1}{k}))}^{k} . \end{aligned}$ Letting $k \to \infty,$ we have $lim_{n \to \infty} P (M_{1, n_{1}} \leq u_{1, n_{1}}, M_{2, n_{2}} \leq u_{2, n_{2}}) \to e^{- τ}$ .

Conversely, if (Equation46(46) $P (M_{1, n_{1}} \leq u_{1, n_{1}}, M_{2, n_{2}} \leq u_{2, n_{2}}) \to e^{- τ}$ (46) ) holds, (A18) $\begin{aligned} 1 - P (M_{1, n_{1}^{'}} \leq u_{1, n_{1}}, M_{2, n_{2}^{'}} \leq u_{2, n_{2}}) \\ \leq n_{1}^{'} (1 - F_{1} (u_{1, n_{1}})) + n_{2}^{'} (1 - F_{2} (u_{2, n_{2}})) \\ \leq 1 - P (M_{1, n_{1}^{'}} \leq u_{1, n_{1}}, M_{2, n_{2}^{'}} \leq u_{2, n_{2}}) + S_{n} . \end{aligned}$ (A18) Since $P (M_{1, n_{1}} \leq u_{1, n_{1}}, M_{2, n_{2}} \leq u_{2, n_{2}}) \to e^{- τ},$ we have $P (M_{1, n_{1}^{'}} \leq u_{1, n_{1}}, M_{2, n_{2}^{'}} \leq u_{2, n_{2}}) \to e^{- τ / k}$ . By letting $n \to \infty$ in (EquationA18(A18) $\begin{aligned} 1 - P (M_{1, n_{1}^{'}} \leq u_{1, n_{1}}, M_{2, n_{2}^{'}} \leq u_{2, n_{2}}) \\ \leq n_{1}^{'} (1 - F_{1} (u_{1, n_{1}})) + n_{2}^{'} (1 - F_{2} (u_{2, n_{2}})) \\ \leq 1 - P (M_{1, n_{1}^{'}} \leq u_{1, n_{1}}, M_{2, n_{2}^{'}} \leq u_{2, n_{2}}) + S_{n} . \end{aligned}$ (A18) ), $\begin{aligned} 1 - e^{- τ / k} \\ \leq \frac{1}{k} \underset{n \to \infty}{lim inf} n_{1} (1 - F_{1} (u_{1, n_{1}})) + n_{2} (1 - F_{2} (u_{2, n_{2}})) \\ \leq \frac{1}{k} \underset{n \to \infty}{lim sup} n_{1} (1 - F_{1} (u_{1, n_{1}})) + n_{2} (1 - F_{2} (u_{2, n_{2}})) \\ \leq 1 - e^{- τ / k} + o (\frac{1}{k}) \end{aligned}$ from which (multiplying k on all sides and let $k \to \infty$ ) we have $n_{1} (1 - F_{1} (u_{1, n_{1}})) + n_{2} (1 - F_{2} (u_{2, n_{2}})) \to τ$ .

A.1.10. Proof of Corollary 4.1

Suppose $n_{1} (1 - F_{1} (u_{1, n_{1}})) + n_{2} (1 - F_{2} (u_{2, n_{2}})) \to \infty,$ by $u_{1, n_{1}} < v_{1, n_{1}}$ and $u_{2, n_{2}} < v_{2, n_{2}}$ , we have $\begin{aligned} P (M_{1, n_{1}} & \leq u_{1, n_{1}}, M_{2, n_{2}} \leq u_{2, n_{2}}) \\ \leq P (M_{1, n_{1}} \leq v_{1, n_{1}}, M_{2, n_{2}} \leq v_{2, n_{2}}) . \end{aligned}$ By Theorem 4.1, $P (M_{1, n_{1}} \leq v_{1, n_{1}}, M_{2, n_{2}} \leq v_{2, n_{2}}) \to e^{- τ}$ . Then $\underset{n \to \infty}{lim sup} P (M_{1, n_{1}} \leq u_{1, n_{1}}, M_{2, n_{2}} \leq u_{2, n_{2}}) \leq e^{- τ} .$ By letting $τ \to \infty$ , we have $lim_{n \to \infty} P (M_{1, n_{1}} \leq u_{1, n_{1}}, M_{2, n_{2}} \leq u_{2, n_{2}}) = 0.$ Conversely, we still have $\begin{aligned} n_{1} (1 - F_{1} (u_{1, n_{1}})) + n_{2} (1 - F_{2} (u_{2, n_{2}})) \\ \geq n_{1} (1 - F_{1} (v_{1, n_{1}}) + n_{2} (1 - F_{2} (v_{2, n_{2}})) \to τ . \end{aligned}$ Since the above inequality holds for arbitrary large $τ > 0,$ we must have $n_{1} (1 - F_{1} (u_{1, n_{1}})) + n_{2} (1 - F_{2} (u_{2, n_{2}})) \to \infty$ .

A.1.11. Proof of Theorem 4.2

For $θ > 0$ , the condition $P ({\hat{M}}_{1, n_{1}} \leq u_{1, n_{1}}, {\hat{M}}_{2, n_{2}} \leq u_{2, n_{2}}) \to θ$ may be rewritten as $P ({\hat{M}}_{1, n_{1}} \leq u_{1, n_{1}}, {\hat{M}}_{2, n_{2}} \leq u_{2, n_{2}}) \to e^{- τ}$ with $τ = - \log θ$ , this holds if and only if $n_{1} (1 - F_{1} (u_{1, n_{1}})) + n_{2} (1 - F_{2} (u_{2, n_{2}})) \to τ$ . The same is true for $P (M_{1, n_{1}} \leq u_{1, n_{1}}, M_{2, n_{2}} \leq u_{2, n_{2}})$ by condition $D (u_{n})$ and $D^{'} (u_{n})$ . When $θ = 0$ , the result follows from Corollary 4.1.

A.1.12. Proof of Theorem 4.3

If $G (x) > 0$ , the equivalence follows from Theorem 4.2, with $θ = G (x)$ .

If $G (x) = 0$ , the continuity of G shows that, if $0 < τ < \infty$ ,there exists $x_{0}$ such that $G (x_{0}) = e^{- τ}$ . $D (v_{i, n_{i}})$ and $D^{'} (v_{i, n_{i}})$ hold for $v_{1, n_{1}} = x_{0} / a_{1, n_{1}} + b_{1, n_{1}}, v_{2, n_{2}} = x_{0} / a_{2, n_{2}} + b_{2, n_{2}}$ and $P (M_{1, n_{1}} \leq v_{1, n_{1}}, M_{2, n_{2}} \leq v_{2, n_{2}}) \to e^{- τ}$ or $P ({\hat{M}}_{1, n_{1}} \leq v_{1, n_{1}}, {\hat{M}}_{2, n_{2}} \leq v_{2, n_{2}}) \to e^{- τ}$ depending on the assumption made, so that $n_{1} (1 - F_{1} (v_{1, n_{1}})) + n_{2} (1 - F_{2} (v_{2, n_{2}})) \to τ$ . If (Equation49(49) $\begin{aligned} P (a_{1, n_{1}} ({\hat{M}}_{1, n_{1}} - b_{1, n_{1}}) \\ \leq x, a_{2, n_{2}} ({\hat{M}}_{2, n_{2}} - b_{2, n_{2}}) \leq x) \to G (x) \end{aligned}$ (49) ) holds, then we have $n_{1} (1 - F_{1} (u_{1, n_{1}})) + n_{2} (1 - F_{2} (u_{2, n_{2}})) \to \infty,$ thus $u_{1, n_{1}} < v_{1, n_{1}}$ and $u_{2, n_{2}} < v_{2, n_{2}}$ (since one of the inequalities must hold and also implies another). By Theorem 4.2, (Equation48(48) $\begin{aligned} P (a_{1, n_{1}} (M_{1, n_{1}} - b_{1, n_{1}}) \\ \leq x, a_{2, n_{2}} (M_{2, n_{2}} - b_{2, n_{2}}) \leq x) \to G (x) \end{aligned}$ (48) ) holds. The converse direction can be proved similarly.

A.2. Lemmas

Lemma A.1

Khintchine, Theorem 1.2.3 in Leadbetter et al. (Citation2012)

Let ${F_{n}}$ be a sequence of cdf's and H a nondegenerate cdf. Let $a_{n} > 0$ and $b_{n}$ be constants such that (A19) $F_{n} (a_{n} x + b_{n}) \overset{w}{\to} H (x) .$ (A19) Then for some nondegenerate cdf $H_{*}$ and constants $α_{n} > 0$ , $β_{n}$ , (A20) $F_{n} (α_{n} x + β_{n}) \overset{w}{\to} H_{*} (x)$ (A20) if and only if (A21) $a_{n}^{- 1} α_{n} \to a a n d a_{n}^{- 1} (β_{n} - b_{n}) \to b$ (A21) for some a>0 and b, and then (A22) $H_{*} (x) = H (a x + b) .$ (A22)

Lemma A.2

Lemma 2.4.1 in Leadbetter et al. (Citation2012)

If $0 \leq x \leq n$ then (A23) $\begin{aligned} 0 \leq e^{- x} - {(1 - \frac{x}{n})}^{n} & \leq \frac{x^{2} e^{- x}}{2} \cdot \frac{1}{n - 1} \\ \leq 2 e^{- 2} \cdot \frac{1}{n - 1} \\ \leq 0.3 \cdot \frac{1}{n - 1} f o r n = 1, 2, \dots, \end{aligned}$ (A23) and further (A24) $\begin{aligned} e^{- x} - {(1 - \frac{x}{n})}^{n} \\ = \frac{x^{2} e^{- x}}{2} \frac{1}{n} (1 + O (\frac{1}{n})) a s n \to \infty, \end{aligned}$ (A24) uniformly for x in bounded intervals.
If $x - y \leq \log 2$ then (A25) $e^{- y} - e^{- x} = e^{- x} {(x - y) + θ (x - y)^{2}},$ (A25) with $0 < θ < 1$ .

Lemma A.3

Lemma 3.3.2 in Leadbetter et al. (Citation2012)

If $D (u_{n})$ holds, for a fixed integer k, we have (A26) $P (M_{n} \leq u_{n}) - P^{k} (M_{[n / k]} \leq u_{n}) \to 0 a s n \to \infty .$ (A26)

Reprints and Corporate Permissions

Please note: Selecting permissions does not provide access to the full text of the article, please see our help page How do I view content?

To request a reprint or corporate permissions for this article, please click on the relevant link below:

Order Reprints Request Corporate Permissions

Academic Permissions

Please note: Selecting permissions does not provide access to the full text of the article, please see our help page How do I view content?

Obtain permissions instantly via Rightslink by clicking on the button below:

Request Academic Permissions

If you are unable to obtain permissions via Rightslink, please complete and submit this Permissions form. For more information, please visit our Permissions help page.

New extreme value theory for maxima of maxima

Abstract

1. Introduction

2. Accelerated max-stable distribution for independent sequences

2.1. A brief review of classical univariate extreme value theory

2.2. Maxima of maxima

2.3. Convergence to the accelerated max-stable distribution

Fréchet and Gumbel

Fréchet and Fréchet

Uniform and normal

Weibull and Weibull

Normal and Pareto

Cauchy and uniform distribution

2.4. Density functions and density plots

2.5. Tail equivalence and the existence of moments

3. Joint convergence and approximation errors

3.1. Convergence of joint probability for general thresholds

3.2. Approximation error

4. Weakly dependent sequences

4.1. Review of some weakly dependent conditions

4.2. Weakly dependent mixed sequences

4.3. Associated independent sequences

5. Numerical experiments

5.1. Simulation

Table 1. The proportions of the simulated data based on the fitted accelerated max-stable distributions and GEV distributions that exceeds the 90th, 95th and 99th percentiles of the original data Zi.

5.2. Real data

Table 2. The proportions of the simulated samples generated from the fitted distributions that exceed the 90th, 95th, and 99th sample percentiles of the maximal daily negative log returns.

6. Conclusions

Acknowledgments

Disclosure statement

Additional information

Funding

Notes on contributors

Wenzhi Cao

Zhengjun Zhang

References

Appendix

A.1. Proofs of Theorems and Propositions

A.1.1. Proof of Theorem 2.1

A.1.2. Proof of Fact 2.2

A.1.3. Proof of Fact 2.3

A.1.4. Proof of Fact 2.4

A.1.5. Proof of Fact 2.5

A.1.6. Proof of Theorem 3.1

A.1.7. Proof of Corollary 3.1

A.1.8. Proof of Theorem 3.2

A.1.9. Proof of Theorem 4.1

A.1.10. Proof of Corollary 4.1

A.1.11. Proof of Theorem 4.2

A.1.12. Proof of Theorem 4.3

A.2. Lemmas

Khintchine, Theorem 1.2.3 in Leadbetter et al. (Citation2012)

Lemma 2.4.1 in Leadbetter et al. (Citation2012)

Lemma 3.3.2 in Leadbetter et al. (Citation2012)

Reprints and Corporate Permissions

Academic Permissions

Related research

To cite this article:

Download citation

Your download is now in progress and you may close this window

Login or register to access this feature

Information for

Open access

Opportunities

Help and information

Keep up to date

Table 1. The proportions of the simulated data based on the fitted accelerated max-stable distributions and GEV distributions that exceeds the 90th, 95th and 99th percentiles of the original data $Z_{i}$ .