Search in:

Journal of Applied Statistics Volume 51, 2024 - Issue 7

Submit an article Journal homepage

Open access

597

Views

CrossRef citations to date

Altmetric

Listen

Articles

Robust autoregressive modeling and its diagnostic analytics with a COVID-19 related application

Yonghui Liua School of Statistics and Information, Shanghai University of International Business and Economics, Shanghai, People's Republic of ChinaView further author information

Jing Wanga School of Statistics and Information, Shanghai University of International Business and Economics, Shanghai, People's Republic of China

https://orcid.org/0000-0002-3941-2147 View further author information

Víctor Leivab School of Industrial Engineering, Pontificia Universidad Católica de Valparaíso, Valparaíso, Chile

https://orcid.org/0000-0003-4755-3270 View further author information

Alejandra Tapiac Department of Statistics, Pontificia Universidad Católica de Chile, Santiago, Chile

https://orcid.org/0000-0003-0762-7618 View further author information

Wei Tand School of Mathematics, Shanghai University of Finance and Economics, Shanghai, People's Republic of ChinaView further author information

Shuangzhe Liue Faculty of Science and Technology, University of Canberra, Canberra, AustraliaCorrespondence[email protected]

https://orcid.org/0000-0002-4858-2789 View further author information

Pages 1318-1343 | Received 03 Oct 2022, Accepted 28 Mar 2023, Published online: 19 Apr 2023

Cite this article
https://doi.org/10.1080/02664763.2023.2198178
CrossMark

In this article

1. Introduction
2. Formulation and estimation
3. Diagnostic analysis
4. Numerical simulation
5. Empirical analysis
6. Conclusions
Acknowledgements
Disclosure statement
Additional information
References
Appendixes

Full Article
Figures & data
References
Citations
Metrics
Licensing
Reprints & Permissions
View PDF PDF View EPUB EPUB

Formulae display: $MathJax Logo$ ?Mathematical formulae have been encoded as MathML and are displayed in this HTML version using MathJax in order to improve their display. Uncheck the box to turn MathJax off. This feature requires Javascript. Click on a formula to zoom.

Abstract

Autoregressive models in time series are useful in various areas. In this article, we propose a skew-t autoregressive model. We estimate its parameters using the expectation-maximization (EM) method and develop the influence methodology based on local perturbations for its validation. We obtain the normal curvatures for four perturbation strategies to identify influential observations, and then to assess their performance through Monte Carlo simulations. An example of financial data analysis is presented to study daily log-returns for Brent crude futures and investigate possible impact by the COVID-19 pandemic.

Keywords:

EM algorithm
influence diagnostics
matrix differential calculus
Monte Carlo simulations
skew-t innovation
time series models

1. Introduction

Autoregressive (AR) modeling is an essential technique in time series data analysis and is widely applied in biology, economics, finance, health and other areas. Several AR models and their statistical inference have been well established [Citation29,Citation47,Citation48]. Furthermore, influence diagnostics for statistical modeling is equally important nowadays [Citation13,Citation22,Citation28,Citation35].

The local influence technique [Citation9] examines how a minor perturbation affects the model fitting and is a powerful tool for statistical diagnostics when identifying potentially influential observations. Influence diagnostics has been conducted in regression models [Citation45,Citation46] and time-series analysis [Citation31,Citation32]. Among others, [Citation7,Citation13,Citation20,Citation27,Citation40,Citation49] investigated the sensitivity of estimates for regression parameters with AR disturbances or similar assumptions employing influence diagnostics. A number of authors, as [Citation17,Citation24,Citation25,Citation30,Citation33,Citation36,Citation51,Citation52], studied the estimation and its validity with diagnostic methods for time series models and related structures.

Note that the standard assumption for many circumstances is that all errors mutually independently follow normal or Student-t (simply t from now) distributions for regression and time-series analysis. For example, [Citation26,Citation34] investigated the inference by maximum likelihood (ML) and its stability by influence diagnostic methods for a vector AR model under normal and t distributions, However, certain economic, financial and other data are known to exhibit errors following skewed distributions. To study such characteristics, distributions proposed by [Citation2–4], related to the skew-normal (SN) and skew-t (ST) models, have had a great receptivity by researchers in recent years. Instead of the Student-t and Gaussian models, they are appealing candidates and can therefore be adopted. As a result, they are becoming increasingly popular; see, for example, [Citation4,Citation5,Citation21]. Moreover, [Citation6–8,Citation13,Citation14,Citation50] studied SN partially linear and nonlinear regression models and/or score test statistics. Robust mixture structures under an ST model have been analyzed by [Citation15,Citation23].

For financial applications, [Citation47,Citation48] discussed the ST distributions for their generalized autoregressive conditional heteroscedastic (GARCH) models, and [Citation12] advocated and compared SN and ST distributions. Liu et al. [Citation31] focused on diagnostic analystics for an AR model with SN errors (SNAR model), while [Citation32] studied estimation and other statistical aspects for an SNAR model and especially made a real-world application of financial data affected during the COVID-19 pandemic. However, we are not aware of any studies that have reported results about influence diagnostics measures in an AR model with ST errors (STAR model).

In the present article, we conduct inference and validation of the STAR model with the financial data analysis which relates to the COVID-19 pandemic. Our main contributions are threefold:

we propose the STAR model with a systematic methodology of estimation and diagnostics. We consider on likelihood methods to fit the STAR model using the EM algorithm and conduct its influence diagnostics based on four perturbation schemes including a brand new one of skewness. The STAR model complements the ST innovation-based GARCH models discussed in [Citation47,Citation48] and an AR model under the SN distribution studied in [Citation31,Citation32].
we establish our mathematical results for the STAR model using the standard matrix differential calculus and examine their statistical implementations using simulations. We compare the ST with normal, t and skew-normal distributions, especially for the diagnostics. Our findings demonstrate the ST model is preferred to the other alternatives which were previously considered in [Citation26,Citation31,Citation32,Citation34].
we conduct an empirical study of real-world financial data to illustrate both the STAR model and our methodology to be effective in practical applications and data analytics.

We proceed as follows. In Section 2, we explain the STAR model and develop an efficient algorithm for calculating the ML estimates, whereas in Section 3 we present our curvature diagnostics under the four perturbation schemes using the local influence technique. Section 4 carries out our simulation studies to examine the influence diagnostics, and compare the ST model with the normal, t, and SN distributions. Section 5 provides an empirical example involving an STAR model to show potential applications of the results. Our concluding comments are provided in Section 6. Finally, our matrix results for curvature diagnostics are derived in the appendix.

2. Formulation and estimation

In this section, we propose our STAR model, obtain its parameters' ML estimates, and establish the corresponding Hessian matrix.

2.1. STAR(p) model

We assume an STAR(p) time series model formulated as $y_{t} := u_{t} + β_{1} y_{t - 1} + \dots + β_{p} y_{t - p}$ , with $y_{t}$ being the response observed at t, for $t \in {1, \dots, R}$ , and p previous values denoted by $y_{1}, \dots, y_{p}$ ; $β_{i}$ if the i-th regression coefficient, for $i \in {1, \dots, p}$ ; and $u_{t}$ is the t-th innovation following an ST distribution denoted as $u_{t} \sim ST (0, σ^{2}, λ, ς)$ , with $σ^{2} > 0$ being the scale, λ being the skewness, and ς being the degrees of freedom. Conveniently, the model $y_{t} := u_{t} + β_{1} y_{t - 1} + \dots + β_{p} y_{t - p}$ is rewritten as (1) $y_{t} = x_{t}^{⊤} β + u_{t},$ (1) where $x_{t} = (y_{t - 1}, \dots, y_{t - p})^{⊤}$ and $β = {(β_{1}, \dots, β_{p})}^{⊤}$ are $p \times 1$ vectors. The parameters are collected by a $(p + 3) \times 1$ vector $Θ = (β, σ^{2}, λ, ς)^{⊤}$ .

Lemma 1

[Citation47]

Let $β (z) = 1 - β_{1} z - \dots - β_{p} z^{p}$ be a characteristic polynomial, and its modulus of all zero solutions is greater than one. Then, the associated AR data are stationary.

Lemma 1 is used in the numerical analysis of Sections 4 and 5, and it offers a necessary and sufficient condition for us to test if our data are stationary. Note that, if Y follows an ST model with μ, $σ^{2}$ , λ, and ς being location, scale, skewness, and degrees of freedom parameters, we denote it as $Y \sim ST (μ, σ^{2}, λ, ς)$ . Thus, if $Y \sim ST (μ, σ^{2}, λ, ς)$ , its density, which we denote by PDF, is established by (2) $f_{Y} (y; ς) = \frac{2}{σ} t_{ς} (η) T_{ς + 1} (λη {(\frac{ς + 1}{η^{2} + ς})}^{1 / 2}), t \in R, η = \frac{y - μ}{σ},$ (2) with $T_{ς}$ and $t_{ς}$ being the distribution function, CDF in short, and PDF of the Student-t model. If $λ = 0$ , the PDF of Y stated in (2) corresponds to the t PDF; if $ς \to \infty$ , then the PDF of Y becomes the SN PDF; and if $λ = 0$ and $ς \to \infty$ , then the PDF of Y is the normal PDF. Observe that the mean of Y and its variance are represented by $\begin{aligned} E (Y) & = \frac{σ δ_{λ} {(ς / π)}^{1 / 2} Γ ((ς - 1) / 2)}{Γ (ς / 2)} + μ, \\ Var (Y) & = σ^{2} (\frac{ς}{ς - 2} - \frac{{ς δ_{λ}}^{2}}{π}) {(\frac{Γ ((ς - 1) / 2)}{Γ (ς / 2)})}^{2}, \end{aligned}$ with $δ_{λ} = λ / (1 + λ^{2})^{1 / 2}$ .

Lemma 2

[Citation23]

Let $Y \sim ST (μ, σ^{2}, λ, ς)$ . Then, we get $\begin{aligned} Y | γ, τ & \sim N (μ + δ_{λ} γ, (1 - {δ_{λ}}^{2}) σ^{2} / τ), γ | τ \sim TN (0, σ^{2} / τ; (0, \infty)), \\ τ \sim Gamma (\frac{ς}{2}, \frac{ς}{2}), \end{aligned}$ where $TN (μ, σ^{2}; (a, b))$ represents the truncated normal distribution with $N (μ, σ^{2})$ lying within the interval $(a, b)$ .

2.2. EM based ML estimation

In practice, directly maximizing the function associated with the logarithmic likelihood structure using the ML method to find the estimate of $Θ$ can be a no easy task. Instead, we implement the ML method based on the EM algorithm with incomplete data proposed by [Citation11]. Here, $y_{complete} = (y_{observed}, y_{missing})^{⊤}$ denotes the set of complete data, where $y_{missing}$ stands for the set of missing data and $y_{observed}$ for the set of observed data. Given a starting estimate $Θ^{(0)}$ , which can be taken from the fit with the normal distribution, we get $Θ^{(r)}$ , for $r \in {1, 2, \dots}$ , iteratively between the E and M steps until reaching convergence as in [Citation13,Citation32]. To reach convergence, we use $| | Θ^{(r + 1)} - Θ^{(r + 1)} | | < ϵ = 10^{- 5}$ , same as used in the simulation study.

According to Lemma 2, the model defined as $y_{t} := β_{1} y_{t - 1} + \dots + β_{p} y_{t - p} + u_{t}$ can be presented hierarchically as $\begin{aligned} u | γ, τ & \sim N (μ + δ_{λ} γ, \frac{(1 - {δ_{λ}}^{2}) σ^{2}}{τ}), \\ γ | τ \sim TN (0, \frac{σ^{2}}{τ}; (0, \infty)), τ \sim Gamma (\frac{ς}{2}, \frac{ς}{2}) . \end{aligned}$ The observed and missing (unobserved) data are ${u_{p + 1}, \dots, u_{T}}$ and ${c_{p + 1}, \dots, c_{T}}$ . Let $y_{observed} = {u_{p + 1}, \dots, u_{T}}$ and $y_{missing} = {c_{p + 1}, \dots, c_{T}}$ , with $y_{complete} = (y_{observed}, y_{missing})$ being the complete observations. In such conditions, the function related to the logarithmic likelihood function of the set of complete-data for $Θ = (β, σ^{2}, δ_{λ}, ς)$ is given by $\begin{aligned} ℓ_{complete} (Θ; Y_{complete}) & = \sum_{i = p + 1}^{R} (- \frac{ς}{2} τ_{i} - \frac{u_{i}^{2} τ_{i}}{2 (1 - δ_{λ}^{2}) σ^{2}} + \frac{δ_{λ} u_{i} γ_{i} τ_{i}}{(1 - δ_{λ}^{2}) σ^{2}} - \frac{τ_{i}^{2} γ_{i}}{2 (1 - δ_{λ}^{2}) σ^{2}} \\ - \ln (σ^{2}) - \frac{1}{2} \ln (1 - δ_{λ}^{2}) + \frac{ς}{2} \ln (\frac{ς}{2}) - \ln (Γ (\frac{ς}{2})) + \frac{ς}{2} \ln (τ_{i})), \end{aligned}$ where $δ_{λ} = λ / (1 + λ^{2})^{1 / 2}$ . For the E step, we obtain, with ${\hat{Θ}}^{(r)}$ , the $Q$ -function defined as follows: (3) $\begin{aligned} Q_{Θ} & = E (ℓ_{complete} (Θ; Y_{complete}) | y_{o}), evaluated at Θ \equiv {\hat{Θ}}^{(r)}, \\ = \sum_{i = p + 1}^{R} (- \frac{ς}{2} {\hat{s}}_{1 i}^{(r)} - \frac{u_{i}^{2} {\hat{s}}_{1 i}^{(r)}}{2 (1 - δ_{λ}^{2}) σ^{2}} + \frac{δ_{λ} u_{i} {\hat{s}}_{2 i}^{(r)}}{(1 - δ_{λ}^{2}) σ^{2}} - \frac{{\hat{s}}_{3 i}^{(r)}}{2 (1 - δ_{λ}^{2}) σ^{2}} \\ - \ln (σ^{2}) - \frac{1}{2} \ln (1 - δ_{λ}^{2}) + \frac{ς}{2} \ln (\frac{ς}{2}) - \ln (Γ (\frac{ς}{2})) + \frac{ς}{2} {\hat{s}}_{4 i}^{(r)}), \end{aligned}$ (3) with $\begin{aligned} {\hat{s}}_{1 i}^{(r)} & = E (τ_{i} | y_{observed}, {\hat{Θ}}^{(r)}) = (\frac{{\hat{ς}}^{(r)} + 1}{{\hat{ς}}^{(r)} + {\hat{η}}_{i}^{2 (r)}}) \frac{R_{{\hat{ς}}^{(r)} + 3} ({\hat{T}}_{i}^{(r)} {(\frac{{\hat{ς}}^{(r)} + 3}{{\hat{ς}}^{(r)} + 1})}^{1 / 2})}{R_{{\hat{ς}}^{(r)} + 1} ({\hat{T}}_{i}^{(r)})}, \\ {\hat{s}}_{2 i}^{(r)} & = E (γ_{i} τ_{i} | y_{observed}, {\hat{Θ}}^{(r)}) = {\hat{δ_{λ}}}^{(r)} {\hat{u_{i}}}^{(r)} {\hat{s}}_{1 i}^{(r)} \\ + \frac{{(1 - {\hat{δ_{λ}}}^{2 (r)})}^{1 / 2}}{π {\hat{f}}_{i}^{(r)} ({\hat{u_{i}}}^{(r)})} {(\frac{{\hat{η}}_{i}^{2 (r)}}{{\hat{ς}}^{(r)} (1 - {\hat{δ_{λ}}}^{2 (r)})} + 1)}^{- \frac{{\hat{ς}}^{(r)} + 2}{2}}, \end{aligned}$ $\begin{aligned} {\hat{s}}_{3 i}^{(r)} & = E (γ_{i}^{2} τ_{i} | y_{observed}, {\hat{Θ}}^{(r)}) = ({\hat{δ}}_{λ}^{(r)})^{2} ({\hat{u}}_{i}^{(r)})^{2} {\hat{s}}_{1 i}^{(r)} + (1 - {\hat{δ}}_{λ}^{2 (r)}) {\hat{σ}}^{2 (r)} \\ + \frac{{\hat{δ}}_{λ}^{(r)}}{{\hat{u}}_{i}^{(r)}} {(1 - {\hat{δ}}_{λ}^{2 (r)})}^{1 / 2} π {\hat{f}}_{i}^{(r)} ({\hat{u}}_{i}^{(r)}) {(\frac{{\hat{η}}_{i}^{2 (r)}}{{\hat{ς}}^{(r)} (1 - {\hat{δ}}_{λ}^{2 (r)})} + 1)}^{- \frac{{\hat{ς}}^{(r)} + 2}{2}}, \\ {\hat{s}}_{4 i}^{(r)} & = E (\ln (τ_{i}) | y_{observed}, {\hat{Θ}}^{(r)}) = G (\frac{{\hat{ς}}^{(r)} + 1}{2}) \\ + \frac{{\hat{ς}}^{(r)} + 1}{{\hat{ς}}^{(r)} + {\hat{η}}_{i}^{2 (r)}} (\frac{R_{{\hat{ς}}^{(r)} + 3} ({\hat{T}}_{i}^{(r)} {(\frac{{\hat{ς}}^{(r)} + 3}{{\hat{ς}}^{(r)} + 1})}^{1 / 2})}{R_{{\hat{ς}}^{(r)} + 1} ({\hat{T}}_{i}^{(r)})} - 1) - \ln (\frac{{\hat{η}}_{i}^{2 (r)} + {\hat{ς}}^{(r)}}{2}) \\ + \frac{{\hat{δ_{λ}}}^{(r)} {\hat{η}}_{i}^{(r)} ({\hat{η}}_{i}^{2 (r)} - 1)}{{(({\hat{ς}}^{(r)} + 1) {({\hat{ς}}^{(r)} + {\hat{η}}_{i}^{2 (r)})}^{3})}^{1 / 2}} (\frac{t_{{\hat{ς}}^{(r)} + 1} ({\hat{T}}_{i}^{(r)})}{R_{{\hat{ς}}^{(r)} + 1} ({\hat{T}}_{i}^{(r)})}) + \frac{1}{R_{{\hat{ς}}^{(r)} + 1} ({\hat{T}}_{i}^{(r)})} \\ \times \int_{- \infty}^{{\hat{T}}_{i}^{(r)}} {\hat{g}}_{{\hat{ς}}^{(r)}} (x) t_{{\hat{ς}}^{(r)} + 1} (x) d x, \end{aligned}$ $\begin{aligned} {\hat{u}}_{i}^{(r)} & = y_{i} - x_{i}^{⊤} {\hat{β}}^{(r)}, {\hat{η}}_{i}^{(r)} = \frac{{\hat{u}}_{i}^{(r)}}{{\hat{σ}}^{(r)}}, {\hat{δ_{λ}}}^{(r)} = \frac{{\hat{λ}}^{(r)}}{{(1 + {\hat{λ}}^{2 (r)})}^{1 / 2}}, \\ {\hat{T}}_{i}^{(r)} & = {\hat{λ}}^{(r)} {\hat{η}}_{i}^{(r)} {(\frac{{\hat{ς}}^{(r)} + 1}{{\hat{ς}}^{(r)} + {\hat{η}}_{i}^{2 (r)}})}^{1 / 2}, \\ {\hat{f}}_{i}^{(r)} ({\hat{u}}_{i}^{(r)}) & = \frac{2}{{\hat{σ}}^{(r)}} t_{{\hat{ς}}^{(r)}} ({\hat{η}}_{i}^{(r)}) R_{{\hat{ς}}^{(r)} + 1} ({\hat{T}}_{i}^{(r)}), G (x) = \frac{Γ^{'} (x)}{Γ (x)}, \\ {\hat{g}}_{{\hat{ς}}^{(r)}} (x) & = G (\frac{{\hat{ς}}^{(r)} + 2}{2}) - G (\frac{{\hat{ς}}^{(r)} + 1}{2}) - \ln (1 + \frac{x^{2}}{{\hat{ς}}^{(r)} + 1}) + \frac{x^{2} - 1}{{\hat{ς}}^{(r)} + 1 + x^{2}} . \end{aligned}$ In the case of the M step, we use ${\dot{Q}}_{{\hat{Θ}}^{(k + 1)}}$ to update ${\hat{Θ}}^{(r)}$ using an iterative algorithm, with $\ddot{Q}$ and $\dot{Q}$ denoting the Hessian matrix and gradient vector. Now, if ${\hat{Θ}}^{(r + 1)} - {\hat{Θ}}^{(r)} \to 0$ , then we get that ${\hat{Θ}}^{(r + 1)} = {\hat{Θ}}^{(r)} - {\ddot{Q}}_{{\hat{Θ}}^{(r)}}^{- 1} {\dot{Q}}_{{\hat{Θ}}^{(r)}}$ . Under mild conditions, and considering appropriate starting values of ${\hat{Θ}}^{(0)}$ , which as mentioned can be taken from the fit with the normal distribution, ${\hat{Θ}}^{(r)}$ converges to the ML estimate $\hat{Θ}$ .

Note that there is an alternative approach as taken by reparametrizating and estimating the parameters in their regression models; see [Citation42]. Thus, we could use a reparametrization to find closed expressions for the estimators of the parameters of the STAR(p) model as well.

2.3. Observed information matrix

We calculate the observed information matrix $- {\ddot{Q}}_{Θ}$ starting from $\begin{aligned} ℓ_{complete} (Θ; Y_{complete}) & = \sum_{i = p + 1}^{R} (- \frac{ς}{2} τ_{i} - \frac{u_{i}^{2} τ_{i}}{2 (1 - δ_{λ}^{2}) σ^{2}} + \frac{δ_{λ} u_{i} γ_{i} τ_{i}}{(1 - δ_{λ}^{2}) σ^{2}} - \frac{τ_{i}^{2} γ_{i}}{2 (1 - δ_{λ}^{2}) σ^{2}} \\ - \ln (σ^{2}) - \frac{1}{2} \ln (1 - δ_{λ}^{2}) + \frac{ς}{2} \ln (\frac{ς}{2}) - \ln (Γ (\frac{ς}{2})) + \frac{ς}{2} \ln (τ_{i})) \end{aligned}$ and (4) $\begin{aligned} Q_{Θ} & = E (ℓ_{complete} (Θ; Y_{complete}) | y_{o}), evaluated at Θ = \hat{Θ}, \\ = \sum_{i = p + 1}^{R} (- \frac{ς}{2} {\hat{s}}_{1 i} - \frac{u_{i}^{2} {\hat{s}}_{1 i}}{2 (1 - δ_{λ}^{2}) σ^{2}} + \frac{δ_{λ} u_{i} {\hat{s}}_{2 i}}{(1 - δ_{λ}^{2}) σ^{2}} - \frac{{\hat{s}}_{3 i}}{2 (1 - δ_{λ}^{2}) σ^{2}} \\ - \ln (σ^{2}) - \frac{1}{2} \ln (1 - δ_{λ}^{2}) + \frac{ς}{2} \ln (\frac{ς}{2}) - \ln (Γ (\frac{ς}{2})) + \frac{ς}{2} {\hat{s}}_{4 i}), \end{aligned}$ (4) where $u_{i} = y_{i} - x_{i}^{⊤} β$ and (5) $\begin{aligned} {\hat{s}}_{1 i} & = E (τ_{i} | y_{observed}, \hat{Θ}) = (\frac{\hat{ς} + 1}{\hat{ς} + {\hat{η}}_{i}^{2}}) \frac{R_{\hat{ς} + 3} ({\hat{T}}_{i} {(\frac{\hat{ς} + 3}{\hat{ς} + 1})}^{1 / 2})}{R_{\hat{ς} + 1} ({\hat{T}}_{i})}, \end{aligned}$ (5) (6) $\begin{aligned} {\hat{s}}_{2 i} & = E (γ_{i} τ_{i} | y_{observed}, \hat{Θ}) = \hat{δ_{λ}} \hat{u_{i}} {\hat{s}}_{1 i} + \frac{{(1 - {\hat{δ_{λ}}}^{2})}^{1 / 2}}{π {\hat{f}}_{i} (\hat{u_{i}})} {(\frac{{\hat{η}}_{i}^{2}}{\hat{ς} (1 - {\hat{δ_{λ}}}^{2})} + 1)}^{- \frac{\hat{ς} + 2}{2}}, \end{aligned}$ (6) (7) $\begin{aligned} {\hat{s}}_{3 i} & = E (γ_{i}^{2} τ_{i} | y_{observed}, \hat{Θ}) \\ = \hat{{δ_{λ}}^{2}} {\hat{u_{i}}}^{2} {\hat{s}}_{1 i} + (1 - {\hat{δ_{λ}}}^{2}) {\hat{σ}}^{2} + \frac{\hat{δ_{λ}} u_{i} {(1 - {\hat{δ_{λ}}}^{2})}^{1 / 2}}{π {\hat{f}}_{i} (\hat{u_{i}})} {(\frac{{\hat{η}}_{i}^{2}}{\hat{ς} (1 - {\hat{δ_{λ}}}^{2})} + 1)}^{- \frac{\hat{ς} + 2}{2}}, \end{aligned}$ (7) (8) $\begin{aligned} {\hat{s}}_{4 i} & = E (\ln (τ_{i}) | y_{observed}, \hat{Θ}) \\ = G (\frac{\hat{ς} + 1}{2}) + \frac{\hat{ς} + 1}{\hat{ς} + {\hat{η}}_{i}^{2}} (\frac{R_{\hat{ς} + 3} ({\hat{T}}_{i} {(\frac{\hat{ς} + 3}{\hat{ς} + 1})}^{1 / 2})}{R_{\hat{ς} + 1} ({\hat{T}}_{i})} - 1) - \ln (\frac{{\hat{η}}_{i}^{2} + \hat{ς}}{2}) \\ + \frac{\hat{δ_{λ}} {\hat{η}}_{i} ({\hat{η}}_{i}^{2} - 1)}{{((\hat{ς} + 1) {(\hat{ς} + {\hat{η}}_{i}^{2})}^{3})}^{1 / 2}} (\frac{t_{\hat{ς} + 1} ({\hat{T}}_{i})}{R_{\hat{ς} + 1} ({\hat{T}}_{i})}) + \frac{1}{R_{\hat{ς} + 1} ({\hat{T}}_{i})} \\ \times \int_{- \infty}^{{\hat{T}}_{i}} {\hat{g}}_{\hat{ς}} (x) t_{\hat{ς} + 1} (x) d x, \end{aligned}$ (8) with $\begin{aligned} {\hat{u}}_{i} & = y_{i} - x_{i}^{⊤} \hat{β}, {\hat{η}}_{i} = \frac{{\hat{u}}_{i}}{\hat{σ}}, \hat{δ_{λ}} = \frac{\hat{λ}}{{(1 + {\hat{λ}}^{2})}^{1 / 2}}, \\ {\hat{T}}_{i} & = {\hat{λ}}^{{\hat{η}}_{i}} {(\frac{\hat{ς} + 1}{\hat{ς} + {\hat{η}}_{i}^{2}})}^{1 / 2}, {\hat{f}}_{i} (\hat{u_{i}}) = \frac{2}{\hat{σ}} t_{\hat{ς}} ({\hat{η}}_{i}) R_{\hat{ς} + 1} ({\hat{T}}_{i}), \\ {\hat{g}}_{\hat{ς}} (x) & = G (\frac{\hat{ς} + 2}{2}) - G (\frac{\hat{ς} + 1}{2}) - \ln (1 + \frac{x^{2}}{\hat{ς} + 1}) + \frac{x^{2} - 1}{\hat{ς} + 1 + x^{2}} . \end{aligned}$

Theorem 1

For the STAR model, the $(p + 3) \times (p + 3)$ observed Fisher information matrix $- {\ddot{Q}}_{\hat{Θ}}$ with $\hat{Θ} = (\hat{β}, \hat{σ^{2}}, \hat{δ_{λ}}, \hat{ς})$ is obtained, whose diagonal and off-diagonal submatrices are provided in the appendix.

3. Diagnostic analysis

We obtain our normal curvatures for the influence diagnostic analysis considering the following perturbation strategies: case-weights, data, variance and skewness.

3.1. Influence diagnostics

For the STAR model postulated in (Equation1(1) $y_{t} = x_{t}^{⊤} β + u_{t},$ (1) ), we use the logarithmic likelihood function of complete-data, $ℓ (Θ; Y_{complete})$ , where $Θ$ is a $(p + 3) \times 1$ parameter vector. We consider a minor modification denoted by $q \times 1$ perturbation vector $ω = (ω_{1}, \dots, ω_{q})^{⊤}$ belonging to $Ω \subset R^{q}$ , and $ℓ (Θ; ω; Y_{complete})$ is the function of logarithmic likelihood for the set of complete data under $ω$ . Consider $ω_{0}$ as the a $q \times 1$ vector of non-perturbation satisfying $ℓ (Θ; Y_{complete}) = ℓ (Θ; ω_{0}; Y_{complete})$ , and this vector can be $ω_{0} = (0, \dots, 0)^{⊤}$ or $ω_{0} = (1, \dots, 1)^{⊤}$ or an alternative as properly chosen, and the dimension q depends on the perturbation strategy adopted. Denote by $\hat{Θ}$ and ${\hat{Θ}}_{ω}$ the ML estimates for the postulated model and perturbed model, respectively. Then, as in [Citation13,Citation14], we compare $\hat{Θ}$ and ${\hat{Θ}}_{ω}$ using the influence measure named Q-displacement, and derive the normal curvature at $q \times 1$ vector l (with $| | l | | = 1$ ) stated as (9) $C_{l} = 2 | l^{⊤} (Δ^{⊤} {\ddot{Q}}^{- 1} Δ) l |,$ (9) with a $(p + 3) \times (p + 3)$ matrix $\ddot{Q} = \partial^{2} Q_{Θ} / \partial Θ \partial Θ^{⊤}$ , evaluated at $Θ = \hat{Θ}$ , as well as a $(p + 3) \times q$ matrix $Δ = \partial^{2} Q_{Θ; ω} / \partial Θ \partial ω^{⊤}$ , evaluated at $Θ = \hat{Θ}, ω = ω_{0}$ . We use the expression given in (Equation9(8) $\begin{aligned} {\hat{s}}_{4 i} & = E (\ln (τ_{i}) | y_{observed}, \hat{Θ}) \\ = G (\frac{\hat{ς} + 1}{2}) + \frac{\hat{ς} + 1}{\hat{ς} + {\hat{η}}_{i}^{2}} (\frac{R_{\hat{ς} + 3} ({\hat{T}}_{i} {(\frac{\hat{ς} + 3}{\hat{ς} + 1})}^{1 / 2})}{R_{\hat{ς} + 1} ({\hat{T}}_{i})} - 1) - \ln (\frac{{\hat{η}}_{i}^{2} + \hat{ς}}{2}) \\ + \frac{\hat{δ_{λ}} {\hat{η}}_{i} ({\hat{η}}_{i}^{2} - 1)}{{((\hat{ς} + 1) {(\hat{ς} + {\hat{η}}_{i}^{2})}^{3})}^{1 / 2}} (\frac{t_{\hat{ς} + 1} ({\hat{T}}_{i})}{R_{\hat{ς} + 1} ({\hat{T}}_{i})}) + \frac{1}{R_{\hat{ς} + 1} ({\hat{T}}_{i})} \\ \times \int_{- \infty}^{{\hat{T}}_{i}} {\hat{g}}_{\hat{ς}} (x) t_{\hat{ς} + 1} (x) d x, \end{aligned}$ (8) ) for our perturbation strategies proposed, by calculating $\ddot{Q}$ and $Δ$ , for which the derivatives are presented in the appendix. Poon and Poon [Citation41] proposed a measure of conformal normal curvature to classify an observation to be potentially influential. Note that for linear regression models, but not restricted to these, to assess the influence of $ω$ , [Citation9] utilized the influence measure named likelihood displacement (LD). Then, a high value of $LD$ provides us information that ML estimates $\hat{Θ}$ and ${\hat{Θ}}_{ω}$ to differ significantly. For details, see [Citation26,Citation27,Citation31,Citation32,Citation34,Citation41].

3.2. Perturbation strategies

3.2.1. Perturbation of case-weights

As in [Citation32], we make a minor perturbation on the residual of the STAR model using $ω_{i} u_{i} = ω_{i} (y_{i} - x_{i}^{⊤} β)$ instead of $u_{i} = y_{i} - x_{i}^{⊤} β$ , where $ω_{i}$ is the modification. We consider $(R - p) \times 1$ vectors $ω = (ω_{p + 1}, \dots, ω_{T})^{⊤}$ and $ω_{0} = 1^{⊤}$ . Then, the expression for $ℓ_{complete} (Θ; ω; Y_{complete})$ is presented as $\begin{array}{l} ℓ_{complete} (Θ; ω; Y_{complete}) & = \sum_{i = p + 1}^{R} (- \frac{ς}{2} τ_{i} - \frac{ω_{i}^{2} u_{i}^{2} τ_{i}}{2 (1 - δ_{λ}^{2}) σ^{2}} + \frac{δ_{λ} ω_{i} u_{i} γ_{i} τ_{i}}{(1 - δ_{λ}^{2}) σ^{2}} - \frac{τ_{i}^{2} γ_{i}}{2 (1 - δ_{λ}^{2}) σ^{2}} \\ - \ln (σ^{2}) - \frac{1}{2} \ln (1 - δ_{λ}^{2}) + \frac{ς}{2} \ln (\frac{ς}{2}) \\ - \ln (Γ (\frac{ς}{2})) + \frac{ς}{2} \ln (τ_{i})) . \end{array}$ Thus, we express the Q-function (perturbed) by means of (10) $\begin{aligned} Q_{Θ; ω} & = E (ℓ_{complete} (Θ; ω; Y_{complete}) | y_{o}), evaluated at Θ \equiv \hat{Θ}, \\ = \sum_{i = p + 1}^{R} (- \frac{ς}{2} {\hat{s}}_{1 i} - \frac{ω_{i}^{2} u_{i}^{2} {\hat{s}}_{1 i}}{2 (1 - δ_{λ}^{2}) σ^{2}} + \frac{δ_{λ} ω_{i} u_{i} {\hat{s}}_{2 i}}{(1 - {δ_{λ}}^{2}) σ^{2}} - \frac{{\hat{s}}_{3 i}}{2 (1 - {δ_{λ}}^{2}) σ^{2}} \\ - \ln (σ^{2}) - \frac{1}{2} \ln (1 - {δ_{λ}}^{2}) + \frac{ς}{2} \ln (\frac{ς}{2}) - \ln (Γ (\frac{ς}{2})) + \frac{ς}{2} {\hat{s}}_{4 i}), \end{aligned}$ (10) where $u_{i} = y_{i} - x_{i}^{⊤} β$ and ${\hat{s}}_{1 i}, {\hat{s}}_{2 i}, {\hat{s}}_{3 i}, {\hat{s}}_{4 i}$ are the same as in the formulations given in (Equation5(4) $\begin{aligned} Q_{Θ} & = E (ℓ_{complete} (Θ; Y_{complete}) | y_{o}), evaluated at Θ = \hat{Θ}, \\ = \sum_{i = p + 1}^{R} (- \frac{ς}{2} {\hat{s}}_{1 i} - \frac{u_{i}^{2} {\hat{s}}_{1 i}}{2 (1 - δ_{λ}^{2}) σ^{2}} + \frac{δ_{λ} u_{i} {\hat{s}}_{2 i}}{(1 - δ_{λ}^{2}) σ^{2}} - \frac{{\hat{s}}_{3 i}}{2 (1 - δ_{λ}^{2}) σ^{2}} \\ - \ln (σ^{2}) - \frac{1}{2} \ln (1 - δ_{λ}^{2}) + \frac{ς}{2} \ln (\frac{ς}{2}) - \ln (Γ (\frac{ς}{2})) + \frac{ς}{2} {\hat{s}}_{4 i}), \end{aligned}$ (4) )–( Equation8(7) $\begin{aligned} {\hat{s}}_{3 i} & = E (γ_{i}^{2} τ_{i} | y_{observed}, \hat{Θ}) \\ = \hat{{δ_{λ}}^{2}} {\hat{u_{i}}}^{2} {\hat{s}}_{1 i} + (1 - {\hat{δ_{λ}}}^{2}) {\hat{σ}}^{2} + \frac{\hat{δ_{λ}} u_{i} {(1 - {\hat{δ_{λ}}}^{2})}^{1 / 2}}{π {\hat{f}}_{i} (\hat{u_{i}})} {(\frac{{\hat{η}}_{i}^{2}}{\hat{ς} (1 - {\hat{δ_{λ}}}^{2})} + 1)}^{- \frac{\hat{ς} + 2}{2}}, \end{aligned}$ (7) ).

Theorem 2

For case-weights perturbation strategy, we get the $(p + 3) \times (R - p)$ matrix given by (11) $Δ = \frac{\partial^{2} Q_{Θ; ω}}{∂Θ∂ω} = (\begin{matrix} \frac{2 (y_{i} - x_{i}^{⊤} \hat{β}) {\hat{s}}_{1 i} - \hat{δ_{λ}} {\hat{s}}_{2 i}}{(1 - {\hat{δ_{λ}}}^{2}) \hat{σ^{2}}} x_{i} \\ \frac{(y_{i} - x_{i}^{⊤} \hat{β})^{2} {\hat{s}}_{1 i} - \hat{δ_{λ}} (y_{i} - x_{i}^{⊤} \hat{β}) {\hat{s}}_{2 i}}{(1 - {\hat{δ_{λ}}}^{2}) \hat{σ^{2}}} \\ \frac{- 2 \hat{δ_{λ}} (y_{i} - x_{i}^{⊤} \hat{β})^{2} {\hat{s}}_{1 i} + {\hat{δ_{λ}}}^{2} (y_{i} - x_{i}^{⊤} \hat{β}) {\hat{s}}_{2 i} + (y_{i} - x_{i}^{⊤} \hat{β}) {\hat{s}}_{2 i}}{(1 - {\hat{δ_{λ}}}^{2})^{2} \hat{σ^{2}}} \\ 0 \end{matrix}),$ (11) evaluated at $Θ = \hat{Θ}, ω = ω_{0}$ , where ${\hat{s}}_{1 i}, {\hat{s}}_{2 i}, {\hat{s}}_{3 i}, {\hat{s}}_{4 i}$ are the same as in the expressions stated by (Equation5(4) $\begin{aligned} Q_{Θ} & = E (ℓ_{complete} (Θ; Y_{complete}) | y_{o}), evaluated at Θ = \hat{Θ}, \\ = \sum_{i = p + 1}^{R} (- \frac{ς}{2} {\hat{s}}_{1 i} - \frac{u_{i}^{2} {\hat{s}}_{1 i}}{2 (1 - δ_{λ}^{2}) σ^{2}} + \frac{δ_{λ} u_{i} {\hat{s}}_{2 i}}{(1 - δ_{λ}^{2}) σ^{2}} - \frac{{\hat{s}}_{3 i}}{2 (1 - δ_{λ}^{2}) σ^{2}} \\ - \ln (σ^{2}) - \frac{1}{2} \ln (1 - δ_{λ}^{2}) + \frac{ς}{2} \ln (\frac{ς}{2}) - \ln (Γ (\frac{ς}{2})) + \frac{ς}{2} {\hat{s}}_{4 i}), \end{aligned}$ (4) )–(Equation8(7) $\begin{aligned} {\hat{s}}_{3 i} & = E (γ_{i}^{2} τ_{i} | y_{observed}, \hat{Θ}) \\ = \hat{{δ_{λ}}^{2}} {\hat{u_{i}}}^{2} {\hat{s}}_{1 i} + (1 - {\hat{δ_{λ}}}^{2}) {\hat{σ}}^{2} + \frac{\hat{δ_{λ}} u_{i} {(1 - {\hat{δ_{λ}}}^{2})}^{1 / 2}}{π {\hat{f}}_{i} (\hat{u_{i}})} {(\frac{{\hat{η}}_{i}^{2}}{\hat{ς} (1 - {\hat{δ_{λ}}}^{2})} + 1)}^{- \frac{\hat{ς} + 2}{2}}, \end{aligned}$ (7) ).

3.2.2. Perturbation of data

As in [Citation32], we make a perturbation to replace $y_{i}$ by $ω_{i} + y_{i}$ . Let $ω = (ω_{p + 1}, \dots, ω_{T})^{⊤}$ and $ω_{0} = 0^{⊤}$ be $(R - p) \times 1$ vectors. For the response perturbation $y_{i} + ω_{i} = β_{1} (y_{i - 1} + ω_{i - 1}) + \dots + β_{p} (y_{i - p} + ω_{i - p}) + u_{i}$ , where $u_{i} = y_{i} - x_{i}^{⊤} β + μ (ω_{i})$ and $μ (ω_{i}) = ω_{i} - β_{1} ω_{i - 1} - \dots - β_{p} ω_{i - p}$ , we reach $\begin{aligned} ℓ_{complete} (Θ; ω; Y_{complete}) \\ = \sum_{i = p + 1}^{R} (- \frac{ς}{2} τ_{i} - \frac{(u_{i} + μ (ω_{i}))^{2} τ_{i}}{2 (1 - δ_{λ}^{2}) σ^{2}} + \frac{δ_{λ} (u_{i} + μ (ω_{i})) γ_{i} τ_{i}}{(1 - δ_{λ}^{2}) σ^{2}} - \frac{τ_{i}^{2} γ_{i}}{2 (1 - δ_{λ}^{2}) σ^{2}} \\ - \ln (σ^{2}) - \frac{1}{2} \ln (1 - δ_{λ}^{2}) + \frac{ς}{2} \ln (\frac{ς}{2}) - \ln (Γ (\frac{ς}{2})) + \frac{ς}{2} \ln (τ_{i})) . \end{aligned}$ Thus, we get the Q-function (perturbed) by means of (12) $\begin{aligned} Q_{Θ; ω} & = \sum_{i = p + 1}^{R} (- \frac{ς}{2} {\hat{s}}_{1 i} - \frac{(u_{i} + μ (ω_{i}))^{2} {\hat{s}}_{1 i}}{2 (1 - {δ_{λ}}^{2}) σ^{2}} + \frac{δ_{λ} (u_{i} + μ (ω_{i})) {\hat{s}}_{2 i}}{(1 - {δ_{λ}}^{2}) σ^{2}} - \frac{{\hat{s}}_{3 i}}{2 (1 - {δ_{λ}}^{2}) σ^{2}} \\ - \ln (σ^{2}) - \frac{1}{2} \ln (1 - {δ_{λ}}^{2}) + \frac{ς}{2} \ln (\frac{ς}{2}) - \ln (Γ (\frac{ς}{2})) + \frac{ς}{2} {\hat{s}}_{4 i}), \end{aligned}$ (12) where $u_{i} = y_{i} - x_{i}^{⊤} β$ and $\hat{s_{1}}, \hat{s_{2}}, \hat{s_{3}}, \hat{s_{4}}$ are the same as in (Equation5(5) $\begin{aligned} {\hat{s}}_{1 i} & = E (τ_{i} | y_{observed}, \hat{Θ}) = (\frac{\hat{ς} + 1}{\hat{ς} + {\hat{η}}_{i}^{2}}) \frac{R_{\hat{ς} + 3} ({\hat{T}}_{i} {(\frac{\hat{ς} + 3}{\hat{ς} + 1})}^{1 / 2})}{R_{\hat{ς} + 1} ({\hat{T}}_{i})}, \end{aligned}$ (5) ) –(Equation8(8) $\begin{aligned} {\hat{s}}_{4 i} & = E (\ln (τ_{i}) | y_{observed}, \hat{Θ}) \\ = G (\frac{\hat{ς} + 1}{2}) + \frac{\hat{ς} + 1}{\hat{ς} + {\hat{η}}_{i}^{2}} (\frac{R_{\hat{ς} + 3} ({\hat{T}}_{i} {(\frac{\hat{ς} + 3}{\hat{ς} + 1})}^{1 / 2})}{R_{\hat{ς} + 1} ({\hat{T}}_{i})} - 1) - \ln (\frac{{\hat{η}}_{i}^{2} + \hat{ς}}{2}) \\ + \frac{\hat{δ_{λ}} {\hat{η}}_{i} ({\hat{η}}_{i}^{2} - 1)}{{((\hat{ς} + 1) {(\hat{ς} + {\hat{η}}_{i}^{2})}^{3})}^{1 / 2}} (\frac{t_{\hat{ς} + 1} ({\hat{T}}_{i})}{R_{\hat{ς} + 1} ({\hat{T}}_{i})}) + \frac{1}{R_{\hat{ς} + 1} ({\hat{T}}_{i})} \\ \times \int_{- \infty}^{{\hat{T}}_{i}} {\hat{g}}_{\hat{ς}} (x) t_{\hat{ς} + 1} (x) d x, \end{aligned}$ (8) ).

Theorem 3

For the data perturbation strategy, we get the $(p + 3) \times (R - p)$ matrix given by (13) $Δ = \frac{\partial^{2} Q_{Θ; ω}}{∂Θ∂ω} = (\begin{matrix} \frac{{\hat{s}}_{1 i}}{(1 - {\hat{δ_{λ}}}^{2}) \hat{σ^{2}}} x_{i} \\ \frac{(y_{i} - x_{i}^{⊤} \hat{β}) {\hat{s}}_{1 i} - \hat{δ_{λ}} {\hat{s}}_{2 i}}{(1 - {\hat{δ_{λ}}}^{2}) {\hat{σ^{2}}}^{2}} \\ \frac{({\hat{δ_{λ}}}^{2} + 1) {\hat{s}}_{2 i} - 2 \hat{δ_{λ}} (y_{i} - x_{i}^{⊤} \hat{β}) {\hat{s}}_{1 i}}{(1 - {\hat{δ_{λ}}}^{2})^{2} \hat{σ^{2}}} \\ 0 \end{matrix}),$ (13) evaluated at $Θ = \hat{Θ}, ω = ω_{0}$ , where ${\hat{s}}_{1 i}, {\hat{s}}_{2 i}, {\hat{s}}_{3 i}, {\hat{s}}_{4 i}$ are the same as in the formulations given in (Equation5(4) $\begin{aligned} Q_{Θ} & = E (ℓ_{complete} (Θ; Y_{complete}) | y_{o}), evaluated at Θ = \hat{Θ}, \\ = \sum_{i = p + 1}^{R} (- \frac{ς}{2} {\hat{s}}_{1 i} - \frac{u_{i}^{2} {\hat{s}}_{1 i}}{2 (1 - δ_{λ}^{2}) σ^{2}} + \frac{δ_{λ} u_{i} {\hat{s}}_{2 i}}{(1 - δ_{λ}^{2}) σ^{2}} - \frac{{\hat{s}}_{3 i}}{2 (1 - δ_{λ}^{2}) σ^{2}} \\ - \ln (σ^{2}) - \frac{1}{2} \ln (1 - δ_{λ}^{2}) + \frac{ς}{2} \ln (\frac{ς}{2}) - \ln (Γ (\frac{ς}{2})) + \frac{ς}{2} {\hat{s}}_{4 i}), \end{aligned}$ (4) )–(Equation8(7) $\begin{aligned} {\hat{s}}_{3 i} & = E (γ_{i}^{2} τ_{i} | y_{observed}, \hat{Θ}) \\ = \hat{{δ_{λ}}^{2}} {\hat{u_{i}}}^{2} {\hat{s}}_{1 i} + (1 - {\hat{δ_{λ}}}^{2}) {\hat{σ}}^{2} + \frac{\hat{δ_{λ}} u_{i} {(1 - {\hat{δ_{λ}}}^{2})}^{1 / 2}}{π {\hat{f}}_{i} (\hat{u_{i}})} {(\frac{{\hat{η}}_{i}^{2}}{\hat{ς} (1 - {\hat{δ_{λ}}}^{2})} + 1)}^{- \frac{\hat{ς} + 2}{2}}, \end{aligned}$ (7) ).

3.2.3. Perturbation of variance

As in [Citation32], we replace the variance $σ^{2}$ by ${ω_{i}}^{- 1} σ^{2}$ , that is, $u_{i} \sim ST (0, {ω_{i}}^{- 1} σ^{2}, λ, ς)$ . Consider $(R - p) \times 1$ vectors $ω = (ω_{p + 1}, \dots, ω_{R})^{⊤}$ and $ω_{0} = 1^{⊤}$ . Thus, we reach that $\begin{aligned} ℓ_{complete} (Θ; ω; Y_{complete}) & = \sum_{i = p + 1}^{R} (- \frac{ς}{2} τ_{i} - \frac{ω_{i} u_{i}^{2} τ_{i}}{2 (1 - δ_{λ}^{2}) σ^{2}} + \frac{δ_{λ} ω_{i} u_{i} γ_{i} τ_{i}}{(1 - δ_{λ}^{2}) σ^{2}} - \frac{ω_{i} τ_{i}^{2} γ_{i}}{2 (1 - δ_{λ}^{2}) σ^{2}} \\ - \ln (\frac{σ^{2}}{ω_{i}}) - \frac{1}{2} \ln (1 - δ_{λ}^{2}) + \frac{ς}{2} \ln (\frac{ς}{2}) \\ - \ln (Γ (\frac{ς}{2})) + \frac{ς}{2} \ln (τ_{i})) . \end{aligned}$ Then, we get the Q-function (perturbed) by means of (14) $\begin{aligned} Q_{Θ; ω} & = \sum_{i = p + 1}^{R} (- \frac{ς}{2} {\hat{s}}_{1 i} - \frac{ω_{i} {u_{i}}^{2} {\hat{s}}_{1 i}}{2 (1 - {δ_{λ}}^{2}) σ^{2}} + \frac{δ_{λ} ω_{i} u_{i} {\hat{s}}_{2 i}}{(1 - {δ_{λ}}^{2}) σ^{2}} - \frac{ω_{i} {\hat{s}}_{3 i}}{2 (1 - {δ_{λ}}^{2}) σ^{2}} \\ - \ln (\frac{σ^{2}}{ω_{i}}) - \frac{1}{2} \ln (1 - {δ_{λ}}^{2}) + \frac{ς}{2} \ln (\frac{ς}{2}) - \ln (Γ (\frac{ς}{2})) + \frac{ς}{2} {\hat{s}}_{4 i}), \end{aligned}$ (14) where $u_{i} = y_{i} - x_{i}^{⊤} β$ and $\hat{s_{1}}, \hat{s_{2}}, \hat{s_{3}}, \hat{s_{4}}$ are the same as in the expressions stated by (Equation5(5) $\begin{aligned} {\hat{s}}_{1 i} & = E (τ_{i} | y_{observed}, \hat{Θ}) = (\frac{\hat{ς} + 1}{\hat{ς} + {\hat{η}}_{i}^{2}}) \frac{R_{\hat{ς} + 3} ({\hat{T}}_{i} {(\frac{\hat{ς} + 3}{\hat{ς} + 1})}^{1 / 2})}{R_{\hat{ς} + 1} ({\hat{T}}_{i})}, \end{aligned}$ (5) ) –(Equation8(8) $\begin{aligned} {\hat{s}}_{4 i} & = E (\ln (τ_{i}) | y_{observed}, \hat{Θ}) \\ = G (\frac{\hat{ς} + 1}{2}) + \frac{\hat{ς} + 1}{\hat{ς} + {\hat{η}}_{i}^{2}} (\frac{R_{\hat{ς} + 3} ({\hat{T}}_{i} {(\frac{\hat{ς} + 3}{\hat{ς} + 1})}^{1 / 2})}{R_{\hat{ς} + 1} ({\hat{T}}_{i})} - 1) - \ln (\frac{{\hat{η}}_{i}^{2} + \hat{ς}}{2}) \\ + \frac{\hat{δ_{λ}} {\hat{η}}_{i} ({\hat{η}}_{i}^{2} - 1)}{{((\hat{ς} + 1) {(\hat{ς} + {\hat{η}}_{i}^{2})}^{3})}^{1 / 2}} (\frac{t_{\hat{ς} + 1} ({\hat{T}}_{i})}{R_{\hat{ς} + 1} ({\hat{T}}_{i})}) + \frac{1}{R_{\hat{ς} + 1} ({\hat{T}}_{i})} \\ \times \int_{- \infty}^{{\hat{T}}_{i}} {\hat{g}}_{\hat{ς}} (x) t_{\hat{ς} + 1} (x) d x, \end{aligned}$ (8) ).

Theorem 4

For the variance perturbation strategy, we attain the $(p + 3) \times (R - p)$ matrix expressed as (15) $Δ = \frac{\partial^{2} Q_{Θ; ω}}{∂Θ∂ω} |_{Θ = \hat{Θ}, ω = ω_{0}} = (\begin{matrix} \frac{{\hat{s}}_{1 i} - \hat{δ_{λ}} {\hat{s}}_{2 i}}{(1 - {\hat{δ_{λ}}}^{2}) \hat{σ^{2}}} x_{i} \\ \frac{(y_{i} - x_{i}^{⊤} \hat{β})^{2} {\hat{s}}_{1 i} - \hat{δ_{λ}} (y_{i} - x_{i}^{⊤} \hat{β}) {\hat{s}}_{2 i} + {\hat{s}}_{3 i}}{2 (1 - {\hat{δ_{λ}}}^{2}) {\hat{σ^{2}}}^{2}} \\ \frac{- δ_{λ} (y_{i} - x_{i}^{⊤} \hat{β})^{2} {\hat{s}}_{1 i} + ({\hat{δ_{λ}}}^{2} + 1) \hat{u_{i}} {\hat{s}}_{2 i} - \hat{δ_{λ}} {\hat{s}}_{3 i}}{(1 - {\hat{δ_{λ}}}^{2})^{2} \hat{σ^{2}}} \\ 0 \end{matrix}),$ (15) where ${\hat{s}}_{1 i}, {\hat{s}}_{2 i}, {\hat{s}}_{3 i}, {\hat{s}}_{4 i}$ are the same as in the formulations given in (Equation5(5) $\begin{aligned} {\hat{s}}_{1 i} & = E (τ_{i} | y_{observed}, \hat{Θ}) = (\frac{\hat{ς} + 1}{\hat{ς} + {\hat{η}}_{i}^{2}}) \frac{R_{\hat{ς} + 3} ({\hat{T}}_{i} {(\frac{\hat{ς} + 3}{\hat{ς} + 1})}^{1 / 2})}{R_{\hat{ς} + 1} ({\hat{T}}_{i})}, \end{aligned}$ (5) )–(Equation8(8) $\begin{aligned} {\hat{s}}_{4 i} & = E (\ln (τ_{i}) | y_{observed}, \hat{Θ}) \\ = G (\frac{\hat{ς} + 1}{2}) + \frac{\hat{ς} + 1}{\hat{ς} + {\hat{η}}_{i}^{2}} (\frac{R_{\hat{ς} + 3} ({\hat{T}}_{i} {(\frac{\hat{ς} + 3}{\hat{ς} + 1})}^{1 / 2})}{R_{\hat{ς} + 1} ({\hat{T}}_{i})} - 1) - \ln (\frac{{\hat{η}}_{i}^{2} + \hat{ς}}{2}) \\ + \frac{\hat{δ_{λ}} {\hat{η}}_{i} ({\hat{η}}_{i}^{2} - 1)}{{((\hat{ς} + 1) {(\hat{ς} + {\hat{η}}_{i}^{2})}^{3})}^{1 / 2}} (\frac{t_{\hat{ς} + 1} ({\hat{T}}_{i})}{R_{\hat{ς} + 1} ({\hat{T}}_{i})}) + \frac{1}{R_{\hat{ς} + 1} ({\hat{T}}_{i})} \\ \times \int_{- \infty}^{{\hat{T}}_{i}} {\hat{g}}_{\hat{ς}} (x) t_{\hat{ς} + 1} (x) d x, \end{aligned}$ (8) ).

3.2.4. Perturbation of skewness

In particular, due to the characteristic of skew distribution of our proposed model, we study its impact by making a small modification in λ, that is, changing $δ_{λ}$ by $(ω_{i})^{1 / 2} δ_{λ}$ . Consider $(R - p) \times 1$ vectors $ω = (ω_{p + 1}, \dots, ω_{T})^{⊤}$ and $ω_{0} = 1^{⊤}$ . Then, we get $\begin{aligned} ℓ_{complete} (Θ; ω; Y_{complete}) \\ = \sum_{i = p + 1}^{R} (- \frac{ς}{2} τ_{i} - \frac{u_{i}^{2} τ_{i}}{2 (1 - ω_{i} δ_{λ}^{2}) σ^{2}} + \frac{δ_{λ} (ω_{i})^{1 / 2} u_{i} γ_{i} τ_{i}}{(1 - ω_{i} δ_{λ}^{2}) σ^{2}} - \frac{τ_{i}^{2} γ_{i}}{2 (1 - ω_{i} δ_{λ}^{2}) σ^{2}} \\ - \ln (σ^{2}) - \frac{1}{2} \ln (1 - ω_{i} δ_{λ}^{2}) + \frac{ς}{2} \ln (\frac{ς}{2}) - \ln (Γ (\frac{ς}{2})) + \frac{ς}{2} \ln (τ_{i})) . \end{aligned}$ Hence, the Q-function (perturbed) is presented by means of (16) $\begin{aligned} Q_{Θ; ω} & = \sum_{i = p + 1}^{R} (- \frac{ς}{2} {\hat{s}}_{1 i} - \frac{{u_{i}}^{2} {\hat{s}}_{1 i}}{2 (1 - ω_{i} {δ_{λ}}^{2}) σ^{2}} + \frac{δ_{λ} (ω_{i})^{1 / 2} u_{i} {\hat{s}}_{2 i}}{(1 - ω_{i} {δ_{λ}}^{2}) σ^{2}} - \frac{{\hat{s}}_{3 i}}{2 (1 - ω_{i} {δ_{λ}}^{2}) σ^{2}} \\ - \ln (σ^{2}) - \frac{1}{2} \ln (1 - ω_{i} {δ_{λ}}^{2}) + \frac{ς}{2} \ln (\frac{ς}{2}) - \ln (Γ (\frac{ς}{2})) + \frac{ς}{2} {\hat{s}}_{4 i}), \end{aligned}$ (16) where $u_{i} = y_{i} - x_{i}^{⊤} β$ and $\hat{s_{1}}, \hat{s_{2}}, \hat{s_{3}}, \hat{s_{4}}$ are the same as in the expressions presented in (Equation5(5) $\begin{aligned} {\hat{s}}_{1 i} & = E (τ_{i} | y_{observed}, \hat{Θ}) = (\frac{\hat{ς} + 1}{\hat{ς} + {\hat{η}}_{i}^{2}}) \frac{R_{\hat{ς} + 3} ({\hat{T}}_{i} {(\frac{\hat{ς} + 3}{\hat{ς} + 1})}^{1 / 2})}{R_{\hat{ς} + 1} ({\hat{T}}_{i})}, \end{aligned}$ (5) ) –(Equation8(8) $\begin{aligned} {\hat{s}}_{4 i} & = E (\ln (τ_{i}) | y_{observed}, \hat{Θ}) \\ = G (\frac{\hat{ς} + 1}{2}) + \frac{\hat{ς} + 1}{\hat{ς} + {\hat{η}}_{i}^{2}} (\frac{R_{\hat{ς} + 3} ({\hat{T}}_{i} {(\frac{\hat{ς} + 3}{\hat{ς} + 1})}^{1 / 2})}{R_{\hat{ς} + 1} ({\hat{T}}_{i})} - 1) - \ln (\frac{{\hat{η}}_{i}^{2} + \hat{ς}}{2}) \\ + \frac{\hat{δ_{λ}} {\hat{η}}_{i} ({\hat{η}}_{i}^{2} - 1)}{{((\hat{ς} + 1) {(\hat{ς} + {\hat{η}}_{i}^{2})}^{3})}^{1 / 2}} (\frac{t_{\hat{ς} + 1} ({\hat{T}}_{i})}{R_{\hat{ς} + 1} ({\hat{T}}_{i})}) + \frac{1}{R_{\hat{ς} + 1} ({\hat{T}}_{i})} \\ \times \int_{- \infty}^{{\hat{T}}_{i}} {\hat{g}}_{\hat{ς}} (x) t_{\hat{ς} + 1} (x) d x, \end{aligned}$ (8) ).

Theorem 5

For the skewness perturbation strategy, we get the $(p + 3) \times (R - p)$ matrix formulated as (17) $Δ = \frac{\partial^{2} Q_{Θ; ω}}{∂Θ∂ω} |_{Θ = \hat{Θ}, ω = ω_{0}} = (\begin{matrix} \frac{{\hat{δ_{λ}}}^{2} (y_{i} - x_{i}^{⊤} \hat{β}) {\hat{s}}_{1 i} - {\hat{δ_{λ}}}^{3} {\hat{s}}_{2 i}}{(1 - {\hat{δ_{λ}}}^{2})^{2} \hat{σ^{2}}} - \frac{\hat{δ_{λ}} {\hat{s}}_{2 i}}{2 (1 - {\hat{δ_{λ}}}^{2}) \hat{σ^{2}}} x_{i} \\ \frac{{\hat{δ_{λ}}}^{2} (y_{i} - x_{i}^{⊤} \hat{β})^{2} {\hat{s}}_{1 i} - \hat{δ_{λ}} (y_{i} - x_{i}^{⊤} \hat{β}) ({\hat{δ_{λ}}}^{2} + 1) {\hat{s}}_{2 i} + {\hat{δ_{λ}}}^{2} {\hat{s}}_{3 i}}{2 (1 - {\hat{δ_{λ}}}^{2})^{2} {\hat{σ^{2}}}^{2}} \\ \frac{- 2 {\hat{δ_{λ}}}^{3} {(y_{i} - x_{i}^{⊤} \hat{β})}^{2} {\hat{s}}_{1 i} + 4 {\hat{δ_{λ}}}^{4} (y_{i} - x_{i}^{⊤} \hat{β}) {\hat{s}}_{2 i} - 2 {\hat{δ_{λ}}}^{3} {\hat{s}}_{3 i}}{(1 - {\hat{δ_{λ}}}^{2})^{3} \hat{σ^{2}}} + \frac{(y_{i} - x_{i}^{⊤} \hat{β}) {\hat{s}}_{2 i} + 2 \hat{δ_{λ}} \hat{σ^{2}}}{2 (1 - {\hat{δ_{λ}}}^{2}) \hat{σ^{2}}} \\ + \frac{- \hat{δ_{λ}} {(y_{i} - x_{i}^{⊤} \hat{β})}^{2} {\hat{s}}_{1 i} + 4 {\hat{δ_{λ}}}^{2} (y_{i} - x_{i}^{⊤} \hat{β}) {\hat{s}}_{2 i} - \hat{δ_{λ}} {\hat{s}}_{3 i} + {\hat{δ_{λ}}}^{3} \hat{σ^{2}}}{(1 - {\hat{δ_{λ}}}^{2})^{2} \hat{σ^{2}}} \\ 0 \end{matrix}),$ (17) where ${\hat{s}}_{1 i}, {\hat{s}}_{2 i}, {\hat{s}}_{3 i}, {\hat{s}}_{4 i}$ are the same as in (Equation5(4) $\begin{aligned} Q_{Θ} & = E (ℓ_{complete} (Θ; Y_{complete}) | y_{o}), evaluated at Θ = \hat{Θ}, \\ = \sum_{i = p + 1}^{R} (- \frac{ς}{2} {\hat{s}}_{1 i} - \frac{u_{i}^{2} {\hat{s}}_{1 i}}{2 (1 - δ_{λ}^{2}) σ^{2}} + \frac{δ_{λ} u_{i} {\hat{s}}_{2 i}}{(1 - δ_{λ}^{2}) σ^{2}} - \frac{{\hat{s}}_{3 i}}{2 (1 - δ_{λ}^{2}) σ^{2}} \\ - \ln (σ^{2}) - \frac{1}{2} \ln (1 - δ_{λ}^{2}) + \frac{ς}{2} \ln (\frac{ς}{2}) - \ln (Γ (\frac{ς}{2})) + \frac{ς}{2} {\hat{s}}_{4 i}), \end{aligned}$ (4) )–(Equation8(7) $\begin{aligned} {\hat{s}}_{3 i} & = E (γ_{i}^{2} τ_{i} | y_{observed}, \hat{Θ}) \\ = \hat{{δ_{λ}}^{2}} {\hat{u_{i}}}^{2} {\hat{s}}_{1 i} + (1 - {\hat{δ_{λ}}}^{2}) {\hat{σ}}^{2} + \frac{\hat{δ_{λ}} u_{i} {(1 - {\hat{δ_{λ}}}^{2})}^{1 / 2}}{π {\hat{f}}_{i} (\hat{u_{i}})} {(\frac{{\hat{η}}_{i}^{2}}{\hat{ς} (1 - {\hat{δ_{λ}}}^{2})} + 1)}^{- \frac{\hat{ς} + 2}{2}}, \end{aligned}$ (7) ).

3.3. Benchmark of influential observations

To assess if a case is influential, we use the benchmark stated as $1 / q + c {SD}_{M (0)}$ , where q = n−p, n is the sample size, c is a pre-chosen positive constant and ${SD}_{M (0)}$ is the sample standard deviation (SD) of $M (0)_{s}$ , for $s \in {1, \dots, q}$ ; see [Citation41]. Then, if a diagnostic value is greater than $1 / q + c {SD}_{M (0)}$ , we identify the corresponding case to be influential.

4. Numerical simulation

We present five simulation studies to examine our estimators and diagnostic results. The results in Sections 4 and 5 are calculated with the software Matlab [Citation44].

4.1. EM algorithm

The first simulation study uses the results found in Section 3. For the STAR(p) model with $p \in {1, 2, 3}$ ), we choose sample sizes n belonging to ${250, 500, 1000}$ , and parameters: $λ \in {- 0.2, - 0.15, - 0.1}$ and $ς \in {3, 4, 5}$ . The results are presented in Tables and . Note that the mean and median of the estimates are coherent with the values stated. In addition, the standard errors (SEs) are very small to indicate our estimates are stable. In short, the results prove our proposal is suitable.

Table 1. Values of the SE, median, and mean with $n \in {250, 500, 1000}$ , $λ \in {- 0.1, - 0.15, - 0.2}$ and $ς \in {3, 4, 5}$ using the STAR model.

Display Table

Table 2. Values of the SE, median, and mean with $n \in {250, 500, 1000}$ , $λ = - 0.1$ and $ς \in {3, 4, 5}$ using the STAR model.

Display Table

4.2. Influence diagnostic analysis

The second study proceeds the steps as follows:

[S1]	The STAR(1) model is $y_{t} = u_{t} + β y_{t - 1}$ , where $u_{t} \sim ST (0, σ^{2}, λ, ς)$ , with $ς = 3, λ = 0.2, β = 0.12, σ^{2} = 0.1$ . Observe the stationarity of the AR(1) model according to Lemma 1.
[S2]	The model in [S1] is used to generate 1000 samples of 400 observations each.
[S3]	A perturbed scalar ε is added to the 200th observation of each sample obtained in [S2]. Note that $y_{t}$ is perturbed to be ${y_{t}}^{*} = y_{t} + ϵ$ , where $ϵ \in {0, 0.2, 0.4, \dots, 2}$ and t = 200 are used to obtain 1000 samples with the 200th observation of each sample as a possible influential observation.
[S4]	Our influence diagnostic is used to detect influential observations for the each of 1000 samples obtained in [S3]. Under each perturbation strategy, 1000 diagnostic values are calculated inspecting an eigenvector linked to its corresponding largest eigenvalue. Then, the index of largest element in absolute value of this vector is registered as an influential observation.
[S5]	How well our diagnostics performs is examined in terms of how many diagnostic values of the 1000 samples in [S3] fulfill the above criteria, with ε run from 0 to 2, and the number of diagnostic values of the 200th position is counted. The number of correct identifications versus ε for the schemes are displayed in Figure (left).

Figure 1. Number of correct identifications related to the four perturbation strategies for the STAR(1) (left) and STAR(2) (right) models.

From Figure (left), we see clearly that the number of samples with correct identification of the influential observation has followed the scale of the perturbed vector to increase in the four perturbation strategies. When $ϵ = 2$ , the four strategies are becoming apparent. Under 1000 samples, the influential observation of the four schemes are 634, 658, 642 and 269 times, respectively. These results are expected.

Next, we further study the STAR(2) model stated as $y_{t} = β_{1} y_{t - 1} + β_{2} y_{t - 2} + u_{t}$ , where $u_{t} \sim ST (0, σ^{2}, λ, ς)$ , with $β_{1} = 0.15, β_{2} = - 0.2, σ^{2} = 0.1, λ = 0.2, ς = 3$ . It is easy to prove that the AR(2) model is stationary. We use the same 5 steps for the AR(1) model to get Figure (right).

From Figure (right), we note that the number of samples with correct identification of the influential observation has follewed the scale of the perturbed vector to increase in the four schemes. When $ϵ = 2$ , the four schemes become apparent. Under 1000 samples, the influential observation of the four schemes are 525, 375, 561 and 277 times, respectively. From both figures, the AR(1) model's pattern appears stronger than the AR(2) model's patter. However, diagnostic ability of the four schemes are different so they should be used in combination in practice.

4.3. Student-t versus Gaussian models

Liu et al.[Citation26] investigated about a methodology to detect influential points in an AR model with the normal distribution. Here, we generate 1000 samples following the procedure provided in Section 4.2 (with $ϵ = 2$ in [3]) based on the ST distribution with $β = 0.12, σ^{2} = 0.1, λ = 0.2, ς = 3$ , and then make an influence analysis under the ST and normal distributions to compare our result with those provided in [Citation26]. Using the method stated in Step 5 of Section 4.2, the comparison is made in Table .

Table 3. Comparison between the ST and normal distributions with ε = 2.

Download CSV Display Table

There are both similarities and differences. The similarities are obvious in the study under both distributions as the number of correct outlier detections are all large, that is, 634, 658, 642 and 603, 596, 122 out of 1000. This indicates that the diagnostic results works well. However, there are also differences. In the case of $ϵ = 2$ , the number of 1000 samples with influential points is 634 and 603 under case-weights strategy; the number of 1000 samples with influential points is 658 and 596 under the data strategy; and the number of 1000 samples with influential points is 642 and 122 under the variance strategy. In terms of diagnostics, the ST model is better than the normal model. Utilizing a correctly specified model in analyzing data is particularly important.

4.4. Skew-student-t versus student-t models

Liu et al. [Citation34] diagnosed influential points in an AR model with the t distribution. We generate 1000 samples following the procedure provided in Section 4.2 (with ε = 2 in Step 3) based on the ST distribution with $β = 0.12, σ^{2} = 0.1, λ = 0.2, ς = 3$ to evaluate the performance of local influence analysis under the ST and t distributions. The comparison is reported in Table . The number of samples where influential observations were detected are all large in scale, namely, 634, 658 and 637, 613 out of 1000, implying that the method works well. There are differences in employing the ST and t distributions. In the case of $ϵ = 2$ , the number of samples with influential observations is 634 and 658 under case weights strategy; and the number of samples with influential observations is 658 and 613 under the data strategy; both with a total 1000 samples. The diagnostic effect under the ST distribution is better than under the t distribution.

Table 4. Comparison between the ST and t distributions with ε = 2.

Download CSV Display Table

4.5. Skew-student-t versus skew-normal models

Liu et al. [Citation31,Citation32] detected influential observations in an SNAR model. We generate 1000 samples following the procedure provided in Section 4.2 (with $ϵ = 2$ in Step 3) based on the ST distribution with $β = 0.12, σ^{2} = 0.1, λ = 0.2, ς = 3$ , and then apply a local influence analysis under the ST and SN distributions. The comparison is provided in Table .

Table 5. Comparison between the ST and SN distributions with $ϵ = 2$ .

Display Table

The similarities are noticeable in the local diagnostic study under the ST and SN models as the number of samples with the influential points are most large enough in the most strategies of perturbation: 634, 658, 642, 269 and 611, 609, 124, 46 out of 1000, meaning that the diagnostic results are effective. However, there are also differences. In the case of $ϵ = 2$ , these can be observed for the variance and skewness perturbation strategies, which influential observations are 642 and 124 under the variance strategy; and 269 and 46 under the skewness strategy; both with a total 1000 samples. The local influence analysis performance is better assuming a ST distribution that an SN distribution, as shown in the previous cases.

5. Empirical analysis

In this section, we use our values stated in Section 3 to analyze financial data and discuss the performance of our proposed methodology. The Brent crude futures (BIPE hereafter) daily log-return data from 16 January 2007 to 11 March 2021 are chosen to construct an STAR model and perform our diagnostic analysis.

5.1. STAR model for BIPE

Figure (left) shows BIPE daily log-return time series data from 16 January 2007 to 11 March 2021. An exploratory data analysis based on basic statistics of the daily financial returns is the following: n = 3442 (sample size); minimum and maximum returns of −0.30856 and 0.15449; first and third quartiles of −0.010586 and 0.011026; sample mean and median of 0.0000669 and 0.00013246; sample coefficient of skewness of −0.93637; and sample excess kurtosis of 18.812. Figures (right) and present a histogram and a kernel plot employing the t model of the BIPE daily data. From this exploratory data analysis, we detect an asymmetrical distributional feature and a high kurtosis level of the data. We can observe that the adjustment with the t model is unsuitable. Furthermore, we use the D'Agostino skewness test [Citation10] and Anscombe-Glynn kurtosis test [Citation1] to assess for skewness and kurtosis of our data, where the D'Agostino statistic is -19.25 and Anscombe-Glynn statistic is 27.754, with their p-values both significantly less than 0.01. These indicate that the distribution of BIPE is skewed, with an obvious peak and two fat tails. We first select an STAR model and determine its order using the following steps as stated as [Citation31,Citation32,Citation47] for their t and SN models.

Figure 2. Time-series (left) and histogram with kernel PDF (right) of BIPE daily returns.

Figure 3. BIPE daily returns with t and ST PDFs.

[S1] Consider an AR(p) model (18) $\begin{array}{l} y_{t} = u_{t} + β_{1} y_{t - 1}, \\ ⋮ \\ y_{t} = u_{t} + β_{1} y_{t - 1} + \dots + β_{p} y_{t - p}, \\ ⋮ \end{array}$ (18) with $p \in {1, 2, \dots}$ .

[S2] In the ith equation of (Equation19(18) $\begin{array}{l} y_{t} = u_{t} + β_{1} y_{t - 1}, \\ ⋮ \\ y_{t} = u_{t} + β_{1} y_{t - 1} + \dots + β_{p} y_{t - p}, \\ ⋮ \end{array}$ (18) ), that is, the AR(i) model, the ordinary least squares estimate of $β_{j}$ is ${\hat{β}}_{j}^{(i)}$ , the residual is ${\hat{u}}_{t}^{(i)} = y_{t} - {\hat{β}}_{1}^{(i)} y_{t - 1} - \dots - {\hat{β}}_{i}^{(i)} y_{t - i},$ and the estimate of $σ_{i}^{2}$ is ${\hat{σ}}_{i}^{2} = \frac{1}{R - 2 i - 1} \sum_{t = i + 1}^{R} {({\hat{u}}_{t}^{(i)})}^{2} .$

[S3] The $(i - 1)$ th and ith equations in (Equation19(18) $\begin{array}{l} y_{t} = u_{t} + β_{1} y_{t - 1}, \\ ⋮ \\ y_{t} = u_{t} + β_{1} y_{t - 1} + \dots + β_{p} y_{t - p}, \\ ⋮ \end{array}$ (18) ) are used to test if coefficient $β_{i}$ is equal to zero or not, that is, to compare the AR(i−1) with AR(i) models. For this hypothesis testing the statistic is stated as $T_{i} = - (R - i - 2.5) \ln ({\hat{σ}}_{i}^{2} / {\hat{σ}}_{i - 1}^{2})$ , which follows an asymptotic $χ^{2} (1)$ distribution. Values of $T_{i}$ for $i \in {1, \dots, 7}$ are presented in Table . As the 95% quantile of $χ^{2} (1)$ distribution is 3.841, we use the empirical values of $T_{i}$ presented in Table to determine the value of p of the AR(p) model. Hence, p = 1 because $T_{i} = 3.863 > 3.841$ . Using the EM algorithm, we calculate $\hat{Θ} = (\hat{β_{1}}, {\hat{σ}}^{2}, \hat{λ}, \hat{ς}) = (- 0.07232, 0.000201, 0.024942, 3)$ . Then, as the absolute value of $β_{1}$ is less than one, we cannot reject the BIPE data to be stationary. From Figure , we find that the ST distribution fits better than the t distribution. Thus, we fit the STAR(1) model ${\hat{y}}_{t} = - 0.07232 y_{t - 1} + {\hat{u}}_{t}$ , with ${\hat{σ}}^{2} = 0.000201$ , $\hat{λ} = 0.024942$ , and $\hat{ς} = 3$ . According to Lemma 1, the data being fitted by the STAR(1) model are stationary.

Table 6. Empirical values of $T_{i}$ , with $i \in {1, \dots, 7}$ for BIPE daily data.

Display Table

5.2. Diagnostic analysis for BIPE

We conduct the local influence analysis in the STAR(1) model for BIPE under the four perturbation strategies proposed. Based on the idea given in Section 3.3 and the results in [Citation31,Citation32], we choose c = 3 and $1 / 3441 + 3 {SD}_{M (0)}$ as the benchmark, obtaining the values of 0.0021, 0.0022, 0.0015 and 0.0012 for the four respective perturbation strategies. In each plot, the red line symbolizes the benchmark to determine if an observation to be potentially influential or not, that is, when its value is beyond the red line. The influential observations are in Table where * denotes the case that is identified as potentially influential for BIPE. We can observe that these cases are mainly related to the 2008–2009 ‘The Great Recession”, as well as to the 2020-2021 SARS-CoV-2 pandemic, which conducted to a large volatility. For example, on 3 October 2008, the U.S. government signed a financial rescue plan with a total of about 700 billion dollars. On 24 February 2020, the global stock markets fell after the global number of COVID-19 cases increased significantly. In summary, Figure and Table justify the effectiveness and practicability of the diagnostics methods for an STAR model.

Figure 4. BIPE diagnostics under skewness, variance, case-weights, and data perturbations.

Table 7. Summary of the diagnostic analysis by perturbation strategy.

Download CSV Display Table

Finally, we compare the two models for AR(1) with outliers and STAR(1) without outliers in Table . The two models’ mean square errors of their predicted values are 0.0014874 and 0.0013266, respectively. It can be seen that the prediction results made after the outliers' removal are better than those made before.

Table 8. Predicted results by the listed structure.

Download CSV Display Table

6. Conclusions

In this article, our STAR model was studied with an ML estimation methodology. Its validation was performed using local diagnostic analysis inspired by the EM algorithm, which allowed us to obtain normal curvatures for four perturbation strategies of interest. Our model was compared with the alternative models based on skew-normal, normal, and Student-t distributions. Our findings showed that the proposed STAR model was more accurate, applicable, and usable to diagnose longer time data and identify more abnormal cases.

The curvature results for our STAR(p) model with four perturbation strategies, including the newly proposed perturbation of skewness, were presented. Monte Carlo simulation studies were conducted to asses adequacy of our methodology. Approximate numerical benchmark values for detecting the possible influential observations were employed to analyze the daily log-returns of Brent crude futures for a period of time covering the events related to 2008 financial crisis and COVID-19 pandemic. Many of the influential observations with the BIPE data were identified to be related to historical events. Our methodology and findings are evidenced to be effective.

Further works are related to the study of statistical structures generated from settings associated with data functional, partial least squares, quantile, multivariate, and spatial regression frameworks [Citation16,Citation38,Citation39,Citation43]. Similarly, considering censored data may also be of interest to analyze in the context of the current investigation [Citation19]. We are planning to conduct studies on these issues in the future.

Acknowledgments

We would like to thank the Editor, Professor Jie Chen, the Associate Editor, and the reviewers for their constructive comments which led an improved presentation of this article.

Disclosure statement

No potential conflict of interest was reported by the author(s).

Additional information

Funding

The research of Y. Liu was supported by the National Social Science Fund of China [grant No. 19BTJ036]. The research of V. Leiva was partially funded by the National Agency for Research and Development (ANID) [project grant number FONDECYT 1200525] of the Chilean government under the Ministry of Science, Technology, Knowledge, and Innovation.

References

F.J. Anscombe and W.J. Glynn, Distribution of the kurtosis statistic b2 for normal samples, Biometrika 70 (1983), pp. 227–234.
Web of Science ®Google Scholar
A. Azzalini, A class of distribution which includes the normal ones, Scand. J. Stat. 12 (1985), pp. 171–178.
Web of Science ®Google Scholar
A. Azzalini, The Skew-Normal and Related Families, Cambridge University Press, Cambridge UK, 2014.
Google Scholar
A Azzalini, An overview on the progeny of the skew-normal family – a personal perspective, J. Multi. Anal. 188 (2022), 104851.
Web of Science ®Google Scholar
A. Azzalini and A. Capitanio, Distributions generated by perturbation of symmetry with emphasis on a multivariate Student-t distribution, J. R. Stat. Soc. 65 (2003), pp. 367–389.
Google Scholar
V.G. Cancho, V.H. Lachos, and E.M.M. Ortega, A nonlinear regression model with skew-normal errors, Stat. Pap. 51 (2010), pp. 547–558.
Web of Science ®Google Scholar
C.Z. Cao, J.G. Lin, and J.Q. Shi, Diagnostics on nonlinear model with scale mixtures of skew-normal and first-order autoregressive errors, Statistics 48 (2014), pp. 1033–1047.
Web of Science ®Google Scholar
B. Carmichael and A. Coën, Asset pricing with skewed-normal return, Finance Res. Lett. 10 (2013), pp. 50–57.
Web of Science ®Google Scholar
D. Cook, Assessment of local influence, J. R. Stat. Soc. B 48 (1986), pp. 133–169.
Google Scholar
R.B. D'Agostino, Transformation to normality of the null distribution of g1, Biometrika 57 (1970), pp. 679–681.
Web of Science ®Google Scholar
A.P. Dempster, N.M. Laird, and D.B. Rubin, Maximum likelihood from incomplete data via the EM algorithm, J. R. Stat. Soc. B 39 (1977), pp. 1–38.
Google Scholar
M. Eling, Fitting asset returns to skewed distributions: Are the skew-normal and skew-Student good models? Insur. Mathem. Econom. 59 (2014), pp. 45–56.
Web of Science ®Google Scholar
C.S. Ferreira, G.A. Paula, and G.C. Lana, Estimation and diagnostic for partially linear models with first-order autoregressive skew-normal errors, Comput. Stat. 37 (2022), pp. 445–468.
Web of Science ®Google Scholar
A.M. Garay, V.H. Lachos, F.V. Labra, and E.M.M. Ortega, Statistical diagnostics for nonlinear regression models based on scale mixtures of skew-normal distributions, J. Stat. Comput. Simul. 84 (2014), pp. 1761–1778.
Web of Science ®Google Scholar
H.J. Ho and T.I. Lin, Robust linear mixed models using the skew-t distribution with application to schizophrenia data, Biom. J. 52 (2010), pp. 449–469.
PubMed Web of Science ®Google Scholar
M. Huerta, V. Leiva, S. Liu, M. Rodriguez, and D Villegas, On a partial least squares regression model for asymmetric data with a chemical application in mining, Chemom. Intell. Lab. Syst. 190 (2019), pp. 55–68.
Web of Science ®Google Scholar
L. Jin, X. Dai, A. Shi, and L. Shi, Detection of outliers in mixed regressive-spatial autoregressive models, Commun. Stat.: Theory Methods 45 (2016), pp. 5179–5192.
Web of Science ®Google Scholar
T. Kollo and D. von Rosen, Advanced Multivariate Statistics with Matrices, Springer, Dordrecht, the Netherlands, 2005.
Google Scholar
J. Leao, V. Leiva, H. Saulo, and V Tomazella, Incorporation of frailties into a cure rate regression model and its diagnostics and application to melanoma data, Stat. Med. 37 (2018), pp. 4421–4440.
PubMed Web of Science ®Google Scholar
V. Leiva, S. Liu, L. Shi, and F.J.A. Cysneiros, Diagnostics in elliptical regression models with stochastic restrictions applied to econometrics, J. Appl. Stat. 43 (2016), pp. 627–642.
Web of Science ®Google Scholar
V. Leiva, H. Saulo, R. Souza, R.G. Aykroyd, and R. Vila, A new BISARMA time series model for forecasting mortality using weather and particulate matter data, J. Forecast. 40 (2021), pp. 346–364.
Web of Science ®Google Scholar
W.K. Li, Diagnostic Checks in Time Series, Chapman and Hall/CRC, Boca Raton, 2004.
Google Scholar
T.I. Lin, J.C. Lee, and W.J. Hsieh, Robust mixture modeling using the skew t distribution, Stat. Comput. 17 (2007), pp. 81–92.
Web of Science ®Google Scholar
S. Liu, On diagnostics in conditionally heteroskedastic time series models under elliptical distributions, J. Appl. Probab. 41A (2004), pp. 393–405.
Web of Science ®Google Scholar
S. Liu and C.C. Heyde, On estimation in conditional heteroskedastic time series models under non-normal distributions, Stat. Pap. 49 (2008), pp. 455–469.
Web of Science ®Google Scholar
Y. Liu, G. Ji, and S. Liu, Influence diagnostics in a vector autoregressive model, J. Stat. Comput. Simul. 85 (2015), pp. 2632–2655.
Web of Science ®Google Scholar
S. Liu, V. Leiva, T. Ma, and A.H. Welsh, Influence diagnostic analysis in the possibly heteroskedastic linear model with exact restrictions, Stat. Methods Appl. 25 (2016), pp. 227–249.
Web of Science ®Google Scholar
S. Liu, V. Leiva, D. Zhuang, T. Ma, and J Figueroa-Zuniga, Matrix differential calculus with applications in the multivariate linear model and its diagnostics, J. Multivar. Anal. 188 (2022), 104849.
Web of Science ®Google Scholar
T. Liu, S. Liu, and L. Shi, Time Series Analysis Using SAS Enterprise Guide, Springer, Singapore, 2020.
Google Scholar
S. Liu, T. Ma, A. SenGupta, K. Shimizu, and M.Z. Wang, Influence diagnostics in possibly asymmetric circular-linear multivariate regression models, Sankhya B 79 (2017), pp. 76–93.
Google Scholar
Y. Liu, G. Mao, V. Leiva, S. Liu, and A Tapia, Diagnostic analytics for an autoregressive model under the skew-normal distribution, Mathematics 8 (2020), 693.
Web of Science ®Google Scholar
Y. Liu, C. Mao, V. Leiva, S. Liu, and W.A. Silva Neto, Asymmetric autoregressive models: Statistical aspects and a financial application under COVID-19 pandemic, J. Appl. Stat. 49 (2022), pp. 1323–1347.
PubMed Web of Science ®Google Scholar
S. Liu and H. Neudecker, On pseudo maximum likelihood estimation for multivariate time series models with conditional heteroskedasticity, Math. Comput. Simul. 79 (2009), pp. 2556–2565.
Web of Science ®Google Scholar
Y. Liu, R. Sang, and S. Liu, Diagnostic analysis for a vector autoregressive model under t distributions, Stat. Neerl. 71 (2017), pp. 86–114.
Web of Science ®Google Scholar
S. Liu and A.H. Welsh, Regression diagnostics, in International Encyclopedia of Statistical Science, M. Lovric, ed., Springer, Berlin, Heidelberg, 2011, pp. 1206–1208
Google Scholar
J. Lu, L. Shi, and F. Chen, Outlier detection in time series models using local influence method, Commun. Stat. Theory Methods 41 (2012), pp. 2202–2220.
Web of Science ®Google Scholar
J.R. Magnus and H. Neudecker, Matrix Differential Calculus with Applications in Statistics and Econometrics, 3rd ed. Wiley, Chichester, 2019.
Google Scholar
C. Marchant and V. Leiva, Robust multivariate control charts based on Birnbaum–Saunders distributions, J. Stat. Comput. Simul. 88 (2018), pp. 182–202.
Web of Science ®Google Scholar
S. Martinez, R. Giraldo, and V Leiva, Birnbaum–Saunders functional regression models for spatial data, Stoch. Environ. Res. Risk Assess 33 (2019), pp. 1765–1780.
Web of Science ®Google Scholar
G.A. Paula, Influence diagnostics for linear models with first-order autoregressive elliptical errors, Stat. Probab. Lett. 79 (2009), pp. 339–346.
Web of Science ®Google Scholar
W.Y. Poon and Y.S. Poon, Conformal normal curvature and assessment of local influence, J. R. Stat. Soc. Ser B 61 (1999), pp. 51–61.
Google Scholar
C.B. Zeller, V.H. Lachos, and F.E. Vilca-Labra, Local influence analysis for regression models with scale mixtures of skew-normal distributions, J. Appl. Stat. 38 (2011), pp. 348–363.
Web of Science ®Google Scholar
L. Sanchez, V. Leiva, M. Galea, and H. Saulo, Birnbaum–Saunders quantile regression models with application to spatial data, Mathematics 8 (2020), 1000.
Web of Science ®Google Scholar
The MathWorks Inc. MATLAB Version: 9.13.0 (R2022b), The MathWorks Inc, Natick, Massachusetts, 2022. Available at https://www.mathworks.com.
Google Scholar
A. Tapia, V. Leiva, M.P. Diaz, and V. Giampaoli, Influence diagnostics in mixed effects logistic regression models, TEST 28 (2019), pp. 920–942.
Web of Science ®Google Scholar
A. Tapia, V. Giampaoli, M.P. Diaz, and V. Leiva, Sensitivity analysis of longitudinal count responses: A local influence approach and application to medical data, J. Appl. Stat. 46 (2019), pp. 1021–1042.
Web of Science ®Google Scholar
R.S. Tsay, Analysis of Financial Time Series, Wiley, New York, 2010.
Google Scholar
R.S. Tsay, An Introduction to Analysis of Financial Data with R, Wiley, New York, 2013.
Google Scholar
I. Vidal and L.M. Castro, Influential observations in the independent student-t measurement error model with weak nondifferential error, Chil. J. Stat. 1 (2010), pp. 17–34.
Google Scholar
F.C. Xie, J.G. Lin, and B.C. Wei, Diagnostics for skew-normal nonlinear regression models with AR(1) errors, Comput. Stat. Data Anal. 53 (2009), pp. 4403–4416.
Web of Science ®Google Scholar
F. Zhu, S. Liu, and L. Shi, Local influence analysis for poisson autoregression with an application to stock transaction data, Stat. Neerl. 70 (2016), pp. 4–25.
Web of Science ®Google Scholar
F. Zhu, L. Shi, and S. Liu, Influence diagnostics in log-linear integer-valued GARCH models, AStA Adv. Stat. Anal. 99 (2015), pp. 311–335.
Web of Science ®Google Scholar

Appendix

In this appendix, we obtain the matrix derivatives involved in our diagnostic analytics [Citation18,Citation28,Citation37].

A.1 Proof (Theorem 1)

Proof. The Q-function is established as

\begin{aligned} Q_{Θ} & = \sum_{i = p + 1}^{R} (- \frac{ς}{2} {\hat{s}}_{1 i} - \frac{{u_{i}}^{2} {\hat{s}}_{1 i}}{2 (1 - {δ_{λ}}^{2}) \hat{σ^{2}}} + \frac{δ_{λ} u_{i} {\hat{s}}_{2 i}}{(1 - {δ_{λ}}^{2}) σ^{2}} - \frac{{\hat{s}}_{3 i}}{2 (1 - {δ_{λ}}^{2}) σ^{2}} \\ - \ln (σ^{2}) - \frac{1}{2} \ln (1 - {δ_{λ}}^{2}) + \frac{ς}{2} \ln (\frac{ς}{2}) - \ln (Γ (\frac{ς}{2})) + \frac{ς}{2} {\hat{s}}_{4 i}) . \end{aligned}

Its derivatives are expressed as

\begin{aligned} \frac{\partial Q_{Θ}}{\partial β} & = \sum_{i = p + 1}^{R} \frac{u_{i} {\hat{s}}_{1 i} - δ_{λ} {\hat{s}}_{2 i}}{(1 - {δ_{λ}}^{2}) σ^{2}} x_{i}^{⊤}, \frac{\partial Q_{Θ}}{\partial σ^{2}} = \sum_{i = p + 1}^{R} \frac{{u_{i}}^{2} {\hat{s}}_{1 i} - 2 δ_{λ} u_{i} {\hat{s}}_{2 i} + {\hat{s}}_{3 i}}{2 (1 - {δ_{λ}}^{2}) σ^{4}} - \frac{1}{σ^{2}}, \\ \frac{\partial Q_{Θ}}{\partial δ_{λ}} & = \sum_{i = p + 1}^{R} (\frac{δ_{λ}}{1 - {δ_{λ}}^{2}} + \frac{- δ_{λ} {u_{i}}^{2} {\hat{s}}_{1 i} + 2 {δ_{λ}}^{2} u_{i} {\hat{s}}_{2 i} + (1 - {δ_{λ}}^{2}) u_{i} {\hat{s}}_{2 i} - δ_{λ} {\hat{s}}_{3 i}}{{(1 - {δ_{λ}}^{2})}^{2} σ^{2}}), \\ \frac{\partial Q_{Θ}}{\partial ς} & = \frac{R - p}{2} (1 - {\hat{s}}_{1 i} + {\hat{s}}_{4 i} + \ln (\frac{ς}{2}) - G (\frac{ς}{2})) . \end{aligned}

The second derivatives are formulated as

\begin{aligned} \frac{\partial^{2} Q_{Θ}}{\partial β \partial β^{⊤}} & = - \sum_{i = p + 1}^{R} \frac{{\hat{s}}_{1 i}}{(1 - {δ_{λ}}^{2}) σ^{2}} {x_{i}}^{⊤} x_{i}, \frac{\partial^{2} Q_{Θ}}{\partial β \partial σ^{2}} = \sum_{i = p + 1}^{R} \frac{δ_{λ} {\hat{s}}_{2 i} - u_{i} {\hat{s}}_{1 i}}{(1 - {δ_{λ}}^{2}) σ^{4}} {x_{i}}^{⊤}, \\ \frac{\partial^{2} Q_{Θ}}{\partial β \partial δ_{λ}} & = \sum_{i = p + 1}^{R} \frac{2 δ_{λ} u_{i} {\hat{s}}_{1 i} - ({δ_{λ}}^{2} + 1) {\hat{s}}_{2 i}}{{(1 - {δ_{λ}}^{2})}^{2} σ^{2}} {x_{i}}^{⊤}, \\ \frac{\partial^{2} Q_{Θ}}{\partial {(σ^{2})}^{2}} & = \sum_{i = p + 1}^{R} (\frac{- {u_{i}}^{2} {\hat{s}}_{1 i} + 2 δ_{λ} u_{i} {\hat{s}}_{2 i} - {\hat{s}}_{3 i}}{(1 - {δ_{λ}}^{2}) σ^{6}} + \frac{1}{σ^{4}}), \\ \frac{\partial^{2} Q_{Θ}}{\partial σ^{2} \partial δ_{λ}} & = \sum_{i = p + 1}^{R} \frac{δ_{λ} {u_{i}}^{2} {\hat{s}}_{1 i} - ({δ_{λ}}^{2} + 1) u_{i} {\hat{s}}_{2 i} + δ_{λ} {\hat{s}}_{3 i}}{{(1 - {δ_{λ}}^{2})}^{2} σ^{4}}, \\ \frac{\partial^{2} Q_{Θ}}{\partial {δ_{λ}}^{2}} & = \sum_{i = p + 1}^{R} (\frac{{δ_{λ}}^{2} + 1}{{(1 - {δ_{λ}}^{2})}^{2}} + \frac{- {u_{i}}^{2} {\hat{s}}_{1 i} + 6 δ_{λ} u_{i} {\hat{s}}_{2 i} - {\hat{s}}_{3 i}}{{(1 - {δ_{λ}}^{2})}^{2} σ^{2}} + \frac{- 4 {δ_{λ}}^{2} {u_{i}}^{2} {\hat{s}}_{1 i} + 8 {δ_{λ}}^{3} u_{i} {\hat{s}}_{2 i} + 4 {δ_{λ}}^{2} {\hat{s}}_{3 i}}{{(1 - {δ_{λ}}^{2})}^{3} σ^{σ^{2}}}), \\ \frac{\partial^{2} Q_{Θ}}{\partial δ_{λ} \partial ς} & = \frac{\partial^{2} Q_{Θ}}{\partial β \partial ς} = \frac{\partial^{2} Q_{Θ}}{\partial σ^{2} \partial ς} = 0, \frac{\partial^{2} Q_{Θ}}{\partial ς^{2}} = \frac{R - p}{4} (\frac{2}{ς} - G^{'} (\frac{ς}{2})) . \end{aligned}

Hence, we obtain

{\ddot{Q}}_{\hat{Θ}}

with

\hat{Θ} = (\hat{β}, {\hat{σ}}^{2}, \hat{δ_{λ}}, \hat{ς})

A.2. Proof (Theorem 2)

Proof.

The Q-function (perturbed) is stated as $\begin{aligned} Q_{Θ; ω} & = \sum_{i = p + 1}^{R} (- \frac{ς}{2} {\hat{s}}_{1 i} - \frac{ω_{i}^{2} {u_{i}}^{2} {\hat{s}}_{1 i}}{2 (1 - {δ_{λ}}^{2}) σ^{2}} + \frac{δ_{λ} ω_{i} u_{i} {\hat{s}}_{2 i}}{(1 - {δ_{λ}}^{2}) σ^{2}} - \frac{{\hat{s}}_{3 i}}{2 (1 - {δ_{λ}}^{2}) σ^{2}} \\ - \ln (σ^{2}) - \frac{1}{2} \ln (1 - {δ_{λ}}^{2}) + \frac{ς}{2} \ln (\frac{ς}{2}) - \ln (Γ (\frac{ς}{2})) + \frac{ς}{2} {\hat{s}}_{4 i}) . \end{aligned}$ Taking the differentials of the Q-function (perturbed) with respect to $β$ , $σ^{2}$ , $δ_{λ}$ , ς and then $ω_{i}$ , for $i \in {p + 1, \dots, R}$ , the derivatives are calculated as $\begin{aligned} \frac{\partial Q_{Θ; ω}}{\partial β} & = \sum_{i = p + 1}^{R} \frac{u_{i} {w_{i}}^{2} {\hat{s}}_{1 i} - δ_{λ} w_{i} {\hat{s}}_{2 i}}{(1 - {δ_{λ}}^{2}) σ^{2}} x_{i}^{⊤}, \frac{\partial Q_{Θ; ω}}{\partial σ^{2}} = \sum_{i = p + 1}^{R} (\frac{{w_{i}}^{2} {u_{i}}^{2} {\hat{s}}_{1 i} - 2 δ_{λ} w_{i} u_{i} {\hat{s}}_{2 i} + {\hat{s}}_{3 i}}{2 (1 - {δ_{λ}}^{2}) σ^{4}} - \frac{1}{σ^{2}}), \\ \frac{\partial Q_{Θ; ω}}{\partial δ_{λ}} & = \sum_{i = p + 1}^{R} (\frac{δ_{λ}}{1 - {δ_{λ}}^{2}} + \frac{- δ_{λ} {w_{i}}^{2} {u_{i}}^{2} {\hat{s}}_{1 i} + 2 {δ_{λ}}^{2} w_{i} u_{i} {\hat{s}}_{2 i} + (1 - {δ_{λ}}^{2}) w_{i} u_{i} {\hat{s}}_{2 i} - δ_{λ} {\hat{s}}_{3 i}}{{(1 - {δ_{λ}}^{2})}^{2} σ^{2}}), \\ \frac{\partial Q_{Θ; ω}}{\partial ς} & = \frac{R - p}{2} (1 - {\hat{s}}_{1 i} + {\hat{s}}_{4 i} + \ln (\frac{ς}{2}) - G (\frac{ς}{2})), \end{aligned}$ and the second derivatives are stated as $\begin{aligned} \frac{\partial^{2} Q_{Θ; ω}}{\partial β \partial ω_{i}} & = \frac{2 w_{i} u_{i} {\hat{s}}_{1 i} - δ_{λ} {\hat{s}}_{2 i}}{(1 - {δ_{λ}}^{2}) σ^{2}} x_{i}^{⊤}, \frac{\partial^{2} Q_{Θ; ω}}{\partial σ^{2} \partial ω_{i}} = \frac{w_{i} {u_{i}}^{2} {\hat{s}}_{1 i} - δ_{λ} u_{i} {\hat{s}}_{2 i}}{(1 - {δ_{λ}}^{2}) σ^{2}}, \\ \frac{\partial^{2} Q_{Θ; ω}}{\partial δ_{λ} \partial ω_{i}} & = \frac{- 2 δ_{λ} w_{i} {u_{i}}^{2} {\hat{s}}_{1 i} + {δ_{λ}}^{2} u_{i} {\hat{s}}_{2 i} + u_{i} {\hat{s}}_{2 i}}{{(1 - {δ_{λ}}^{2})}^{2} σ^{2}}, \frac{\partial^{2} Q_{Θ; ω}}{\partial ς \partial ω_{i}} = 0. \end{aligned}$ Letting $\hat{Θ} = (\hat{β}, \hat{σ^{2}}, \hat{δ_{λ}}, \hat{ς})$ and $ω = (1, \dots, 1)^{⊤}$ , we obtain the expression presented in (Equation11(10) $\begin{aligned} Q_{Θ; ω} & = E (ℓ_{complete} (Θ; ω; Y_{complete}) | y_{o}), evaluated at Θ \equiv \hat{Θ}, \\ = \sum_{i = p + 1}^{R} (- \frac{ς}{2} {\hat{s}}_{1 i} - \frac{ω_{i}^{2} u_{i}^{2} {\hat{s}}_{1 i}}{2 (1 - δ_{λ}^{2}) σ^{2}} + \frac{δ_{λ} ω_{i} u_{i} {\hat{s}}_{2 i}}{(1 - {δ_{λ}}^{2}) σ^{2}} - \frac{{\hat{s}}_{3 i}}{2 (1 - {δ_{λ}}^{2}) σ^{2}} \\ - \ln (σ^{2}) - \frac{1}{2} \ln (1 - {δ_{λ}}^{2}) + \frac{ς}{2} \ln (\frac{ς}{2}) - \ln (Γ (\frac{ς}{2})) + \frac{ς}{2} {\hat{s}}_{4 i}), \end{aligned}$ (10) ).

A.3. Proof (Theorem 3)

Proof.

We get the $Q$ -function (perturbed) $\begin{aligned} Q_{Θ; ω} & = \sum_{i = p + 1}^{R} (- \frac{ς}{2} {\hat{s}}_{1 i} - \frac{(u_{i} + μ (ω_{i}))^{2} {\hat{s}}_{1 i}}{2 (1 - {δ_{λ}}^{2}) σ^{2}} + \frac{δ_{λ} (u_{i} + μ (ω_{i})) {\hat{s}}_{2 i}}{(1 - {δ_{λ}}^{2}) σ^{2}} - \frac{{\hat{s}}_{3 i}}{2 (1 - {δ_{λ}}^{2}) σ^{2}} \\ - \ln (σ^{2}) - \frac{1}{2} \ln (1 - {δ_{λ}}^{2}) + \frac{ς}{2} \ln (\frac{ς}{2}) - \ln (Γ (\frac{ς}{2})) + \frac{ς}{2} {\hat{s}}_{4 i}) . \end{aligned}$ Taking the differentials of the Q-function (perturbed) with respect to $β$ , $σ^{2}$ , $δ_{λ}$ , ς and then $ω_{i}$ , for $i \in {p + 1, \dots, R}$ , the derivatives are found by $\begin{aligned} \frac{\partial Q_{Θ; ω}}{\partial β} & = \sum_{i = p + 1}^{R} \frac{(u_{i} + μ (ω_{i})) {\hat{s}}_{1 i} - δ_{λ} {\hat{s}}_{2 i}}{(1 - {δ_{λ}}^{2}) σ^{2}} ({x_{i}}^{⊤} + {(ω_{i - 1}, \dots, ω_{i - p})}^{⊤}), \\ \frac{\partial Q_{Θ; ω}}{\partial σ^{2}} & = \sum_{i = p + 1}^{R} (\frac{{(u_{i} + μ (ω_{i}))}^{2} {\hat{s}}_{1 i} - 2 δ_{λ} (u_{i} + μ (ω_{i})) {\hat{s}}_{2 i} + {\hat{s}}_{3 i}}{2 (1 - {δ_{λ}}^{2}) σ^{4}} - \frac{1}{σ^{2}}), \\ \frac{\partial Q_{Θ; ω}}{\partial δ_{λ}} & = \sum_{i = p + 1}^{R} (\frac{δ_{λ}}{1 - {δ_{λ}}^{2}} + \frac{- δ_{λ} {(u_{i} + μ (ω_{i}))}^{2} {\hat{s}}_{1 i} + ({δ_{λ}}^{2} + 1) (u_{i} + μ (ω_{i})) {\hat{s}}_{2 i} - δ_{λ} {\hat{s}}_{3 i}}{(1 - {δ_{λ}}^{2}) σ^{2}}), \\ \frac{\partial Q_{Θ; ω}}{\partial ς} & = \frac{R - p}{2} (1 - {\hat{s}}_{1 i} + {\hat{s}}_{4 i} + \ln (\frac{ς}{2}) - G (\frac{ς}{2})), \end{aligned}$ and the second derivatives are established as $\begin{aligned} \frac{\partial^{2} Q_{Θ; ω}}{\partial β \partial ω_{i}} & = \frac{{\hat{s}}_{1 i}}{(1 - {δ_{λ}}^{2}) σ^{2}} ({x_{i}}^{⊤} + {(ω_{i - 1}, \dots, ω_{i - p})}^{⊤}), \frac{\partial^{2} Q_{Θ; ω}}{\partial σ^{2} \partial ω_{i}} = \frac{(u_{i} + μ (ω_{i})) {\hat{s}}_{1 i} - δ_{λ} {\hat{s}}_{2 i}}{(1 - {δ_{λ}}^{2}) σ^{4}}, \\ \frac{\partial^{2} Q_{Θ; ω}}{\partial δ_{λ} \partial ω_{i}} & = \frac{({δ_{λ}}^{2} + 1) {\hat{s}}_{2 i} - 2 δ_{λ} (u_{i} + μ (ω_{i})) {\hat{s}}_{1 i}}{{(1 - {δ_{λ}}^{2})}^{2} σ^{2}}, \frac{\partial^{2} Q_{Θ; ω}}{\partial ς \partial ω_{i}} = 0. \end{aligned}$ Noting $\hat{Θ} = (\hat{β}, {\hat{σ}}^{2}, \hat{δ_{λ}}, \hat{ς})$ and $ω = (0, \dots, 0)^{⊤}$ , we reach the formula given in (Equation13(12) $\begin{aligned} Q_{Θ; ω} & = \sum_{i = p + 1}^{R} (- \frac{ς}{2} {\hat{s}}_{1 i} - \frac{(u_{i} + μ (ω_{i}))^{2} {\hat{s}}_{1 i}}{2 (1 - {δ_{λ}}^{2}) σ^{2}} + \frac{δ_{λ} (u_{i} + μ (ω_{i})) {\hat{s}}_{2 i}}{(1 - {δ_{λ}}^{2}) σ^{2}} - \frac{{\hat{s}}_{3 i}}{2 (1 - {δ_{λ}}^{2}) σ^{2}} \\ - \ln (σ^{2}) - \frac{1}{2} \ln (1 - {δ_{λ}}^{2}) + \frac{ς}{2} \ln (\frac{ς}{2}) - \ln (Γ (\frac{ς}{2})) + \frac{ς}{2} {\hat{s}}_{4 i}), \end{aligned}$ (12) ).

A.4. Proof (Theorem 4)

Proof.

We attain Q-function (perturbed) $\begin{aligned} Q_{Θ; ω} & = \sum_{i = p + 1}^{⊤} (- \frac{ς}{2} {\hat{s}}_{1 i} - \frac{ω_{i} {u_{i}}^{2} {\hat{s}}_{1 i}}{2 (1 - {δ_{λ}}^{2}) σ^{2}} + \frac{δ_{λ} ω_{i} u_{i} {\hat{s}}_{2 i}}{(1 - {δ_{λ}}^{2}) σ^{2}} - \frac{ω_{i} {\hat{s}}_{3 i}}{2 (1 - {δ_{λ}}^{2}) σ^{2}} \\ - \ln (\frac{σ^{2}}{ω_{i}}) - \frac{1}{2} \ln (1 - {δ_{λ}}^{2}) + \frac{ς}{2} \ln (\frac{ς}{2}) - \ln (Γ (\frac{ς}{2})) + \frac{ς}{2} {\hat{s}}_{4 i}) . \end{aligned}$ Taking the differentials of the Q-function (perturbed) with respect to $β$ , $σ^{2}$ , $δ_{λ}$ , ς and then $ω_{i}$ , for $i \in {p + 1, \dots, R}$ , the derivatives are stated as $\begin{aligned} \frac{\partial Q_{Θ; ω}}{\partial β} & = \sum_{i = p + 1}^{R} w_{i} \frac{{\hat{s}}_{1 i} u_{i} - δ_{λ} {\hat{s}}_{2 i}}{(1 - {δ_{λ}}^{2}) σ^{2}} {x_{i}}^{⊤}, \frac{\partial Q_{Θ; ω}}{\partial σ^{2}} = \sum_{i = p + 1}^{R} (w_{i} \frac{{u_{i}}^{2} {\hat{s}}_{1 i} - 2 δ_{λ} u_{i} {\hat{s}}_{2 i} + {\hat{s}}_{3 i}}{2 (1 - {δ_{λ}}^{2}) σ^{4}} - \frac{1}{σ^{2}}), \\ \frac{\partial Q_{Θ; ω}}{\partial δ_{λ}} & = \sum_{i = p + 1}^{R} (\frac{δ_{λ}}{1 - {δ_{λ}}^{2}} + w_{i} \frac{- δ_{λ} {u_{i}}^{2} {\hat{s}}_{1 i} + ({δ_{λ}}^{2} + 1) u_{i} {\hat{s}}_{2 i} - δ {\hat{s}}_{3 i}}{{(1 - {δ_{λ}}^{2})}^{2} σ^{2}}), \\ \frac{\partial Q_{Θ; ω}}{\partial ς} & = \frac{R - p}{2} (1 - {\hat{s}}_{1 i} + {\hat{s}}_{4 i} + \ln (\frac{ς}{2}) - G (\frac{ς}{2})), \end{aligned}$ and the second derivatives are presented as $\begin{aligned} \frac{\partial^{2} Q_{Θ; ω}}{\partial β \partial ω_{i}} & = \frac{{\hat{s}}_{1 i} - δ_{λ} {\hat{s}}_{2 i}}{(1 - {δ_{λ}}^{2}) σ^{2}} {x_{i}}^{⊤}, \frac{\partial^{2} Q_{Θ; ω}}{\partial σ^{2} \partial ω_{i}} = \frac{{u_{i}}^{2} {\hat{s}}_{1 i} - δ_{λ} u_{i} {\hat{s}}_{2 i} + {\hat{s}}_{3 i}}{2 (1 - {δ_{λ}}^{2}) σ^{4}}, \\ \frac{\partial^{2} Q_{Θ; ω}}{\partial δ_{λ} \partial ω_{i}} & = \frac{- δ_{λ} {u_{i}}^{2} {\hat{s}}_{1 i} + ({δ_{λ}}^{2} + 1) u_{i} {\hat{s}}_{2 i} - δ_{λ} {\hat{s}}_{3 i}}{{(1 - {δ_{λ}}^{2})}^{2} σ^{2}}, \frac{\partial^{2} Q_{Θ; ω}}{\partial ς \partial ω_{i}} = 0. \end{aligned}$ Noting $\hat{Θ} = (\hat{β}, \hat{σ^{2}}, \hat{δ_{λ}}, \hat{ς})$ and $ω = (1, \dots, 1)^{⊤}$ , we get the expression formulated in (Equation15(14) $\begin{aligned} Q_{Θ; ω} & = \sum_{i = p + 1}^{R} (- \frac{ς}{2} {\hat{s}}_{1 i} - \frac{ω_{i} {u_{i}}^{2} {\hat{s}}_{1 i}}{2 (1 - {δ_{λ}}^{2}) σ^{2}} + \frac{δ_{λ} ω_{i} u_{i} {\hat{s}}_{2 i}}{(1 - {δ_{λ}}^{2}) σ^{2}} - \frac{ω_{i} {\hat{s}}_{3 i}}{2 (1 - {δ_{λ}}^{2}) σ^{2}} \\ - \ln (\frac{σ^{2}}{ω_{i}}) - \frac{1}{2} \ln (1 - {δ_{λ}}^{2}) + \frac{ς}{2} \ln (\frac{ς}{2}) - \ln (Γ (\frac{ς}{2})) + \frac{ς}{2} {\hat{s}}_{4 i}), \end{aligned}$ (14) ).

A.5. Proof (Theorem 5)

Proof.

We obtain the $Q$ -function (perturbed) $\begin{aligned} Q_{Θ; ω} & = \sum_{i = p + 1}^{R} (- \frac{ς}{2} {\hat{s}}_{1 i} - \frac{{u_{i}}^{2} {\hat{s}}_{1 i}}{2 (1 - ω_{i} {δ_{λ}}^{2}) σ^{2}} + \frac{δ_{λ} (ω_{i})^{1 / 2} u_{i} {\hat{s}}_{2 i}}{(1 - ω_{i} {δ_{λ}}^{2}) σ^{2}} - \frac{{\hat{s}}_{3 i}}{2 (1 - ω_{i} {δ_{λ}}^{2}) σ^{2}} \\ - \ln (σ^{2}) - \frac{1}{2} \ln (1 - ω_{i} {δ_{λ}}^{2}) + \frac{ς}{2} \ln (\frac{ς}{2}) - \ln (Γ (\frac{ς}{2})) + \frac{ς}{2} {\hat{s}}_{4 i}) . \end{aligned}$ Taking the differentials of the Q-function (perturbed) with respect to $β$ , $σ^{2}$ , $δ_{λ}$ , ς and then $ω_{i}$ , for $i \in {p + 1, \dots, R}$ , we obtain the first-order derivatives $\begin{aligned} \frac{\partial Q_{Θ; ω}}{\partial β} & = \sum_{i = p + 1}^{R} \frac{u_{i} {\hat{s}}_{1 i} - δ_{λ} (w_{i})^{1 / 2} {\hat{s}}_{2 i}}{(1 - w_{i} {δ_{λ}}^{2}) σ^{2}} x_{i}^{⊤}, \frac{\partial Q_{Θ; ω}}{\partial σ^{2}} = \sum_{i = p + 1}^{R} (\frac{{u_{i}}^{2} {\hat{s}}_{1 i} - 2 δ_{λ} u_{i} (w_{i})^{1 / 2} {\hat{s}}_{2 i} + {\hat{s}}_{3 i}}{2 (1 - w_{i} {δ_{λ}}^{2}) σ^{4}} - \frac{1}{σ^{2}}), \\ \frac{\partial Q_{Θ; ω}}{\partial δ_{λ}} & = \sum_{i = p + 1}^{R} (\frac{- δ {u_{i}}^{2} w_{i} {\hat{s}}_{1 i} + u_{i} (w_{i})^{1 / 2} {\hat{s}}_{2 i} (1 - w_{i} {δ_{λ}}^{2}) - w_{i} δ_{λ} {\hat{s}}_{3 i}}{{(1 - w_{i} {δ_{λ}}^{2})}^{2} σ^{2}} + \frac{w_{i} δ_{λ}}{1 - w_{i} {δ_{λ}}^{2}}), \\ \frac{\partial Q_{Θ; ω}}{\partial ς} & = \frac{R - p}{2} (1 - {\hat{s}}_{1 i} + {\hat{s}}_{4 i} + \ln (\frac{ς}{2}) - G (\frac{ς}{2})), \end{aligned}$ and the second derivatives are $\begin{aligned} \frac{\partial^{2} Q_{Θ; ω}}{\partial β \partial ω_{i}} & = (\frac{{δ_{λ}}^{2} u_{i} {\hat{s}}_{1 i} - {δ_{λ}}^{3} (w_{i})^{1 / 2} {\hat{s}}_{2 i}}{{(1 - w_{i} {δ_{λ}}^{2})}^{2} σ^{2}} - \frac{δ_{λ} {\hat{s}}_{2 i}}{2 (w_{i})^{1 / 2} (1 - w_{i} {δ_{λ}}^{2}) σ^{2}}) x_{i}^{⊤}, \\ \frac{\partial^{2} Q_{Θ; ω}}{\partial σ^{2} \partial ω_{i}} & = \frac{{δ_{λ}}^{2} {u_{i}}^{2} (w_{i})^{1 / 2} {\hat{s}}_{1 i} - δ_{λ} u_{i} ({δ_{λ}}^{2} (w_{i})^{1 / 2} + 1) {\hat{s}}_{2 i} + {δ_{λ}}^{2} (w_{i})^{1 / 2} {\hat{s}}_{3 i}}{2 (w_{i})^{1 / 2} {(1 - w_{i} {δ_{λ}}^{2})}^{2} σ^{4}}, \frac{\partial^{2} Q_{Θ; ω}}{\partial ς \partial ω_{i}} = 0, \\ \frac{\partial^{2} Q_{Θ; ω}}{\partial δ_{λ} \partial ω_{i}} & = \frac{- 2 {δ_{λ}}^{3} {u_{i}}^{2} w_{i} {\hat{s}}_{1 i} + 4 {δ_{λ}}^{4} u_{i} w_{i}^{1.5} {\hat{s}}_{2 i} - 2 {δ_{λ}}^{3} w_{i} {\hat{s}}_{3 i}}{{(1 - {δ_{λ}}^{2} w_{i})}^{3} σ^{2}} \\ + \frac{- δ_{λ} {u_{i}}^{2} {\hat{s}}_{1 i} + 4 {δ_{λ}}^{2} u_{i} (w_{i})^{1 / 2} {\hat{s}}_{2 i} - δ_{λ} {\hat{s}}_{3 i} + {δ_{λ}}^{3} σ^{2} w_{i}}{{(1 - {δ_{λ}}^{2} w_{i})}^{2} σ^{2}} + \frac{u_{i} {\hat{s}}_{2 i} + 2 δ_{λ} σ^{2} (w_{i})^{1 / 2}}{2 (w_{i})^{1 / 2} (1 - {δ_{λ}}^{2} w_{i}) σ^{2}} . \end{aligned}$ Using $\hat{Θ} = (\hat{β}, {\hat{σ}}^{2}, \hat{δ_{λ}}, \hat{ς})$ and $ω = (1, \dots, 1)^{⊤}$ , we get the expression introduced in (Equation17(16) $\begin{aligned} Q_{Θ; ω} & = \sum_{i = p + 1}^{R} (- \frac{ς}{2} {\hat{s}}_{1 i} - \frac{{u_{i}}^{2} {\hat{s}}_{1 i}}{2 (1 - ω_{i} {δ_{λ}}^{2}) σ^{2}} + \frac{δ_{λ} (ω_{i})^{1 / 2} u_{i} {\hat{s}}_{2 i}}{(1 - ω_{i} {δ_{λ}}^{2}) σ^{2}} - \frac{{\hat{s}}_{3 i}}{2 (1 - ω_{i} {δ_{λ}}^{2}) σ^{2}} \\ - \ln (σ^{2}) - \frac{1}{2} \ln (1 - ω_{i} {δ_{λ}}^{2}) + \frac{ς}{2} \ln (\frac{ς}{2}) - \ln (Γ (\frac{ς}{2})) + \frac{ς}{2} {\hat{s}}_{4 i}), \end{aligned}$ (16) ).

Download PDF

Share icon
Back to Top

Related research

People also read lists articles that other readers of this article have read.

Recommended articles lists articles that we recommend and is powered by our AI driven recommendation engine.

Cited by lists all citing articles based on Crossref citations.
Articles with the Crossref icon will open in a new tab.

People also read
Recommended articles
Cited by

To cite this article:

Reference style: APA Chicago Harvard

Citation copied to clipboard

Reference styles above use APA (6th edition), Chicago (16th edition) & Harvard (10th edition)

Download citation

Download a citation file in RIS format that can be imported by citation management software including EndNote, ProCite, RefWorks and Reference Manager.

Choose format: RIS BibTex RefWorks Direct Export

Choose options: Citation Citation & abstract Citation & references

Your download is now in progress and you may close this window

Did you know that with a free Taylor & Francis Online account you can gain access to the following benefits?

Choose new content alerts to be informed about new research of interest to you
Easy remote access to your institution's subscriptions on any device, from any location
Save your searches and schedule alerts to send you new results
Export your search results into a .csv file to support your research

Have an account?
Login now Don't have an account?
Register for free

Login or register to access this feature

Have an account?
Login now Don't have an account?
Register for free

Choose new content alerts to be informed about new research of interest to you
Easy remote access to your institution's subscriptions on any device, from any location
Save your searches and schedule alerts to send you new results
Export your search results into a .csv file to support your research

Robust autoregressive modeling and its diagnostic analytics with a COVID-19 related application

Abstract

1. Introduction

2. Formulation and estimation

2.1. STAR(p) model

[Citation47]

[Citation23]

2.2. EM based ML estimation

2.3. Observed information matrix

3. Diagnostic analysis

3.1. Influence diagnostics

3.2. Perturbation strategies

3.2.1. Perturbation of case-weights

3.2.2. Perturbation of data

3.2.3. Perturbation of variance

3.2.4. Perturbation of skewness

3.3. Benchmark of influential observations

4. Numerical simulation

4.1. EM algorithm

Table 1. Values of the SE, median, and mean with n∈{250,500,1000}, λ∈{−0.1,−0.15,−0.2} and ς∈{3,4,5} using the STAR model.

Table 2. Values of the SE, median, and mean with n∈{250,500,1000}, λ=−0.1 and ς∈{3,4,5} using the STAR model.

4.2. Influence diagnostic analysis

4.3. Student-t versus Gaussian models

Table 3. Comparison between the ST and normal distributions with ε = 2.

4.4. Skew-student-t versus student-t models

Table 4. Comparison between the ST and t distributions with ε = 2.

4.5. Skew-student-t versus skew-normal models

Table 5. Comparison between the ST and SN distributions with ϵ=2.

5. Empirical analysis

5.1. STAR model for BIPE

Table 6. Empirical values of Ti, with i∈{1,…,7} for BIPE daily data.

5.2. Diagnostic analysis for BIPE

Table 7. Summary of the diagnostic analysis by perturbation strategy.

Table 8. Predicted results by the listed structure.

6. Conclusions

Acknowledgments

Disclosure statement

Additional information

Funding

References

Appendix

A.1 Proof (Theorem 1)

A.2. Proof (Theorem 2)

A.3. Proof (Theorem 3)

A.4. Proof (Theorem 4)

A.5. Proof (Theorem 5)

Related research

To cite this article:

Download citation

Your download is now in progress and you may close this window

Login or register to access this feature

Information for

Open access

Opportunities

Help and information

Keep up to date

Table 1. Values of the SE, median, and mean with $n \in {250, 500, 1000}$ , $λ \in {- 0.1, - 0.15, - 0.2}$ and $ς \in {3, 4, 5}$ using the STAR model.

Table 2. Values of the SE, median, and mean with $n \in {250, 500, 1000}$ , $λ = - 0.1$ and $ς \in {3, 4, 5}$ using the STAR model.

Table 5. Comparison between the ST and SN distributions with $ϵ = 2$ .

Table 6. Empirical values of $T_{i}$ , with $i \in {1, \dots, 7}$ for BIPE daily data.