Full article: Estimation of variance of the difference-cum-ratio-type exponential estimator in simple random sampling

Formulae display: $MathJax Logo$ ?Mathematical formulae have been encoded as MathML and are displayed in this HTML version using MathJax in order to improve their display. Uncheck the box to turn MathJax off. This feature requires Javascript. Click on a formula to zoom.

ABSTRACT

In this article, we have suggested a class of estimators for the estimation of the population variance of the variable of interest. The proposed estimators used some certain known information of the auxiliary variable, such as kurtosis, coefficient of variation, and the minimum and maximum values. The properties of the suggested class of estimators such as the bias and mean squared error (MSE) are obtained up to the first order of approximation. In order to check the performances of the estimators and to verify the theoretical results, we conducted a simulation study. The results of the simulation study show that the proposed class of estimators have lower MSE than other existing estimators. This holds for all simulation scenarios. In the application part, we used data from Statistical Bureau of Pakistan, and from the Textbook of Cochran, which also confirms that the suggested class of estimators is more efficient than the usual unbiased variance estimator, ratio estimator, traditional regression estimator, and other existing estimators in survey literature.

KEYWORDS:

1. Introduction

The purpose of survey sampling is to get accurate information about the characteristics of the population for improving the efficiency of the estimators under study at the lowest costs, less time and human efforts (for more details, see Yang et al. (Citation2020)). In several populations, there has been a few extreme values and to estimate the unknown population parameters without including this information is very sensitive. In which case, the results will be underestimated or overestimated. To solve this issue, it is important to use this information in estimating the population parameters. Isaki (Citation1983), Bahl and Tuteja (Citation1991), Upadhyaya and Singh (Citation1999), Kadilar and Cingi (Citation2006), Dubey and Sharma (Citation2008), H. Singh and Chandra (Citation2008), Shabbir and Gupta (Citation2010), H. P. Singh and Solanki (Citation2013), and Yadav et al. (Citation2015) have all suggested some wider classes of estimators for estimating finite population variance. Consider a finite population $U = (U_{1}, U_{2}, U_{3}, \dots, U_{N})$ of size $N$ units. Let $y_{i}$ and $x_{i}$ be the values of the study variable $Y$ and the auxiliary variable $X$ for the $i t h$ units respectively. Let $\overset{ˉ}{Y} = (1 / N) \sum_{i = 1}^{N} Y_{i}$ and $\overset{ˉ}{X} = (1 / N) \sum_{i = 1}^{N} X_{i}$ be the population mean of the study and the auxiliary variable, respectively. It is further assumed that $S_{y}^{2} = (1 / N - 1) \sum_{i = 1}^{N} {(Y_{i} - \overset{ˉ}{Y})}^{2}$ and $S_{x}^{2} = (1 / N - 1) \sum_{i = 1}^{N} {(X_{i} - \overset{ˉ}{X})}^{2}$ be the population variances of the study as well as auxiliary variable, respectively.

To estimate the unknown population parameter $\overset{ˉ}{Y}$ , we select a random sample of size $n$ units from the population by using simple random sampling without replacement (SRSWOR). Let $\overset{ˉ}{y} = (1 / n) \sum_{i = 1}^{n} y_{i}$ and $\overset{ˉ}{x} = (1 / n) \sum_{i = 1}^{n} x_{i}$ be the sample means of the study and the auxiliary variables, respectively, and their corresponding sample variances are ${\hat{S}}_{y}^{2} = (1 / n - 1) \sum_{i = 1}^{n} {(y_{i} - \overset{ˉ}{y})}^{2}$ and ${\hat{S}}_{x}^{2} = (1 / n - 1) \sum_{i = 1}^{n} {(x_{i} - \overset{ˉ}{x})}^{2}$ , respectively.

To find the bias and MSE for different estimators, we define the following terms. Let $e_{0} = (\frac{s_{y}^{2} - S_{y}^{2}}{S_{y}^{2}})$ , $e_{1} = (\frac{s_{x}^{2} - S_{x}^{2}}{S_{x}^{2}})$ and $e_{2} = (\frac{\overset{ˉ}{x} - \overset{ˉ}{X}}{\overset{ˉ}{X}})$ such that $E (e_{i}) = 0$ for i = 0, 1, 2. $E (e_{0}^{2}) = θ λ_{40}^{*}, E (e_{1}^{2}) = θ λ_{04}^{*}, E (e_{2}^{2}) = θ C_{x}^{2}, E (e_{0} e_{1}) = θ λ_{22}^{*}, E (e_{0} e_{2}) = θ C_{x} λ_{21}$

E (e_{1} e_{2}) = θ C_{x} λ_{03},

where $λ_{40}^{*} = (λ_{40} - 1)$ , $λ_{04}^{*} = (λ_{04} - 1)$ , $λ_{22}^{*} = (λ_{22} - 1)$ , $θ = (\frac{1}{n} - \frac{1}{N})$ . Also $λ_{r s} = \frac{μ_{r s}}{μ_{20}^{r / 2} μ_{02}^{s / 2}}$ , where $μ_{r s} = \frac{\sum_{i = 1}^{N} {(Y_{i} - \overset{ˉ}{Y})}^{r} {(X_{i} - \overset{ˉ}{X})}^{s}}{N - 1}$ . Here $λ_{40} = β_{2 (y)}$ and $λ_{04} = β_{2 (x)}$ are the population coefficients of kurtosis.

The usual variance estimator of ${\hat{S}}_{y}^{2} = s_{y}^{2}$ [1] for population variance is given by

(1.1)

V a r ({\hat{S}}_{y}^{2}) = θ S_{y}^{4} λ_{40}^{*} .

(1.1)

Isaki (Citation1983) suggested a ratio-type estimator for the variance of the study variable $Y$ , which is denoted by ${\hat{S}}_{R}^{2}$ [2], and is given by

(1.2)

{\hat{S}}_{R}^{2} = s_{y}^{2} (\frac{S_{x}^{2}}{s_{x}^{2}}),

(1.2)

Expressions for bias and MSE of ${\hat{S}}_{R}^{2}$ , in sample random sampling (SRS) are given by

(1.3)

B i a s ({\hat{S}}_{R}^{2}) ≅ θ S_{y}^{4} (λ_{04}^{*} - λ_{22}^{*}),

(1.3)

and

(1.4)

M S E ({\hat{S}}_{R}^{2}) ≅ θ S_{y}^{4} (λ_{40}^{*} + λ_{04}^{*} - 2 λ_{22}^{*}) .

(1.4)

The classical regression estimator ${\hat{S}}_{l r}^{2}$ [3] in SRS is given by

(1.5)

{\hat{S}}_{l r}^{2} = s_{y}^{2} + b_{(s_{y}^{2}, s_{x}^{2})} (S_{x}^{2} - s_{x}^{2}),

(1.5)

where $b_{(s_{y}^{2}, s_{x}^{2})} = \frac{s_{y}^{2} {\hat{λ}}_{22}^{*}}{s_{x}^{2} {\hat{λ}}_{04}^{*}}$ is the sample regression coefficient. The MSE of the estimator ${\hat{S}}_{l r}^{2}$ , is given by

(1.6)

M S E ({\hat{S}}_{l r}^{2}) ≅ θ S_{y}^{4} λ_{40}^{*} (1 - ρ^{* 2}),

(1.6)

where

(1.7)

ρ^{*} = \frac{λ_{22}^{*}}{\sqrt{λ_{40}^{*}} \sqrt{λ_{04}^{*}}}

(1.7)

Bahl and Tuteja (Citation1991) suggested an exponential ratio-type estimator for the population variance of the study variable $Y$ , which is denoted by ${\hat{S}}_{B T}^{2}$ [4] and is given by:

(1.8)

{\hat{S}}_{B T}^{2} = s_{y}^{2} e x p (\frac{S_{x}^{2} - s_{x}^{2}}{S_{x}^{2} + s_{x}^{2}}),

(1.8)

Expressions for bias and MSE respectively of ${\hat{S}}_{B T}^{2}$ , are given by

(1.9)

B i a s ({\hat{S}}_{B T}^{2}) ≅ \frac{1}{2} θ S_{y}^{2} (\frac{3 λ_{04}^{*}}{4} - λ_{22}^{*}),

(1.9)

and

(1.10)

M S E ({\hat{S}}_{B T}^{2}) ≅ θ S_{y}^{4} (λ_{40}^{*} + \frac{λ_{04}^{*}}{4} - λ_{22}^{*}) .

(1.10)

Upadhyaya and Singh (Citation1999) proposed a ratio-type estimator ${\hat{S}}_{U S}^{2}$ [5], that uses the kurtosis of an auxiliary variable in SRS, given by

(1.11)

{\hat{S}}_{U S}^{2} = s_{y}^{2} (\frac{S_{x}^{2} + λ_{04}}{s_{x}^{2} + λ_{04}}),

(1.11)

Expressions for bias and MSE respectively of ${\hat{S}}_{U S}^{2}$ , are given by

(1.12)

B i a s ({\hat{S}}_{U S}^{2}) ≅ θ S_{y}^{2} g_{0} (g_{0} λ_{04}^{*} - λ_{22}^{*}),

(1.12)

and

(1.13)

M S E ({\hat{S}}_{U S}^{2}) ≅ θ S_{y}^{4} (λ_{40}^{*} + g_{0}^{2} λ_{04}^{*} - 2 g_{0} λ_{22}^{*}),

(1.13)

where

(1.14)

g_{0} = \frac{S_{x}^{2}}{S_{x}^{2} + λ_{04}}

(1.14)

Kadilar and Cingi (Citation2006) suggested a class of ratio estimators ${\hat{S}}_{K C i}^{2}$ [6–8] which are given by

(1.15)

{\hat{S}}_{K C 1}^{2} = s_{y}^{2} (\frac{S_{x}^{2} + C_{x}}{s_{x}^{2} + C_{x}}),

(1.15)

(1.16)

{\hat{S}}_{K C 2}^{2} = s_{y}^{2} (\frac{λ_{04} S_{x}^{2} + C_{x}}{λ_{04} s_{x}^{2} + C_{x}}),

(1.16)

(1.17)

{\hat{S}}_{K C 3}^{2} = s_{y}^{2} (\frac{C_{x} S_{x}^{2} + λ_{04}}{C_{x} s_{x}^{2} + λ_{04}}),

(1.17)

where $C_{x} = \frac{S_{x}}{\overset{ˉ}{X}}$ is the population coefficient of variation.

Expressions for bias and MSE’s respectively of ${\hat{S}}_{K C i}^{2} (i = 1, 2, 3)$ , in SRS are given by

(1.18)

B i a s ({\hat{S}}_{K C i}^{2}) ≅ θ S_{y}^{2} g_{i} (g_{i} λ_{04}^{*} - λ_{22}^{*}),

(1.18)

and

(1.19)

M S E ({\hat{S}}_{K C i}^{2}) ≅ θ S_{y}^{4} (λ_{40}^{*} + g_{i}^{2} λ_{04}^{*} - 2 g_{i} λ_{22}^{*}),

(1.19)

where

(1.20)

g_{1} = \frac{S_{x}^{2}}{S_{x}^{2} + C_{x}}, g_{2} = \frac{λ_{04} S_{x}^{2}}{λ_{04} S_{x}^{2} + C_{x}}, g_{3} = \frac{C_{x} S_{x}^{2}}{C_{x} S_{x}^{2} + λ_{04}} .

(1.20)

2. Proposed estimators

Motivated by Daraz et al. (Citation2018), we proposed an improved class of estimators for estimating the finite population variance $S_{y}^{2}$ using certain known population parameters under simple random sampling scheme. The proposed estimator is given by

(2.1)

{\hat{S}}_{D}^{2} = [k_{1} s_{y}^{2} {(\frac{S_{x}^{2}}{s_{x}^{2}})}^{α_{1}} + k_{2} (\overset{ˉ}{X} - \overset{ˉ}{x}) {(\frac{S_{x}^{2}}{s_{x}^{2}})}^{α_{2}}] exp (\frac{a_{1} (s_{x}^{2} - S_{x}^{2})}{a_{1} (s_{x}^{2} + S_{x}^{2}) + 2 b_{1}}),

(2.1)

where $k_{1}$ and $k_{2}$ are the unknown constants whose values are to be determined such that the MSE’s are minimum, $a_{1}$ and $b_{1}$ are the parameters of the auxiliary variables. Also, $α_{1}$ and $α_{2}$ are the scalar quantities which contain the values (0, −1, 1) from (2.1) we can generate the different classes of proposed estimator which are given in .

Table 1. Some classes of the proposed estimator

Display Table

where

(2.2)

L = exp (\frac{a_{1} (s_{x}^{2} - S_{x}^{2})}{a_{1} (s_{x}^{2} + S_{x}^{2}) + 2 b_{1}})

(2.2)

Properties of the proposed estimator

Rewriting (2.1) in term of errors, we have

(2.2)

\begin{aligned} {\hat{S}}_{D}^{2} = [k_{1} S_{y}^{2} (1 + e_{0}) {(1 + e_{1})}^{- α_{1}} - k_{2} \overset{ˉ}{X} e_{2} {(1 + e_{1})}^{- α_{2}}] \\ exp [\frac{- g_{4} e_{1}}{2} {(1 + \frac{g_{4}}{2} e_{1})}^{- 1}] \end{aligned}

(2.2)

where

(2.3)

g_{4} = \frac{a_{1} S_{x}^{2}}{a_{1} S_{x}^{2} + b_{1}}

(2.3)

By using Taylor series up to the first order of approximation, we have

(2.3)

\begin{matrix} {\hat{S}}_{D}^{2} - S_{y}^{2} ≅ - S_{y}^{2} + k_{1} S_{y}^{2} [1 + e_{0} - e_{1} (α_{1} + \frac{g_{4}}{2}) + e_{1}^{2} (\frac{α_{1} g_{4}}{2} + \frac{3 g_{4}^{2}}{8} + \frac{α_{1} (α_{1} + 1)}{2}) - e_{0} e_{1} (α_{1} + \frac{g_{4}}{2})] \\ - k_{2} \overset{ˉ}{X} [e_{2} - e_{1} e_{2} (α_{2} + \frac{g_{4}}{2})] \end{matrix}

(2.3)

Using (2.3), the bias of ${\hat{S}}_{D}^{2}$ , is given by

(2.4)

B i a s ({\hat{S}}_{D}^{2}) ≅ - [S_{y}^{2} - k_{1} S_{y}^{2} D - k_{2} G],

(2.4)

where $D = [1 + θ \{λ_{04}^{*} (\frac{3 g_{4}^{2} + 4 α_{1} (g_{4} + α_{1} + 1)}{8}) - λ_{22}^{*} (\frac{α_{1} + g_{4}}{2})\}]$ , and $G = θ S_{x} λ_{03} (\frac{α_{2} + g_{4}}{2})$ . By squaring and taking expectation on both sides of EquationEquation (2.3)(2.3) $\begin{matrix} {\hat{S}}_{D}^{2} - S_{y}^{2} ≅ - S_{y}^{2} + k_{1} S_{y}^{2} [1 + e_{0} - e_{1} (α_{1} + \frac{g_{4}}{2}) + e_{1}^{2} (\frac{α_{1} g_{4}}{2} + \frac{3 g_{4}^{2}}{8} + \frac{α_{1} (α_{1} + 1)}{2}) - e_{0} e_{1} (α_{1} + \frac{g_{4}}{2})] \\ - k_{2} \overset{ˉ}{X} [e_{2} - e_{1} e_{2} (α_{2} + \frac{g_{4}}{2})] \end{matrix}$ (2.3) , we get the mean squared error by using the first order of approximation, which is given by

(2.5)

M S E ({\hat{S}}_{D}^{2}) ≅ [S_{y}^{4} + k_{1}^{2} S_{y}^{4} A + k_{2}^{2} B - 2 k_{1} S_{y}^{4} D - 2 k_{2} S_{y}^{2} G + 2 k_{1} k_{2} S_{y}^{2} F],

(2.5)

where

A = [1 + θ \{λ_{40}^{*} + λ_{04}^{*} \{{(α_{1} + \frac{g_{4}}{2})}^{2} + (α_{1} g_{4} + \frac{3 g_{4}^{2}}{4} + \frac{α_{1} (α_{1} + 1)}{2})\} - 2 λ_{22}^{*} (2 α_{1} + g_{4})\}]

$B = θ S_{x}^{2}$ , and $F = θ S_{x} [λ_{03} (α_{1} + α_{2} + g_{4}) - λ_{21}]$ .

The optimum values of $k_{1}$ and $k_{2}$ obtained by minimizing (2.5) are $k_{1 (o p t)} = \frac{B D - F G}{A B - F^{2}}$ , and $k_{2 (o p t)} = \frac{S_{y}^{2} (A G - D F)}{A B - F^{2}}$ . By substituting the optimum values of $k_{1}$ and $k_{2}$ in (2.5), we get the minimum MSE of ${\hat{S}}_{D}^{2}$ , which is given below:

(2.6)

M S E {({\hat{S}}_{D}^{2})}_{m i n} ≅ S_{y}^{4} [1 - \frac{(A G^{2} + B D^{2} - 2 D F G)}{A B - F^{2}}] .

(2.6)

3. Mathematical comparison

In this section, we compare the suggested class of estimator ${\hat{S}}_{D}^{2}$ with the existing estimators ${\hat{S}}_{y}^{2}, {\hat{S}}_{R}^{2}, {\hat{S}}_{l r}^{2}, {\hat{S}}_{B T}^{2}, {\hat{S}}_{U S}^{2}$ , and ${\hat{S}}_{K C i}^{2}$ .

Condition (i): By (1.1) and (2.6), $V a r ({\hat{S}}_{y}^{2}) > M S E {({\hat{S}}_{D}^{2})}_{m i n}$ if

θ λ_{40}^{*} + (\frac{A G^{2} + B D^{2} - 2 D F G}{A B - F^{2}}) > 1.

Condition (ii): By (1.4) and (2.6), $M S E ({\hat{S}}_{R}^{2}) > M S E {({\hat{S}}_{D}^{2})}_{m i n}$ if

θ (λ_{40}^{*} + λ_{04}^{*} - 2 λ_{22}^{*}) + (\frac{A G^{2} + B D^{2} - 2 D F G}{A B - F^{2}}) > 1.

Condition (iii): By (1.6) and (2.6), $M S E ({\hat{S}}_{l r}^{2}) > M S E {({\hat{S}}_{D}^{2})}_{m i n}$ if

θ λ_{40}^{*} (1 - ρ^{* 2}) + (\frac{A G^{2} + B D^{2} - 2 D F G}{A B - F^{2}}) > 1.

Condition (iv): By (1.9) and (2.6), $M S E ({\hat{S}}_{B T}^{2}) > M S E {({\hat{S}}_{D}^{2})}_{m i n}$ if

θ (λ_{40}^{*} + \frac{λ_{04}^{*}}{4} - λ_{22}^{*}) + (\frac{A G^{2} + B D^{2} - 2 D F G}{A B - F^{2}}) > 1.

Condition (v): By (1.12) and (2.6), $M S E ({\hat{S}}_{U S}^{2}) > M S E {({\hat{S}}_{D}^{2})}_{m i n}$ if

θ (λ_{40}^{*} + g_{0}^{2} λ_{04}^{*} - 2 g_{0} λ_{22}^{*}) + (\frac{A G^{2} + B D^{2} - 2 D F G}{A B - F^{2}}) > 1.

Condition (vi): By (1.17) and (2.6), $M S E ({\hat{S}}_{K C i}^{2}) > M S E {({\hat{S}}_{D}^{2})}_{m i n}$ if

θ (λ_{40}^{*} + g_{i}^{2} λ_{04}^{*} - 2 g_{i} λ_{22}^{*}) + (\frac{A G^{2} + B D^{2} - 2 D F G}{A B - F^{2}}) > 1.

4. Simulation study

In order to verify the theoretical results in Section 3, we have conducted a simulation study by using the idea from Agarwal et al. (Citation2012). We generated six different artificial populations of the auxiliary variable $X$ by using the following probability distributions.

• $X \sim E x p o n e n t i a l (λ = 3)$ and $X \sim E x p o n e n t i a l (λ = 7)$ , • $X \sim U n i f o r m (b_{3} = 0, b_{4} = 1)$ and $X \sim U n i f o r m (b_{3} = 3, b_{4} = 5)$ ,

• $X \sim G a m m a (α_{3} = 4, α_{4} = 6)$ and $X \sim G a m m a (α_{3} = 8, α_{4} = 10)$ .

After that, the study variable $Y$ is computed as $Y = r_{y x} \times X + e$ , taking $r_{y x} = 0.80$ , where $r_{y x}$ is the correlation coefficient between the study and the auxiliary variables and $e \sim N (0, 1)$ is the error term.

We considered the following steps in R-Software to obtain the MSE’s of the proposed class of estimators:

Step 1: In the first step, we generated a population of size 1000 using a certain type of probability distributions.

Step 2: We obtained population total, minimum and maximum values of the auxiliary variable from Step 1. We also computed the optimum values of the unknown constants of the proposed estimator.

Step 3: We considered different sample sizes for each population to generate the samples using SRSWOR.

Step 4: For each sample size, the values of bias’s and MSE’s are computed for all the estimators considered in this paper.

Step 5: The process in Step 3 and Step 4 is repeated 50,000 times and the results for artificial populations are reported in , whereas the results of the real data sets are summarized in .

Table 2. Mean squared error (MSE) of the estimators using the artificial populations

Display Table

Table 3. Mean squared error (MSE) of the estimators using empirical data sets

Display Table

Finally, the MSE’s of the estimators over all replications are obtained by using the following formula. $M S E ({\hat{S}}_{k}^{2})_{min} = \frac{\sum_{g = 1}^{50000} {({\hat{S}}_{k}^{2} - S_{y}^{2})}^{2}}{50000}$ , for $k = R, l r, B T, U S, K C i, D 1, D 2, \dots, D 8$ .

Figure 1. Graphical display of the MSE’s results of the estimators using the artificial data

Note: The vertical line of the figures shows the MSE’s of the estimators, while the horizontal line indicates the corresponding estimators. For easiness, we denote the estimators by different numbers starting from 1 to 16. For more details, see . Source: Own computations.

Figure 2. Graphical display of the MSE’s results of the estimators using the artificial data

5. Numerical examples

To check the performances of the suggested class of estimators, we used three real data sets to compare the MSE’s of different estimators. The description and summary statistics are given by

Data 1. (Bureau of Statistics (Citation2013), p. 135)

$Y$ : Total number of students enrolls in 2012,

$X$ : Total number of government primary and secondary schools for boys and girls 2012.

The summary statistics are given below:

\begin{matrix} N = 36, n = 15, \overset{ˉ}{Y} = 148718.70, \overset{ˉ}{X} = 1054.39, S_{y} = 182315.10, S_{x} = 402.61, \\ X_{M} = 2370, X_{m} = 388, C_{x} = 0.38, C_{y} = 1.23, λ_{40} = 2365, λ_{04} = 4698, λ_{03} = 4697, \\ λ_{21} = 4698, λ_{22} = 8975, ρ_{y x} = 0.18. \end{matrix}

Data 2. (Bureau of Statistics (Citation2013), p. 226)

$Y$ : Employment level in 2012 by divisions,

$X$ : Number of registered factories in 2012 by divisions.

The summary statistics are given below:

\begin{matrix} N = 36, n = 15, \overset{ˉ}{Y} = 52432.86, \overset{ˉ}{X} = 335.78, S_{y} = 178201.10, S_{x} = 451.14, \\ X_{M} = 2055, X_{m} = 24, C_{x} = 1.34, C_{y} = 3.3986, ρ_{y x} = 0.39, λ_{40} = 2365, \\ λ_{04} = 4698, λ_{03} = 4697, λ_{21} = 4698, λ_{22} = 8975. \end{matrix}

Data 3. ((Cochran (Citation1963), p. 24)

$Y$ : Food cost of families employment,

$X$ : Weekly income of families.

The summary statistics are given below:

\begin{matrix} N = 33, n = 5, \overset{ˉ}{Y} = 27.49, \overset{ˉ}{X} = 72.55, S_{y} = 10.13, S_{x} = 10.58, X_{M} = 95, \\ X_{m} = 58, C_{x} = 0.15, C_{y} = 0.37, ρ_{y x} = 0.25, λ_{40} = 5.55, λ_{04} = 2.08, λ_{03} = 0.51, \\ λ_{21} = 0.54, λ_{22} = 2.22. \end{matrix}

Figure 3. Graphical display of the MSE’s results of the estimators using the empirical data

Note: The vertical line of the Figures shows the MSE’s of the estimators, while the horizontal line indicates the corresponding estimators. For easiness, we denote the estimators by different numbers starting from 1 to 16. For more details, see . Source: Own computations.

6. Conclusion

In this paper, we proposed a class of estimators for estimating the population variance of the study variable using some known information of the auxiliary variable. The properties of the proposed class of estimators are compared with other existing estimators. For this purpose, we reported some theoretical conditions in Section 3 under which the proposed estimators are more efficient than the existing estimators. These theoretical conditions are verified through the help of a simulation study and some empirical data sets. MSE’s results of various estimators over the simulation setup are demonstrated in . In comparing the MSE’s of the estimators, it is clear from the table that the proposed class of estimators performs the best over the cited existing estimators. The MSE’s results of various estimators in are plotted in which demonstrates that the MSE’s of the purposed class of estimators are significantly smaller than the MSE’s of other estimators. Similar results are obtained from the empirical data, which also confirms the theoretical results in Section 3. The empirical results are displayed in , which is then graphically shown in . Hence, based on our simulation results as well as through empirical results, we observed that the proposed class of estimators ${\hat{S}}_{D i}^{2} (i = 9, 10, 11, \dots, 16)$ are more efficient than the other considered estimators. Among the suggested class of estimators, ${\hat{S}}_{D 8}^{2}$ is preferable because of its least MSE.

PUBLIC INTEREST STATEMENT

In this article, we have suggested a class of estimators for the estimation of the population variance by using the maximum and minimum values of independent variable. In order to check the performance of the estimators and to verify the theoretical results we conducted a simulation study from different distribution and also used the data sets from real life application and display it graphically which confirmed that the suggested class of estimator is more efficient than the existent estimators because it’s least mean squared errors.

Acknowledgment

This work was supported by NSFC of China with grant [12071329]. We are very thankful to the two unknown referees, and the editor for their insightful comments and suggestions which greatly improved this paper.

Disclosure statement

The authors declare that there is no conflict of interests regarding the publication of this article.

Additional information

Notes on contributors

Umer Daraz

Umer Daraz received his M.Phil degree in Survey Sampling from Quaid-i-Azam University, Islamabad, Pakistan in 2016. He is currently pursuing his PhD degree under the supervision of Prof. Tang Yu at Soochow University, Suzhou, China. His research interests lie in the survey sampling, design experiment and combination design.

Mursala Khan

Mursala Khan got his Doctorate degree from Free University Berlin, Germany. His field of specialization is survey sampling. Currently, he is working as an assistant professor in the Department of Mathematics and Statistics, Riphah International University, Islamabad, Pakistan.

References

Agarwal, G. K., Allende, S. M., & Bouza, C. (2012). Double sampling with ranked set selection in the second phase with nonresponse: Analytical results and Monte Carlo experiences. Journal of Probability and Statistics, 2012, 1–8. https://doi.org/https://doi.org/10.1155/2012/214959
Google Scholar
Bahl, S., & Tuteja, R. (1991). Ratio and product type exponential estimators. Journal of Information & Optimization Sciences, 12(1), 159–164. https://www.tandfonline.com/doi/abs/ https://doi.org/10.1080/02522667.1991.10699058
Google Scholar
Bureau of Statistics. (2013). Punjab development statistics government of the Punjab, Lahore, Pakistan: Pakistan Bureau of Statistics. Retrieved from http://www.bos.gop.pk/system/files/Dev-2013.pdf.
Google Scholar
Cochran, W. B. (1963). Sampling techniques. John Wiley and Sons.
Google Scholar
Daraz, U., Shabbir, J., & Khan, H. (2018). Estimation of finite population mean by using minimum and maximum values in stratified random sampling. Journal of Modern Applied Statistical Methods, 17(1), 1–15. https://digitalcommons.wayne.edu/jmasm/vol17/iss1/20/
Web of Science ®Google Scholar
Dubey, V., & Sharma, H. (2008). On estimating population variance using auxiliary information. Statistics in Transition New Series, 9(11), 7–18.
Google Scholar
Isaki, C. T. (1983). Variance estimation using auxiliary information. Journal of the American Statistical Association, 78(381), 117–123. https://doi.org/https://doi.org/10.1080/01621459.1983.10477939
Web of Science ®Google Scholar
Kadilar, C., & Cingi, H. (2006). Ratio estimators for the population variance in simple and stratified random sampling. Applied Mathematics and Computation, 173(2), 1047–1059. https://www.sciencedirect.com/science/article/pii/S0096300305004108?via%3Dihub
Web of Science ®Google Scholar
Shabbir, J., & Gupta, S. (2010). Some estimators of finite population variance of stratified sample mean. Communications in Statistics-Theory and Methods, 39(16), 3001–3008. https://doi.org/https://doi.org/10.1080/03610920903170384
Web of Science ®Google Scholar
Singh, H., & Chandra, P. (2008). An alternative to ratio estimator of the population variance in sample surveys. Journal of Transportation and Statistics, 9(1), 89–103.
Google Scholar
Singh, H. P., & Solanki, R. S. (2013). A new procedure for variance estimation in simple random sampling using auxiliary information. Journal of Statistical Papers, 54(2), 479–497. https://doi.org/https://doi.org/10.1007/s00362-012-0445-2
Web of Science ®Google Scholar
Upadhyaya, L., & Singh, H. (1999). An estimator for population variance that utilizes the kurtosis of an auxiliary variable in sample surveys. Vikram Mathematical Journal, 19(1), 14–17.
Google Scholar
Yadav, S. K., Kadilar, C., Shabbir, J., & Gupta, S. (2015). Improved family of estimators of population variance in simple random sampling. Journal of Statistical Theory and Practice, 9(2), 219–226. https://doi.org/https://doi.org/10.1080/15598608.2013.856359
Web of Science ®Google Scholar
Yang, R., Chen, W., Yao, D., Long, C., Dong, Y., & Shen, B. (2020). The efficiency of ranked set sampling design for parameter estimation for the log-extended exponential-geometric distribution. Iranian Journal of Science and Technology, Transactions A: Science, 44(2), 497–507. https://doi.org/https://doi.org/10.1007/s40995-020-00855-x
Web of Science ®Google Scholar

Estimation of variance of the difference-cum-ratio-type exponential estimator in simple random sampling

ABSTRACT

1. Introduction

2. Proposed estimators

Table 1. Some classes of the proposed estimator

Properties of the proposed estimator

3. Mathematical comparison

4. Simulation study

Table 2. Mean squared error (MSE) of the estimators using the artificial populations

Table 3. Mean squared error (MSE) of the estimators using empirical data sets

5. Numerical examples

6. Conclusion

PUBLIC INTEREST STATEMENT

Acknowledgment

Disclosure statement

Notes on contributors

Umer Daraz

Mursala Khan

References

Information for

Open access

Opportunities

Help and information

Estimation of variance of the difference-cum-ratio-type exponential estimator in simple random sampling

ABSTRACT

1. Introduction

2. Proposed estimators

Table 1. Some classes of the proposed estimator

Properties of the proposed estimator

3. Mathematical comparison

4. Simulation study

Table 2. Mean squared error (MSE) of the estimators using the artificial populations

Table 3. Mean squared error (MSE) of the estimators using empirical data sets

5. Numerical examples

6. Conclusion

PUBLIC INTEREST STATEMENT

Acknowledgment

Disclosure statement

Additional information

Notes on contributors

Umer Daraz

Mursala Khan

References

Related research

To cite this article:

Download citation

Your download is now in progress and you may close this window

Login or register to access this feature

Information for

Open access

Opportunities

Help and information

Keep up to date