![MathJax Logo](/templates/jsp/_style2/_tandf/pb2/images/math-jax.gif)
Abstract
This paper proposed an improved generalized class of estimator for estimating population variance using auxiliary variables based on simple random sampling without replacement. The expression of mean square error of the proposed estimator is obtained up to the first order of approximation. We have derived the conditions for the parameters under which the proposed estimator performs better compared to the usual estimator and other existing estimators. An empirical study and simulation study are also carried out with the support of theoretical results.
Public Interest Statement
Variance estimation plays an important role in survey sampling. It is used for two purposes. One is the analytic purpose such as for constructing confidence intervals or performing hypothesis testing and another one is for descriptive purpose such as to evaluate the efficiency of the survey designs or to provide estimates for planning surveys. Several authors have paid their attention towards formation of estimators for the estimation of population variance using auxiliary information. Motivated by their works, we have proposed an improved generalized class of estimators for estimation of population variance using auxiliary information. The expression of mean square error is derived up to the first order of approximation. We have compared our proposed estimator with the other existing estimators. The efficiencies of the estimators are validated using real population data-sets and simulation study.
1. Introduction
Through sampling literature it is well established that the use of Auxiliary information in Sampling Survey at the estimation stage improve the efficiency of the estimators of the population parameters. viz. population mean, variance, median, population correlation coefficient, etc. Ratio, regression and product method of estimation are good examples in this context. Several author including Ahmed, Raman, and Hossain (Citation2000), Gupta and Shabbir (Citation2008), Singh and Solanki (Citation2009–2010, Citation2013), Yadav and Kadilar (Citation2013), Solanki and Singh (Citation2013) proposed estimators of population variance using information of auxiliary variable under different sampling schemes. Asghar, Sanaullah, and Hanif (Citation2014) proposed generalized exponential-type estimators using information on auxiliary variable for estimating the population variance . Singh, Chauhan, Sawan, and Smarandache (Citation2009) and Adichwal, Sharma, and Singh (Citation2015) proposed estimator of population variance using two auxiliary variables and suggested to use two auxiliary variables, if they are made available. In this paper, we have also proposed an improved estimator for the population variance
based on simple random sampling without replacement utilizing information of two auxiliary variables X and Z.
Let us consider a simple random sample (SRS) of size n is drawn from the given population of N units. Let the value of the study variable Y and the auxiliary variables X and Z for the ith units (i = 1, 2, 3, …, N) of the population be denoted by Yi, Xi, and Zi for the ith unit in the sample (i = 1, 2, 3, …, n) by yi, xi and zi, respectively. From the sample observation we have
Let us define
such that E(∊x) = E(∊y) = E(∊z) = 0
where, ,
; p, q, r being the non-negative integers.
2. Estimators in literature
In order to estimate population variance of the study variable Y, using the information of two auxiliary variables X and Z, Singh et al. (Citation2009) proposed a general class of exponential estimator given as(2.1)
(2.1)
Adichwal et al. (Citation2015) proposed the following two generalized class of estimators for estimation of population variance using two auxiliary variables as(2.2)
(2.2)
(2.3)
(2.3)
The mean square error (MSE) expressions of the estimators t1, tN and tM are, respectively, given by(2.4)
(2.4)
where, (2.5)
(2.5)
where, (2.6)
(2.6)
where,
3. Proposed estimators
In this paper, we have proposed a generalized exponential-type estimator for population variance for the study variable Y based on simple random sampling without replacement using information of two auxiliary variable X and Z given by(3.1)
(3.1)
where η, ψ, a, and b are suitable chosen constants to be determined such that the MSE of t is minimum.
Expanding Equation (3.1) in terms of e’s up to the first order of approximation, we have(3.2)
(3.2)
(3.3)
(3.3)
(3.4)
(3.4)
where, and
Squaring both sides of Equation (3.4) and taking its expectations we get the MSE expressions of the estimator t as,(3.5)
(3.5)
Partially differentiating Equation (3.5) with respect to λ, η*, and ψ* and equating it to zero, we get the optimum value of λ, η* and ψ* as
Substituting the optimum value of ,
and
in Equation (3.5), we obtain the minimum MSE associated with the estimators t as,
(3.6)
(3.6)
4. Efficiency comparison
In this section, we are comparing the minimum MSE of the proposed estimator t with usual estimator and other existing estimators.
The variance of the usual estimator under SRSWOR is given by
(4.1)
(4.1)
(4.2)
(4.2)
(4.3)
(4.3)
(4.4)
(4.4)
When the conditions (4.2) to (4.4) are satisfied, our suggested estimator t will be more efficient as compared to , t1, tN and tM respectively.
5. Empirical study
To illustrate the performance of various estimators of , we consider the following data sets
Population I [Source: Sarjinder Singh (Citation2003, p. 1116)].
y: Fish caught in year 1995, x: Fish caught in year 1993, z: Fish caught in year 1994,
N = 69, n = 25.
∂400 = 7.7685, ∂040 = 9.9860, ∂004 = 9.9851, ∂220 = 8.3107, ∂202 = 8.1715, ∂022 = 9.6631, .
Population II [Source: Murthy (Citation1967, p. 399)].
y: Area under wheat in 1964, x: Area under wheat in 1963, z: Cultivated area in 1961,
N = 34, n = 15.
∂400 = 3.7879, ∂040 = 2.9123, ∂004 = 2.8082, ∂220 = 3.1046, ∂202 = 2.9790, ∂022 = 2.7379, .
From Table , it is clear that the MSE of the estimator “t” is less as compared to the estimators , tN, tM and t1. In terms of PRE’s, it is observed that the PRE of the proposed estimator t is higher as compared to PRE’s of the estimators
, tN, tM, and t1 for both the given data-sets. So, the estimator t is more efficient than the estimator
and other existing estimators.
Table 1. PRE of various estimators with respect to ![](//:0)
![](//:0)
6. Simulation study
This section presents the computational procedure for the comparison of proposed estimator with other existing estimators. The simulation study is based on the algorithm proposed by Reddy, Rao, and Boiroju (Citation2010) to illustrate the performance of various estimators of . The following algorithm explain the simulation procedure used in this paper.
Step-1: Generate two independent random variable X from N and X1 from N
using box-Muller method (Jhonson, Citation1987).
Step-2: Set where 0 < ρ = 0.4, 0.6, 0.8 < 1.
Step-3: return the pair (Y, X).
Step-4: Consider the population-I with the parameters μ = 5,σ = 3, μ1 = 5 and σ1 = 3 in step-1 and repeat the steps 1 to 3 for 1000 times. This population will contain the same variance for the variable Y and X.
Step-5: Similarly, generate the population-II with the parameters μ = 3, σ = 2, μ1 = 5 and σ1 = 3 in step-1 and repeat the steps 1 to 3 for 1000 times. This population will have different variances for the variable Y and X.
Step-6: From the population of size N = 1000, draw 2000 SRSs without replacement of size n = 40, 50 and 60.
Step-7: The Average mean squared error (MSE) of the estimators are defined by
The PRE of an estimator t with respect to the usual estimator is defined by
The obtained results of the simulation study are as follows.
From Tables and we observe that as the sample size and the value of correlation coefficient increases, the average PRE’s of the estimators also increases for both the population I and II. The average PRE of the estimator “t” is higher in all cases indicating that the proposed estimator t is more efficient as compared to usual estimator and other existing estimators tN, tM, and t1.
Table 2. Comparison of proposed estimator t with other existing estimators for Population-I
Table 3. Comparison of proposed estimator t with other existing estimators for Population-II
7. Conclusion
This paper proposed an improved generalized class of estimator for estimating population variance based on simple random sampling without replacement using information of two auxiliary variables. The performance of the proposed estimator is verified by using two real population data-sets and by simulation study. Tables clearly show that the proposed estimator t is more efficient as compared to the usual estimator and other existing estimators. Hence, it is recommended for use in practice.
Funding
The authors received no direct funding for this research.
Additional information
Notes on contributors
Rajesh Singh
Nitesh K. Adichwal is a research fellow at the Banaras Hindu University in Varanasi. His research interest is in the area of sampling survey and demography. He has completed his MSc in Statistics from Purvanchal University, Jaunpur in 2010. He has also completed his MSc in Population Studies from International institute for population sciences, Mumbai in 2014. This paper has also published research papers in the area of sampling survey and demography in various national and international journals.
References
- Adichwal, N. K., Sharma, P., & Singh, R. (2015). Generalized class of estimators for population variance using information on two auxiliary variables. International Journal of Applied and Computational Mathematics, 1–11.
- Ahmed, M. S., Raman, M. S., & Hossain, M. I. (2000). Some competitive estimators of finite population variance multivariate auxiliary information. Information and Management Sciences, 11(1), 49–54.
- Asghar, A., Sanaullah, A., & Hanif, M. (2014). Generalized exponential- type estimator for population variance in survey sampling. Revista Colombiana de Estadística, 37(1), 211–222.
- Gupta, S., & Shabbir, J. (2008). Variance estimation in simple random sampling using auxiliary information. Hacettepe Journal of Mathematics and Statistics, 37, 57–67.
- Jhonson, M. E. (1987). Multivariate Statistical Simulation. New York, NY: John Wiley & Sons.
- Murthy, M. N. (1967). Sampling theory and methods. Calcutta: Statistical Publishing Society.
- Reddy, M. K., Rao, K. R., & Boiroju, N. K. (2010). Comparison of ratio estimators using Monte Carlo simulation. International Journal of Agriculture and Statistical Sciences, 6(2), 517–527.
- Singh, H. P., & Solanki, R. S. (2013). A new procedure for variance estimation in simple random sampling using auxiliary information. Statistical Papers, 54(2), 479–497.10.1007/s00362-012-0445-2
- Singh, H. P., & Solanki, R. S. (2009–2010). Estimation of finite population variance using auxiliary information in presence of random non-response. Gujarat Statistical Review, 36 & 37 (1 & 2), 46–58.
- Singh, R., Chauhan, P., Sawan, N., & Smarandache, F. (2009). Improved exponential estimator for population variance using two auxiliary variables. Studies in Statistical Inference, Sample Techniques and Demography, 36–44.
- Singh, S. (2003). Advanced sampling theory with applications: How Michael”“ Selected”“ Amy(Vol. 2). Springer Science & Business Media.10.1007/978-94-007-0789-4
- Solanki, R. S., & Singh, H. P. (2013). An Improved class of estimators for the population variance. Model Assisted Statistics and Applications, 8(3), 229–238.
- Yadav, S. K., & Kadilar, C. (2013). Improved exponential type ratio estimator of population variance. Revista Colombiana de Estadística, 36(1), 145–152.