![MathJax Logo](/templates/jsp/_style2/_tandf/pb2/images/math-jax.gif)
ABSTRACT
Recently, Haq et al. [A new estimator of finite population mean based on the dual use of the auxiliary information. Commun Stat Theory Methods. 2017;46(9):4425–4436] utilized the dual auxiliary information under simple random sampling only. Motivated by their idea, we initiated the dual use of auxiliary variable under a stratified random sampling scheme. Dual use of auxiliary variable consists: (1) the original auxiliary information and (2) the ranked auxiliary information. We proposed new optimal exponential-type estimators for the estimation of the finite population mean. Mathematical properties such as bias and mean squared error of the proposed estimators are derived. Monte Carlo simulation studies are included to successfully validate the theoretical results. Moreover, the applicability of the proposed estimators is highlighted through empirical interpretation with the help of a real-life data set. It is clearly identified from the numerical results that our proposed estimators are more efficient over the competitors.
1. Introduction
One of the objectives of sample survey theory is to estimate the unknown population parameters of the study variable such as population total, mean, proportion, ratio and variance etc. A procedure is desirable that provides a precise estimator of the parameter of interest by surveying a suitably chosen sample of individuals. Supplementary/additional information provided by an auxiliary variable which is correlated with the study variable enhances the precision of the estimators. Survey statisticians take advantage of this information whenever it is available to explore the efficient estimators. Ratio, product, regression and their modified estimators are best examples in this regard.
An elaborate literature has grown for identifying more efficient estimators under different sampling designs, e.g. simple random sampling, stratified random sampling, cluster sampling, systematic sampling and etc. Simple random sampling does not produce administrative convenience and representative sample for a heterogeneous population. As it does not capture the diversity which is likely to be mined through stratified random sampling. Stratified random sampling is one of the possible ways to increase the precision of the estimates. It is a powerful and flexible method that is widely used in practice. Many researchers, such as Kadilar and Cingi [Citation1,Citation2], Koyuncu and Kadilar [Citation3,Citation4], Singh and Vishwakarma [Citation5], Shabbir and Gupta [Citation6], Haq and Shabbir [Citation7], Singh and Solanki [Citation8], Yadav et al. [Citation9], Solanki and Singh [Citation10,Citation11], Aslam [Citation12], Bhatti et al. [Citation13], Javed et al. [Citation14], Marin et al. [Citation15–17], etc. have contributed to estimate the finite population mean under stratified random sampling scheme. All these contributions and alike published work under a stratified random sampling scheme are based on only the utilization of original auxiliary information. None of them tried the dual use of auxiliary information to enhance the estimation procedure.
Recently, Haq et al. [Citation18] used an additional information of the auxiliary variable called ranked auxiliary variable to develop efficient estimators for the estimation of mean. These estimators are developed only to cope with the simple random sampling scheme.
Here, comes a new challenge/idea for exploring more optimal estimators using dual use of auxiliary information to deal with the stratified random sampling scheme. This challenge is successfully meet and new optimal estimators for finite population mean are developed under a stratified random sampling scheme in this article.
The remaining part of the paper is organized as follows: In section 2, procedures, notations and various estimators under stratified random sampling are introduced. In section 3, proposed estimators for estimating finite population mean using the original and ranked auxiliary information are defined. In section 4, an empirical study is carried out to evaluate the performance of the proposed estimators. Monte Carlo simulation studies are included to successfully validate the theoretical results in section 5. Finally, concluding remarks are enclosed in the last section.
2. Procedure, notations and review of literature
Consider be a finite population of size
and is divided into L homogenous strata with
stratum containing
units with the condition that
Under the condition
, a sample of size
is drawn under simple random sampling without replacement (SRSWOR) from
stratum. Let
We define the following relative error terms and their expectations to drive the expressions for bias, MSE and minimum MSE of the proposed estimators.
such that
Let us define,
(2.1)
(2.1)
Using (2.1), we can write as:
(2.2)
(2.2) and
(2.3)
(2.3) where
Some well-known estimators for population mean under stratified random sampling scheme are detailed below. All these estimators are based on only original auxiliary information.
2.1. Usual unbiased, combined ratio and combined regression estimators are detailed below
(2.4)
(2.4)
(2.5)
(2.5)
(2.6)
(2.6)
2.2. Haq and Shabbir [7] proposed two exponential ratio-type families of estimators detailed below
(2.7)
(2.7)
(2.8)
(2.8) where η is the suitable constant,
and b
st are either real numbers or functions of known parameters of the auxiliary variable.
2.3. Singh and Solanki [8] proposed a family of estimators as given below
(2.9)
(2.9)
where and
are defined earlier.
Remark 2.1:
reduces to the ratio-type
, product-type
and ratio-cum-product-type
estimators by placing the suitable values of the constants as:
, respectively.
2.4. Given below is the class of estimators suggested by Solanki and Singh [9]
(2.10)
(2.10)
where and
are defined earlier.
Remark 2.2:
reduces the following different estimators by placing different values of
in (2.10) as:
for
.
for
.
for
.
. for
.
for
.
for
.
2.5. Recently, Solanki and Singh [10] defined an improved estimation given as
(2.11)
(2.11) where
are either real number to parameters related to auxiliary variate X.
Remark 2.3:
For obtaining different class of estimators , assume the different values of the constants
. in Equation (2.11) as:
for
.
for
.
for
.
Remark 2.4:
The optimal weights of are determined for minimizing the MSE’s of estimators mentioned in (2.7)–(2.11).
where
Remark 2.5:
By placing the suitable weights in corresponding estimators, we have the following minimum MSE’s of above-said estimators.
(2.12)
(2.12)
(2.13)
(2.13)
(2.14)
(2.14)
(2.15)
(2.15)
(2.16)
(2.16)
(2.17)
(2.17)
(2.18)
(2.18)
(2.19)
(2.19)
3. Proposed estimators
In this section, two new exponential-type estimators are proposed for the estimation of population mean using dual auxiliary information in stratified random sampling. Dual auxiliary information refers to the double use of auxiliary variable (i) the original/actual measurements of the auxiliary variable and (ii) the use of ranks of the auxiliary variable. Mathematical properties such as bias and mean square error (MSE) of the proposed estimators are derived up to first order of approximation. The bias of an estimator is the difference between the estimator's expected value and the true value of the parameter being estimated i.e. and MSE can be defined as the divergence of the estimator values from the true parameter value i.e.
3.1. First proposed estimator
(3.1)
(3.1) where
are the suitably chosen weights.
The bias and MSE of are given below
(3.2)
(3.2) and
(3.3)
(3.3) where
The optimal weights are obtained by minimizing Equation (3.3), so
Inserting optimal weights of in Equation (3.3), the minimum MSE of the proposed estimator is
(3.4)
(3.4) where
3.2. Second proposed estimator
(3.5)
(3.5) where
are the suitably chosen weights.
The bias and MSE of are given below
(3.6)
(3.6) and
(3.7)
(3.7)
By minimizing Equation (3.7), the optimal weights are as under:
Inserting optimal weights of in Equation (3.7), the minimum MSE of the proposed estimator is
(3.8)
(3.8) where
4. Application on a real data
In this section, we compare the performance of newly proposed estimators with the traditional unbiased, combined ratio and combined regression estimators and existing estimators, i.e. Haq and Shabbir [Citation7], Singh and Solanki [Citation8] and Solanki and Singh [Citation10,Citation11]. We considered a real-life data set of Turkey (2007) used by Koyuncu and Kadilar [Citation3]. For the remaining characteristics of the data set, interested readers may refer to Koyuncu and Kadilar [Citation3]. Necessary data statistics are given in Table .
Table 1. Data statistics.
We calculated the MSEs of the proposed and competing exponential-type estimators and are presented in Table . Table reveals that the proposed estimators have smaller MSE values i.e. (57.0590 and 67.9338) among all the reviewed exponential-type estimators i.e. .
5. Simulation study based on real data
In the previous section, it is clearly observed that proposed estimators are efficient over the competing estimators. In addition, this superiority is assessed through a Monte Carlo simulation study using R software. Again, the real population presented in Table is used. We considered different sample sizes through the proportional allocation method. The steps of a simulation study to find the average MSE of an estimator are as follows:
Step 1: Select a bivariate stratified sample of size using SRSWOR from the bivariate stratified population.
Step 2: Use sample data from step 1 to find the MSE of all the estimators under study.
Step 3: The whole procedure is repeated 30,000 times and obtain 30,000 values i.e. for MSEs.
Step 4: Average MSE of each estimator is calculated as:
Tables – present the minimum mean square errors provided by the simulation study. It is quite obvious, as in the previous section, that the proposed estimators have the least MSEs over all the competing estimators under study in different sample sizes i.e.
The sequel of the above findings, the performance of the proposed estimators is the best among all the reviewed estimators under study.
6. Concluding remarks
Several estimators for the estimation of finite population mean based only on original auxiliary information under stratified random sampling are available in the literature. Haq et al. [Citation18] built up a family of estimators for evaluating the population mean under simple random sampling scheme by using additional information of the auxiliary variable called ranked auxiliary variable. First time in this manuscript, new optimal estimators are suggested for the estimation of population mean by using the original and the ranked auxiliary information under a stratified random sampling scheme. Mathematical properties such as bias, mean square error (MSE) and minimum MSE of the proposed estimators are derived up to the first degree of approximation. Both real-life applications and simulation studies are performed to access the potentiality of the proposed estimators over the competitors. Numerical findings confirmed that the proposed estimators have the minimum mean square errors than all the other existing estimators such as usual unbiased, combined ratio, combined regression, Haq and Shabbir [Citation7], Singh and Solanki [Citation8] and Solanki and Singh [Citation10,Citation11]. Therefore, new proposed estimators under stratified random sampling are very attractive to the survey statisticians.
The possible extension of this current work to estimate the: (1) finite population mean under other sampling designs like stratified double sampling and different rank set sampling schemes, etc.; (2) other unknown finite population parameters including median, variance, interquartile range and proportions, etc.; (3) population mean of a sensitive variable in the presence of sensitive and non-sensitive auxiliary information.
Disclosure statement
No potential conflict of interest was reported by the author(s).
References
- Kadilar C, Cingi H. Ratio estimator in stratified random sampling. Biom J. 2003;45:218–225. doi: 10.1002/bimj.200390007
- Kadilar C, Cingi H. A new estimator in stratified random sampling. Commun Stat Theory Methods. 2005;34:597–602. doi: 10.1081/STA-200052156
- Koyuncu N, Kadilar C. Ratio and product estimators in stratified random sampling. J Stat Plan Inference. 2009;139:2552–2558. doi: 10.1016/j.jspi.2008.11.009
- Koyuncu N, Kadilar C. On improvement in estimating population mean in stratified random sampling. J Appl Stat. 2010;37(6):999–1013. doi: 10.1080/02664760903002675
- Singh HP, Vishwakarma GK. A family of estimators of population mean using auxiliary information in stratified random sampling. Commun Stat Theory Methods. 2008;37:1038–1050. doi: 10.1080/03610920701713237
- Shabbir J, Gupta S. On estimating finite population mean in simple and stratified random sampling. Commun Stat Theory Methods. 2011;40(2):199–212. doi: 10.1080/03610920903411259
- Haq A, Shabbir J. Improved family of ratio estimators in simple and stratified random sampling. Commun Stat Theory Methods. 2013;42(5):782–799. doi: 10.1080/03610926.2011.579377
- Singh HP, Solanki RS. An efficient class of estimators for the population mean using auxiliary information. Commun Stat Theory Methods. 2013;42:145–163. doi: 10.1080/03610926.2011.575519
- Yadav R, Upadhyaya LN, Singh HP, et al. Improved ratio and product exponential type estimators for finite population mean in stratified random sampling. Commun Stat Theory Methods. 2014;43(15):3269–3285. doi: 10.1080/03610926.2012.694547
- Solanki RS, Singh HP. An efficient class of estimators for the population mean using auxiliary information in stratified random sampling. Commun Stat Theory Methods. 2014;43:3380–3401. doi: 10.1080/03610926.2012.700378
- Solanki RS, Singh HP. An improved estimation in stratified random sampling. Commun Stat Theory Methods. 2016;45(7):2056–2070. doi: 10.1080/03610926.2013.826367
- Aslam M. Design of the Bartlett and Hartley tests for homogeneity of variances under indeterminacy environment. J Taibah Univ Sci. 2020;14(1):6–10. doi: 10.1080/16583655.2019.1700675
- Bhatti MM, Ellahi R, Zeeshan A, et al. Numerical study of heat transfer and hall current impact on peristaltic propulsion of particle-fluid suspension with compliant wall properties. Mod Phys Lett B. 2019;33(35):1950439. doi: 10.1142/S0217984919504396
- Javed M, Irfan M, Pang T. Utilizing bivariate auxiliary information for enhanced estimation of population mean under simple and stratified random sampling schemes. J Natl Sci Found Sri. 2019;47(2):199–211.
- Marin M, Vlase M, Paun M. Considerations on double porosity structure for micropolar bodies. AIP Adv. 2015;5(3):037113, doi:10.1063/1.4914912.
- Marin M, Ellahi R, Chirila A. On solutions of Saint-Venant's problem for elastic dipolar bodies with voids. Carpathian J Math. 2017;33(2):219–232.
- Marin M, Vlase S, Ellahi R, et al. On the partition of energies for the backward in time problem of thermoelastic materials with a dipolar structure. Symmetry. 2019;11(7):863, doi:10.3390/sym11070863.
- Haq A, Khan M, Hussain Z. A new estimator of finite population mean based on the dual use of the auxiliary information. Commun Stat Theory Methods. 2017;46(9):4425–4436. doi: 10.1080/03610926.2015.1083112