Abstract
Calibration methods play a vital role in improving the accuracy of parameter estimates by effectively integrating information from various data sources. In the context of population parameter estimation, memory-type statistics—such as the exponentially weighted moving average (EWMA), extended exponentially weighted moving average (EEWMA), and hybrid exponentially weighted moving average (HEWMA)—leverage both current and historical data. This study proposes new ratio and product estimators within a calibration framework that utilizes these memory-type statistics. A simulation study is conducted to evaluate the performance of the proposed estimators. The mean squared error (MSE) and relative efficiency (RE) are computed, accompanied by graphical representations to illustrate the behavior of the estimators. The performance of the proposed estimators is compared with existing memory-type estimators. Furthermore, a real-world application is presented to validate the effectiveness of the proposed methods. The comparison, based on MSE and RE across different values of the smoothing constant, demonstrates that the calibration-based memory-type estimators outperform the existing approaches. The results indicate that the proposed estimators consistently achieve lower MSE and higher RE, confirming their superiority over traditional methods.
Similar content being viewed by others
Introduction
Parameter estimation is crucial for heterogeneous populations in survey sampling. Past information is considered highly significant and beneficial in improving the estimation process. When the past (auxiliary) information is positively and linearly correlated with the study variable, the ratio estimator is generally preferred for estimating population parameters. Conversely, if the relationship between the auxiliary variable and the study variable is negatively linear, the product estimator becomes more appropriate. The simple ratio estimator, as proposed by Cochran1, is given by:
The approximate mean square error (MSE) expression for ratio estimator is presented as
The simple product estimator, proposed by Robson2, is given by:
The MSE for product estimator has the form
By incorporating both past and current information, the exponentially weighted moving average (EWMA) statistic was first introduced by Roberts3 to enhance the efficiency of estimators. The EWMA statistic is defined as follows:
where \(\:t>0\). As the weight parameter \(\:\lambda\:\) varies from 0 to 1, greater emphasis is placed on the current study information, while the influence of past (auxiliary) data correspondingly decreases. The EWMA statistic, as defined in Eq. (5), reduces to the simple sample average when the smoothing constant \(\:\lambda\:=1\). In this context, \(\:{Z}_{t-1}\) in Eq. (5) represents the previous value of the EWMA statistic. The initial value \(\:{Z}_{0}\) is typically set as the expected average. The expressions for the mean and variance of the EWMA statistic are given as follows:
and
As time t approaches to infinity (t → ∞), the variance of \(\:{Z}_{t}\) reduces to
The EWMA statistic in Eq. (5) was used by Noor-ul-Amin4 to advise the memory type estimators. The EWMA statistic for study variable \(\:y\) in stratified sampling is denoted by \(\:{Z}_{st}\) as follows:
The EWMA statistic for the auxiliary variable \(\:x\), as proposed by Aslam et al.5, is denoted by \(\:{Q}_{st}\) and is given by:
The projected memory type EWMA ratio and product estimators under stratified sampling can be written as:
and
The estimated MSE expressions for the stratified memory-type ratio and product estimators are given, respectively, as follows:
and
The extended exponentially weighted moving average (EEWMA) statistic for the study variable \(\:y\), as proposed by Naveed et al.6, is denoted by \(\:{Z}_{te}\) and is defined as follows:
The EEWMA statistic for the auxiliary variable \(\:x,\) symbolized by \(\:{Q}_{te}\), takes the form
.
The variance for EEWMA statistic is presented as
The memory-type ratio and product estimators based on the EEWMA statistic, as developed by Zahid et al.7, are expressed as follows:
and
The MSEs for both memory type ratio and product estimators can be presented as
and
Further, the hybrid exponentially weighted moving average (HEWMA) statistic by Haq8 is defined as:
Furthermore, the hybrid exponentially weighted moving average (HEWMA) statistic, introduced by Haq8, is defined as follows:
and
The MSE expressions for the memory-type ratio and product estimators, as presented by Noor-ul-Amin21, are given as follows:
Survey sampling has established that the linear regression estimator performs efficiently when the regression line of the study variable (\(\:Y\)) passes through the origin and the variance of the study variable is proportional to the auxiliary variable (\(\:X\)). Under these conditions, it often outperforms both ratio and product estimators9.
Bhushan et al.10,11 proposed effective combined and separate-type estimators for population mean estimation under a stratified sampling framework, supported by real data applications. In a related development, Song and Kawai12 introduced an adaptive approach within stratified sampling for estimating failure probabilities. Additionally, Pandey et al.13 developed a calibration estimator for population variance under two-phase stratified sampling by incorporating non-sampling errors through calibrated weights.
The use of auxiliary information has been widely recognized in the literature for improving the efficiency of point estimators. Several researchers have contributed to this area. For instance, Koyuncu and Kadilar15 extended a general class of estimators to the stratified random sampling technique. Yasmeen et al.16 proposed an exponential estimator for estimating the population mean using a transmuted ancillary variable, while Malik and Singh17 introduced exponential-type estimators utilizing two auxiliary variables in a stratified context.
Kumar et al.18 studied the estimation of the population mean in the presence of non-response and measurement error, using two auxiliary variables. Raza et al.19 proposed a ratio-type regression estimator that is robust to outliers through the use of re-descending M-estimators. Saini and Kumar20 applied stratified and ranked set sampling techniques to improve population mean estimation under stratification.
Noor-ul-Amin21 was among the first to propose memory-type ratio and product estimators based on the exponentially weighted moving average (EWMA) and hybrid EWMA (HEWMA) statistics. Kumar et al.22 later extended this approach by introducing generalized ratio and product estimators using the HEWMA statistic under simple random sampling (SRS) without replacement. Bhushan et al.11 addressed the estimation of the population mean using EWMA statistics under SRS and proposed memory-type logarithmic estimators, while Bhushan et al.23 further evaluated the performance of these log memory-type estimators under the same sampling scheme.
However, in certain situations, ratio and product-type estimators may not outperform the traditional linear regression estimator. As a result, these estimators are often limited by the requirement that their efficiency should not be lower than that of the regression estimator. In much of the previous literature, it has been observed that the performance of such estimators depends on specific conditions that must be satisfied to achieve improved estimation.
To address these limitations, a widely adopted approach known as calibration has been introduced. Calibration techniques are commonly used in survey sampling to enhance the estimation of population parameters. Deville and Särndal24 were the first to propose the calibration estimator, which has since been extended by numerous researchers. The calibration method relies on minimizing a distance function to reduce the discrepancy between the original sampling weights and the new calibrated weights, while satisfying a set of calibration constraints.
Furthermore, Wu and Sitter25 generalized the calibration estimation framework by developing a pseudo-empirical likelihood method that incorporates model-based calibration. This approach provides efficient estimators for quadratic and other second-order finite population functions.
Furthermore, Hidiroglou and Särndal26 applied the calibration procedure within a two-phase sampling framework using a two-step approach, and discussed its application for estimating variances and domain-specific parameters. To estimate the parameters of linear equations, particularly in cases where the equations have no exact solution, Berge27 employed the calibration technique as an alternative method. Additionally, Harms and Duchesne28 constructed an estimator based on population quantiles in the presence of auxiliary data and compared its performance with other quantile-based calibration estimators.
Calibration techniques are frequently used to incorporate historical data, enhancing the accuracy of population parameter estimation, as noted by Kim et al.29. Koyuncu and Kadilar30 also proposed a novel estimator within the calibration framework for estimating the population mean. Sud et al.31 introduced an estimator for the population total under the assumption that the auxiliary and study variables are negatively correlated. More recently, Jabeen et al.32 developed a new strategy under the randomized response technique, integrating a calibration scheme that minimizes the distance between the original and calibrated weights.
Further contributions to calibration methodology under stratified sampling were made by Singh et al.33, followed by several other researchers, including34,35,36,37,38. The latter proposed two calibrated estimators within the stratified sampling framework that incorporate both the expected value and the coefficient of variation information for each stratum. The main objective of defining calibrated weights is to minimize the chi-squared distance measure, subject to newly defined calibration constraints. In stratified random sampling, the calibration approach is commonly used to determine optimal weights across strata. Additionally, Clement et al.39 proposed estimators for domain totals using the calibration technique under a stratified sampling design.
The primary objective of this study is to develop new memory-type calibrated estimators for estimating the population mean, extending the methodologies introduced by Noor-ul-Amin4,5,21. Specifically, the study proposes memory-type ratio and product estimators within the framework of calibration estimation, employing a chi-square distance function and subject to calibration constraints. To assess the efficiency of the proposed estimators, a comprehensive simulation study is carried out, comparing their performance with existing estimators using MSE and RE, supported by graphical illustrations. Additionally, a real-life dataset is analyzed to demonstrate the practical applicability and effectiveness of the proposed estimators.
Simple ratio estimator in stratified sampling
This section provides an overview of existing ratio estimators in stratified random sampling. The sample means of the study and auxiliary variables in stratified random sampling, as defined by Kadilar and Cingi40, are given by:
and
where k denotes the number of strata. The ratio estimator in stratification is given by
Considering that each stratum may have a different population mean, the ratio estimator in stratified random sampling can be expressed as:
The mean of the estimator presented in Eq. (22) is given by
The expression for variance can be written as
Stratified memory type estimators
The memory type stratified ratio and product estimators were suggested by Aslam et al.5 to estimate the population mean. The simple memory type ratio and product estimators by Noor-ul-Amin4 are defined as
The stratified memory type ratio estimator can be written as
where\(\:\:{Z}_{th}=\lambda\:{\stackrel{-}{y}}_{st}+(1-\lambda\:){Z}_{th-1}\) and \(\:{Q}_{th}=\lambda\:{\stackrel{-}{x}}_{st}+(1-\lambda\:){Q}_{th-1}\) are the EWMA statistic for under study and ancillary variables respectively. The expected value of the stratified estimator can be obtained as
The MSE for the estimator \(\:{y}^{*}\) is given by
Likewise, the MSE of stratified product memory type estimator is defined by
Proposed calibration estimation of memory type estimators under stratified sampling design
The discrepancy between the calibrated and original weights, as well as the associated distance measure, can be minimized through the calibration process. The primary objective of the calibration method is to provide more precise and accurate estimates compared to conventional estimators. By leveraging various relationships between the study and auxiliary variables, we develop calibrated memory-type ratio and product estimators under the stratified random sampling scheme to estimate the population mean of the study variable. Specifically, we modify the estimators proposed by Aslam et al.5 and introduce the following improved versions.
Calibrated memory type estimators using EWMA statistic
The calibrated memory-type ratio and product estimators in stratified random sampling, assuming that each stratum has a different population mean, are defined as follows:
and
The calibration constraint, which reflects the association between the auxiliary variable and the study variable, is given as follows:
To minimize the calibrated weights, the following distance measure is employed:
Using the distance measure and calibration constraint specified in Eqs. (32) and (31), respectively, the Lagrange multiplier can be derived as follows:
Differentiating Eq. (32) with respect to \(\:{W}_{h}^{*}\) and equating it to zero, we obtain
Substituting the expression of \(\:{W}_{h}^{\text{*}}\) into the calibration constraint given in Eq. (31) yields:
By simplifying the above expression, the resulting value of \(\:\lambda\:\) is obtained as follows:
.
Substituting the value of \(\:\lambda\:\) into Eq. (34) yields the calibrated weight:
Substituting the value of \(\:{W}_{h}^{*}\) from Eq. (34) into Eq. (29), the expression for the memory-type ratio estimator in stratified sampling is obtained as follows:
After simplification, the memory-type calibrated ratio estimator in stratified sampling is obtained as:
Similarly, the calibrated memory-type product estimator in stratified sampling is given by:
Calibrated memory type estimators using EEWMA statistic
Assuming that each stratum has a different population mean, the suggested calibrated memory-type ratio and product estimators based on the Exponentially Weighted Moving Average (EEWMA) statistic are defined as follows:
-
Calibrated memory-type ratio estimator using EEWMA:
-
Calibrated memory-type product estimator using EEWMA:
where \(\:{Z}_{teh}={\lambda\:}_{1}{\stackrel{-}{y}}_{st}{-\lambda\:}_{2}{\stackrel{-}{y}}_{st-1}+\left(1-{\lambda\:}_{1}+{\lambda\:}_{2}\right){Z}_{teh-1}\) and \(\:{Q}_{teh}={\lambda\:}_{1}{\stackrel{-}{x}}_{st}{-\lambda\:}_{2}{\stackrel{-}{x}}_{t-1}+(1-{\lambda\:}_{1}+{\lambda\:}_{2}){Q}_{teh-1}\) are EEWMA statistic for study as well as assisting variables respectively.
Calibrated memory type estimators using HEWMA statistic
Using the HEWMA statistic, the calibrated memory-type ratio and product estimators in stratified random sampling—assuming each stratum has a different population mean—are defined as follows:
-
Calibrated memory-type ratio estimator (HEWMA-based):
-
Calibrated memory-type product estimator (HEWMA-based):
where \(\:{Z}_{thh}\) and \(\:{Q}_{thh}\) are the HEWMA statistics for the study and auxiliary variables in the hth stratum, respectively. These are recursively defined as:
-
$$\:{E}_{tyh}={\lambda\:}_{2}{\stackrel{-}{y}}_{st}+\left(1-{\lambda\:}_{2}\right){E}_{tyh-1},$$
-
$$\:{Z}_{thh}={\lambda\:}_{1}{E}_{tyh}+\left(1-{\lambda\:}_{1}\right){Z}_{thh-1},$$
-
$$\:{E}_{txh}={\lambda\:}_{2}{\stackrel{-}{x}}_{st}+\left(1-{\lambda\:}_{2}\right){E}_{txh-1},$$
-
$$\:{Q}_{thh}={\lambda\:}_{1}{E}_{txh}+\left(1-{\lambda\:}_{1}\right){Q}_{thh-1},$$
where \(\:{\lambda\:}_{1}\) and \(\:{\lambda\:}_{2}\) are smoothing parameters such that \(\:0<{\lambda\:}_{1},{\lambda\:}_{2}\le\:1,\) \(\:{\stackrel{-}{y}}_{st}\) and \(\:{\stackrel{-}{x}}_{st}\) are the stratified sample means for the study and auxiliary variables, respectively, and \(\:{x}_{h}\), \(\:{q}_{h}\), \(\:{W}_{h}\), and \(\:{\mu\:}_{xh}\) are defined as per earlier notation for the hth stratum.
Simulation study
The simulation study is carried out to evaluate the performance of the proposed calibrated memory-type ratio and product estimators. Specifically, we compare the proposed calibrated estimators—\(\:{\widehat{y}}_{mr,}{\widehat{y}}_{mp},{\widehat{y}}_{mre},{\widehat{y}}_{mpe},{\widehat{y}}_{mrh},{\widehat{y}}_{mph}\)—with the memory-type estimator developed by Noor-ul-Amin4. Additionally, a detailed comparative analysis is performed between the proposed calibrated estimators and the simple stratified memory-type estimators introduced by Aslam et al.5, under the assumption that the auxiliary variable has different means across strata.
The MSE and RE of the proposed estimators are computed based on 50,000 replications. The values of the correlation coefficient (ρ) for each stratum are set at 0.05,0.25,0.50,0.750.05, 0.25, 0.50, 0.750.05,0.25,0.50,0.75, and 0.950.950.95, while the smoothing parameter \(\:\lambda\:\) takes values 0.1, 0.15, 0.25, 0.50, 0.75 and 1.00, following the simulation settings of Noor-ul-Amin4.
The MSEs for each estimator are computed using the following formula:
The MSE and RE values for both the simple and stratified memory-type estimators, as well as the proposed calibrated memory-type ratio and product estimators based on EWMA, EEWMA, and HEWMA statistics, are reported in Tables 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16 and 17, respectively. These tables illustrate the comparative performance of all considered estimators under various correlation levels and smoothing parameter settings.
The RE of an estimator is computed using the following formula:
where \(\:{y}^{h}={y}^{*},{y}^{**},\:{{\stackrel{-}{y}}^{M}}_{rt},\:{{\stackrel{-}{y}}^{M}}_{pt}\&\:\widehat{y}={\widehat{y}}_{mr},{\widehat{y}}_{mp},{\widehat{y}}_{mpe},{\widehat{y}}_{mre},{\widehat{y}}_{mph},{\widehat{y}}_{mrh}\)
Algorithm for computing the MSE and RE of the proposed estimators.
Following the approach of Noor-ul-Amin4:
-
1.
Generate a population of size 5000 from a bivariate normal distribution.
-
2.
Select appropriate values for the parameter \(\:\lambda\:\).
-
3.
Draw samples of sizes ni = 10, 20, 30, 50, 200, and 500 units from each stratum separately.
-
4.
Based on the samples obtained in Step 3, generate 50,000 values for each estimator.
-
5.
For each sample size, compute the MSE of the estimators using Eq. (44).
-
6.
Compute the RE of the proposed estimators using Eq. (45).
The results of the simulation study, in terms of MSEs and REs, are summarized in Tables 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16 and 17. Tables 1 and 2 present the MSEs, while Table 3 reports the REs of both the proposed and existing estimators. Tables 4 and 5 display the MSEs and REs for the stratified and suggested calibrated memory-type product estimators. Furthermore, Tables 6, 7, 8, 9, 10 and 11 illustrate the MSEs and REs of the simple memory-type estimators and the proposed calibrated memory-type estimators based on the EWMA statistic. Additionally, Tables 12, 13, 14, 15, 16 and 17 provide the corresponding results using the EEWMA and HEWMA statistics, respectively.
Overall, it is evident from the tables that the proposed calibrated memory-type ratio and product estimators consistently yield lower MSE values compared to the other estimators considered. These results demonstrate that the calibrated memory-type estimators outperform their counterparts in terms of estimation efficiency.
Similarly, the MSE results for the previously developed memory-type estimators by Noor-ul-Amin4 and the proposed calibrated memory-type estimators based on different statistics are presented in the corresponding tables alongside their REs. It is evident that the MSE values for the calibrated memory-type ratio and product estimators constructed using EWMA, EEWMA, and HEWMA statistics are consistently lower compared to the existing memory-type estimators.
Moreover, it is observed that increasing the correlation coefficient (ρ) from 0.00 to 0.95 leads to a noticeable reduction in MSEs, thereby enhancing the efficiency of the proposed calibrated estimators. This trend is further supported by Figs. 1, 2, 3, 4, 5 and 6, which depict the MSEs at various smoothing constants and correlation coefficient values, highlighting the superior performance of the proposed estimators over the existing ones. Additionally, Figs. 7, 8, 9, 10, 11 and 12 illustrate the REs of the estimators, reaffirming the improved performance of the suggested calibrated memory-type estimators.
Real life application
In this section, we apply the proposed estimators to a real-time dataset involving apple production, originally considered by Kadilar and Cingi14. The dataset includes information on apple production (\(\:Y\)) and the number of apple trees (\(\:X\)) across 854 communities in Turkey for the year 1999. The data were collected by the Turkish Ministry of Education to explore practical applications. The dataset is stratified into six regions—Marmara, Aegean, Mediterranean, Central Anatolia, Black Sea, and East and Southeast Anatolia—each treated as a separate stratum. The performance of the proposed calibrated memory-type ratio estimators is evaluated by assessing the strength of association between the study and auxiliary variables in this real-world application.
We consider a sample size of \(\:n=140\), with smoothing parameters λ = 0.25, λ1 = 0.25, λ2 = 0.05. The observed statistics for the population, strata, and sample sizes are presented in Table 18.
Table 19 presents the estimated values obtained from the proposed calibrated memory type estimators, while Fig. 13 illustrates the corresponding MSEs of both existing stratified memory type estimators and the proposed calibrated estimators. From the results in Table 19 and the trends depicted in Fig. 13, it is evident that the proposed calibrated memory type ratio estimators yield more accurate and reliable estimates compared to the existing stratified memory type estimators when using the EWMA, EEWMA, and HEWMA statistics. Notably, the MSE values associated with the proposed estimators are consistently lower than those of the existing ones, highlighting the improved efficiency of the calibration approach. These findings demonstrate that incorporating memory type estimators into the calibration framework enhances estimation accuracy over time.
Conclusion
The estimation of parameters plays a crucial role in sampling techniques. The use of auxiliary information significantly enhances the accuracy and reliability of the estimates. Moreover, calibration techniques, particularly those incorporating prior information, are vital in the estimation process. These techniques utilize distance measures to minimize the discrepancy between the calibrated and original sampling weights under various calibration constraints. In this study, we proposed calibrated ratio and product-type estimators that incorporate both past sample data and current information using EWMA, EEWMA, and HEWMA statistics. A simulation study was carried out to evaluate the performance of the proposed estimators. Based on the simulation results (Tables 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16 and 17), it was observed that the proposed memory-type calibrated estimators are more efficient and accurate in estimating the population mean compared to existing estimators, as reflected in their lower mean squared error (MSE) and higher relative efficiency (RE) values. Additionally, an empirical study was conducted to validate the findings of the simulation. The results, presented in Table 19, further confirmed that the proposed ratio and product-type memory estimators outperform previous estimators, exhibiting the minimum MSE values among the compared methods.
Future studies may extend the proposed memory-type calibrated estimators to more complex sampling designs such as stratified or cluster sampling. Incorporating multiple auxiliary variables could further enhance estimator efficiency and flexibility. It is also recommended to explore the robustness of these estimators in the presence of outliers or missing data. Additionally, integrating alternative memory-based statistics or adaptive calibration techniques could improve performance under dynamic conditions. Finally, applying the proposed methods to real-world datasets across various domains would help validate their practical applicability.
Data availability
The simulated facts and data statistics used to support the findings of this study are included in the article.
References
Cochran, W. The Estimation of the yields of cereal experiments by sampling for the ratio of grain to total produce. J. Agric. Sci. 30 (2), 262–275 (1940).
Robson, D. S. Applications of multivariate Polykays to the theory of unbiased ratio-type Estimation. J. Am. Stat. Assoc. 52 (280), 511–522 (1957).
Roberts, S. W. Control chart tests based on geometric moving averages. Technometrics 1 (3), 239–250 (1959).
Noor-ul-Amin, M. Memory type estimators of population mean using exponentially weighted moving averages for time scaled surveys. Commun. Stat. Theory Methods. 50 (12), 2747–2758 (2021).
Aslam, I., Noor-ul-Amin, M., Yasmeen, U. & Hanif, M. Memory type ratio and product estimators in stratified sampling. J. Reliab. Stat. Stud. 13 (1), 1–20. https://doi.org/10.13052/jrss0974-8024.1311 (2020).
Naveed, M., Azam, M., Khan, N. & Aslam, M. Design of a control chart using extended EWMA statistic. Technologies 6 (4), 1–15 (2018).
Zahid, R. et al. Combination of memory type ratio and product estimators under extended EWMA statistic with application to wheat production. Sci. Rep. 13, 13547 (2023).
Haq, A. A new hybrid exponentially weighted moving average control chart for monitoring process mean. Qual. Reliab. Eng. Int. 29 (7), 1015–1025 (2013).
Scheaffer, R. L., William, M. & Lyman, O. Elementary survey sampling. PWS-Kent Publ Co (1990).
Bhushan, S., Kumar, A. & Singh, S. Some efficient classes of estimators under stratified sampling. Commun. Stat. Theory Methods. 52 (6), 1767–1796 (2021).
Bhushan, S., Kumar, A., Al-Omari, A. I. & Alomani, G. A. Mean Estimation for time-based surveys using memory-type logarithmic estimators. Mathematics 11 (9), 2125 (2023).
Song, C. & Kawai, R. Adaptive stratified sampling for structural reliability analysis. Struct. Saf. 101, 1–16 (2023).
Pandey, M. K. et al. A general class of improved population variance estimators under non-sampling errors using calibrated weights in stratified sampling. Sci. Rep. 14, 2948 (2024).
Kadilar, C. & Cingi, H. Ratio estimators in stratified random sampling. Biometrical J. 45 (2), 218–225 (2003).
Koyuncu, N. & Kadilar, C. Ratio and product estimators in stratified random sampling. J. Stat. Plann. Inference. 139 (8), 2552–2558 (2009).
Yasmeen, U., Noor-ul-Amin, M. & Hanif, M. Generalized exponential estimators of finite population mean using transformed auxiliary variables. Int. J. Appl. Comput. Math. 1 (4), 589–598 (2015).
Malik, S. & Singh, R. A new estimator for population mean using two auxiliary variables in stratified random sampling. J. Inf. Optim. Sci. 38 (8), 1243–1252 (2017).
Kumar, S., Trehan, M. & Joorel, J. P. S. A simulation study: Estimation of population mean using two auxiliary variables in stratified random sampling. J. Stat. Comput. Simul. 88 (18), 3694–3707 (2018).
Raza, A., Noor-ul-Amin, M. & Hanif, M. Regression in ratio estimators in the re-descending M-estimator. J. Reliab. Stat. Stud. 12 (2), 1–10 (2019).
Saini, M. & Kumar, A. Ratio estimators using stratified random sampling and stratified ranked set sampling. Life Cycle Reliab. Saf. Eng. 8 (1), 85–89 (2019).
Noor-ul-Amin, M. Memory type ratio and product estimators for population mean for time-based surveys. J. Stat. Comput. Simul. 90 (17), 3080–3092 (2020).
Kumar, S., Chhaparwal, P., Kumar, K. & Kumar, P. Generalized memory-type estimators for time-based surveys: simulation experience and empirical results with birth weight dataset. Life Cycle Reliab. Saf. Eng. 13, 15–23 (2024).
Bhushan, S. et al. Evaluating the performance of memory type logarithmic estimators using simple random sampling. PloS ONE. 17 (12), e0278264 (2022).
Deville, J. C. & Sarndal, C. E. Calibration estimators in survey sampling. J. Am. Stat. Assoc. 87, 376–382 (1992).
Wu, C. & Sitter, R. R. A model-calibration approach to using complete auxiliary information from survey data. J. Am. Stat. Assoc. 96, 185–193 (2001).
Hidiroglou, M. A. & Sarndal, C. E. Use of auxiliary information for two phase sampling. Surv. Methodol. 24, 11–20 (1998).
Berge, A. Extension of calibration estimators in survey sampling. J. Am. Stat. Assoc. 94, 635–644 (1999).
Harms, T. & Duchesne, P. On calibration Estimation for quintiles. Surv. Methodol. 32, 37–52 (2006).
Kim, J. M., Sungur, E. A. & Heo, T. Y. Calibration approach estimators in stratified sampling. Stat. Probab. Lett. 77 (1), 99–103 (2007).
Koyuncu, N. & Kadilar, C. Calibration estimators using different measures in stratified random sampling. Int. J. Mod. Eng. Res. 3 (1), 415–419 (2013).
Sud, U. C., Chandra, H. & Gupta, V. K. Calibration approach-based regression-type estimator for inverse relationship between study and auxiliary variable. J. Theory Pract. 8 (4), 707–721 (2014).
Jabeen, R., Akhtar, A. & Zaka, A. Modified calibrated control charts for monitoring the population mean under stratified sampling. Qual Reliab. Eng. Int 1–21 (2022).
Singh, S., Horn, S. & Yu, F. Estimation of variance of the general regression estimator: higher level calibration approach. Surv. Methodol. 24, 41–50 (1998).
Tracy, D. S., Singh, S. & Arnab, R. Note on calibration in stratified and double sampling. Surv. Methodol. 29, 99–104 (2003).
Singh, S. Advanced Sampling Theory with Applications: How Michael ‘Selected’ Amy1 & 21–1247 (Kluwer Acad. Publ., 2003).
Singh, G. K., Rao, D. K. & Khan, M. G. M. Calibration estimator of population mean in stratified sampling. Proc. Asia-Pacific World Congr. Comput. Sci. Eng., 1–5 (2014).
Rao, D. K., Khan, M. G. M. & Khan, S. Mathematical programming on multivariate calibration Estimation in stratified sampling. World Acad. Sci. Eng. Technol. 72, 58–62 (2012).
Rao, D. K., Khan, M. G. M. & Reddy, K. G. Stratified calibration estimator of population mean using multi-auxiliary information. Proc. Asia-Pacific World Congr. Comput. Sci. Eng., (2015).
Clement, E. P., Udofia, G. A. & Enang, E. I. Sample design for domain calibration estimators. Int. J. Probab. Stat. 3 (1), 8–14 (2014).
Kadilar, C. & Cingi, H. Ratio estimators in simple random sampling. Appl. Math. Comput. 151 (3), 893–902 (2004).
Funding
The authors received no specific funding for this study.
Author information
Authors and Affiliations
Contributions
All authors reviewed the manuscript.
Corresponding author
Ethics declarations
Competing interests
The authors declare no competing interests.
Additional information
Publisher’s note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Appendix: table of notations and symbols
Appendix: table of notations and symbols
To aid the reader’s understanding of the mathematical formulations presented in the paper, we provide below a comprehensive table summarizing the notations and symbols used throughout the manuscript. Each symbol is defined along with its corresponding description and, where applicable, its role within the proposed methodology.
\(\:\stackrel{-}{y}\) and \(\:\stackrel{-}{x}\) | The sample means under study and auxiliary variable. |
Cx and Cy | The coefficients of deviation for the study and auxiliary variable. |
ρ | The correlation coefficient among current and past information. |
\(\:\stackrel{-}{{y}_{t}}\) | Considered as average of sample at time t. |
λ | The weight or smoothing constant of the data values. |
\(\:\stackrel{-}{Y}\) and \(\:{{S}^{2}}_{y}\) | The average and variance of study variable y. |
\(\:{\stackrel{-}{y}}_{st}\) and \(\:{\stackrel{-}{x}}_{st}\) | Are the data’s sample averages in stratified random sampling structure at t. |
\(\:\stackrel{-}{X}\) | Represents the average of population of the auxiliary data. |
\(\:{W}_{h}=\frac{{N}_{h}}{N}\) | The weight of each stratum. |
N and Nh | Are the number of population units and the number of observations in each stratum. |
\(\:{\stackrel{-}{y}}_{h}=\frac{\sum\:_{i}^{{n}_{h}}{y}_{hi}}{{n}_{h}}\) | As the sample average of study variable in hth stratum. |
\(\:{\stackrel{-}{x}}_{h}\) | Is auxiliary variable’s sample average in hth stratum. |
\(\:{\stackrel{-}{X}}_{h}\) | Described as the population average of ancillary information for hth stratum. |
\(\:var\left({\stackrel{-}{x}}_{h}\right)=\frac{{N}_{h}-{n}_{h}}{{N}_{h}{n}_{h}}.\:\:{{s}^{2}}_{hx}\) | As the sample variance of auxiliary variable in hth stratum. |
\(\:\text{v}ar\left({\stackrel{-}{y}}_{h}\right)=\frac{{N}_{h}-{n}_{h}}{{N}_{h}{n}_{h}}.\:\:{{s}^{2}}_{hy}\) | Is the sample variance of study variable in hth stratum. |
\(\:{\mu\:}_{xh}\) | Is considered as mean of population of assisting variable in each stratum. |
\(\:{R}_{h}=\frac{{\stackrel{-}{Y}}_{h}}{{\stackrel{-}{X}}_{h}}\) | The population ratio of means in kth stratum. |
\(\:\theta\:=\left(\frac{1}{n}-\frac{1}{N}\right)\:,\:\phi\:=\frac{{\lambda\:}_{1}^{2}+{\lambda\:}_{2}^{2}-2a{\lambda\:}_{1}{\lambda\:}_{2}}{2\left({\lambda\:}_{1}-{\lambda\:}_{2}\right)-{\left({\lambda\:}_{1}{-\lambda\:}_{2}\right)}^{2}}\:\text{a}\text{n}\text{d}\:\gamma\:=\frac{{\left(1-{\lambda\:}_{1}\right)}^{2}-(1-{\left(1-{\lambda\:}_{1}\right)}^{2t})}{1-{\left(1-{\lambda\:}_{1}\right)}^{2}}\) \(\:+\frac{{\left(1-{\lambda\:}_{2}\right)}^{2}-(1-{\left(1-{\lambda\:}_{2}\right)}^{2t})}{1-{\left(1-{\lambda\:}_{2}\right)}^{2}}-\frac{2(1-{\lambda\:}_{1})(1-{\lambda\:}_{2})(1-{\left(1-{\lambda\:}_{1}\right)}^{t}{\left(1-{\lambda\:}_{2}\right)}^{t})}{1-(1-{\lambda\:}_{1})(1-{\lambda\:}_{2})}.\) | |
\(\:var\left({\stackrel{-}{y}}_{s}\right)=\sum\:_{h=1}^{L}{W}_{h}^{2}{\theta\:}_{h}{S}_{yh}^{2},\:var\left({\stackrel{-}{x}}_{s}\right)=\sum\:_{h=1}^{L}{W}_{h}^{2}{\theta\:}_{h}{S}_{xh}^{2}\) \(\text{a}\text{n}\text{d}\:\text{c}ov\left({\stackrel{-}{y}}_{s},\:{\stackrel{-}{x}}_{s}\right)=\sum\:_{h=1}^{L}{W}_{h}^{2}{\theta\:}_{h}{S}_{yxh}^{2}.\:\) |
Rights and permissions
Open Access This article is licensed under a Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License, which permits any non-commercial use, sharing, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if you modified the licensed material. You do not have permission under this licence to share adapted material derived from this article or parts of it. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by-nc-nd/4.0/.
About this article
Cite this article
Minhas, K.S., Jabeen, R., Zaka, A. et al. Innovative memory-type calibration estimators for better survey accuracy in stratified sampling. Sci Rep 15, 34470 (2025). https://doi.org/10.1038/s41598-025-17917-y
Received:
Accepted:
Published:
DOI: https://doi.org/10.1038/s41598-025-17917-y